39450
Three ways of understanding the world
> But I want to make a different point. I think Gobry is right that there's something special about controlled experiments, whether or not you want to restrict the word "science" to mean only that. But there are other ways of systematically understanding the world. In fact, I think there are 3 big ones:
51 minutes ago
Is economics a science?
> While I was in Norway to give a talk about macroeconomics, an interdisciplinary group at the University of Oslo also invited me to give a talk about whether economics is a science or not. That's an impossible question, of course, since there's no official definition of what "a science" is. But I did have some thoughts on the matter. Here are the slides from the talk:
economics 
52 minutes ago
#c0ffee is the color
> What real words are actually valid CSS HEX colors? Parsing an English dictionary for entries containing only the letters ABCDEF and limiting the result to words of exactly 6 or 3 letters length (#FFFFFF or #FFF) gives us some interesting results. Use STRICT to limit the list to 100% valid HEX words as opposed to the default output which contains 0157 disguised as OIST (limited/ intuitive 1337). This interface uses #coffee, #teases and #facade, the latter because #faeces rendered a tad too pale. You should listen to this while browsing the colors. Mouseclick copies the HEX code to clipboard.
color  reference  funny 
52 minutes ago
InspiroBot
generates quotes that feel all too real.
from twitter_favs
4 hours ago
Twitter
generates quotes that feel all too real.
from twitter_favs
4 hours ago
Install node & npm on Mac OS X with Homebrew
> Installing node through Homebrew can cause problems with npm for globally installed packages. To fix it quickly, use the solution below. An explanation is also included at the end of this document.
npm  homebrew 
5 hours ago
dplyr 0.7.0
> I’m pleased to announce that dplyr 0.7.0 is now on CRAN! (This was dplyr 0.6.0 previously; more on that below.) dplyr provides a “grammar” of data transformation, making it easy and elegant to solve the most common data manipulation challenges. dplyr supports multiple backends: as well as in-memory data frames, you can also use it with remote SQL databases. If you haven’t heard of dplyr before, the best place to start is the Data transformation chapter in R for Data Science.
data  R  dplyr 
5 hours ago
Police-reported hate crime in Canada, 2015
This Juristat article examines the nature and extent of police-reported hate crime in Canada. Key topics include motivations for hate crime (e.g. race/ethnicity, religion, and sexual orientation), types of offences, geographical comparisons and accused/victim characteristics. The article uses data from the Incident-based Uniform Crime Reporting Survey which gathers data from police records.
canada  data  crime:hate 
5 hours ago
master
fingertools - Ad-hoc programs I had lying around
7 hours ago
pandas-dev/pandas
pandas - Flexible and powerful data analysis / manipulation library for Python, providing labeled data structures similar to R data.frame objects, statistical functions, and much more
python  data 
7 hours ago
pandas 2.0 Design Documents — pandas 2.0 Design Docs 0.1 documentation
> These are a set of documents, based on discussions started in December 2015, to assist with discussions around changes to Python pandas’s internal design intended to better accommodate the evolving needs of the growing Python data userbase and to help ensure that pandas remains a relevant and important project in the future.
>
> We also will use this place to collect ideas for things to remove (such as deprecated features, but possibly other things) from the library that don’t necessarily depend on the internal implementation.
python  data 
7 hours ago
nteract/nteract
nteract - :notebook: Desktop notebook app + packages
data 
7 hours ago
jupyterlab_table
- A Jupyter Lab extension for rendering tabular data
data 
7 hours ago
Table Schema
> A simple format to declare a schema for tabular data. The schema is designed to be expressible in JSON.
data 
7 hours ago
Pandas - What's new and what's coming
> Pandas
>
> What's new and what's coming
>
> Joris Van den Bossche, PyParis, June 12, 2017
>
> https://github.com/jorisvandenbossche/talks/
python  pandas 
7 hours ago
Twitter
COME IN HERE AND SAY THAT AGAIN I'LL FUCK YR SHIT UP DRY BOY. LEG-HAVIN ASS LAND BITCH
from twitter_favs
8 hours ago
Twitter
Great news from our fed counterparts at this AM - Fed $$ for FinchWLRT
from twitter_favs
11 hours ago
DataSciencePython
> This repo contains a curated list of Python tutorials for Data Science, NLP and Machine Learning.
python  data 
12 hours ago
Introduction to the Windows Command Line with PowerShell
> This tutorial will introduce you to the basics of Windows PowerShell, the standard command-line interface for Windows computers. If you are a Mac or Linux user, you should check out the Bash introduction instead. If you are already familiar with using Bash, you may be able to get started with PowerShell just by looking at the table at the end of this lesson.
>
> The tutorial is divided into two main sections. In the first section, “Getting Started,” you will learn to do basic desktop tasks like creating and opening files and folders using PowerShell. In the second section, “Doing More,” you will get a glimpse of some of the features that make work on the command line particularly efficient, and learn enough of the basics to be able to explore further on your own. You will also get set up to run Python scripts from the command line.
>
> This tutorial was written for PowerShell 5.0. If you are using an earlier version, you may encounter a few small syntax differences which you should be able to overcome with a little help from a search engine.
PowerShell  tutorial  @followup 
12 hours ago
Designing Data-Intensive Applications
> As software engineers, we need to build applications that are reliable, scalable and maintainable in the long run. We need to understand the range of available tools and their trade-offs. For that, we have to dig deeper than buzzwords.
>
> This book will help you navigate the diverse and fast-changing landscape of technologies for storing and processing data. We compare a broad variety of tools and approaches, so that you can see the strengths and weaknesses of each, and decide what’s best for your application.
book  data  database 
12 hours ago
Log-structured storage
> This morning I’m reading Designing data-intensive applications by Martin Kleppmann.
>
> I’m only a couple chapters in, but it’s already definitely the best thing about databases I’ve ever read. It’s doing an amazing job of
>
> introducing totally new-to-me concepts (like “log-structured storage”)
> explaining what terms like “ACID” mean in a rigorous and clear way (turns out that the “C” in ACID stands for “consistency”, but has nothing to do with either linearizability or “eventual consistency”, it’s actually about maintaining application-level invariants. It’s really helpful to know that there are actually 5 completely unrelated uses of the word “consistency” when talking about databases). He’s also very up front about “this word is used very inconsistently in practice, be careful!”
> explaining how the concepts in the book relate to real-world databases like PostgreSQL, MySQL, Redis, Cassandra, and many many more
> giving references (just the first chapter on storage engines has 65 amazing-looking references), if you want to learn more and go deeper

BOOK: http://dataintensive.net/
book  data  database 
12 hours ago
tenacity
> Tenacity is an Apache 2.0 licensed general-purpose retrying library, written in Python, to simplify the task of adding retry behavior to just about anything. It originates from a fork of retrying.
>
> The simplest use case is retrying a flaky function whenever an Exception occurs until a value is returned.

Documentation: https://www.amazon.ca/Poetic-Edda-Stories-Norse-Heroes/dp/1624663567
python  testing 
12 hours ago
Arranging subplots with ggplot2
> For my recently published paper, I produced not-so-standard figures that show the two step decomposition used in the analysis. Have a look:
ggplot2  R  maps  data:visualization 
12 hours ago
Twitter
This is my favorite so far. I love this AI!
from twitter_favs
12 hours ago
Learning By Shipping
> Many of us have been using the dev builds of iOS 11 and MacOS High Sierra this week. I wanted to share some thoughts on what I think are some of the important advances.
> I have attended WWDC for many years, sometimes as a partner (working on Office), sometimes as competitors (working on Windows), and sometimes just as an interested developer (grad school). There are always a range of emotions coming out of the event. In this era of massively global Apple, every event is galactic in scale yet it is super important to keep in mind that this is still their developer conference.
Apple 
12 hours ago
Z (1969 film)
> Z is a 1969 Algerian-French epic political thriller film directed by Costa-Gavras, with a screenplay by Gavras and Jorge Semprún, based on the 1966 novel of the same name by Vassilis Vassilikos. The film presents a thinly fictionalized account of the events surrounding the assassination of democratic Greek politician Grigoris Lambrakis in 1963. With its satirical view of Greek politics, its dark sense of humor, and its downbeat ending, the film captures the outrage about the military dictatorship that ruled Greece at the time of its making.
>
> Z
>
> Theatrical release poster
> Directed by Costa-Gavras
> Produced by
> Jacques Perrin
> Ahmed Rachedi
> Screenplay by
> Jorge Semprún
> Costa-Gavras
> Based on Z
> by Vassilis Vassilikos
> Starring
> Jean-Louis Trintignant
> Yves Montand
> Irene Papas
> Jacques Perrin
> Music by Mikis Theodorakis
> Cinematography Raoul Coutard
> Edited by Françoise Bonnot
> Production
> company
> Office National pour le Commerce et l'Industrie Cinématographique
> Reggane Films
> Valoria Films
> Distributed by Cinema V (US)
> Release date
> 26 February 1969 (France)
> May 1969 (Cannes)
> Running time
> 127 minutes[1]
> Country
> Algeria
> France
> Language
> French
> Russian
> English
> Box office $14.3 million[2]
> The film stars Jean-Louis Trintignant as the investigating magistrate (an analogue of Christos Sartzetakis who later served as president of Greece from 1985 to 1990). International stars Yves Montand and Irene Papas also appear, but despite their star billing have very little screen time. Jacques Perrin, who co-produced, plays a key role as a photojournalist. The film's title refers to a popular Greek protest slogan (Greek: Ζει, IPA: [ˈzi]) meaning "he lives," in reference to Lambrakis.
movies 
13 hours ago
Zulip
> Every conversation in Zulip has a topic, so it’s easy to keep conversations straight. Are your coworkers discussing a software bug and the content of your website at the same time? No problem.
chat 
13 hours ago
Unconf projects 4: cityquant, notary, packagemetrics, pegax
> Continuing our series of blog posts (day 1, day 2, day 3) this week about unconf 17.
R 
13 hours ago
Digital Archives Database Project
> View thousands of Métis historical documents, genealogical information, Hudson's Bay Company records, Church missionary records, and personal Métis accounts. The database is currently offering 109262 records for public and institutional browsing. Log in or sign up for access to the search engine.
archives  history  Canada 
13 hours ago
A Year of rOpenSci's Unconf
> The rOpenSci Unconf is an annual gathering of individuals who are interested in and excited about using R to perform and promote open science. Some have described it as a hackathon, but I think that’s a mischaracterization. Though a great deal of code is written in a short period of time, a substantial proportion of the attendees make large and important contributions while hardly writing any code. The Unconf is not a competition, and it’s intensely social and collaborative.
R  rOpenSci  conference 
13 hours ago
pyparsing 2.2.0
> The pyparsing module is an alternative approach to creating and executing
> simple grammars, vs. the traditional lex/yacc approach, or the use of
> regular expressions. The pyparsing module provides a library of classes
> that client code uses to construct the grammar directly in Python code.
>
> Here is a program to parse "Hello, World!" (or any greeting of the form
> "<salutation>, <addressee>!"):
>
> from pyparsing import Word, alphas
> greet = Word( alphas ) + "," + Word( alphas ) + "!"
> hello = "Hello, World!"
> print hello, "->", greet.parseString( hello )
>
> The program outputs the following:
>
> Hello, World! -> ['Hello', ',', 'World', '!']
>
> The Python representation of the grammar is quite readable, owing to the
> self-explanatory class names, and the use of '+', '|' and '^' operator
> definitions.
>
> The parsed results returned from parseString() can be accessed as a
> nested list, a dictionary, or an object with named attributes.
>
> The pyparsing module handles some of the problems that are typically
> vexing when writing text parsers:
> - extra or missing whitespace (the above program will also handle
> "Hello,World!", "Hello , World !", etc.)
> - quoted strings
> - embedded comments
>
> The .zip file includes examples of a simple SQL parser, simple CORBA IDL
> parser, a config file parser, a chemical formula parser, and a four-
> function algebraic notation parser. It also includes a simple how-to
> document, and a UML class diagram of the library's classes.

Documentation: http://pythonhosted.org/pyparsing/
python  python:text 
13 hours ago
pyparsing 1.5.7
This is an OLD version.

> The pyparsing module is an alternative approach to creating and executing
> simple grammars, vs. the traditional lex/yacc approach, or the use of
> regular expressions. The pyparsing module provides a library of classes
> that client code uses to construct the grammar directly in Python code.
>
> Here is a program to parse "Hello, World!" (or any greeting of the form
> "<salutation>, <addressee>!"):
>
> from pyparsing import Word, alphas
> greet = Word( alphas ) + "," + Word( alphas ) + "!"
> hello = "Hello, World!"
> print hello, "->", greet.parseString( hello )
>
> The program outputs the following:
>
> Hello, World! -> ['Hello', ',', 'World', '!']
>
> The Python representation of the grammar is quite readable, owing to the
> self-explanatory class names, and the use of '+', '|' and '^' operator
> definitions.

Documentation: http://pythonhosted.org/pyparsing/
python  python:text  @followup 
13 hours ago
undebt
> Undebt is a fast, straightforward, reliable tool for performing massive, automated code refactoring used @Yelp. Undebt lets you define complex find-and-replace rules using standard, straightforward Python that can be applied quickly to an entire code base with a simple command.
>
> To learn about what Undebt is and why we created it, check out our post on the Yelp Engineering Blog.

PyPI: https://pypi.python.org/pypi/undebt

Blog Post: https://engineeringblog.yelp.com/2016/08/undebt-how-we-refactored-3-million-lines-of-code.html
python  programming:refactoring  @followup 
13 hours ago
2017-Winter
> The UBC Scientific Software Seminar is inspired by Software Carpentry and its goal is to help students, graduates, fellows and faculty at UBC develop software skills for science.
python  machine_learning  UBC  Software_Carpentry 
13 hours ago
Coding is not “fun,” it’s technically and ethically complex
> Programming computers is a piece of cake. Or so the world’s digital-skills gurus would have us believe. From the non-profit Code.org’s promise that “Anybody can learn!” to Apple chief executive Tim Cook’s comment that writing code is “fun and interactive,” the art and science of making software is now as accessible as the alphabet.
> Unfortunately, this rosy portrait bears no relation to reality. For starters, the profile of a programmer’s mind is pretty uncommon. As well as being highly analytical and creative, software developers need almost superhuman focus to manage the complexity of their tasks. Manic attention to detail is a must; slovenliness is verboten. Attaining this level of concentration requires a state of mind called being “in the flow,” a quasi-symbiotic relationship between human and machine that improves performance and motivation.
education  education:computing  programming 
16 hours ago
Let’s call It “Computer Science” AND “Programming”: The fat line where most people will be
> Who knows why meme’s start, but one of the big ones in the Computing Education blogosphere today is “Let’s not call it ‘Computer Science’ if we really mean ‘Computer Programming.'”  Based on Neil Brown’s excellent response, I suspect that it’s the UK “Computing at Schools” effort that is leading to this question.  If you’re going to define a computer science curriculum, you’re going to have to define “computer science.”  Both Neil and Alfred Thompson do a great job of helping us define kinds of computing and understand the goals of the different kinds of curricula.
> I’m more interested in the assumptions in the original blog about what the “regular people” are going to do.  Jason Gorman claims that 99% of people involved in computing are just “users.”  “I believe that what’s needed is a much more rounded computing education for the 99%, with IT blending seemlessly and ubiquitously into everyday lessons as well as home life.”  They need computational thinking, but not programming, argues Jason.  Jason sees that only 1% of students should get programming. “For the remaining 1%, of whom some might become software developers, we need programming in schools (and out of school). Lots of it. ”
education  education:computing  programming 
16 hours ago
We need to separate Computing for All from Software Development: Claims that coding is not “fun,” it’s technically and ethically complex
> The problem with the article linked below is that Code.org and the author mean two different things by the word “programming.”  Programming can be fun, insightful, sloppy, small, and still useful without demanding “superhuman focus” and “manic attention to detail.”  This is an issue I’ve talked about with respect to the thick line between programmer and user where most people will be in the future. I’m teaching an ethics course this summer — building software for others is technically and ethically complex, as the author states.  But building software as an end-user, as a hobbyist, as a scientist or engineer exploring an idea?  We need a different word.
education  education:computing 
16 hours ago
Twitter
TIL about uniscribe, a super nifty unicode string de-constructor

h/t
from twitter_favs
20 hours ago
Twitter
In other words, kids, practice safe secs.
from twitter_favs
23 hours ago
Twitter
"two," whispers Rowling, eyes unfocused, hair wild. in the distance, sirens. she wipes blood from her face. "two ha…
from twitter_favs
23 hours ago
Data visualization workshop: basics in ggplot2
> ggplot(data=data, aes(x=Region, y=Seal)) + geom_bar(aes(fill=Sex), stat="identity", position=position_dodge())
data:visualization  data  ggplot2  R 
yesterday
gh-pages
> YorkU.DataViz.2017-6-20
data:visualization  data  R  ggplot2 
yesterday
gh-pages
> ggplot(data, aes(Long,Lat)) + geom_polygon(data=w, colour="grey50", fill="grey50", aes(x=long, y=lat))+coord_map()+ ylim(67.5, 76.5) + xlim(-142,-100)+ geom_point(data=data, aes(Long, Lat))+labs(title="New title")+geom_text(aes(x=-127, y=68.5, label="N.W.T."), size=3, , color = "black")+geom_text(aes(x=-135, y=75, label="Beaufort\nSea"), size=4, color = "darkblue", fontface = "italic")+geom_text(aes(x=-110, y=71, label="Victoria\nIs."), size=3,color = "black")+geom_text(aes(x=-121, y=73, label="Banks\nIs."), size=3, , color = "black")+geom_text(aes(x=-111.5, y=75.75, label="Viscount\nMelville Is."), size=3, , color = "black")
data:visualization  data  R 
yesterday
Lay out panels in a grid
> facet_grid forms a matrix of panels defined by row and column facetting variables. It is most useful when you have two discrete variables, and all combinations of the variables exist in the data.
data  R  ggplot2 
yesterday
Leaflet for R
> Leaflet is one of the most popular open-source JavaScript libraries for interactive maps. It’s used by websites ranging from The New York Times and The Washington Post to GitHub and Flickr, as well as GIS specialists like OpenStreetMap, Mapbox, and CartoDB.
>
> This R package makes it easy to integrate and control Leaflet maps in R.
javascript  R 
yesterday
Package ‘tmap’
Description Thematic maps are geographical maps in which spatial data distributions are visual- ized. This package offers a flexible, layer-based, and easy to use approach to create the- matic maps, such as choropleths and bubble maps.
ggplot2  R 
yesterday
tmap in a nutshell
> With the tmap package, thematic maps can be generated with great flexibility. The syntax for creating plots is similar to that of ggplot2. The add-on package tmaptools contains tool functions for reading and processing shape files.
R 
yesterday
Package 'ggmap'
Description A collection of functions to visualize spatial data and models
on top of static maps from various online sources (e.g Google Maps and Stamen Maps). It includes tools common to those tasks, including functions for geolocation and routing.
R 
yesterday
A Layered Grammar of Graphics
A grammar of graphics is a tool that enables us to concisely describe the components of a graphic. Such a grammar allows us to move beyond named graphics (e.g., the “scat- terplot”) and gain insight into the deep structure that underlies statistical graphics. This article builds on Wilkinson, Anand, and Grossman (2005), describing extensions and refinements developed while building an open source implementation of the grammar of graphics for R, ggplot2.
The topics in this article include an introduction to the grammar by working through the process of creating a plot, and discussing the components that we need. The gram- mar is then presented formally and compared to Wilkinson’s grammar, highlighting the hierarchy of defaults, and the implications of embedding a graphical grammar into a programming language. The power of the grammar is illustrated with a selection of examples that explore different components and their interactions, in more detail. The article concludes by discussing some perceptual issues, and thinking about how we can build on the grammar to learn how to create graphical “poems.”
Supplemental materials are available online.
yesterday
..level.. in ggplot2 contour plot
> From Hadley Wickham's A Layered Grammar of Graphics, page 21, the .. .. is used because the aesthetic (in this case, the levels of the contours) is not present in the original dataset, but instead is calculated by the contour statistic.
ggplot2  R 
yesterday
what does ..level.. mean in ggplot::stat_density2d
> the stat_ functions compute new values and create new data frames. this one creates a data frame with a level variable. you can see it if you use ggplot_build vs plotting the graph
data  R  ggplot2 
yesterday
The {do.call} function
> R has an interesting function called do.call. This function allows you to call any R function, but instead of writing out the arguments one by one, you can use a list to hold the arguments of the function. While it may not seem useful on the surface, a simple example will help to show how powerful do.call is.
R 
yesterday
Twitter
I was about to comment on this but my excellent intern swiftly whacked me on the head with a 2 by 4, as per instruc…
from twitter_favs
yesterday
Twitter
Thank you to . and . for literally the best panel in comics ever:
from twitter_favs
yesterday
Twitter
This isn't a syntax error in python, but I wish it was!
This simple typo can cause subtle hard to find bugs. 😡
from twitter_favs
yesterday
Canada's Housing Bubble Will Burst
Canadian home sales fell the most in five years last month. That didn’t stop an increase in prices, which were up 18 percent nationwide from a year earlier. When you consider that most houses are leveraged assets, this represents huge gains for homeowners.

While leverage can help boost performance on the way up, it becomes very dangerous on the way down. Leverage can turn even the best investments into poor ones when things go wrong, as losses are amplified. Equity can get wiped out pretty quickly on an overleveraged asset.

Canadian real estate has been on fire for years. The housing price data there has made the U.S. real estate market during the boom of the mid-2000s look mild.

The Federal Reserve Bank of Dallas puts out a global housing price index for more than 20 countries every quarter. Using this data, I looked at the real house price index data for Canada and compared it with the same data in the U.S. going back to 1975.
2 days ago
Twitter
If Pacman eats a yellow ghost he dies.
from twitter_favs
2 days ago
« earlier      
1password 5by5 @followup ableg academic amazon android app app:hazel app:text_editor app_net apple apple:app_store applescript art backup bash blockchain:bitcoin blog blogging book books books:science_fiction brad_delong canada canada:census:2011 canada:census:2011:nhs car cdnecon cdnpoli cea2014 census cindy_phan cirpa2016 coffee comics cybersummit16 cycling data data:machine_learning data:visualization datascience design development dnd docker doctorwho drm dropbox ebooks econometrics econometrics:simulation economics economics:canada economics:debt economics:labour edu:canada edu:canada:pse edu:canada:pse:ucalgary edu:grad_school edu:pse edu:pse:canada edu:tuition education edward_snowden email encryption environment epri_data_list facebook finance foursquare funny games:board games:rpg games:rpg:dnd games:video gender gender:tech ggplot2 git github gmail google gradearnings heqco highered his history homebrew howto icloud imported instapaper internet ios ip:copyright ip:copyright:canada:c-32 ip:copyright:piracy ip:patents ipad iphone itunes javascript jeff_atwood john_siracusa keyboard_maestro kindle latex leopard library linux mac machine_learning macos mail.app maps markdown math merlin_mann microsoft mobile mountain_lion movies music netflix nsa oc_transpo omnifocus open_data opensource os_x ottawa pandas pdf perl photo photography pinboard podcast pokemongo politics politics:canada politics:toronto politics:toronto:rob_ford politics:us politics:us:trump privacy productivity programming pycon pyconca2016 pydata python r raspberry_pi regex religion research rss rstats ruby safari science security security:computer security:password software software_carpentry star_trek star_wars startup stata stata:intro statistics statistics_canada stem steve_jobs stevenuniverse stuff surveillance technology telecom:canada:crtc telecom:canada:rogers telephone terminal text textexpander toronto transit transit:toronto ttc tv tv:doctor_who tv:game_of_thrones tv:general tv:lost twitter ubb unix via video vim web wikileaks windows wireless work worthwhile_canadian_initiative writing yits yyc zombies

Copy this bookmark:



description:


tags: