Preferential Rents in New York City - ProPublica Data Store
In 2003, lawmakers in New York State passed a law that in effect allowed landlords to bypass annual limits on rent increases for their rent-stabilized apartments. Owners could raise rents by more than the annual limits if they registered a high rent — often high above existing market rates -- but charged tenants a lower, “preferential” rent. Preferential rents are not regulated and can be raised up to the registered rate upon lease renewal. Today, more than 250,000 New York City apartments feature these rents.
19 hours ago
The Platinum Patients - The Atlantic
Each year, 1 in every 20 Americans racks up just as much in medical bills as another 19 combined. This critical five percent of the U.S. population is key to solving the nation's health care spending crisis.
data-visualization  padjo 
Laurence Tratt: What Challenges and Trade-Offs do Optimising Compilers Face?
After immersing oneself in a research area for several years, it's easy to feel like the heir to a way of looking at the world that is criminally underrated: why don't more people want what I want? This can be a useful motivation, but it can also lead to ignoring the changing context in which most research sits. So, from time to time, I like to ask myself the question: “what if everything I believe about this area is wrong?” It’s a simple question, but hard to answer honestly. Mostly, of course, the only answer my ego will allow me is “I can’t see anything I could be wrong about”. Sometimes I spot a small problem that can be easily fixed. Much more rarely, I've been forced to acknowledge that there’s a deep flaw in my thinking, and that the direction I’m heading in is a dead-end. I’ve twice had to face this realisation, each time after a gradual accumulation of unpleasant evidence that I've tried to ignore. While abandoning several years work wasn’t easy, it felt a lot easier than trying to flog a horse I now knew to be dead.
Gene name errors are widespread in the scientific literature | Genome Biology | Full Text
The spreadsheet software Microsoft Excel, when used with default settings, is known to convert gene names to dates and floating-point numbers. A programmatic scan of leading genomics journals reveals that approximately one-fifth of papers with supplementary Excel gene lists contain erroneous gene name conversions.
Human error costs TransAlta $24-million on contract bids - The Globe and Mail
A slip of the hand in a computer spreadsheet for bidding on electricity transmission contracts in New York will cost TransAlta Corp. $24-million (U.S.), wiping out 10 per cent of the company's profit this year.

The error, although costly, was bizarrely simple. Someone preparing the electronic file of bids that TransAlta submitted at the end of April misaligned the rows of information in the spreadsheet. High bids intended for certain transmission paths were instead made for lower-demand routes -- meaning that TransAlta overpaid for transmission contracts, as well as buying more capacity than it intended in certain cases.

"It was literally a cut-and-paste error in an Excel spreadsheet that we did not detect when we did our final sorting and ranking of bids prior to submission," said Steve Snyder, TransAlta's president.
Excel Error by a Cleary Gottlieb Associate Alters Lehman Asset Deal
A first-year associate at Cleary Gottlieb Steen & Hamilton made an Excel reformatting error that mistakenly added 179 contracts to an agreement to buy Lehman Brothers assets, according to a motion filed with a New York bankruptcy court on Friday.

Cleary Gottlieb Steen & Hamilton represents Barclays Capital Inc. in its agreement to buy the assets. Its motion (PDF posted by Above the Law) says the law firm was working under a tight deadline when the mistake was made, Computerworld reports.

The motion seeks to exclude the 179 contracts from the purchase agreement.

Above the Law was the first to report the error. Its story said the mistake happened at around 11:30 p.m. on Sept. 18 when a second-year associate asked a first-year associate to reformat an Excel document of critical Lehman contracts to be assumed by Barclays.

The associate resized the rows when converting the spreadsheet into a PDF document, causing “hidden” contracts in the spreadsheet to be exposed. The spreadsheet, which had been e-mailed to the law firm by a Lehman representative, contained nearly 1,000 rows and more than 24,000 individual cells.
Researchers Finally Replicated Reinhart-Rogoff, and There Are Serious Problems. - Roosevelt Institute
In 2010, economists Carmen Reinhart and Kenneth Rogoff released a paper, “Growth in a Time of Debt.” Their “main result is that…median growth rates for countries with public debt over 90 percent of GDP are roughly one percent lower than otherwise; average (mean) growth rates are several percent lower.” Countries with debt-to-GDP ratios above 90 percent have a slightly negative average growth rate, in fact.
models/object_detection at master · tensorflow/models
Creating accurate machine learning models capable of localizing and identifying multiple objects in a single image remains a core challenge in computer vision. The TensorFlow Object Detection API is an open source framework built on top of TensorFlow that makes it easy to construct, train and deploy object detection models. At Google we’ve certainly found this codebase to be useful for our computer vision needs, and we hope that you will as well.

tensorflow  computer-vision 
6 days ago
Inside the Algorithm That Tries to Predict Gun Violence in Chicago - The New York Times

Gun violence in Chicago has surged since late 2015, and much of the news media attention on how the city plans to address this problem has focused on the Strategic Subject List, or S.S.L.

The list is made by an algorithm that tries to predict who is most likely to be involved in a shooting, either as perpetrator or victim. The algorithm is not public, but the city has now placed a version of the list — without names — online through its open data portal, making it possible for the first time to see how Chicago evaluates risk.

We analyzed that information and found that the assigned risk scores — and what characteristics go into them — are sometimes at odds with the Chicago Police Department’s public statements and cut against some common perceptions.
datasets  compciv  socrata 
9 days ago
Making the Internet Archive’s full text search faster.
This article describes how we made the full-text organic search faster at the Internet Archive — without scaling horizontally — allowing our users to search in just a few seconds across our collection of 35 million documents containing books, magazine, newspapers, scientific papers, patents and much more.

By organic search we mean the “method for entering one or a plurality of search items in a single data string into a search engine. Organic search results are listings on search engine results pages that appear because of their relevance to the search terms.” (1).

The relevance should be scored on the search query matches for every document. In other words, if the search query has a perfect match in some of our documents, we want to return these documents; otherwise, we want to return the documents containing a part of the query or one of its subset.
14 days ago
Comey hearing – Cable news coverage, chyrons - Washington Post
Coverage of former FBI director James B. Comey’s testimony looked about the same across cable news channels. A closeup of a senator forming a question, a wide show of the room — there’s just not much to show on TV. But the ALL CAPS text in the bar at screen bottom differentiates networks, exposing what they want viewers to take away from the hearing.
journalism  news  politics  best 
14 days ago
Crime and Punishment in Chicago
Crime and Punishment in Chicago is an index of data sources surrounding this criminal justice system as it is in Chicago. We track data sources from the commission of the crime all the way to prison. We aggregate sources of data, provide insight into how this data is generated, discuss how to get it, and expose what data is unavailable.
crime  data  reference  padjo  compciv 
15 days ago
Testing from the ground up
Tests are pieces of code that check if your main code works. I write tests to catch bugs when I refactor. I write tests to force myself to think through and handle edge cases. I write tests to show the users of my project that my code does what I say it does.
testing  READTODO 
16 days ago
Failure to warn: Hundreds died while taking an arthritis drug, but nobody alerted patients
In a review of millions of reports to the FDA involving more than 100 drugs approved since 2010, Actemra stood out. It showed that Actemra patients had experienced an unusually large number of serious side effects that didn’t appear on the drug’s warning label.

The initial review was performed for STAT by Advera Health Analytics in Santa Rosa, Calif., a company that collects and curates drug-related complaints to the FDA Adverse Events Reporting System, known as FAERS. The company then provided comparison data for all major rheumatoid arthritis drugs.

STAT’s analysis of that data, including more than 13,500 FAERS reports on Actemra, showed higher than expected numbers of several serious problems when compared to competing drugs. These included the blockbusters Humira and Remicade, which have many more users.
fda  padjo  drugs  investigations 
16 days ago
Comma Separated Vulnerabilities
This post introduces Formula Injection, a technique for exploiting ‘Export to Spreadsheet’ functionality in web applications to attack users and steal spreadsheet contents. It also details a command injection exploit for Apache OpenOffice and LibreOffice that can be delivered using this technique.
security  programming  excel 
17 days ago
Learnings from analyzing my compromised server (Linode)
Some one from Linode further fueled it by giving me a $10 promo code. Talk about pushing things.
Fast forward some hours. I have a 4GB centOS 7 box up and running on a Singapore datecenter. After some failed attempts, got my ansible-playbook to run on the remote machine which installed ovirt-engine on it.
Happy period quickly turning to a bad one
Everything is fine and dandy and I am watching a basketball match with my friends after office hours. 4 minutes left to the final whistle, Cracking match on, everyone is playing like a pro. A very close call between the two teams, but one gets the better of the other one.
Returning back home. I get this buzz on my phone. Turns out it’s an email from Linode. Daym. I thought was I billed already?
29 days ago
How Facebook Moved 20 Billion Instagram Photos Without You Noticing | WIRED
But Krieger says Instagram doesn’t really share data with the wider Facebook operation. He indicates, for instance, that Facebook can’t use your behavior on Instagram to target ads on Facebook proper—or vice versa. “One of the things we had to do was really silo the information, which ends up being important for privacy and other reasons,” he says.
5 weeks ago
For 'New York Times' Obit Writers, 'Death Is Never Solicitous Of A Deadline' : NPR
This is FRESH AIR. I'm Terry Gross. Our guest today - a veteran journalist who spent years writing evocative profiles of people they typically never met. Margalit Fox and Bruce Weber have each written more than a thousand obituaries for The New York Times. Their subjects have ranged from celebrities and politicians to, as you'll soon hear, the inventor of Stove Top stuffing. Fox and Weber are among those featured in a new documentary about The Times obituary department called "Obit." It's directed by Vanessa Gould.
5 weeks ago
Few Police Officers Have Been Convicted in High-Profile Cases Involving Deaths of Blacks
We looked at 15 high-profile cases from the last three years that rose to national prominence and increased racial tensions, often prompting protests around the country.

In some of the cases, the police offered an explanation for their actions, but raw videos led many to conclude that their actions were unjustified.

Officers were indicted or charged in eight of the cases. Trials are pending in five cases. Just two cases have resulted in guilty pleas or convictions so far.
5 weeks ago
In Conversation with... Charles Ornstein | AHRQ Patient Safety Network
CO: Good reporters have a variety of sources of information. Reporters who routinely cover the hospitals in their communities should be constantly looking at state inspection reports, lawsuit data about payouts, their Joint Commission accreditation, and how they're doing with their training program. All these types of things should be on the radar screen and monitored regularly. But nothing can replace talking to employees in the facilities and the patients that receive care there—trying to get an on-the-ground perspective. It's important to be out there as a reporter. You need to get to as many people as you can, handing out your business card and urging people to call you. Even the best hospitals in the country have problems, and there are problems that they can fix. But if employees have raised their concerns internally and the problems are not fixed and they feel like they cannot get traction, that's really the time to start calling the media. There's nothing like a call from a reporter to get things moving.
5 weeks ago
POI Homicide Database Project - Home
Our project explores un- and under-reported incidents of police officer involved (POI) homicides, both justified and unjustified. To fill gaps found in existing government and local databases pertaining to POI homicides, we deploy participatory action research methods through community involvement in mining and analyzing social media data related to these incidents. Through these methods, social media information operates in concert with publicly available government and local databases to create a clearer representation of the lived realities of communities experiencing police homicides in the United States. Los Angeles County is our first community of study.
police  data  compciv 
5 weeks ago
10-Year-Old Shames Boston Globe into Updating Its Database of Murder Victims
Kristen Lartey was a 22-year-old recent graduate of St. John’s University, active in youth ministries and running her own nail design business when she and two friends were shot to death on a Sunday evening in August 2012. That’s according to a story in the Boston Globe a few days after the tragic murder.
journalism  data-journalism  compciv 
5 weeks ago
Journalist, 9, Responds to Her Critics and Becomes a Media Star
A 9-year-old journalist from Selinsgrove, Pa., became a media sensation this week, with articles in The Washington Post and The Guardian and an appearance on “Good Morning America,” with more to come.

It all started when Hilde Kate Lysiak, the publisher of and a reporter for The Orange Street News, a monthly newspaper (circulation 200), broke the news about a killing in her neighborhood on Saturday.
journalism  best 
5 weeks ago
Steven Rich: I failed again at getting my colleagues to use an actual spreadsheet for data. One day...
.@Fahrenthold & I are tracking every event where somebody rented a room at Mar-a-Lago since the election. Are we missing any? Help us out!
5 weeks ago
A16Z AI Playbook
Artificial Intelligence (AI) is a set of computer science techniques that, as Stanford professor Andrew Ng is fond of saying, gives your software super powers.

Building on our Primer on Artificial Intelligence, this microsite is intended to help newcomers (both non-technical and technical) begin exploring what's possible with AI. We've met with hundreds of Fortune 500 / Global 2000 companies, startups, and government policy makers asking: "How do I get started with artificial intelligence?" and "What can I do with AI in my own product or company?"

This site is designed as a resource for anyone asking those questions, complete with examples and sample code to help you get started.
5 weeks ago
Prison tattoos: A statistical analysis of the art on convicts’ bodies | The Economist
If people’s ethnicity and sex determines their tattoos, can the same be said of their types of crime? Using data from the FDOC, The Economist built a series of statistical models to predict the likelihood criminals had committed particular crimes based on their demographic traits and choices of tattoos (see table).

6 weeks ago
FDA Calendar • BioPharmCatalyst
Biotech Catalyst and FDA Calendar for your biotech stock investing. Use our tools on your road to profit in the stock market. Biotech stocks with key binary events/catalysts - FDA Approval/PDUFA, Advisory Committee and Phase 2 & 3 trial data releases dates.

Refer to the Biotech Historical Catalyst Calendar for completed biotech catalysts.

The catalyst date is the LATEST expected date for the catalyst to occur by. For example, if the catalyst date is 12/31/17, the event is expected to occur BY 12/31/17. It does NOT necessarily mean that the catalyst will occur ON this date. In most cases it will occur BEFORE.
6 weeks ago
Google Guidelines for Raters

Though each task is brief, a rater's work isn't easy. Before they begin at Leapforce, all raters must pass a series of rigorous exams to make sure they understand the 160-page book of guidelines that Google provides to raters. "It's hard to pass," one rater told Ars. "I have referred nine people to this job. Every one of them failed the exam."
google  search  seo  teaching  guide 
7 weeks ago
Google rater fired after speaking to Ars about work conditions | Ars Technica
Last week we reported on the lives of Google raters, people whose job is to provide Google with data on the usefulness of its algorithms. The 10 anonymous raters we spoke with were all contractors at Leapforce, a staffing firm that provided rater services to Google. Yesterday, one of those raters, Kyle Martin Medeiros, was fired by Leapforce for an unspecified "breach of contract." Every Leapforce rater signs a contract that includes a broad NDA.
journalism  algorithms 
7 weeks ago
Hell hath no fury like The New York Times scorned by Hollywood - Columbia Journalism Review
At the very least, The Post has occasioned some wry inside humor. “I am looking,” quips Neil Lewis, a Times correspondent for 24 years, “for help in writing a screenplay about how the Times broke the Watergate story.”
case-study  journalism 
7 weeks ago
« earlier      
a-b-testing academic advice ai algorithm algorithms amazon analysis analytics angular angularjs animation api apis apple apps architecture art article aws backbone bash bayesian best big-data bioinformatics book bots business c caching campaign-finance census cheatsheet cli clinicaltrials clojure code colors command-line compciv compilers computer computer-science computer-vision computing course crime crypto css d3 data data-analysis data-journalism data-mining data-munging data-science data-sharing data-visualization database databases datajournalism datasets ddj death-data debugging deep-learning deployment design design-example devops digital-humanities django drugs education elections email engineering essay excel facebook fakenews finance flux foia framework funny game game-dev games gaming git github golang google government graphics guide hack hacking hadoop hardware hash haskell health history howto html html5 http image-processing infographic interactive interesting internet introduction investigations ios java javascript journalism jquery json judicial-system language learning linux lisp mac machine-learning map-reduce mapping maps marketing math medicine mobile mongodb music mysql netsec network neural-networks news nlp nodejs nosql nyc nylist object-oriented ocr oop open-data opencv optimization osx padjo pandas papers patterns performance photography police politics postgres prisons privacy programming publicrecords punctuation python r rails react reactjs reference regex research ruby rust scalability science scraping search security semitechnical seo server server-ops shell spam spreadsheets sql standards startups statistics syllabus tdd teaching tensorflow testing text text-mining tools transparency tutorial twitter typography ui unicode unix ux video vim visualization visualizations web web-design web-development web-scraping writing wtfviz

Copy this bookmark: