ajohnson1200 + data 42
W. Edwards Deming - Wikipedia, the free encyclopedia
may 2011 by ajohnson1200
Quote: "In God we trust; all others must bring data."
data
bigdata
research
culture
management
may 2011 by ajohnson1200
dl.google.com/googleio/2010/app-engine-data-pipelines.pdf
april 2011 by ajohnson1200
Some interesting ideas around scheduled tasks / pipelining.
data
bigdata
processing
tasks
april 2011 by ajohnson1200
language agnostic - What are the lesser known but cool data structures ? - Stack Overflow
march 2011 by ajohnson1200
Fun stuff in here. via @joshua schachter
algorithms
data
programming
engineering
march 2011 by ajohnson1200
Facebook | The Full Stack, Part I
december 2010 by ajohnson1200
Great blog post. Quote: "... People who develop broad skills also tend to develop a good mental model of how different layers of a system behave. This turns out to be especially valuable for performance & optimization work. No one can know everything about everything, but you should be able to visualize what happens up and down the stack as an application does its thing. An application is shaped by the requirements of its data, and performance is shaped by how quickly hardware can throw data around."
performance
facebook
engineering
data
education
december 2010 by ajohnson1200
google-refine - Project Hosting on Google Code
november 2010 by ajohnson1200
Cool tool for working with large, unrefined data sets.
bigdata
data
infoviz
information
november 2010 by ajohnson1200
Expedia on how one extra data field can cost $12m | Sales & Marketing | silicon.com
november 2010 by ajohnson1200
"After we realised that we just went onto the site and deleted that field - overnight there was a step function [change], resulting in $12m of profit a year, simply by deleting a field. We have found 50 or 60 of these kinds of things by using analytics and paying attention to the customer."
analytics
data
ux
abtesting
datamining
november 2010 by ajohnson1200
A Data-driven Look at the Realtime Web
october 2010 by ajohnson1200
Some inside baseball looks at bit.ly data / flow.
bit.ly
bigdata
data
twitter
realtime
october 2010 by ajohnson1200
JavaScript grid editor: I want to be Excel « Eltit Golb
january 2010 by ajohnson1200
A breakdown of about 15 different browser-based Excel-like jQuery modules.
javascript
grid
spreadsheet
excel
data
ui
jquery
datagrid
january 2010 by ajohnson1200
Facebook | Facebook Data Team: Distributed Data Analysis at Facebook
december 2009 by ajohnson1200
Today, Facebook counts 29% of its employees (and growing!) as Hive users. More than half (51%) of those users are outside of Engineering. They come from distinct groups like User Operations, Sales, Human Resources, and Finance. Many of them had never used a database before working here. Thanks to Hive, they are now all data ninjas who are able to move fast and make great decisions with data.
facebook
hadoop
analytics
hive
data
bigdata
december 2009 by ajohnson1200
Turning Predictions into Opportunities - O'Reilly Radar
november 2009 by ajohnson1200
Big data is being democratised, but there's a lot of unmet need in businesses around data warehousing. The typical solution is to build a data warehouse team around a product like Oracle, but I've heard plenty of business people grizzling about the result. They want answers, they don't want the headaches and lag that a data warehouse involve. Big Data (or Cloud Analytics or whatever) may be the opportunity to figure out a new minimum viable product for these folks, and offer it without the "data warehouse" baggage. This might be back end, might be UIs, might be visualisation, but all of these have a lot of room for improvement.
trends
datamining
data
analytics
hadoop
mapreduce
november 2009 by ajohnson1200
AppleInsider | Apple iPhone eats up 50% share of all mobile data traffic globally
november 2009 by ajohnson1200
AdMob notes that Apple advanced its lead in smartphone traffic share from 43% last month to an even 50%. Symbian slipped from 29% to 25%, while third place Android grew from 10% to 11%. RIM's share fell slightly from 8% to 7%, Windows Mobile dropped from 5% to 3%.
iphone
mobile
apple
data
android
phone
traffic
november 2009 by ajohnson1200
Hadoop, Pig, and Twitter (NoSQL East 2009)
november 2009 by ajohnson1200
Twitter + terabytes of information + datamining = pig.
hadoop
twitter
pig
mapreduce
analytics
presentation
datamining
data
november 2009 by ajohnson1200
How Google and Facebook are using R : Dataspora Blog
october 2009 by ajohnson1200
Quote: "[Facebook] ... found that activity at three months was predicted by variables related to three classes of behavior: (i) how often a user was reached out to by others, (ii) frequency of third party application use, and (iii) what Itamar termed “receptiveness” — related to how forthcoming a user was on the site."
statistics
google
analytics
facebook
datamining
analysis
data
october 2009 by ajohnson1200
Palantir Government | Palantir Technologies
september 2009 by ajohnson1200
Palantir Government is a platform for information analysis. Palantir was designed for environments where the fragments of data that an analyst combines to tell the larger story are spread across a vast set of starting material. Palantir provides flexible tools to import and model data, intuitive constructs to search against this data, and powerful techniques to iteratively define and test hypotheses. With Palantir, an analyst can go all the way from an initial tasking to a final product in hours or days, rather than weeks.
government
information
infoviz
data
september 2009 by ajohnson1200
PostRank™ Data Services // Home
august 2009 by ajohnson1200
Another business built around access to real-time data.
data
api
realtime
datamining
social
august 2009 by ajohnson1200
The Nike Experiment: How the Shoe Giant Unleashed the Power of Personal Metrics
june 2009 by ajohnson1200
Quote: "The gist of the idea is that people change their behavior—often for the better—when they are being observed (which is why it's sometimes called the observer effect). Those workers at Western Electric didn't build more relays because there was more or less light or because they had more or fewer breaks. The Hawthorne effect posits that they built more relays simply because they knew someone was keeping track of how many relays they built."
apple
nike
data
stats
health
hawthorne-effect
observer-effect
metrics
june 2009 by ajohnson1200
Why Email Clients Need to Change
june 2009 by ajohnson1200
Quote: ".. But if inboxes don’t fundamentally change in order to adapt to their new role as the keeper of myriad transactions across the entire web, they’ll be obsolete."
email
data
ideas
inbox
filtering
communication
june 2009 by ajohnson1200
Churchill Club Top Ten Tech Trends on Flickr - Photo Sharing!
may 2009 by ajohnson1200
Ann Winblad: The unstructured data deluge creates the next great information leaders. Every click, message, tweet is rich data amassed at exponential rates. Gartner predicts enterprise data growth of 650 percent in five years; 80 percent of that data will be unstructured.
data
bigdata
datamining
storage
enterprise
may 2009 by ajohnson1200
Alex Payne | Life As A Series of Queues
december 2008 by ajohnson1200
Quote: "My suspicion is that there’s a market in making each and every one of those queues smaller, if not making them disappear entirely." Bigger market: being able to prioritize across queues.
data
productivity
queueing
queue
december 2008 by ajohnson1200
Delighting with Data » Talks » tomtaylor.co.uk
july 2008 by ajohnson1200
Making things talk, mostly with twitter, in 1st person.
data
twitter
visualization
development
hacks
design
july 2008 by ajohnson1200
How to do data portability (Scripting News)
may 2008 by ajohnson1200
Quote: "The best way to achieve data portability is to just do it." Couldn't agree more.
data
database
open
dataportability
davewiner
may 2008 by ajohnson1200
How I Learned to Stop Worrying and Love Using a Lot of Disk Space to Scale | High Scalability
may 2008 by ajohnson1200
A list of some of the seemingly crazy things you have to do if you're going to use BigTable.
architecture
cluster
database
data
sharding
scaling
performance
bigtable
scalability
may 2008 by ajohnson1200
JavaScript Information Visualization Toolkit (JIT) at noumena
may 2008 by ajohnson1200
JS based charting including treemaps, hyperbolic trees and space trees.
ajax
charting
cool
data
visualization
javascript
may 2008 by ajohnson1200
Yahoo! Developer Network: Yahoo! Weather
april 2008 by ajohnson1200
Free weather data in RSS format for "... your own web site or client application" but then the fine print says only personal / non-commercial. Bollucks.
weather
yahoo
api
feed
feeds
rss
data
april 2008 by ajohnson1200
sioc-project.org | Semantically-Interlinked Online Communities
april 2008 by ajohnson1200
Description: SIOC provides a Semantic Web ontology for representing rich data from the Social Web in RDF.
communities
community
data
semantic
foaf
ontology
rdf
sioc
semanticweb
april 2008 by ajohnson1200
Where to Find Open Data on the Web - ReadWriteWeb
april 2008 by ajohnson1200
Great list (and comments) of freely available data sources. When is someone going to make the weather forecaster datasource free so that I don't have to pay accuweather or weather.com for forecasts / current weather?
data
free
database
resources
open
april 2008 by ajohnson1200
reddit.com: ask reddit: what other visualization tools do you know besides processing, graphviz and nodebox?
april 2008 by ajohnson1200
References to some other cool Java viz tools.
data
graph
nodebox
opensource
statistics
visualization
april 2008 by ajohnson1200
Datawocky: More data usually beats better algorithms
april 2008 by ajohnson1200
Quote: "... adding more, independent data usually beats out designing ever-better algorithms to analyze an existing data set."
algorithms
bigdata
data
database
databases
algorithm
netflix
datamining
april 2008 by ajohnson1200
Persai Research
january 2008 by ajohnson1200
Feed corpus: 124,460 unique RSS/Atom feeds
persai
feeds
rss
research
data
bigdata
january 2008 by ajohnson1200
Web Data Mining, book by Bing Liu
january 2008 by ajohnson1200
Exploring Hyperlinks, Contents and Usage Data
data
datamining
research
book
books
january 2008 by ajohnson1200
(theinfo)
january 2008 by ajohnson1200
"... a site for large data sets and the people who love them"
bigdata
data
megadata
datasets
visualization
analytics
analysis
january 2008 by ajohnson1200
Google Admits "Data is the Intel Inside"
december 2007 by ajohnson1200
Quote: "As the applications become apparent, the data will be valuable in new ways, and the company with the most data wins."
megadata
data
datamining
google
december 2007 by ajohnson1200
Joe Gregorio | BitWorking | Megadata Follow-up
december 2007 by ajohnson1200
Quote: "... just like using more methods and pushing information into the headers gives more information to the network, by denormalizing you are implicitly giving more information to the database, and that 'extra information' makes things run faster."
databases
denormalization
megadata
scalability
data
database
december 2007 by ajohnson1200
Joe Gregorio | BitWorking | ETech '07 Summary - Part 2 - MegaData
december 2007 by ajohnson1200
Quote: "... we need a new kind of data store, a new kind of SQL, something that does for storing and querying large amounts of data what SQL did for normalized data."
bigdata
datamining
dataveillance
analytics
storage
database
data
megadata
december 2007 by ajohnson1200
Well-formed data | Elastic lists
october 2007 by ajohnson1200
Very cool combination of infoviz and browsing.
classification
data
filtering
information
navigation
visualization
october 2007 by ajohnson1200
Google Analytics: The goggles, they do nothing! | Archives | codablog | Coda Hale
march 2007 by ajohnson1200
Great rant on pie chart and 3D graphs. Quote: "... if it’s a simple dataset, boil it down to the essentials. If there are two numbers which add up to 100%, you don’t need to tell me both, and you certainly don’t need to draw me a picture of it."
infoviz
information
graphing
graph
visualization
google
charting
chart
data
analytics
march 2007 by ajohnson1200
Dare Obasanjo aka Carnage4Life - Updated: XML Has Too Many Architecture Astronauts
january 2007 by ajohnson1200
JSON is a better fit for Web services that power Web mashups and AJAX widgets due to the fact that it is essentially serialized Javascript objects which makes it fit better client side scripting which is primarily done in Javascript.
json
javascript
client
xml
data
browser
january 2007 by ajohnson1200
visualcomplexity.com | A visual exploration on mapping complex networks
july 2006 by ajohnson1200
Graph / infoviz p0rn. I could look at these all day long.
visualization
infoviz
maps
mapping
art
data
information
stats
wow
july 2006 by ajohnson1200
Data Analytics :: Mozilla Add-ons :: Add Features to Mozilla Software
may 2006 by ajohnson1200
Very cool FireFox plugin that basically gives you Excel graphing / spreadsheet abilities in your browser.
analytics
data
firefox
plugins
may 2006 by ajohnson1200
related tags
abtesting ⊕ ajax ⊕ algorithm ⊕ algorithms ⊕ analysis ⊕ analytics ⊕ android ⊕ api ⊕ apple ⊕ architecture ⊕ art ⊕ bigdata ⊕ bigtable ⊕ bit.ly ⊕ book ⊕ books ⊕ browser ⊕ chart ⊕ charting ⊕ classification ⊕ client ⊕ cloud ⊕ cluster ⊕ communication ⊕ communities ⊕ community ⊕ cool ⊕ culture ⊕ dashboard ⊕ data ⊖ database ⊕ databases ⊕ datagrid ⊕ datamining ⊕ dataportability ⊕ datasets ⊕ dataveillance ⊕ davewiner ⊕ denormalization ⊕ design ⊕ development ⊕ education ⊕ email ⊕ engineering ⊕ enterprise ⊕ excel ⊕ facebook ⊕ feed ⊕ feeds ⊕ filtering ⊕ firefox ⊕ foaf ⊕ free ⊕ google ⊕ government ⊕ graph ⊕ graphing ⊕ grid ⊕ hacks ⊕ hadoop ⊕ hawthorne-effect ⊕ health ⊕ hive ⊕ ideas ⊕ inbox ⊕ infographic ⊕ information ⊕ infoviz ⊕ iphone ⊕ javascript ⊕ jquery ⊕ json ⊕ management ⊕ mapping ⊕ mapreduce ⊕ maps ⊕ megadata ⊕ metrics ⊕ mobile ⊕ navigation ⊕ netflix ⊕ news ⊕ newspapers ⊕ nike ⊕ nodebox ⊕ observer-effect ⊕ ontology ⊕ open ⊕ opensource ⊕ performance ⊕ persai ⊕ phone ⊕ pig ⊕ plugins ⊕ presentation ⊕ processing ⊕ productivity ⊕ programming ⊕ queue ⊕ queueing ⊕ rdf ⊕ realtime ⊕ research ⊕ resources ⊕ rss ⊕ scalability ⊕ scaling ⊕ semantic ⊕ semanticweb ⊕ sharding ⊕ sioc ⊕ social ⊕ spreadsheet ⊕ statistics ⊕ stats ⊕ storage ⊕ tasks ⊕ traffic ⊕ trends ⊕ twitter ⊕ ui ⊕ ux ⊕ visualization ⊕ weather ⊕ wow ⊕ xml ⊕ yahoo ⊕Copy this bookmark: