dataset   8890

« earlier    

100+ Interesting Data Sets for Data Science - Data Science Central
Read full list if you find these examples interesting.

How about reading other people’s emails? Ever wanted to do that, but can’t be bothered to train l33t hacking skills (and never mind the legality of it)? (Okay, this one I have thought about.) Well, I’ve got you covered. Check out the Enron corpus. It contains more than half a million emails from about 150 users, mostly senior management of Enron, organized into folders. Wikipedia calls it “unique in that it is one of the only publicly available mass collections of ‘real’ emails easily available for study.” Business idea: figure out what sort of information gets leaked in the emails that will later harm the execs at trial or whatever, then build a software system to automatically mine those out of real email. Either sell it to law enforcement or to corporate executives as the finest cover-your-ass email system.

Wondering what the internet really cares about? Well, I don’t know about that, but you could answer an easier question: What does Reddit care about? Someone has scraped the top 2.5 million Reddit posts and then placed them on GitHub. Now you can figure out (with data!) just how much Redditors love cats. Or how about a data backed equivalent of
dataset  datos 
4 days ago by Wentrukona
AI-Driven Company Data Injected into Your Decision-Making |
With our database of 108+ million companies, unique data points, and AI expertise, we’ll solve any problem related to company data by injecting AI-driven data into your decision-making.
SMB  dataset 
10 days ago by clehene

« earlier    

related tags

2018  3d  academic  aclu  aesthetics  ai  amazon  analysis  angle  anime  api  appstore  archaeology  article  attributes  audio  autoencoder  awesome  baseline  big_data  bigdata  bio  biotech  bom  book  books  bounding  cancer  car  cars  celebrity  civil-liberty  climate  co-occurence  collection  color  commentary  computer-vision  consent  content  corpora  crawl  crawler  criminal-justice  criminology  crop  cropping  crypto  csv  data-sets  data  data_mining  data_science  database  datamining  datascience  datasets  datathon  dating  datos  deep-learning  deeplearning  degrees-of-freedom  demographics  demography  differential-privacy  directory  download  dragnet  drawing  dropbox  duplicate-content  dvcs  dweb  economics  economy  eda  election  ethical-algorithms  example-data  exploratory-data-analysis  extract  extraction  face  facial-recognition  facialrecognition  fashion  flickr  free  frequency  gan  generative-adversarial-networks  genetics  genomics  geographical  github  gnxp  golang  google  government  graphs  gretl  handwriting  happy  healthcare  howto  huge-data-the-biggest  identity  image-aesthetics  image-captioning  image-saliency  image  imagerecognition  images  instagram  intel  international  ipfs  jeffbezos  kaggle  kindle  kinship  language  law  law_enforcement  legibility  leviathan  linked_data  list  machine-learning  machine_learning  machinelearning  macroecometrics  macroeconomics  management  managerial-state  matching  material  measurement  meteorology  methodology  mfglabs  microsoft  microsoftresearch  mit  ml  moana  model  moz  multi  myki  network  nlp  nyc  ocr  open_data  opendata  opensource  paper  placement  poetics  police  population  postgres  poverty  preprint  privacy  programming  proposal  protocol  public  python  r-project  recognition  reddit  reference  regulation  rekognition  research  resource  risk  rotation  sales  saliency  scale  science  scitariat  scrape  scraping  security  sentiment  skip-thought  smb  social  spreading  stanford  state-of-art  stations  statistics  storage  study  surveillance  tdf  technology  testing  textanalysis  the-watchers  theano  torrent  tour-de-france  train-stations  train  trained  training  trees  trends  tutorial  tweets  twitter  ufo  unintended-consequences  university-ad-hoc  usa  vae  variational  venmo  versioncontrol  vision  visualisation  visualise  visualization  visualizer  volo-avolo  weather  web  whole-partial-many  wikipedia  world  worldbank 

Copy this bookmark: