otlib + datamining   100

Data mining with WEKA, Part 1: Introduction and regression
Data mining is the talk of the tech industry, as companies are generating millions of data points about their users and looking for a way to turn that information into increased revenue. Data mining is a collective term for dozens of techniques to glean information from data and turn it into something meaningful. This article will introduce you to open source data-mining software and some of the most common techniques to interpret data.
ibm  datamining  ml  machienlearning 
september 2018 by otlib
Using Scrapy to Build your Own Dataset – Towards Data Science – Medium
When I first started working in industry, one of the things I quickly realized is sometimes you have to gather, organize, and clean your own data. For this tutorial, we will gather data from a…
scrapy  datamining  data  python  tutorial  web-scraping  dataset  open  research  webdev 
september 2017 by otlib
Top 28 Cheat Sheets for Machine Learning, Data Science, Probability, SQL & Big Data
This article provides 28 cheat sheets for machine learning,data science,probability,SQL & big data.You will find cheat sheets for various tools & techniques
cheatsheet  datamining  mysql  python  datascience 
february 2017 by otlib
On Being a Data Puppet — Medium
The following is drawn from fragments of talks at Next15, Emerce eDay 2014, combined with recent adventures in expatriat…
data  consumerism  datamining  usa  europe 
december 2015 by otlib
import.io | Web Data Platform & Free Web Scraping Tool
Turn the web into data, today. Transform any website into a table of data or an API in minutes without even writing any code with our free app
data  api  scraping  datamining  json  web  tools  scrape  parse  tool 
may 2015 by otlib
The joyless world of data-driven startups — Medium
Everyone tells early stage startups to use data for big strategic decisions. But does that really work, and whatever hap…
data  bigdata  startup  decisionmaking  business  intuition  contrarian  culture  datamining  data_analytics 
march 2015 by otlib
Algorithms Every Data Scientist Should Know: Reservoir Sampling | Apache Hadoop for the Enterprise | Cloudera
Cloudera offers enterprises a powerful new data platform built on the popular Apache Hadoop open-source software package.
data  science  programming  datamining  algorithms  computer  science  blog  reservoir  sampling  statistics 
july 2013 by otlib
« earlier      
per page:    204080120160

related tags

500px  aaaareview  aggregator  ai  algorithm  algorithms  amazon  analysis  analytics  apache  api  Apriori  arxiv  association  aws  bash  bayesian  beautifulsoup  behavioral  bigdata  blog  blog-post  book  books  business  BusinessIntelligence  by:programisto  c++  career  careers  challenges  cheatsheet  cheatsheets  cities  classification  classifier  clean  cloud  cloudcomputing  clustering  codegolf  comments  commerce  competition  computer  computer-science  computerscience  computing  consumerism  content  contest  contrarian  cool  critique  crowdsourcing  cs  csv  culture  curl  curriculum  data  data-analysis  data-cleaning  data-mining  data-proc  data-science  data-scientist  dataanalysis  database  datamining  dataprocessing  datascience  datascientist  dataset  data_analytics  data_mining  dating  db  dbadmin  decisionmaking  deep-learning  deeplearning  demographics  development  dl  ebook  ebooks  ec2  echelonCompetitors  ecommerce  economics  email  etf  etl  europe  everquest  excel  experience  facebook  fico  finance  firehose  flowchart  foros  forum  free  github  goodreads  google  graphql  grep  gutneberg  hackernews  hadoop  have-read  hbr  hedge  hedgefund  hiring  hn  howto  hustle  ibm  ideas  ieee  imp  information  interesting  internet  intuition  investing  iphone  ir  java  Jesse  job  jobs  Johnson  json  julia  jupyter  kaggle  kettle  knowledge  language  lda  leaderboard  learning  lecture  lectures  library  linkedin  linux  list  log  lumt  mac  machienlearning  machine  machine-learning  machinelearning  machine_learning  magentodevelopersindia  magentodevelopmentindia  mahout  mapreduce  market  marketing  mashup  massive  math  matlab  metrics  mind  mining  ml  mmorpg  mobile  mondrian  money  movies  mturk  music  mysql  nature  nerds  netflix  netflixprize  networks  news  newsmetrics  newyorktimes  nips2012  nlp  notebook  numerai  numerics  nyt  okcupid  online  onlinedating  ontology  open  opendata  opensource  opera  opinion  orange  oreilly  overfitting  pageviews  pandas  papers  parse  patterns  pdf  pentaho  personalization  photo  politics  portfolio  postgresql  powershell  prediction  predictive  privacy  prize  probability  programming  proprietary  pubsub  pvalues  python  python-url-scaping-beautifulsoup  quant  r  random  react  recognition  recommendation  recommendations  reddit  reference  relay  research  reservoir  retrieval  reuters  rlanguage  RobertsPaige  rproject  rstatistics  ruby  saas  salary  sampling  scale  science  scientificmethod  scrape  scraper  scraping  scrapy  search  semantic  semanticweb  SEO  service  shazam  shell  shipping  skills  slides  sna  social  socialmedia  socialscience  socialsoftware  sociology  software  soup  splunk  stanford  startup  statistics  Statistikk  stats  stock  streaming  study  submissions  supply  surveillance  targeting  technical  telco  text  textmining  tf-idf  tfidf  theory  thomson  tool  toolkit  tools  Topology  toread  trading  trends  tutorial  tutorials  twitter  ucsd  usa  via:zite  video  videolectures  videos  visualization  weather  web  web-scraping  web2.0  webdev  webscraping  wget  wiki  wikipedia 

Copy this bookmark: