dataengineering   112

« earlier    

Know your Latencies – 97 Things – Medium
Every data system has three characteristics that uniquely identifies it: the size of the data, the recency of the data, and the latency of queries on that data. You are probably familiar with the…
data:engineering  dataengineering 
19 days ago by ese
adilkhash/Data-Engineering-HowTo: A list of useful resources to learn Data Engineering from scratch
A list of useful resources to learn Data Engineering from scratch - adilkhash/Data-Engineering-HowTo
dataengineering  programming 
4 weeks ago by intoverflow
1, 2, 3...
(1) Integration with Notebooks
(2) Orchestrating & Models
TensorFlow  DataEngineering  Jupyter  Keras  from twitter_favs
april 2019 by cdrago
Why a data scientist is not a data engineer - O'Reilly Media
A few months ago, I wrote about the differences between data engineers and data scientists. I talked about their skills and common starting points. An interesting thing happened: the data scientists started pushing back, arguing that they are, in fact, as skilled as data engineers at data engineering. That was interesting because the data engineers didn’t push back saying they’re data scientists. So, I’ve spent the past few months gathering data and observing the behaviors of data scientists in their natural habitat. This post will offer more information about why a data scientist is not a data engineer.
data  datascience  dataengineering  oreilly 
april 2019 by dlkinney
Why a data scientist is not a data engineer - O'Reilly Media
Interesting... "Why a data scientist is not a data engineer" I think they are both distinct…
tm351  tm358  dataScience  dataEngineering  data  jobs  dataJobs 
april 2019 by psychemedia

« earlier    

related tags

2read  accumulo  ai  airbnb  airflow  architecture  article  athena  aws  best-practices  bestpractices  bigdata  cassandra  cockroachdb  comparison  computer-science  consultants  couchbase  data  data:engineering  dataanalysis  database  databases  dataeng  dataengineer  dataforgood  datajobs  dataops  datapipeline  datascience  datawarehouse  datawarehousing  designpatterns  development  docker  dynamodb  elasticsearch  engineering  escience  etl  eventsourcing  functional  glossary  google  graphdata  graphdb  hadoop  hadoop:newsletter  hadoop:weekly  hase  hierarchy  hive  hypertable  interview  jobs  jupyter  kafka  keras  kubernetes  lambda  log  logging  machinelearning  ml  mongo  mysql  newsql  nosql  nyt  nytimes  oreilly  outliers  pipeline  programming  python  redis  riak  satelliteimagery  scala  scalaris  scale  scikit-learn  smalldata  spark  sql  sqlite  streaming  tensorflow  testing  tm351  tm358  toread  training  versioncontrol  voltdb 

Copy this bookmark: