data-collection   153

« earlier    

Irish State told to delete ‘unlawful’ data on 3.2m citizens
This is amazing:
The State has been told it must delete data held on 3.2 million citizens, which was gathered as part of the roll-out of the Public Services Card, as there is no lawful basis for retaining it.

In a highly critical report on its investigation into the card, the Data Protection Commission found there was no legal reason to make individuals obtain the card in order to access State services such as renewing a driving licence or applying for a college grant. [...]

Helen Dixon, the Data Protection Commissioner, told The Irish Times that forcing people to obtain such a card for services other than those provided by the department was “unlawful from a data-processing point of view”.
psc  ireland  politics  data-privacy  privacy  data-collection  dpo  dpc 
8 days ago by jm
The Point of Collection
> The below-the-surface work of a particular data set is joined to the reasons and means that created it and the relationships running through those reasons and means.
data-collection  bias  archives 
april 2019 by tarakc02
Amazon Dark Patterns | Hacker News
Prompted by this post, I wanted to check what happened to the 1-star review I have left 6 months ago. (Product worked for 3 days and then stopped, and after a replacement, the same thing happened). Sure enough, I have 0 comment in my profile, and I just checked, it has also disappeared from the product page.

This is shady as hell, because I am 100% sure I wrote this review. I even wrote it twice, once on and once translated on a local amazon site. This is slightly infuriating.
amazon  dark.patterns  all-your-data-are-belong-to-us  privacy  data-collection 
july 2018 by MarcK
Respectful Collection of Demographic Data
Demographic data may be critical to your mission as a community center, legally required diversity disclosure of a corporation, or an idle curiosity of a blogger to understand their followers.

Whatever your reason, this article establishes some guidelines for respectful design of your form and language for collecting demographic data.
data-collection  demographic-data  form-fields  ux-writing  content-strategy 
july 2018 by marysbutler
Garbage In, Garbage Out: machine learning has not repealed the iron law of computer science / Boing Boing
> The problem is as old as data-processing itself: garbage in, garbage out. Assembling the large, well-labeled datasets needed to train machine learning systems is a tedious job (indeed, the whole point and promise of machine learning is to teach computers to do this work, which humans are generally not good at and do not enjoy). The shortcuts we take to produce datasets come with steep costs that are not well-understood by the industry.

> It's an important lesson for product design, but even more important when considering machine learning's increasing role in adversarial uses like predictive policing, sentencing recommendations, parole decisions, lending decisions, hiring decisions, etc. These datasets are just as noisy and faulty and unfit for purpose as the datasets Warden cites, but their garbage out problem ruins peoples' lives or gets them killed.
machine-learning  bias  algorithmic-bias  training-data  data-collection  labeled-data 
may 2018 by tarakc02
Open Data Kit
Open Data Kit (ODK) is a free and open-source set of tools which help organizations author, field, and manage mobile data collection solutions.
forms  java  android  data-collection 
may 2018 by spl

« earlier    

related tags

***  *  2016  20bn  2fw  accuracy  active-learning  adaptive  advertising  affective-computing  airbnb  algorithmic-bias  all-your-data-are-belong-to-us  amazon  analysis  analytics  android  annotation  api  apple  ar  architecture  archives  area  article  asset-management  atom  babel  bar-graphs  bias  biodiversity  biology  blog  box-and-whisker-plots  budget  business  cacti  charts  circle-graphs  citizen-science  client  climate  clinical-trial  cloud-computing  cloud  coding  collaboration  collecting  collection  common-crawl  competitive-analysis  conservation  content-strategy  content  cost  course  crawl  crime-data  crowdsource  crowdsourcing  cryosphere  current  dark.patterns  dashboard  data-analysis  data-curation  data-displays  data-extraction  data-mining  data-privacy  data-publishing  data-quality  data-sharing  data-sources  data-validation  data-warehousing  data  data_preservation  database  dating  deep-learning  demographic-data  development  digital-humanities  digital-media  digitalcuration  documentation  dpc  dpo  earth-science  ebooks  ec2  ecology  economics  engineering  environment  ethics  event  events  experience-sampling  expertise  extraction  face  facebook  federal  feeds  feminism  finance  financial-plan  financial-planning  form-fields  forms  frequency-distribution  funny  fw  ganglia  geek  geo  git  github  google  government  graphing  hadoop  health-care  health  healthcare  histograms  history  html  humidity  ice  images  indicia  infographics  infrastructure  instapaper  integration  ios  iphone  ireland  java  javascript  jquery  json  jsoneditor  keystroke-logger  kindle  labeled-data  labeling  labour  language  law  life-logging  light  line-graphs  line-plots  linguistics  linux  logging  london  low-resource  machine-learning  magnetic-field  management  map-reduce  maps  math-5  math-6  math-7  mattermark  mechanical-turk  metadata  meteorology  metrics  misleading-graphs  mobile  mongodb  monitoring  motion  munin  nasa  news  nlp  opal  open-data  opensource  papers  parsing  peer-review  people  personal-data  pictographs  police-data  politics  precipitation  privacy  programming  project  protocol  psc  public-health  publishing  pubsubhubbub  python-modules  python  quantified-self  rails  reading  regulation  reliability  research  rss  ruby  s3  saas  san-francisco  scatter-plots  scholarly-communication  science  scraping  search  self-surveillance  sensor  sharing-economy  sharing  social-media  social-networks  social-science  soft  software  sparkline  specification  statistics  stats  stratify  stumbleupon  survey  surveys  sysadmin  tally  tech  temperature  text-manipulation  thesis  time-series  tool  toolkit  tools  top  tracking  training-data  training  travel  tube  twitter  usability  ux-writing  validate  visualization  volunteer  web-pages  web-service  web-services  web  web_development  webdev  webscraping  wireless  xml 

Copy this bookmark: