jm + via:lemonodor   2

Cameras as Traffic Cops – Hacker Noon
Guy trains model on NYC traffic camera video to detect blocked bike lanes and bus stops. Estimates "the number of tickets being given represents less than .0001% of infractions". (via lemonodor)
via:lemonodor  future  tickets  traffic  nyc  cameras  surveillance  bike-lanes  bus-stops  traffic-law 
5 weeks ago by jm
Five Takeaways on the State of Natural Language Processing
Good overview of the state of the art in NLP nowadays. I particularly like word2vec interesting:
Embedding words as real-numbered vectors using a skip-gram, negative-sampling model (word2vec code) was mentioned in nearly every talk I attended. Either companies are using various word2vec implementations directly or they are building diffs off of the basic framework. Trained on large corpora, the vector representations encode concepts in a large dimensional space (usually 200-300 dim).

Quite similar to some tokenization approaches we experimented with in SpamAssassin, so I don't find this too surprising....
word2vec  nlp  tokenization  machine-learning  language  parsing  doc2vec  skip-grams  data-structures  feature-extraction  via:lemonodor 
may 2015 by jm

Copy this bookmark: