feedback loop n-gram analyzer
september 2011 by jm
'a simple parser of ARF compliant FBL complaints, which normalizes the email complaints and generates a 6-tuple n-gram version of the message. These n-grams are stored in a Redis database, keyed by the file in which they can be found. An inverse index also exists that allow you to find all messages containing a particular n-gram word.'
anti-spam
spam
fbl
feedback
filtering
n-grams
similarity
hashing
redis
searching
september 2011 by jm
Dr. Neal Krawetz explains perceptual hashing
june 2011 by jm
ie. TinEye and other "images like this one" search engines. nice explanation
algorithm
images
analysis
programming
dct
hashing
perceptual-hash
tineye
via:hn
image
june 2011 by jm
deeptoad - Project Hosting on Google Code
november 2010 by jm
'a (python) library and a tool to clusterize similar files using fuzzy hashing techniques. This project is inspired by the well known tool ssdeep.' Via Nelson
via:nelson
deeptoad
software
open-source
fuzzy
hashing
from delicious
november 2010 by jm
3 Rules of thumb for Bloom Filters
november 2010 by jm
good to know (via Jeremy)
via:jzawodny
bloom-filters
hashing
algorithms
coding
tips
false-positives
from delicious
november 2010 by jm
Stop using unsafe keyed hashes, use HMAC
october 2009 by jm
why HMAC is more secure than secret-suffix and secret-prefix keyed hashing. good to know
hmac
security
crypto
hashing
md5
hashes
sha256
sha1
from delicious
october 2009 by jm
related tags
algorithm ⊕ algorithms ⊕ analysis ⊕ anti-spam ⊕ bloom-filters ⊕ coding ⊕ crypto ⊕ dct ⊕ deeptoad ⊕ false-positives ⊕ fbl ⊕ feedback ⊕ filtering ⊕ fuzzy ⊕ hashes ⊕ hashing ⊖ hmac ⊕ image ⊕ images ⊕ md5 ⊕ n-grams ⊕ open-source ⊕ perceptual-hash ⊕ programming ⊕ redis ⊕ searching ⊕ security ⊕ sha1 ⊕ sha256 ⊕ similarity ⊕ software ⊕ spam ⊕ tineye ⊕ tips ⊕ via:hn ⊕ via:jzawodny ⊕ via:nelson ⊕Copy this bookmark: