Toolkit for record linkage and deduplication written in Python
python  deduplication  record-linkage  data-wrangling  analytics  etl 
2 days ago
Python3 library for advanced bibliometrics.
bibliometrics  python  analytics 
2 days ago
Blot – a blogging platform with no interface | Hacker News
Neat daemon for hosting a blog in Dropbox or other place for static files.
blogging  cms  publishing  static  markdown 
3 days ago
data-8 - The UC Berkeley Foundations of Data Science
Combines inferential thinking, computational thinking, and real-world relevance.
course  data-science  programming  data-analysis  introduction  practical 
7 days ago
Network of decentralizid communities powered by Ethereum.
community  decentralization  ethereum 
11 days ago
Voyager : MARC record export script
Voyager MARC export script with holdings support.
voyager  export  holdings 
12 days ago
Hal Daumé III (2017). A Course in Machine Learning (CIML)
Set of introductory materials that covers most major aspects of modern machine learning (supervised learning, unsupervised learning, large margin methods, probabilistic modeling, learning theory, etc.)
machine-learning  introduction  undergraduate  course  book  opencontent 
16 days ago
Machine Learning Crash Course  |  Google Developers
Ppractical 20-hour introduction to machine learning fundamentals, with companion TensorFlow exercises
ai  machine-learning  google  tensorflow 
19 days ago
natasha/yargy: Tiny package for information extraction
Pure Python implementation of Earley parser for Russian language.
parser  python  language  russian  nlp 
20 days ago
A library for defensive data analysis, eg. for making sanity checks in ETL pipeline.
data-analysis  python  defensive-coding  analytics 
20 days ago
Frictionless Data
Suite of tools for dataset quality control.
data-analysis  pipeline  validation  etl 
21 days ago
Unit tests in IPython notebooks. Supports pytest / unittest.
python  testing  jupyter 
21 days ago
Wired Elements
Neat web components library for creating functional wireframes.
javascript  prototyping  sketch  wireframe  lo-fi 
25 days ago
Modern terminal multiplexer.
linux  screen  terminal  tmux  cli  productivity 
25 days ago
Site and web service for monitoring scientific journals.
journal  monitoring  rss  development  reserach 
26 days ago
FOIA-based releases of US goverment documents.
information  politics  journalism  tools  source  research 
28 days ago
Writes selected parts of a MARC XML bibliographic data file into a .csv.gz file.
marcxml  extract  analysis  etl  bibliographic 
4 weeks ago
Data analysis group researching human rights violations.
data-analysis  human-rights  war  monitoring  datamining 
5 weeks ago
Shiny application for inspecting structural topic models
stm  topic-modeling  shiny  r  package 
5 weeks ago
MapAnalyst - MapAnalyst
For georeferencing and analyzing old maps.
gis  cartography  maps  antique  research  opensource 
6 weeks ago
NCSA Brown Dog
Research data management and preservation for non-structured data too.
data-management  service  infrastructure  digital-humanities  preservation  archive 
6 weeks ago
elixirschool/homework: A collection of coding exercises to be completed in conjunction the lessons available on elixirschool.com
A collection of coding exercises to be completed in conjunction the lessons available on elixirschool.com
webdev  elixir  learning  programming 
6 weeks ago
« earlier      
academic administration amazon analysis analytics android api archive archives arduino art audio automation backup bash best-practices bibliography bibliometrics bigdata bike blogging book books browser business buy case cli cms code collaboration communication community computational-art computer-science content conversion course css csv culture dashboard data data-analysis database datamining dataset deep-learning deployment design development digital-humanities distributed diy documentation drupal dspace ebook ebooks economics editor education electronics epub etl filib2 finland finnish fitness framework freeware gear gis git github google graphics gtd gui hacking hardware health hiking history howto html5 information-retrieval infrastructure inspiration integration introduction java javascript journal json language latex learning librarianship libraries library library2.0 lifehacks linked-data linux logging machine-learning management mapping marc markdown mathematics metadata mobile modeling modules monitoring multiplatform music nlp noosphere notetaking omeka openaccess opencontent opendata opensource osx pdf performance person philosophy photography php presentation preservation privacy processing productivity professional-development programming prototyping publishing python r rdf reading recreational reference repository research rest ruby running scalability schema scraping scripting search security semantic-web server skills sna socialsoftware software solr sql statistics sync sysadmin teaching testing text-analytics thesis tool toolkit tools toread training travel travelling trends tutorial typography ui unix usability utility ux versioning video visualization web webdesign webdev wiki work workflow writing xml xquery

Copy this bookmark: