The advantages of using count() to get N-way frequency tables as data frames in R
Introduction I recently introduced how to use the count() function in the “plyr” package in R to produce 1-way frequency tables in R.  Several
r  linguistics  380  tools  data  tables  xtabs  plyr 
april 2016 by jerid.francom
A short note on mapping text
Kristina had a question. So I started puttering. We came up with this. 1. Grab the text of a blog post (but not too much, or do this in a bunch of rounds). 2. Put it at the end of this URL: like so: Ernesto Rivera Gracias was in El Salvador and…
linguistics  tools  mapping  toponyms 
july 2015 by jerid.francom
OpenRefine : A free, open source, power tool for working with messy data
linguistics  tools  software  data  analytics  cleaning 
february 2015 by jerid.francom
Is the Innanet RUINING teh English Language??? ¯(°_o)/¯
There exists a certain paranoia that the web will somehow destroy the English language as we all start communicating solely in LOLs and smileys. But seen another way, the linguistic tricks we've enlisted to portray attitude and action, tone and meaning through text online are just the natural evolution of the written word—a way to adapt to the absence of facial cues and recreate the quirks of IRL conversation in the contextless vacuum of a chat window.
linguistics  380  150  internet  prescriptivist  evolution  writing  speech 
january 2015 by jerid.francom
Columbia computer science course will bring in the humanities
Students seeking to learn the basics of computer science in the context of the humanities, the social sciences, or economics will now have an opportunity to do just that.
linguistics  corpora  380  computer-science  interdisciplinary  python 
november 2014 by jerid.francom
Linguistic Mapping Reveals How Word Meanings Sometimes Change Overnight | MIT Technology Review
Data mining the way we use words is revealing the linguistic earthquakes that constantly change our language.
linguistics  corpora  nlp  vector-space-models  semantics  language  change  variation 
november 2014 by jerid.francom
Endangered Languages Project
The Endangered Languages Project is a collaborative online platform for sharing knowledge and resources for endangered languages. Join this global effort to conserve linguistic diversity.
language  documentation  maps  linguistics  150  endangeredlanguage 
october 2014 by jerid.francom
Distant reading and the blurry edges of genre. | The Stone and the Shell
There are basically two different ways to build collections for distant reading. You can build up collections of specific genres, selecting volumes that you know belong to them. Or you can take an entire digital library as your base collection, and subdivide it by genre. Most people do it the first way, and having just…
linguistics  corpora  corpus  nlp  genre  380  literature 
october 2014 by jerid.francom
