jerid.francom + linguistics   371

The advantages of using count() to get N-way frequency tables as data frames in R
Introduction I recently introduced how to use the count() function in the “plyr” package in R to produce 1-way frequency tables in R.  Several
r  linguistics  380  tools  data  tables  xtabs  plyr 
april 2016 by jerid.francom
A short note on mapping text
Kristina had a question. So I started puttering. We came up with this. 1. Grab the text of a blog post (but not too much, or do this in a bunch of rounds). 2. Put it at the end of this URL: like so: Ernesto Rivera Gracias was in El Salvador and…
linguistics  tools  mapping  toponyms 
july 2015 by jerid.francom
OpenRefine : A free, open source, power tool for working with messy data
linguistics  tools  software  data  analytics  cleaning 
february 2015 by jerid.francom
Is the Innanet RUINING teh English Language??? ¯(°_o)/¯
There exists a certain paranoia that the web will somehow destroy the English language as we all start communicating solely in LOLs and smileys. But seen another way, the linguistic tricks we've enlisted to portray attitude and action, tone and meaning through text online are just the natural evolution of the written word—a way to adapt to the absence of facial cues and recreate the quirks of IRL conversation in the contextless vacuum of a chat window.
linguistics  380  150  internet  prescriptivist  evolution  writing  speech 
january 2015 by jerid.francom
Columbia computer science course will bring in the humanities
Students seeking to learn the basics of computer science in the context of the humanities, the social sciences, or economics will now have an opportunity to do just that.
linguistics  corpora  380  computer-science  interdisciplinary  python 
november 2014 by jerid.francom
Linguistic Mapping Reveals How Word Meanings Sometimes Change Overnight | MIT Technology Review
Data mining the way we use words is revealing the linguistic earthquakes that constantly change our language.
linguistics  corpora  nlp  vector-space-models  semantics  language  change  variation 
november 2014 by jerid.francom
Endangered Languages Project
The Endangered Languages Project is a collaborative online platform for sharing knowledge and resources for endangered languages. Join this global effort to conserve linguistic diversity.
language  documentation  maps  linguistics  150  endangeredlanguage 
october 2014 by jerid.francom
Distant reading and the blurry edges of genre. | The Stone and the Shell
There are basically two different ways to build collections for distant reading. You can build up collections of specific genres, selecting volumes that you know belong to them. Or you can take an entire digital library as your base collection, and subdivide it by genre. Most people do it the first way, and having just…
linguistics  corpora  corpus  nlp  genre  380  literature 
october 2014 by jerid.francom
« earlier      
per page:    204080120160

related tags

6th-edition  aacl  academia  accomodation  acoustic  acquisition  aggregator  ai  american  amlap  analysis  analytics  ANEW  annotated  anthropology  applications  applied  argentina  arizona  article  artificial-intelligence  association  association-measures  att  audio-search  author  baayen  basque  bayesian  behavior  biber  bibliography  bilingual  blogs  BNC  boilerplate  BOLD  book  books  brain  british  brown  bst  buffalo  cal  calculations  career  catalan  cedel2  chairs  change  charset  chat  chatbots  chicago  childes  children  chomsky  cinema  cleaneval  cleaning  cleverbot  cls  clustering  cognition  cognitive  cognitivescience  collaboration  colleagues  collegues  colloquial  comlex  community  comp-ling  comparable  comparison  comprehension  computation  computational  computer-science  computers  concordance  concordancer  concordancers  conference  conferences  corde  corpora  corpus  corpus-linguistics  courses  court  crawler  crubadan  culture  culturomics  darwin  data  database  datascience  demo  demographics  department  description  descriptivist  dh  dialect  dialectology  dialects  dialogue  dictionaries  dictionary  digitalhumanities  directorship  discourse  discussion  distance  diversity  DMDX  documentation  eagles  ebooks  education  elanguage  elexicon  elra  emotion  endangeredlanguage  english  ethics  europe  everett  evolution  examples  exercises  experiments  exploration  facebook  faculty  fellowships  film  filtering  foma  forensic  forensics  foundation  free  french  frequency  fst  funding  generator  genre  ggplot  ggplot2  glossary  google  graduate  grammar  grants  gries  groups  hamming  haskins  heritrix  hispanic  historical  history  hlt  homepage  howto  HTML  humanities  humor  ibm  ICE  iconicity  identification  interdisciplinary  interface  international  internet  internetarchive  interpreting  introduction  ipa  iphone  irb  java  jobs  journal  journals  jstor  k-means  keyboard  l2  labs  language  languageprocessing  languages  latex  law  LDC  learning  legal  less-resourced  lexical  lexicology  lexicon  library  Linguistic  linguistics  linguists  listing  literacy  literature  loanwords  logging  loglikelihood  lrec  lsa  lyrics  machine  machinelearning  machine_learning  major  mapping  maps  maryland  materials  medical  memory  methodology  mexico  minor  misconduct  mit  mla  movie  movies  multilevel  multilingual  music  nc  neighborhooddensity  nels  neural-networks  neuroscience  news  newspapers  ngrams  nlp  nltk  Nodalida  norms  NSF  nutch  nwlc  nytimes  okcupid  online  Onomatopoeia  open-source  openlibrary  opensource  orthography  oxford  parallel  parser  pc  pca  PDF  pedagogy  Penn  pennstate  people  perl  phonetics  phonology  phonotactics  php  pirãha  plyr  porter-stemmer  portuguese  pos-tagger  praat  prescriptivist  proceedings  processing  professional  programming  programs  pronunciation  proto-indo-european  psycholinguistics  psychology  publication  publications  publishing  pyschology  python  question-answering  questions  r  rae  readability  reading  ref  reference  regional  regular-expressions  relativism  reproducible  research  researchers  resource  resources  retractions  reviews  robot  russia  s  saltmil  scholars  science  scip  screenplays  scripts  semantics  sentiment  sepln  sex  shiny  SIL  sla  slang  social_learning  sociolinguistics  sociology  software  sounds  spain  spanish  speech  spoken  standards  stanford  statistics  subtitles  SUBTLEXesp  SUBTLEXus  subtlex_us  swadeshlist  syllabi  syntax  tables  tagger  tagging  tagset  talkbank  teaching  technology  ted  tenure  text  text-classification  text-to-speech  textbook  textbooks  textmessages  Tgrep2  theoretical  thesaurus  thought  TLT  tokenizer  tool  tools  toponyms  translation  treebanks  treetagger  tts  tufts  tulane  tutorials  tv  twitter  uba  ubuntu  ucsd  unam  unicode  unison  universals  unix  usa  usenet  variation  varieties  vector-space-models  via:zite  video  videos  visualization  vocabulary-size  web  web2.0  webcrawling  weird-al  wfu  whorf  wiki  wikipedia  windows  wire  wolfram  wordle  wordlists  wordnet  words  world  writing  xtabs 

Copy this bookmark: