arabic   4298

« earlier    

Assem's Arabic Stemmer
Welcome to the Arabic Light Stemming Algorithm made for Snowball, it's fast and can be generated in many programming languages (through Snowball).
text  search  Arabic  language 
11 days ago by ironymark
Snowball
Snowball is a small string processing language designed for creating stemming algorithms for use in Information Retrieval. This site describes Snowball, and presents several useful stemmers which have been implemented using it.

The Snowball compiler translates a Snowball script into another language - currently ISO C, C#, Java, Javascript, Python, Rust and Go are supported.
text  Arabic  search  language 
11 days ago by ironymark
Language Analysis | Apache Solr Reference Guide 7.0
Solr provides support for the Light-10 (PDF) stemming algorithm, and Lucene includes an example stopword list.

This algorithm defines both character normalization and stemming, so these are split into two filters to provide more flexibility.

Factory classes: solr.ArabicStemFilterFactory, solr.ArabicNormalizationFilterFactory
Arabic  search  unicode 
11 days ago by ironymark
Lemur Project Home
The Lemur Project develops search engines, browser toolbars, text analysis tools, and data resources that support research and development of information retrieval and text mining software. The project is best known for its Indri search engine, Lemur Toolbar, and ClueWeb09 dataset. Our software and datasets are used widely in scientific and research applications, as well as in some commercial applications.
language  NLP  search  Arabic  unicode 
11 days ago by ironymark

« earlier    

related tags

1980  1980s  1990s  2011  2017  2018  africa  amytan  anatomy  anshumanpandey  apple  appropriation  arabic_language  art  article  ascii  ask  assimilation  astronomy  bad  beauty  behdad  bicon  biology  blog  book.art  book.history  books  books_in_browsers  bordder  border  bornaizadpanah  cairo  calligraphy  charts  chemistry  chinese  colonization  color  communication  connotation  creativity  critical-editions  culture  decolonization  design  dialects  dictionary  displacement  dohraahmed  duolingo  east  editing  education  egypt  egyptian-arabic  egyptian  eighthclimate  emoji  emojicon  english  esfahbod  facebook  fashion  fonts  for  foreign  foreigncy  free  french  fribidi  funny  game  gazetteer  geometry  google  grammar  graphs  hans-wehr  hardware  harryettemullen  hassan  hebrew  henricorbin  hindi  history  hmong  holosensory  hunusimaginalis  i18n  imaginal  immigration  indesign_resources  indoctrination  infographics  infoviz  international  internationalisation  internet  iqra  islam  islamic-science  islamophobia  jamesbaldwin  junejordan  kashida  kenwhistler  kurdish  kyledacuyan  lang:de  langauge  language-learning  language  languagelearning  languages  later  learning  letters  listenlist  literature  lithography  localization  logic  malcolmx  manga  maps  mar15  markbramhill  marwahelal  meaning  medieval  michaelaerard  middle  middleeast  mobile  mohammedhanif  msa  muhammadnoor  multilingual  national-socialism  netflix  nlp  non-latin  nowhere  nyiakengpuachuehmong  ocr  online  online_courses  ottoman  people  perception  persian  phones  pht101  pkk  place  plain-text  plugin  podcasts  poems  poetry  portraits  preservation  printing  priorities  probability  programming  proofs  psychology  publishing  punctuation  pyd  racism  reference  research  resistance  rohingya  rojava  rtl  script  search  sense  shopping?  slug  software  solmazsharif  spanish  sparklines  standards  statistics  stories  studio  style  subtitles  suheirhammad  syntax  technology  text-shaping  text  text_processing  tibet  tigalari  transformation  translation  travel  typeface  typesetting  typography  uk  unicode  urdu  urls  vegan  vernacular  vim  wagtailsite  web  whiteprivilege  whitesupremacy  wiki  word  words  worldbank  writing  yoast  yoast:  تاكسي   

Copy this bookmark:



description:


tags: