BERT 381
A Visual Guide to Using BERT for the First Time
6 days ago by drmeme
An introduction and visual tutorial on using BERT. Simple enough to get started and advanced enough to give one an understanding of why it works.
bert
nlp
classification
sentiment
6 days ago by drmeme
The Illustrated BERT, ELMo, and co. (How NLP Cracked Transfer Learning)
6 days ago by drmeme
An overview of how BERT classification works for NLP. Some history. Some connections to other work. Very much worth a read.
bert
nlp
elmo
sentenceclassification
6 days ago by drmeme
Twitter
6 days ago by typhon
RT @InstitutduGalo: Le #Galo se rechome o la souéte Cllâssiers e Fabien Lecuyer ! #Bertègn
//
La langue gallèse se réveille avec l'asso…
Galo
Bert
from twitter_favs
//
La langue gallèse se réveille avec l'asso…
6 days ago by typhon
[1906.01502] How multilingual is Multilingual BERT?
9 days ago by arsyed
In this paper, we show that Multilingual BERT (M-BERT), released by Devlin et al. (2018) as a single language model pre-trained from monolingual corpora in 104 languages, is surprisingly good at zero-shot cross-lingual model transfer, in which task-specific annotations in one language are used to fine-tune the model for evaluation in another language. To understand why, we present a large number of probing experiments, showing that transfer is possible even to languages in different scripts, that transfer works best between typologically similar languages, that monolingual corpora can train models for code-switching, and that the model can find translation pairs. From these results, we can conclude that M-BERT does create multilingual representations, but that these representations exhibit systematic deficiencies affecting certain language pairs.
nlp
transfer-learning
multilingual
bert
9 days ago by arsyed
related tags
adversarial ai albert algorithms algoritmes allennlp analysis api arxiv attention bert blogs branding cade_metz christopherpenn classification cloudcomputing compress compression context coreml dataset deep-learning deeplearning development distillation elasticsearch elmo embeddings entityextraction eric_wallace evaluation explainability exposition fairness fakenews fast.ai fast_ai fastai featuredsnippets fine-tune fine-tuning galo generative german google gp2 gpgpu gpt-2 gpt2 grover healthcare heygoogle howto interpretability intro ir keras language later learning libs linguistics machine-learning machine_learning machinelearning microservices ml mobile model models multilingual muppets neal_lathia netapinotes neural-mt neural-net neuralnetwork news newsletter nlp nlu nn nvidia openai opensource paper pretrained python pytorch qa quantization question_answering ranking research-article research search-intent search sem semantics sentenceclassification sentiment sentimentanalysis seo squad summarization swift tensorflow text textanalysis textgeneration textmining toread transfer-learning transferlearning transformer transformers trends tutorial tutorials video vision washingtonpost wordembeddingCopy this bookmark: