- Energy-Based Models for Text.
arxiv
- Recipes for building an open-domain chatbot.
arxiv
- FastWordBug: A Fast Method To Generate Adversarial Text Against NLP Applications.
arxiv
- Rapid Adaptation of BERT for Information Extraction on Domain-Specific Business Documents.
arxiv
- BERT-of-Theseus: Compressing BERT by Progressive Module Replacing.
arxiv
code
- BERTweet: A pre-trained language model for English Tweets.
arxiv
code
- Blank Language Models.
arxiv
- Controlling Computation versus Quality for Neural Sequence Models.
arxiv
- ELECTRA: Pre-training Text Encoders as Discriminators Rather Than Generators.
arxiv
code
⭐ - Extending Multilingual BERT to Low-Resource Languages.
arxiv
- Limits of Detecting Text Generated by Large-Scale Language Models.
arxiv
- PALM: Pre-training an Autoencoding&Autoregressive Language Model for Context-conditioned Generation.
arxiv
- Pretrained Transformers Improve Out-of-Distribution Robustness.
arxiv
- Semantics-aware BERT for Language Understanding.
arxiv
code
- Joint Embedding in Named Entity Linking on Sentence Level.
arxiv
- AmbigQA: Answering Ambiguous Open-domain Questions.
arxiv
- Asking and Answering Questions to Evaluate the Factual Consistency of Summaries.
arxiv
- Conversational Question Answering over Passages by Leveraging Word Proximity Networks.
arxiv
code
- Probing Emergent Semantics in Predictive Agents via Question Answering.
arxiv
- Unsupervised Commonsense Question Answering with Self-Talk.
arxiv
code