Transactions of the Association for Computational Linguistics

Papers
(The median citation count of Transactions of the Association for Computational Linguistics is 3. The table below lists those papers that are above that threshold based on CrossRef citation counts [max. 250 papers]. The publications cover those that have been published in the past four years, i.e., from 2020-04-01 to 2024-04-01.)
ArticleCitations
SpanBERT: Improving Pre-training by Representing and Predicting Spans545
A Primer in BERTology: What We Know About How BERT Works355
Multilingual Denoising Pre-training for Neural Machine Translation269
Topic Modeling in Embedding Spaces250
How Can We Know What Language Models Know?213
KEPLER: A Unified Model for Knowledge Embedding and Pre-trained Language Representation158
Efficient Content-Based Sparse Attention with Routing Transformers133
Leveraging Pre-trained Checkpoints for Sequence Generation Tasks114
What BERT Is Not: Lessons from a New Suite of Psycholinguistic Diagnostics for Language Models111
Sparse, Dense, and Attentional Representations for Text Retrieval91
SummEval: Re-evaluating Summarization Evaluation75
TyDi QA: A Benchmark for Information-Seeking Question Answering in Typologically Diverse Languages70
A Knowledge-Enhanced Pretraining Model for Commonsense Story Generation66
A Survey on Automated Fact-Checking59
Nested Named Entity Recognition via Second-best Sequence Learning and Decoding49
Compressing Large-Scale Transformer-Based Models: A Case Study on BERT46
ByT5: Towards a Token-Free Future with Pre-trained Byte-to-Byte Models44
oLMpics-On What Language Model Pre-training Captures38
Samanantar: The Largest Publicly Available Parallel Corpora Collection for 11 Indic Languages36
Multimodal Pretraining Unmasked: A Meta-Analysis and a Unified Framework of Vision-and-Language BERTs33
Causal Inference in Natural Language Processing: Estimation, Prediction, Interpretation and Beyond32
Machine Learning–Driven Language Assessment31
Decoupling the Role of Data, Attention, and Losses in Multimodal Transformers31
Gender Bias in Machine Translation30
Dealing with Disagreements: Looking Beyond the Majority Vote in Subjective Annotations30
Investigating Prior Knowledge for Challenging Chinese Machine Reading Comprehension28
BLiMP: The Benchmark of Linguistic Minimal Pairs for English28
Experts, Errors, and Context: A Large-Scale Study of Human Evaluation for Machine Translation28
CrossWOZ: A Large-Scale Chinese Cross-Domain Task-Oriented Dialogue Dataset27
Amnesic Probing: Behavioral Explanation with Amnesic Counterfactuals27
Self-Diagnosis and Self-Debiasing: A Proposal for Reducing Corpus-Based Bias in NLP26
MKQA: A Linguistically Diverse Benchmark for Multilingual Open Domain Question Answering26
Break It Down: A Question Understanding Benchmark25
Soloist: BuildingTask Bots at Scale with Transfer Learning and Machine Teaching25
Measuring and Improving Consistency in Pretrained Language Models25
Theoretical Limitations of Self-Attention in Neural Sequence Models24
An Empirical Study on Robustness to Spurious Correlations using Pre-trained Language Models23
PAQ: 65 Million Probably-Asked Questions and What You Can Do With Them23
AMR-To-Text Generation with Graph Transformer23
How Can We Know When Language Models Know? On the Calibration of Language Models for Question Answering23
Quality at a Glance: An Audit of Web-Crawled Multilingual Datasets22
Extractive Opinion Summarization in Quantized Transformer Spaces22
Syntax-Guided Controlled Generation of Paraphrases22
The Flores-101 Evaluation Benchmark for Low-Resource and Multilingual Machine Translation22
Membership Inference Attacks on Sequence-to-Sequence Models: Is My Data In Your Machine Translation System?21
Data-to-text Generation with Macro Planning20
Relevance-guided Supervision for OpenQA with ColBERT20
Modeling Global and Local Node Contexts for Text Generation from Knowledge Graphs20
SummaC: Re-Visiting NLI-based Models for Inconsistency Detection in Summarization19
MasakhaNER: Named Entity Recognition for African Languages17
Canine: Pre-training an Efficient Tokenization-Free Encoder for Language Representation17
Data Weighted Training Strategies for Grammatical Error Correction17
Retrieve Fast, Rerank Smart: Cooperative and Joint Approaches for Improved Cross-Modal Retrieval16
Best-First Beam Search16
Does Syntax Need to Grow on Trees? Sources of Hierarchical Inductive Bias in Sequence-to-Sequence Networks16
Beat the AI: Investigating Adversarial Human Annotation for Reading Comprehension16
Explanation-Based Human Debugging of NLP Models: A Survey15
Why Does Surprisal From Larger Transformer-Based Language Models Provide a Poorer Fit to Human Reading Times?14
Target-Guided Structured Attention Network for Target-Dependent Sentiment Analysis14
Did Aristotle Use a Laptop?A Question Answering Benchmark with Implicit Reasoning Strategies14
Context-aware Adversarial Training for Name Regularity Bias in Named Entity Recognition14
Acoustic-Prosodic and Lexical Cues to Deception and Trust: Deciphering How People Detect Lies14
Multilingual Autoregressive Entity Linking13
An Empirical Survey of Data Augmentation for Limited Data Learning in NLP13
Quantifying Social Biases in NLP: A Generalization and Empirical Comparison of Extrinsic Fairness Metrics13
PADA: Example-based Prompt Learning for on-the-fly Adaptation to Unseen Domains13
Unsupervised Quality Estimation for Neural Machine Translation13
Planning with Learned Entity Prompts for Abstractive Summarization12
A Survey on Cross-Lingual Summarization12
Nurse is Closer to Woman than Surgeon? Mitigating Gender-Biased Proximities in Word Embeddings12
PERL: Pivot-based Domain Adaptation for Pre-trained Deep Contextualized Embedding Models12
EDITOR: An Edit-Based Transformer with Repositioning for Neural Machine Translation with Soft Lexical Constraints12
FeTaQA: Free-form Table Question Answering11
ABNIRML: Analyzing the Behavior of Neural IR Models11
TopiOCQA: Open-domain Conversational Question Answering with Topic Switching11
Time-Aware Language Models as Temporal Knowledge Bases11
Towards Question-Answering as an Automatic Metric for Evaluating the Content Quality of a Summary10
QED: A Framework and Dataset for Explanations in Question Answering10
Sketch-Driven Regular Expression Generation from Natural Language and Examples10
A Graph-based Model for Joint Chinese Word Segmentation and Dependency Parsing10
Task-Oriented Dialogue as Dataflow Synthesis10
Aligning Faithful Interpretations with their Social Attribution10
Sentence Similarity Based on Contexts10
Infusing Finetuning with Semantic Dependencies10
Provable Limitations of Acquiring Meaning from Ungrounded Form: What Will Future Language Models Understand?10
Better Document-Level Machine Translation with Bayes’ Rule9
Improving Dialog Evaluation with a Multi-reference Adversarial Dataset and Large Scale Pretraining9
Improving the Domain Adaptation of Retrieval Augmented Generation (RAG) Models for Open Domain Question Answering9
VILA: Improving Structured Content Extraction from Scientific PDFs Using Visual Layout Groups9
Phonotactic Complexity and Its Trade-offs9
Adaptive Semiparametric Language Models9
Decontextualization: Making Sentences Stand-Alone8
WikiAsp: A Dataset for Multi-domain Aspect-based Summarization8
Benchmarking Large Language Models for News Summarization8
Efficient Methods for Natural Language Processing: A Survey8
Revisiting Multi-Domain Machine Translation7
Unsupervised Bitext Mining and Translation via Self-Trained Contextual Embeddings7
Differentiable Subset Pruning of Transformer Heads7
Modeling Content and Context with Deep Relational Learning7
Augmenting Transformers with KNN-Based Composite Memory for Dialog7
Revisiting Few-shot Relation Classification: Evaluation Data and Classification Schemes7
In-Context Retrieval-Augmented Language Models7
True Few-Shot Learning with Prompts—A Real-World Perspective7
Syntactic Structure Distillation Pretraining for Bidirectional Encoders7
Transformers for Tabular Data Representation: A Survey of Models and Applications7
It’s not Rocket Science: Interpreting Figurative Language in Narratives6
A Neural Generative Model for Joint Learning Topics and Topic-Specific Word Embeddings6
Decoding Brain Activity Associated with Literal and Metaphoric Sentence Comprehension Using Distributional Semantic Models6
Pre-train, Prompt, and Recommendation: A Comprehensive Survey of Language Modeling Paradigm Adaptations in Recommender Systems6
Efficient Long-Text Understanding with Short-Text Models6
Controllable Summarization with Constrained Markov Decision Process6
Compositional Evaluation on Japanese Textual Entailment and Similarity6
Document Summarization with Latent Queries6
Generative Spoken Dialogue Language Modeling6
Improving Candidate Generation for Low-resource Cross-lingual Entity Linking6
Neural Modeling for Named Entities and Morphology (NEMO2)6
Pretraining the Noisy Channel Model for Task-Oriented Dialogue6
Reducing Confusion in Active Learning for Part-Of-Speech Tagging6
Characterizing English Variation across Social Media Communities with BERT6
A Statistical Analysis of Summarization Evaluation Metrics Using Resampling Methods6
Let’s PlayMono-Poly: BERT Can Reveal Words’ Polysemy Level and Partitionability into Senses5
Neural OCR Post-Hoc Correction of Historical Corpora5
Lost in the Middle: How Language Models Use Long Contexts5
♫ MuSiQue: Multihop Questions via Single-hop Question Composition5
Dialogue State Tracking with Incremental Reasoning5
Deciphering Undersegmented Ancient Scripts Using Phonetic Prior5
End-to-end Argument Mining with Cross-corpora Multi-task Learning5
A Computational Framework for Slang Generation5
AMR Similarity Metrics from Principles5
How Much Do Language Models Copy From Their Training Data? Evaluating Linguistic Novelty in Text Generation Using RAVEN5
Czech Grammar Error Correction with a Large and Diverse Corpus5
FaithDial: A Faithful Benchmark for Information-Seeking Dialogue5
Locally Typical Sampling5
Self-supervised Regularization for Text Classification5
Evaluating Attribution in Dialogue Systems: The BEGIN Benchmark5
Generate, Annotate, and Learn: NLP with Synthetic Text5
Hallucinations in Large Multilingual Translation Models5
Conversation Graph: Data Augmentation, Training, and Evaluation for Non-Deterministic Dialogue Management4
Morphology Matters: A Multilingual Language Modeling Analysis4
Evaluating Document Coherence Modeling4
Idiomatic Expression Identification using Semantic Compatibility4
ParsiNLU: A Suite of Language Understanding Challenges for Persian4
Hate Speech Classifiers Learn Normative Social Stereotypes4
Hierarchical Mapping for Crosslingual Word Embedding Alignment4
Saturated Transformers are Constant-Depth Threshold Circuits4
DP-Parse: Finding Word Boundaries from Raw Speech with an Instance Lexicon4
Latent Compositional Representations Improve Systematic Generalization in Grounded Question Answering4
Fact Checking with Insufficient Evidence4
Rank-Aware Negative Training for Semi-Supervised Text Classification4
Speak, Read and Prompt: High-Fidelity Text-to-Speech with Minimal Supervision4
What Helps Transformers Recognize Conversational Structure? Importance of Context, Punctuation, and Labels in Dialog Act Recognition4
Data-to-text Generation with Variational Sequential Planning4
LOT: A Story-Centric Benchmark for Evaluating Chinese Long Text Understanding and Generation4
Reproducible and Efficient Benchmarks for Hyperparameter Optimization of Neural Machine Translation Systems4
Reducing Conversational Agents’ Overconfidence Through Linguistic Calibration4
Aggretriever: A Simple Approach to Aggregate Textual Representations for Robust Dense Passage Retrieval4
Structural Persistence in Language Models: Priming as a Window into Abstract Language Representations4
Heterogeneous Supervised Topic Models4
ProoFVer: Natural Logic Theorem Proving for Fact Verification4
Erratum: “BLiMP: The Benchmark of Linguistic Minimal Pairs for English”4
On the Effect of Anticipation on Reading Times3
Learning English with Peppa Pig3
What Does My QA Model Know? Devising Controlled Probes Using Expert Knowledge3
Recursive Non-Autoregressive Graph-to-Graph Transformer for Dependency Parsing with Iterative Refinement3
Relational Memory-Augmented Language Models3
On the Robustness of Dialogue History Representation in Conversational Question Answering: A Comprehensive Study and a New Prompt-based Method3
Visual Spatial Reasoning3
Word Representation Learning in Multimodal Pre-Trained Transformers: An Intrinsic Evaluation3
Learning Lexical Subspaces in a Distributional Vector Space3
Coreference Resolution through a seq2seq Transition-Based System3
On the Difficulty of Translating Free-Order Case-Marking Languages3
Unsupervised Abstractive Opinion Summarization by Generating Sentences with Tree-Structured Topic Guidance3
Supervised Gradual Machine Learning for Aspect-Term Sentiment Analysis3
An End-to-End Contrastive Self-Supervised Learning Framework for Language Understanding3
Neural Event Semantics for Grounded Language Understanding3
Unsupervised Discourse Constituency Parsing Using Viterbi EM3
Strong Equivalence of TAG and CCG3
Revisiting Negation in Neural Machine Translation3
Testing the Predictions of Surprisal Theory in 11 Languages3
Formal Language Recognition by Hard Attention Transformers: Perspectives from Circuit Complexity3
Consistent Transcription and Translation of Speech3
Word Acquisition in Neural Language Models3
Ultra-fine Entity Typing with Indirect Supervision from Natural Language Inference3
Identity-Based Patterns in Deep Convolutional Networks: Generative Adversarial Phonology and Reduplication3
Meta-Learning a Cross-lingual Manifold for Semantic Parsing3
0.039582967758179