Transactions of the Association for Computational Linguistics

Papers
(The median citation count of Transactions of the Association for Computational Linguistics is 4. The table below lists those papers that are above that threshold based on CrossRef citation counts [max. 250 papers]. The publications cover those that have been published in the past four years, i.e., from 2022-01-01 to 2026-01-01.)
ArticleCitations
Overcoming Source Object Grounding for Semantic Image Editing539
From Robustness to Improved Generalization and Calibration in Pre-trained Language Models258
KEFT: Knowledge-Enhanced Fine-Tuning for Large Language Models in Domain-Specific Question Answering219
DARE: Diverse Visual Question Answering with Robustness Evaluation218
Persona-Aware Alignment Framework for Personalized Dialogue Generation195
Cross-functional Analysis of Generalization in Behavioral Learning133
How to Select Datapoints for Efficient Human Evaluation of NLG Models?99
Segmentation-Free Streaming Machine Translation99
The Ethics of Automating Legal Actors97
State of What Art? A Call for Multi-Prompt LLM Evaluation96
Understanding and Detecting Hallucinations in Neural Machine Translation via Model Introspection93
Transformers for Tabular Data Representation: A Survey of Models and Applications92
The Flores-101 Evaluation Benchmark for Low-Resource and Multilingual Machine Translation78
Erasure of Unaligned Attributes from Neural Representations77
A Survey of Text Games for Reinforcement Learning Informed by Natural Language75
Aggretriever: A Simple Approach to Aggregate Textual Representations for Robust Dense Passage Retrieval69
Revisiting Meta-evaluation for Grammatical Error Correction68
T 2 -NER: A Two-Stage Span-Based Framework for Unified Named Entity Recognition with Templates60
A Survey on Automated Fact-Checking59
Bridging the Gap between Synthetic and Natural Questions via Sentence Decomposition for Semantic Parsing58
Learning English with Peppa Pig57
Do Multi-Document Summarization Models Synthesize?56
Retrieval-Pretrained Transformer: Long-range Language Modeling with Self-retrieval55
DEAR: Disentangled Event-Agnostic Representation Learning for Early Fake News Detection52
Uncertainty Estimation and Reduction of Pre-trained Models for Text Regression50
Benchmarking the Generation of Fact Checking Explanations49
Frame Representation Hypothesis: Multi-Token LLM Interpretability and Concept-Guided Text Generation49
Federated Learning for Exploiting Annotators’ Disagreements in Natural Language Processing47
Context-Aware Machine Translation with Source Coreference Explanation45
Learning More from Mixed Emotions: A Label Refinement Method for Emotion Recognition in Conversations43
mtRAG: A Multi-Turn Conversational Benchmark for Evaluating Retrieval-Augmented Generation Systems42
Time-Aware Language Models as Temporal Knowledge Bases42
How to Dissect a Muppet: The Structure of Transformer Embedding Spaces39
Few-Shot Multilingual Open-Domain QA from Five Examples37
Scientia Potentia Est—On the Role of Knowledge in Computational Argumentation36
To Diverge or Not to Diverge: A Morphosyntactic Perspective on Machine Translation vs Human Translation34
Adversarial Defense without Adversarial Defense : Enhancing Language Model Robustness via Instance-level Principal Component Removal33
Compositional Evaluation on Japanese Textual Entailment and Similarity32
Quality at a Glance: An Audit of Web-Crawled Multilingual Datasets32
Culturally Aware and Adapted NLP: A Taxonomy and a Survey of the State of the Art31
Cross-Lingual Dialogue Dataset Creation via Outline-Based Generation27
Morphology Without Borders: Clause-Level Morphology27
Template-based Abstractive Microblog Opinion Summarization26
An Energy-based Model for Word-level AutoCompletion in Computer-aided Translation26
Communication Drives the Emergence of Language Universals in Neural Agents: Evidence from the Word-order/Case-marking Trade-off26
Toward Robust RALMs: Revealing the Impact of Imperfect Retrieval on Retrieval-Augmented Language Models26
Break, Perturb, Build: Automatic Perturbation of Reasoning Paths Through Question Decomposition26
ProoFVer: Natural Logic Theorem Proving for Fact Verification25
PADA: Example-based Prompt Learning for on-the-fly Adaptation to Unseen Domains25
Are Triggers Needed for Document-Level Event Extraction?25
Conformal Prediction for Natural Language Processing: A Survey25
Questions Are All You Need to Train a Dense Passage Retriever25
Transformer Grammars: Augmenting Transformer Language Models with Syntactic Inductive Biases at Scale25
Analyzing and Adapting Large Language Models for Few-Shot Multilingual NLU: Are We There Yet?25
True Few-Shot Learning with Prompts—A Real-World Perspective22
Prompt Contrastive Transformation: An Enhanced Strategy for Efficient Prompt Transfer in Natural Language Processing21
Adapting to the Long Tail: A Meta-Analysis of Transfer Learning Research for Language Understanding Tasks21
Accurate and Efficient Fine-Tuning of Quantized Large Language Models Through Optimal Balance in Adaptation21
Canine: Pre-training an Efficient Tokenization-Free Encoder for Language Representation19
Improving Probability-based Prompt Selection Through Unified Evaluation and Analysis19
Navigating Cultural Chasms: Exploring and Unlocking the Cultural POV of Text-To-Image Models18
Are Character-level Translations Worth the Wait? Comparing ByT5 and mT5 for Machine Translation17
Navigating the Landscape of Hint Generation Research: From the Past to the Future16
OpenFact: Factuality Enhanced Open Knowledge Extraction16
Retrieve What You Need: A Mutual Learning Framework for Open-domain Question Answering16
InSCIt: Information-Seeking Conversations with Mixed-Initiative Interactions16
CRAFT Your Dataset: Task-Specific Synthetic Dataset Generation Through Corpus Retrieval and Augmentation16
Neuron-level Interpretation of Deep NLP Models: A Survey16
Sense-specific Historical Word Usage Generation15
Efficient Long-Text Understanding with Short-Text Models15
Salute the Classic: Revisiting Challenges of Machine Translation in the Age of Large Language Models15
Interactive Machine Teaching by Labeling Rules and Instances15
Addressing the Binning Problem in Calibration Assessment through Scalar Annotations14
A Confidence-based Acquisition Model for Self-supervised Active Learning and Label Correction14
Objectifying the Subjective: Cognitive Biases in Topic Interpretations13
Structural Persistence in Language Models: Priming as a Window into Abstract Language Representations13
ABNIRML: Analyzing the Behavior of Neural IR Models13
BharatBBQ: A Multilingual Bias Benchmark for Question Answering in the Indian Context13
Pre-train, Prompt, and Recommendation: A Comprehensive Survey of Language Modeling Paradigm Adaptations in Recommender Systems13
Robust Pronoun Fidelity with English LLMs: Are they Reasoning, Repeating, or Just Biased?13
MENLI: Robust Evaluation Metrics from Natural Language Inference13
Explainable Abuse Detection as Intent Classification and Slot Filling13
TaxoPro: A Plug-In LoRA-based Cross-Domain Method for Low-Resource Taxonomy Completion12
Human Choice Prediction in Language-based Persuasion Games: Simulation-based Off-Policy Evaluation12
Safe Pruning LoRA: Robust Distance-Guided Pruning for Safety Alignment in Adaptation of LLMs12
Learning Fair Representations via Rate-Distortion Maximization12
Rescue Conversations from Dead-ends: Efficient Exploration for Task-oriented Dialogue Policy Optimization11
How “Real” is Your Real-Time Simultaneous Speech-to-Text Translation System?11
Self-Rationalization in the Wild: A Large-scale Out-of-Distribution Evaluation on NLI-related tasks11
Investigating Critical Period Effects in Language Acquisition through Neural Language Models11
Towards More Realistic Extraction Attacks: An Adversarial Perspective11
PaniniQA: Enhancing Patient Education Through Interactive Question Answering11
NLP Security and Ethics, in the Wild11
TANQ: An Open Domain Dataset of Table Answered Questions11
Adding Chocolate to Mint : Mitigating Metric Interference in Machine Translation11
Modeling Emotion Dynamics in Song Lyrics with State Space Models11
Is My Model Using the Right Evidence? Systematic Probes for Examining Evidence-Based Tabular Reasoning11
Data-driven Parsing Evaluation for Child-Parent Interactions10
xcomet : Transparent Machine Translation Evaluation through Fine-grained Error Detection10
Evaluating Attribution in Dialogue Systems: The BEGIN Benchmark10
Time-and-Space-Efficient Weighted Deduction10
Data-to-text Generation with Variational Sequential Planning10
Helpful Neighbors: Leveraging Neighbors in Geographic Feature Pronunciation10
Sub-Character Tokenization for Chinese Pretrained Language Models10
Patchwise Cooperative Game-based Interpretability Method for Large Vision-language Models10
Not Eliminate but Aggregate: Post-Hoc Control over Mixture-of-Experts to Address Shortcut Shifts in Natural Language Understanding9
Visual Spatial Reasoning9
Step-by-Step Unmasking for Parameter-Efficient Fine-Tuning of Large Language Models9
Samanantar: The Largest Publicly Available Parallel Corpora Collection for 11 Indic Languages9
FeTaQA: Free-form Table Question Answering9
Benchmarking Large Language Models for News Summarization9
Assessing the Capacity of Transformer to Abstract Syntactic Representations: A Contrastive Analysis Based on Long-distance Agreement9
End-to-end Argument Mining with Cross-corpora Multi-task Learning9
Large Language Models Enable Few-Shot Clustering9
Decomposing and Recomposing Event Structure9
Know Your Limits: A Survey of Abstention in Large Language Models9
Evaluating Transformer Models and Human Behaviors on Chinese Character Naming8
Abstractive Meeting Summarization: A Survey8
Direct Speech Translation for Automatic Subtitling8
QAmeleon: Multilingual QA with Only 5 Examples8
Diff-Explainer: Differentiable Convex Optimization for Explainable Multi-hop Inference8
How Abstract Is Linguistic Generalization in Large Language Models? Experiments with Argument Structure8
Bridging the Gap: A Survey on Integrating (Human) Feedback for Natural Language Generation8
On the Effect of Instruction Tuning Loss on Generalization8
QE4PE: Word-level Quality Estimation for Human Post-Editing8
Conformalizing Machine Translation Evaluation7
Scope Ambiguities in Large Language Models7
Visually Grounded Speech Models Have a Mutual Exclusivity Bias7
Can Authorship Representation Learning Capture Stylistic Features?7
The Parallelism Tradeoff: Limitations of Log-Precision Transformers7
CreoleVal: Multilingual Multitask Benchmarks for Creoles7
Speak, Read and Prompt: High-Fidelity Text-to-Speech with Minimal Supervision7
♫ MuSiQue: Multihop Questions via Single-hop Question Composition7
A Cross-Linguistic Pressure for Uniform Information Density in Word Order7
Hallucinations in Large Multilingual Translation Models7
Meta-Learning the Difference: Preparing Large Language Models for Efficient Adaptation7
Expectations over Unspoken Alternatives Predict Pragmatic Inferences6
Cultural Adaptation of Recipes6
Visual Writing Prompts: Character-Grounded Story Generation with Curated Image Sequences6
Causal Inference in Natural Language Processing: Estimation, Prediction, Interpretation and Beyond6
A Multi-Level Optimization Framework for End-to-End Text Augmentation6
Collective Human Opinions in Semantic Textual Similarity6
A Comparative Approach for Auditing Multilingual Phonetic Transcript Archives6
Chinese Idiom Paraphrasing6
The Emergence of Argument Structure in Artificial Languages6
Robust Dialogue State Tracking with Weak Supervision and Sparse Data6
mGPT: Few-Shot Learners Go Multilingual5
Comparing Humans and Large Language Models on an Experimental Protocol Inventory for Theory of Mind Evaluation (EPITOME)5
Improving the Domain Adaptation of Retrieval Augmented Generation (RAG) Models for Open Domain Question Answering5
A Unifying Scheme for Extractive Content Selection Tasks5
AfriSpeech-200: Pan-African Accented Speech Dataset for Clinical and General Domain ASR5
How Much Semantic Information is Available in Large Language Model Tokens?5
Meta-Learning a Cross-lingual Manifold for Semantic Parsing5
Hierarchical Indexing for Retrieval-Augmented Opinion Summarization5
Lost in the Middle: How Language Models Use Long Contexts5
Compositional Generalization in Multilingual Semantic Parsing over Wikidata5
STPar: A Structure-Aware Triaffine Parser for Screenplay Character Coreference Resolution5
Document Summarization with Latent Queries5
Surveying the Landscape of Image Captioning Evaluation: A Comprehensive Taxonomy, Trends, and Metrics Analysis5
ReCOGS: How Incidental Details of a Logical Form Overshadow an Evaluation of Semantic Interpretation5
Less is More: Mitigate Spurious Correlations for Open-Domain Dialogue Response Generation Models by Causal Discovery4
FoVer: First-Order Logic Verification for Natural Language Reasoning4
Investigating Reasons for Disagreement in Natural Language Inference4
Exploring Contrast Consistency of Open-Domain Question Answering Systems on Minimally Edited Questions4
Optimal Transport Posterior Alignment for Cross-lingual Semantic Parsing4
FaithDial: A Faithful Benchmark for Information-Seeking Dialogue4
Relational Memory-Augmented Language Models4
Decision-Oriented Dialogue for Human-AI Collaboration4
KoBBQ: Korean Bias Benchmark for Question Answering4
Ultra-fine Entity Typing with Indirect Supervision from Natural Language Inference4
Saturated Transformers are Constant-Depth Threshold Circuits4
Unleashing the True Potential of Sequence-to-Sequence Models for Sequence Tagging and Structure Parsing4
Sentence Similarity Based on Contexts4
Hate Speech Classifiers Learn Normative Social Stereotypes4
Retrieve Fast, Rerank Smart: Cooperative and Joint Approaches for Improved Cross-Modal Retrieval4
RepreGuard: Detecting LLM-Generated Text by Revealing Hidden Representation Patterns4
Shared Lexical Items as Triggers of Code Switching4
Can Authorship Attribution Models Distinguish Speakers in Speech Transcripts?4
Do LLMs Exhibit Human-like Response Biases? A Case Study in Survey Design4
Do Text Simplification Systems Preserve Meaning? A Human Evaluation via Reading Comprehension4
Naturalistic Causal Probing for Morpho-Syntax4
0.026962041854858