Computer Speech and Language

Papers
(The TQCC of Computer Speech and Language is 8. The table below lists those papers that are above that threshold based on CrossRef citation counts [max. 250 papers]. The publications cover those that have been published in the past four years, i.e., from 2022-06-01 to 2026-06-01.)
ArticleCitations
Corpus and unsupervised benchmark: Towards Tagalog grammatical error correction87
A language-agnostic model of child language acquisition87
Stochastic Data-to-Text Generation Using Syntactic Dependency Information80
Automatic detection of behavioural codes in team interactions78
KddRES: A Multi-level Knowledge-driven Dialogue Dataset for Restaurant Towards Customized Dialogue System72
Editorial Board70
Speech enhancement approach for body-conducted unvoiced speech based on Taylor–Boltzmann machines trained DNN43
Seq2Seq dynamic planning network for progressive text generation42
Towards privacy-preserving conversation analysis in everyday life: Exploring the privacy-utility trade-off40
Optimization of modular multi-speaker distant conversational speech recognition39
Room impulse response reshaping-based expectation–maximization in an underdetermined reverberant environment39
Monotonic Gaussian regularization of attention for robust automatic speech recognition33
Unsupervised question-retrieval approach based on topic keywords filtering and multi-task learning32
Identifying offensive memes in low-resource languages: A multi-modal multi-task approach using valence and arousal31
Misogynistic attitude detection in YouTube comments and replies: A high-quality dataset and algorithmic models31
Contextual emotion detection using ensemble deep learning30
Editorial Board30
A method of phonemic annotation for Chinese dialects based on a deep learning model with adaptive temporal attention and a feature disentangling structure29
Multi-branch feature aggregation based on multiple weighting for speaker verification25
Complementary regional energy features for spoofed speech detection24
PaSCoNT - Parallel Speech Corpus of Northern-central Thai for automatic speech recognition24
Compositional domain adaptation for automatic speech recognition with headwise selective attention merging21
Improving self-supervised learning model for audio spoofing detection with layer-conditioned embedding fusion21
Predicting accentedness and comprehensibility through ASR scores and acoustic features21
Vishing: Detecting social engineering in spoken communication — A first survey & urgent roadmap to address an emerging societal challenge21
Editorial Board20
A hybrid approach to Natural Language Inference for the SICK dataset20
Augmentative and alternative speech communication (AASC) aid for people with dysarthria20
Editorial Board19
Enhancing analysis of diadochokinetic speech using deep neural networks18
Maximal activation weighted memory for aspect based sentiment analysis18
A transformer-based spelling error correction framework for Bangla and resource scarce Indic languages16
Loanword identification based on web resources: A case study on wikipedia16
Preserving the beamforming effect for spatial cue-based pseudo-binaural dereverberation of a single source16
Combining replay and LoRA for continual learning in natural language understanding16
Combined generative and predictive modeling for speech super-resolution16
Representation learning strategies to model pathological speech: Effect of multiple spectral resolutions16
English–Assamese neural machine translation using prior alignment and pre-trained language model16
Under the hood: Phonemic Restoration in transformer-based automatic speech recognition15
The use of Active Learning systems for stimulus selection and response modelling in perception experiments15
A mobile application using automatic speech analysis for classifying Alzheimer's disease and mild cognitive impairment15
A lightweight approach based on prompt for few-shot relation extraction15
Editorial Board14
Effects of cross-cultural language differences on social cognition during human-agent interaction in cooperative game environments14
Privacy-preserving feature extractor using adversarial pruning for TBI assessment from speech14
Conversations in the wild: Data collection, automatic generation and evaluation14
A novel channel estimate for noise robust speech recognition14
Meta adversarial learning improves low-resource speech recognition14
Named entity recognition using neural language model and CRF for Hindi language13
Evidence and Axial Attention Guided Document-level Relation Extraction13
A bias evaluation solution for multiple sensitive attribute speech recognition13
Compress, Align, and Transfer: A new method for transferring pre-trained language models knowledge to CTC-based speech recognition13
Editorial Board13
Performance assessment of voice conversion models using speech production-based parameters12
Zero-Shot Strike: Testing the generalisation capabilities of out-of-the-box LLM models for depression detection12
FinD: Fine-grained discrepancy-based fake news detection enhanced by event abstract generation12
Offensive language detection in Tamil YouTube comments by adapters and cross-domain knowledge transfer12
MPSA-DenseNet: A novel deep learning model for English accent classification12
Modality fusion using auxiliary tasks for dementia detection12
Tailored design of Audio–Visual Speech Recognition models using Branchformers12
A flexible BERT model enabling width- and depth-dynamic inference12
SecNLP: An NLP classification model watermarking framework based on multi-task learning12
A novel graph kernel algorithm for improving the effect of text classification12
Improved relation extraction through key phrase identification using community detection on dependency trees12
Addressing subjectivity in paralinguistic data labeling for improved classification performance: A case study with Spanish-speaking Mexican children using data balancing and semi-supervised learning11
A computational analysis of transcribed speech of people living with dementia: The Anchise 2022 Corpus11
Towards inclusive automatic speech recognition11
Towards detecting the level of trust in the skills of a virtual assistant from the user’s speech11
Prototypical networks relation classification model based on entity convolution11
Towards decoupling frontend enhancement and backend recognition in monaural robust ASR11
Raw acoustic-articulatory multimodal dysarthric speech recognition11
Neural multi-task learning for end-to-end Arabic aspect-based sentiment analysis11
A tag-based methodology for the detection of user repair strategies in task-oriented conversational agents11
A closer look at reinforcement learning-based automatic speech recognition11
Effective infant cry signal analysis and reasoning using IARO based leaky Bi-LSTM model11
Enhancing accuracy and privacy in speech-based depression detection through speaker disentanglement11
Editorial Board11
GenCeption: Evaluate vision LLMs with unlabeled unimodal data11
Improving BERT with local context comprehension for multi-turn response selection in retrieval-based dialogue systems11
Test-retest reliability of acoustic and linguistic measures of speech tasks10
Towards lifelong human assisted speaker diarization10
Continual End-to-End Speech-to-Text translation using augmented bi-sampler10
Multi-task unified model for Chinese aspect-based sentiment analysis10
Multiple time-instances features based approach for reference-free speech quality measurement10
Objective and subjective evaluation of speech enhancement methods in the UDASE task of the 7th CHiME challenge10
Adaptive feature extraction for entity relation extraction9
Hate speech and offensive language detection in Dravidian languages using deep ensemble framework9
An automated quality evaluation framework of psychotherapy conversations with local quality estimates9
Deep feature representations and fusion strategies for speech emotion recognition from acoustic and linguistic modalities: A systematic review9
Refining the evaluation of speech synthesis: A summary of the Blizzard Challenge 20239
A neural network approach for speech enhancement and noise-robust bandwidth extension9
DiffATSM: High quality adaptive time-scale modification using diffusion-based post-processing9
End-to-End Speech-to-Text Translation: A Survey9
A physical exertion inspired multi-task learning framework for detecting out-of-breath speech9
Minerva 2 for speech and language tasks8
Morse wavelet transform-based features for voice liveness detection8
Significance of chirp MFCC as a feature in speech and audio applications8
Measuring and implementing lexical alignment: A systematic literature review8
A novel approach to cross-linguistic transfer learning for hope speech detection in Tamil and Malayalam8
Speech self-supervised representations benchmarking: A case for larger probing heads8
Two in One: A multi-task framework for politeness turn identification and phrase extraction in goal-oriented conversations8
Universal constituency treebanking and parsing: A pilot study8
0.11102700233459