Computer Speech and Language

Papers
(The TQCC of Computer Speech and Language is 7. The table below lists those papers that are above that threshold based on CrossRef citation counts [max. 250 papers]. The publications cover those that have been published in the past four years, i.e., from 2021-05-01 to 2025-05-01.)
ArticleCitations
Room impulse response reshaping-based expectation–maximization in an underdetermined reverberant environment181
GEPC: Global embeddings with PID control120
KddRES: A Multi-level Knowledge-driven Dialogue Dataset for Restaurant Towards Customized Dialogue System120
A language-agnostic model of child language acquisition93
Stochastic Data-to-Text Generation Using Syntactic Dependency Information93
Corpus and unsupervised benchmark: Towards Tagalog grammatical error correction73
Automatic detection of behavioural codes in team interactions62
Seq2Seq dynamic planning network for progressive text generation59
Misogynistic attitude detection in YouTube comments and replies: A high-quality dataset and algorithmic models57
Speech enhancement approach for body-conducted unvoiced speech based on Taylor–Boltzmann machines trained DNN57
Improving low-resource machine transliteration by using 3-way transfer learning53
Editorial Board53
A method of phonemic annotation for Chinese dialects based on a deep learning model with adaptive temporal attention and a feature disentangling structure49
Unsupervised question-retrieval approach based on topic keywords filtering and multi-task learning49
Monotonic Gaussian regularization of attention for robust automatic speech recognition48
Complementary regional energy features for spoofed speech detection45
PaSCoNT - Parallel Speech Corpus of Northern-central Thai for automatic speech recognition39
Multi-branch feature aggregation based on multiple weighting for speaker verification39
Verbal fluency in normal aging and cognitive decline: Results of a longitudinal study37
Contextual emotion detection using ensemble deep learning35
Learning to generate structured queries from natural language with indirect supervision30
Editorial Board29
Accentron: Foreign accent conversion to arbitrary non-native speakers using zero-shot learning29
Unsupervised sign language validation process based on hand-motion parameter clustering29
Maximal activation weighted memory for aspect based sentiment analysis27
Improving self-supervised learning model for audio spoofing detection with layer-conditioned embedding fusion25
A transformer-based spelling error correction framework for Bangla and resource scarce Indic languages24
Editorial Board24
A hybrid approach to Natural Language Inference for the SICK dataset23
Enhancing analysis of diadochokinetic speech using deep neural networks23
Exploring accidental triggers of smart speakers23
Adversarial subsequences for unconditional text generation23
Perceptions and reactions to conversational privacy initiated by a conversational user interface21
Combining replay and LoRA for continual learning in natural language understanding20
Representation learning strategies to model pathological speech: Effect of multiple spectral resolutions19
An unsupervised approach to detect review spam using duplicates of images, videos and Chinese texts19
The use of Active Learning systems for stimulus selection and response modelling in perception experiments19
Preserving the beamforming effect for spatial cue-based pseudo-binaural dereverberation of a single source19
Symbolic and Statistical Learning Approaches to Speech Summarization: A Scoping Review19
English–Assamese neural machine translation using prior alignment and pre-trained language model17
A lightweight approach based on prompt for few-shot relation extraction17
Loanword identification based on web resources: A case study on wikipedia16
Enhancing Arabic aspect-based sentiment analysis using deep learning models16
Conversations in the wild: Data collection, automatic generation and evaluation15
A mobile application using automatic speech analysis for classifying Alzheimer's disease and mild cognitive impairment15
Dialect Identification using Chroma-Spectral Shape Features with Ensemble Technique15
Effects of cross-cultural language differences on social cognition during human-agent interaction in cooperative game environments14
Unsupervised induction of inflectional families14
Editorial Board14
Meta adversarial learning improves low-resource speech recognition13
A novel channel estimate for noise robust speech recognition13
MPSA-DenseNet: A novel deep learning model for English accent classification13
Editorial Board12
Editorial Board12
Zero-Shot Strike: Testing the generalisation capabilities of out-of-the-box LLM models for depression detection12
Evidence and Axial Attention Guided Document-level Relation Extraction12
Adjustable deterministic pseudonymization of speech12
A multi-label emoji classification method using balanced pointwise mutual information-based feature selection12
SecNLP: An NLP classification model watermarking framework based on multi-task learning12
Named entity recognition using neural language model and CRF for Hindi language12
Effective infant cry signal analysis and reasoning using IARO based leaky Bi-LSTM model11
A flexible BERT model enabling width- and depth-dynamic inference11
A computational analysis of transcribed speech of people living with dementia: The Anchise 2022 Corpus11
Phase sensitive masking-based single channel speech enhancement using conditional generative adversarial network11
FinD: Fine-grained discrepancy-based fake news detection enhanced by event abstract generation11
Investigation of learning abilities on linguistic features in sequence-to-sequence text-to-speech synthesis10
Addressing subjectivity in paralinguistic data labeling for improved classification performance: A case study with Spanish-speaking Mexican children using data balancing and semi-supervised learning10
Improved relation extraction through key phrase identification using community detection on dependency trees10
BERT-hLSTMs: BERT and hierarchical LSTMs for visual storytelling10
Towards a unified assessment framework of speech pseudonymisation9
Towards inclusive automatic speech recognition9
A tag-based methodology for the detection of user repair strategies in task-oriented conversational agents9
Prosodic event detection in children’s read speech9
Offensive language detection in Tamil YouTube comments by adapters and cross-domain knowledge transfer9
Detection of vowel transition regions from Hindi language9
Prototypical networks relation classification model based on entity convolution9
Editorial Board9
Neural multi-task learning for end-to-end Arabic aspect-based sentiment analysis9
Enhancing accuracy and privacy in speech-based depression detection through speaker disentanglement9
Improving BERT with local context comprehension for multi-turn response selection in retrieval-based dialogue systems9
Conversation Initiation of Mothers, Fathers, and Toddlers in their Natural Home Environment8
Arabic speech recognition by end-to-end, modular systems and human8
Objective and subjective evaluation of speech enhancement methods in the UDASE task of the 7th CHiME challenge8
A closer look at reinforcement learning-based automatic speech recognition8
To what extent does content selection affect surface realization in the context of headline generation?8
Evaluating voice-assistant commands for dementia detection8
Generating identities with mixture models for speaker anonymization8
Towards detecting the level of trust in the skills of a virtual assistant from the user’s speech8
Automated grapheme-to-phoneme conversion for Central Kurdish based on optimality theory8
A review of speaker diarization: Recent advances with deep learning8
Hate speech and offensive language detection in Dravidian languages using deep ensemble framework7
Adaptive feature extraction for entity relation extraction7
A neural network approach for speech enhancement and noise-robust bandwidth extension7
Multiple time-instances features based approach for reference-free speech quality measurement7
Generative adversarial networks for speech processing: A review7
An intention multiple-representation model with expanded information7
Towards lifelong human assisted speaker diarization7
Test-retest reliability of acoustic and linguistic measures of speech tasks7
Refining the evaluation of speech synthesis: A summary of the Blizzard Challenge 20237
Empirical Mode Decomposition articulation feature extraction on Parkinson’s Diadochokinesia7
Adversarial attack and defense strategies for deep speaker recognition systems7
0.51507210731506