Computer Speech and Language

Papers
(The median citation count of Computer Speech and Language is 2. The table below lists those papers that are above that threshold based on CrossRef citation counts [max. 250 papers]. The publications cover those that have been published in the past four years, i.e., from 2021-03-01 to 2025-03-01.)
ArticleCitations
Non-negative matrix factorization-based time-frequency feature extraction of voice signal for Parkinson's disease prediction141
Building a text retrieval system for the Sanskrit language: Exploring indexing, stemming, and searching issues103
Speech enhancement approach for body-conducted unvoiced speech based on Taylor–Boltzmann machines trained DNN99
FE-CFNER: Feature Enhancement-based approach for Chinese Few-shot Named Entity Recognition90
Towards a unified assessment framework of speech pseudonymisation86
A new speech corpus of super-elderly Japanese for acoustic modeling65
Addressing subjectivity in paralinguistic data labeling for improved classification performance: A case study with Spanish-speaking Mexican children using data balancing and semi-supervised learning54
Stochastic Data-to-Text Generation Using Syntactic Dependency Information53
Seq2Seq dynamic planning network for progressive text generation53
Optimizing pipeline task-oriented dialogue systems using post-processing networks46
Assessing language models’ task and language transfer capabilities for sentiment analysis in dialog data46
Improved relation extraction through key phrase identification using community detection on dependency trees43
A computational analysis of transcribed speech of people living with dementia: The Anchise 2022 Corpus43
ECDG-DST: A dialogue state tracking model based on efficient context and domain guidance for smart dialogue systems39
Transfer fine-tuning of BERT with phrasal paraphrases38
Automatic detection of pharyngeal fricatives in cleft palate speech using acoustic features based on the vocal tract area spectrum36
Adaptive line enhancer for nonstationary harmonic noise reduction34
Turn-taking in Conversational Systems and Human-Robot Interaction: A Review33
Sentence transition matrix: An efficient approach that preserves sentence semantics31
On the use of blind channel response estimation and a residual neural network to detect physical access attacks to speaker verification systems29
Glottal features for classification of phonation type from speech and neck surface accelerometer signals27
Investigation of learning abilities on linguistic features in sequence-to-sequence text-to-speech synthesis27
Automatic detection of behavioural codes in team interactions25
Learning to extract from multiple perspectives for neural keyphrase extraction25
A methodological approach to enable natural language interaction in an Intelligent Tutoring System25
Enhancing accuracy and privacy in speech-based depression detection through speaker disentanglement24
Editorial Board21
Editorial Board21
Effective infant cry signal analysis and reasoning using IARO based leaky Bi-LSTM model21
KddRES: A Multi-level Knowledge-driven Dialogue Dataset for Restaurant Towards Customized Dialogue System20
RepSum: A general abstractive summarization framework with dynamic word embedding representation correction19
Two evaluations on Ontology-style relation annotations18
Editorial Board18
GEPC: Global embeddings with PID control18
Talking-heads attention-based knowledge representation for link prediction17
PLDA inspired Siamese networks for speaker verification17
Automatic offline annotation of turn-taking transitions in task-oriented dialogue16
A benchmark dataset for Turkish data-to-text generation15
Document-level relation extraction with entity mentions deep attention15
An effective approach for identifying keywords as high-quality filters to get emergency-implicated Twitter Spanish data14
Neural candidate-aware language models for speech recognition14
Editorial Board14
Tamil Handwritten Character Recognition System using Statistical Algorithmic Approaches13
Phase sensitive masking-based single channel speech enhancement using conditional generative adversarial network13
FinD: Fine-grained discrepancy-based fake news detection enhanced by event abstract generation13
A language-agnostic model of child language acquisition12
Effectiveness of energy separation-based instantaneous frequency estimation for cochlear cepstral features for synthetic and voice-converted spoofed speech detection12
Room impulse response reshaping-based expectation–maximization in an underdetermined reverberant environment12
Exploring the ability of LLMs to classify written proficiency levels12
An investigation of neural uncertainty estimation for target speaker extraction equipped RNN transducer12
Taking relations as known conditions: A tagging based method for relational triple extraction12
A flexible BERT model enabling width- and depth-dynamic inference12
Towards better Chinese-centric neural machine translation for low-resource languages12
Deep ad-hoc beamforming11
Towards inclusive automatic speech recognition11
Incorporating external knowledge for text matching model11
An online multi-source summarization algorithm for text readability in topic-based search11
Direct enhancement of pre-trained speech embeddings for speech processing in noisy conditions11
BERT-hLSTMs: BERT and hierarchical LSTMs for visual storytelling10
C-KGE: Curriculum learning-based Knowledge Graph Embedding10
A generalized decoding method for neural text generation10
A potential relation trigger method for entity-relation quintuple extraction in text with excessive entities10
Offensive language detection in Tamil YouTube comments by adapters and cross-domain knowledge transfer10
Editorial Board9
Hate speech detection on Twitter using transfer learning9
A novel word sense disambiguation approach using WordNet knowledge graph9
Neural multi-task learning for end-to-end Arabic aspect-based sentiment analysis9
MS-Transformer: Introduce multiple structural priors into a unified transformer for encoding sentences9
The effect of preference elicitation methods on the user experience in conversational recommender systems9
An optimal approach for text feature selection8
A review of speaker diarization: Recent advances with deep learning8
BERT syntactic transfer: A computational experiment on Italian, French and English languages8
Misogynistic attitude detection in YouTube comments and replies: A high-quality dataset and algorithmic models8
Cross-lingual multi-speaker speech synthesis with limited bilingual training data8
A tag-based methodology for the detection of user repair strategies in task-oriented conversational agents8
Prosodic event detection in children’s read speech8
Self-segmentation of pass-phrase utterances for deep feature learning in text-dependent speaker verification8
Editorial Board8
Monotonic Gaussian regularization of attention for robust automatic speech recognition8
Exploring intrinsic information content models for addressing the issues of traditional semantic measures to evaluate verb similarity8
On significance of constant-Q transform for pop noise detection8
Speech intelligibility assessment of dysarthria using Fisher vector encoding8
A study of vowel nasalization using instantaneous spectra7
Speech recognition using Taylor-gradient Descent political optimization based Deep residual network7
Natural language processing for under-resourced languages: Developing a Welsh natural language toolkit7
Editorial Board7
Rep-MCA-former: An efficient multi-scale convolution attention encoder for text-independent speaker verification7
Uncertainty-aware non-autoregressive neural machine translation7
Detection of vowel transition regions from Hindi language7
A novel approach to unsupervised pattern discovery in speech using Convolutional Neural Network7
Contextual emotion detection using ensemble deep learning7
Speech emotion recognition in real static and dynamic human-robot interaction scenarios6
Generating identities with mixture models for speaker anonymization6
Complementary regional energy features for spoofed speech detection6
Scale-aware dual-branch complex convolutional recurrent network for monaural speech enhancement6
Intelligibility assessment of impaired speech using Regularized self-representation based compact supervectors6
Feature learning for efficient ASR-free keyword spotting in low-resource languages6
Improving BERT with local context comprehension for multi-turn response selection in retrieval-based dialogue systems6
Verbal fluency in normal aging and cognitive decline: Results of a longitudinal study6
To what extent does content selection affect surface realization in the context of headline generation?6
Multi-branch feature aggregation based on multiple weighting for speaker verification6
Training RNN language models on uncertain ASR hypotheses in limited data scenarios6
PaSCoNT - Parallel Speech Corpus of Northern-central Thai for automatic speech recognition6
The limits of the Mean Opinion Score for speech synthesis evaluation6
End-to-end neural systems for automatic children speech recognition: An empirical study6
Prototypical networks relation classification model based on entity convolution6
DAE-NER: Dual-channel attention enhancement for Chinese named entity recognition6
A closer look at reinforcement learning-based automatic speech recognition5
Joint speaker encoder and neural back-end model for fully end-to-end automatic speaker verification with multiple enrollment utterances5
Bayesian active summarization5
A method of phonemic annotation for Chinese dialects based on a deep learning model with adaptive temporal attention and a feature disentangling structure5
Language-independent extractive automatic text summarization based on automatic keyword extraction5
Unsupervised question-retrieval approach based on topic keywords filtering and multi-task learning5
Conversation Initiation of Mothers, Fathers, and Toddlers in their Natural Home Environment5
CLIPMulti: Explore the performance of multimodal enhanced CLIP for zero-shot text classification5
Morphologically motivated word classes for very large vocabulary speech recognition of Finnish and Estonian5
Objective and subjective evaluation of speech enhancement methods in the UDASE task of the 7th CHiME challenge5
Evaluating factual accuracy in complex data-to-text4
Editorial Board4
COMPASS: A creative support system that alerts novelists to the unnoticed missing contents4
A semi-supervised high-quality pseudo labels algorithm based on multi-constraint optimization for speech deception detection4
Test-retest reliability of acoustic and linguistic measures of speech tasks4
Speaking to remember: Model-based adaptive vocabulary learning using automatic speech recognition4
Deep reinforcement and transfer learning for abstractive text summarization: A review4
Replay spoof detection using energy separation based instantaneous frequency estimation from quadrature and in-phase components4
Towards detecting the level of trust in the skills of a virtual assistant from the user’s speech4
Multipath-guided heterogeneous graph neural networks for sequential recommendation4
Analysis of out-of-breath speech for assessment of person’s physical fitness4
A novel Chinese–Tibetan mixed-language rumor detector with multi-extractor representations4
Pronoun use in preclinical and early stages of Alzheimer's dementia4
Enhanced local knowledge with proximity values and syntax-clusters for aspect-level sentiment analysis4
GenCeption: Evaluate vision LLMS with unlabeled unimodal data4
UniKDD: A Unified Generative model for Knowledge-driven Dialogue4
A high-performance speech BioHashing retrieval algorithm based on audio segmentation4
Exploring the relationship between children's facial emotion processing characteristics and speech communication ability using deep learning on eye tracking and speech performance measures4
Editorial Board4
Learning to generate structured queries from natural language with indirect supervision4
Editorial Board4
End-to-End Spoken Language Understanding: Performance analyses of a voice command task in a low resource setting4
Optimized Binaural Enhancement via attention masking network-based speech separation framework in digital hearing aids4
Multi-level embeddings for processing Arabic social media contents4
A knowledge-Aware NLP-Driven conversational model to detect deceptive contents on social media posts4
Prediction of speech intelligibility with DNN-based performance measures4
Improving low-resource machine transliteration by using 3-way transfer learning4
An analysis of machine learning models for sentiment analysis of Tamil code-mixed data3
Comparison of rule-based and data-driven approaches for syllabification of simple syllable languages and the effect of orthography3
Prompting large language models for user simulation in task-oriented dialogue systems3
Self-conducted speech audiometry using automatic speech recognition: Simulation results for listeners with hearing loss3
Neural referential form selection: Generalisability and interpretability3
Though this be hesitant, yet there is method in ’t: Effects of disfluency patterns in neural speech synthesis for cultural heritage presentations3
Generating unambiguous and diverse referring expressions3
A code-mixed task-oriented dialog dataset for medical domain3
MuST-C: A multilingual corpus for end-to-end speech translation3
Detecting dementia from speech and transcripts using transformers3
Editorial Board3
TadaStride: Using time adaptive strides in audio data for effective downsampling3
A local dynamic feature selection fusion method for voice diagnosis of Parkinson's disease3
A dilemma of ground truth in noisy speech separation and an approach to lessen the impact of imperfect training data3
Voice privacy using CycleGAN and time-scale modification3
Adversarial attack and defense strategies for deep speaker recognition systems3
Enhancing analysis of diadochokinetic speech using deep neural networks3
Perceptions and reactions to conversational privacy initiated by a conversational user interface3
EEG-dependent automatic speech recognition using deep residual encoder based VGG net CNN3
Multiple time-instances features based approach for reference-free speech quality measurement3
Cross-lingual transfer learning for relation extraction using Universal Dependencies3
A novel and secured email classification using deep neural network with bidirectional long short-term memory3
How to make embeddings suitable for PLDA3
Bayesian HMM clustering of x-vector sequences (VBx) in speaker diarization: Theory, implementation and analysis on standard tasks3
Cross-language article linking with deep neural network based paragraph encoding3
NUVA: A Naming Utterance Verifier for Aphasia Treatment3
Automated grapheme-to-phoneme conversion for Central Kurdish based on optimality theory3
Maximal activation weighted memory for aspect based sentiment analysis3
Hate speech and offensive language detection in Dravidian languages using deep ensemble framework3
Generation of Coherent Multi-Sentence Texts with a Coherence Mechanism3
Arabic speech recognition by end-to-end, modular systems and human3
Editorial Board2
Deep learning based multi-source localization with source splitting and its effectiveness in multi-talker speech recognition2
A lightweight approach based on prompt for few-shot relation extraction2
Joint speaker separation and recognition using non-negative matrix deconvolution with adaptive dictionary2
Exploring neural models for predicting dementia from language2
LSRD-Net: A fine-grained sentiment analysis method based on log-normalized semantic relative distance2
Exploring accidental triggers of smart speakers2
Copiously Quote Classics: Improving Chinese Poetry Generation with historical allusion knowledge2
Speaker anonymization by modifying fundamental frequency and x-vector singular value2
Taris: An online speech recognition framework with sequence to sequence neural networks for both audio-only and audio-visual speech2
Knowledge-aware audio-grounded generative slot filling for limited annotated data2
A comprehensive solution to retrieval-based chatbot construction2
Editorial Board2
Controlling contents in data-to-document generation with human-designed topic labels2
QBSUM: A large-scale query-based document summarization dataset from real-world applications2
A classification benchmark for Arabic alphabet phonemes with diacritics in deep neural networks2
Adversarial subsequences for unconditional text generation2
Enhancing Arabic aspect-based sentiment analysis using deep learning models2
Predicting children’s perceived reading proficiency with prosody modeling2
Unsupervised sign language validation process based on hand-motion parameter clustering2
Bispectral feature speech intelligibility assessment metric based on auditory model2
Corrigendum to ‘Unsupervised sign language validation process based on hand-motion parameter clustering’ <Computer Speech & Language Volume 71, January 2022, 101256>2
Dereverberation of autoregressive envelopes for far-field speech recognition2
You Are What You Write: Author re-identification privacy attacks in the era of pre-trained language models2
Generalizing Hate Speech Detection Using Multi-Task Learning: A Case Study of Political Public Figures2
A transformer-based spelling error correction framework for Bangla and resource scarce Indic languages2
Evaluating voice-assistant commands for dementia detection2
Adopting machine translation in the healthcare sector: A methodological multi-criteria review2
Integrating frame-level boundary detection and deepfake detection for locating manipulated regions in partially spoofed audio forgery attacks2
A neural network approach for speech enhancement and noise-robust bandwidth extension2
English–Assamese neural machine translation using prior alignment and pre-trained language model2
Local and non-local dependency learning and emergence of rule-like representations in speech data by deep convolutional generative adversarial networks2
Attention-based BiLSTM fused CNN with gating mechanism model for Chinese long text classification2
Combining background noise and artificial masking to achieve privacy in sound zones2
Investigation of self-supervised pre-trained models for classification of voice quality from speech and neck surface accelerometer signals2
Accentron: Foreign accent conversion to arbitrary non-native speakers using zero-shot learning2
Towards lifelong human assisted speaker diarization2
A hybrid approach to Natural Language Inference for the SICK dataset2
Dual Knowledge Distillation for neural machine translation2
Improving self-supervised learning model for audio spoofing detection with layer-conditioned embedding fusion2
0.083069086074829