Computer Speech and Language

Papers
(The TQCC of Computer Speech and Language is 8. The table below lists those papers that are above that threshold based on CrossRef citation counts [max. 250 papers]. The publications cover those that have been published in the past four years, i.e., from 2020-11-01 to 2024-11-01.)
ArticleCitations
ASVspoof 2019: A large-scale public database of synthesized, converted and replayed speech180
A review of speaker diarization: Recent advances with deep learning141
Attention-based BiLSTM fused CNN with gating mechanism model for Chinese long text classification99
Bayesian HMM clustering of x-vector sequences (VBx) in speaker diarization: Theory, implementation and analysis on standard tasks82
Turn-taking in Conversational Systems and Human-Robot Interaction: A Review81
Deep reinforcement and transfer learning for abstractive text summarization: A review77
Human evaluation of automatically generated text: Current trends and best practice guidelines55
Hate speech detection on Twitter using transfer learning53
Combining context-relevant features with multi-stage attention network for short text classification51
Spoken language interaction with robots: Recommendations for future research46
Deep learning based multi-source localization with source splitting and its effectiveness in multi-talker speech recognition46
Linguistic features and automatic classifiers for identifying mild cognitive impairment and dementia43
Enhancing Arabic aspect-based sentiment analysis using deep learning models43
Adversarial attack and defense strategies for deep speaker recognition systems43
Non-negative matrix factorization-based time-frequency feature extraction of voice signal for Parkinson's disease prediction39
The VoicePrivacy 2020 Challenge: Results and findings37
Hate speech and offensive language detection in Dravidian languages using deep ensemble framework36
Generative adversarial networks for speech processing: A review36
MuST-C: A multilingual corpus for end-to-end speech translation34
Part-of-speech tagging for Arabic tweets using CRF and Bi-LSTM33
BERT syntactic transfer: A computational experiment on Italian, French and English languages31
Emotion recognition in low-resource settings: An evaluation of automatic feature selection methods30
An automatic Alzheimer’s disease classifier based on spontaneous spoken English29
tax2vec: Constructing Interpretable Features from Taxonomies for Short Text Classification28
Advances in subword-based HMM-DNN speech recognition across languages27
Investigations on speech recognition systems for low-resource dialectal Arabic–English code-switching speech27
Automatic assessment of intelligibility in speakers with dysarthria from coded telephone speech using glottal features26
Arabic speech recognition by end-to-end, modular systems and human25
Offensive language detection in Tamil YouTube comments by adapters and cross-domain knowledge transfer25
Analysis and classification of speech sounds of children with autism spectrum disorder using acoustic features24
Detection of replay spoof speech using teager energy feature cues22
BERT-hLSTMs: BERT and hierarchical LSTMs for visual storytelling21
Comprehensive analysis of aspect term extraction methods using various text embeddings21
Vocal tract shaping of emotional speech20
Voice spoofing detection corpus for single and multi-order audio replays20
Evaluating voice-assistant commands for dementia detection20
On the effect of dropping layers of pre-trained transformer models20
Replay spoofing countermeasure using autoencoder and siamese networks on ASVspoof 2019 challenge19
The automatic detection of heart failure using speech signals18
TOP-Rank: A TopicalPostionRank for Extraction and Classification of Keyphrases in Text18
Named entity recognition using neural language model and CRF for Hindi language18
A Korean named entity recognition method using Bi-LSTM-CRF and masked self-attention17
A novel word sense disambiguation approach using WordNet knowledge graph17
Verbal fluency in normal aging and cognitive decline: Results of a longitudinal study17
Transfer fine-tuning of BERT with phrasal paraphrases17
X-vector anonymization using autoencoders and adversarial training for preserving speech privacy17
End-to-end neural systems for automatic children speech recognition: An empirical study17
Improving the potential of Enhanced Teager Energy Cepstral Coefficients (ETECC) for replay attack detection16
Analysis of gender and identity issues in depression detection on de-identified speech16
Phase sensitive masking-based single channel speech enhancement using conditional generative adversarial network15
Representation transfer learning from deep end-to-end speech recognition networks for the classification of health states from speech14
Cluster-based beam search for pointer-generator chatbot grounded by knowledge14
HOTTEST: Hate and Offensive content identification in Tamil using Transformers and Enhanced STemming14
An analysis of machine learning models for sentiment analysis of Tamil code-mixed data13
A Bayesian end-to-end model with estimated uncertainties for simple question answering over knowledge bases13
Accentron: Foreign accent conversion to arbitrary non-native speakers using zero-shot learning13
Siamese networks for large-scale author identification13
Dialect Identification using Chroma-Spectral Shape Features with Ensemble Technique12
Towards a unified assessment framework of speech pseudonymisation12
Low resource end-to-end spoken language understanding with capsule networks12
Detecting dementia from speech and transcripts using transformers12
A multi-label emoji classification method using balanced pointwise mutual information-based feature selection12
Language-independent extractive automatic text summarization based on automatic keyword extraction12
Discriminating speech traits of Alzheimer's disease assessed through a corpus of reading task for Spanish language11
Towards inclusive automatic speech recognition11
QBSUM: A large-scale query-based document summarization dataset from real-world applications11
A speaker verification backend with robust performance across conditions11
Towards a speech therapy support system based on phonological processes early detection11
Multilingual and unsupervised subword modeling for zero-resource languages10
Replay spoof detection using energy separation based instantaneous frequency estimation from quadrature and in-phase components10
Deep ad-hoc beamforming10
Perceptions and reactions to conversational privacy initiated by a conversational user interface10
Hybrid-task learning for robust automatic speech recognition10
Replay attack detection using variable-frequency resolution phase and magnitude features10
Speaker anonymization by modifying fundamental frequency and x-vector singular value10
Investigation of learning abilities on linguistic features in sequence-to-sequence text-to-speech synthesis10
Acoustic and articulatory analysis and synthesis of shouted vowels9
Exploring neural models for predicting dementia from language9
Automatic speaker independent dysarthric speech intelligibility assessment system9
An online multi-source summarization algorithm for text readability in topic-based search9
Joint emotion label space modeling for affect lexica9
NUVA: A Naming Utterance Verifier for Aphasia Treatment9
Overlapped Speech Detection and speaker counting using distant microphone arrays8
Cross-lingual transfer learning for relation extraction using Universal Dependencies8
Prediction of speech intelligibility with DNN-based performance measures8
Cross-lingual detection of mild cognitive impairment based on temporal parameters of spontaneous speech8
Trends and developments in automatic speech recognition research8
Multi-level embeddings for processing Arabic social media contents8
Replay anti-spoofing countermeasure based on data augmentation with post selection8
A comparison of data augmentation methods in voice pathology detection8
Code-switched automatic speech recognition in five South African languages8
A classification benchmark for Arabic alphabet phonemes with diacritics in deep neural networks8
Natural language processing for under-resourced languages: Developing a Welsh natural language toolkit8
0.057382822036743