Computer Speech and Language

Papers
(The median citation count of Computer Speech and Language is 3. The table below lists those papers that are above that threshold based on CrossRef citation counts [max. 250 papers]. The publications cover those that have been published in the past four years, i.e., from 2020-11-01 to 2024-11-01.)
ArticleCitations
ASVspoof 2019: A large-scale public database of synthesized, converted and replayed speech180
A review of speaker diarization: Recent advances with deep learning141
Attention-based BiLSTM fused CNN with gating mechanism model for Chinese long text classification99
Bayesian HMM clustering of x-vector sequences (VBx) in speaker diarization: Theory, implementation and analysis on standard tasks82
Turn-taking in Conversational Systems and Human-Robot Interaction: A Review81
Deep reinforcement and transfer learning for abstractive text summarization: A review77
Human evaluation of automatically generated text: Current trends and best practice guidelines55
Hate speech detection on Twitter using transfer learning53
Combining context-relevant features with multi-stage attention network for short text classification51
Deep learning based multi-source localization with source splitting and its effectiveness in multi-talker speech recognition46
Spoken language interaction with robots: Recommendations for future research46
Enhancing Arabic aspect-based sentiment analysis using deep learning models43
Adversarial attack and defense strategies for deep speaker recognition systems43
Linguistic features and automatic classifiers for identifying mild cognitive impairment and dementia43
Non-negative matrix factorization-based time-frequency feature extraction of voice signal for Parkinson's disease prediction39
The VoicePrivacy 2020 Challenge: Results and findings37
Generative adversarial networks for speech processing: A review36
Hate speech and offensive language detection in Dravidian languages using deep ensemble framework36
MuST-C: A multilingual corpus for end-to-end speech translation34
Part-of-speech tagging for Arabic tweets using CRF and Bi-LSTM33
BERT syntactic transfer: A computational experiment on Italian, French and English languages31
Emotion recognition in low-resource settings: An evaluation of automatic feature selection methods30
An automatic Alzheimer’s disease classifier based on spontaneous spoken English29
tax2vec: Constructing Interpretable Features from Taxonomies for Short Text Classification28
Advances in subword-based HMM-DNN speech recognition across languages27
Investigations on speech recognition systems for low-resource dialectal Arabic–English code-switching speech27
Automatic assessment of intelligibility in speakers with dysarthria from coded telephone speech using glottal features26
Arabic speech recognition by end-to-end, modular systems and human25
Offensive language detection in Tamil YouTube comments by adapters and cross-domain knowledge transfer25
Analysis and classification of speech sounds of children with autism spectrum disorder using acoustic features24
Detection of replay spoof speech using teager energy feature cues22
Comprehensive analysis of aspect term extraction methods using various text embeddings21
BERT-hLSTMs: BERT and hierarchical LSTMs for visual storytelling21
On the effect of dropping layers of pre-trained transformer models20
Vocal tract shaping of emotional speech20
Voice spoofing detection corpus for single and multi-order audio replays20
Evaluating voice-assistant commands for dementia detection20
Replay spoofing countermeasure using autoencoder and siamese networks on ASVspoof 2019 challenge19
TOP-Rank: A TopicalPostionRank for Extraction and Classification of Keyphrases in Text18
Named entity recognition using neural language model and CRF for Hindi language18
The automatic detection of heart failure using speech signals18
Verbal fluency in normal aging and cognitive decline: Results of a longitudinal study17
Transfer fine-tuning of BERT with phrasal paraphrases17
X-vector anonymization using autoencoders and adversarial training for preserving speech privacy17
End-to-end neural systems for automatic children speech recognition: An empirical study17
A Korean named entity recognition method using Bi-LSTM-CRF and masked self-attention17
A novel word sense disambiguation approach using WordNet knowledge graph17
Analysis of gender and identity issues in depression detection on de-identified speech16
Improving the potential of Enhanced Teager Energy Cepstral Coefficients (ETECC) for replay attack detection16
Phase sensitive masking-based single channel speech enhancement using conditional generative adversarial network15
Cluster-based beam search for pointer-generator chatbot grounded by knowledge14
HOTTEST: Hate and Offensive content identification in Tamil using Transformers and Enhanced STemming14
Representation transfer learning from deep end-to-end speech recognition networks for the classification of health states from speech14
Siamese networks for large-scale author identification13
An analysis of machine learning models for sentiment analysis of Tamil code-mixed data13
A Bayesian end-to-end model with estimated uncertainties for simple question answering over knowledge bases13
Accentron: Foreign accent conversion to arbitrary non-native speakers using zero-shot learning13
A multi-label emoji classification method using balanced pointwise mutual information-based feature selection12
Language-independent extractive automatic text summarization based on automatic keyword extraction12
Dialect Identification using Chroma-Spectral Shape Features with Ensemble Technique12
Towards a unified assessment framework of speech pseudonymisation12
Low resource end-to-end spoken language understanding with capsule networks12
Detecting dementia from speech and transcripts using transformers12
A speaker verification backend with robust performance across conditions11
Towards a speech therapy support system based on phonological processes early detection11
Discriminating speech traits of Alzheimer's disease assessed through a corpus of reading task for Spanish language11
Towards inclusive automatic speech recognition11
QBSUM: A large-scale query-based document summarization dataset from real-world applications11
Perceptions and reactions to conversational privacy initiated by a conversational user interface10
Hybrid-task learning for robust automatic speech recognition10
Replay attack detection using variable-frequency resolution phase and magnitude features10
Speaker anonymization by modifying fundamental frequency and x-vector singular value10
Investigation of learning abilities on linguistic features in sequence-to-sequence text-to-speech synthesis10
Multilingual and unsupervised subword modeling for zero-resource languages10
Replay spoof detection using energy separation based instantaneous frequency estimation from quadrature and in-phase components10
Deep ad-hoc beamforming10
Automatic speaker independent dysarthric speech intelligibility assessment system9
An online multi-source summarization algorithm for text readability in topic-based search9
Joint emotion label space modeling for affect lexica9
NUVA: A Naming Utterance Verifier for Aphasia Treatment9
Acoustic and articulatory analysis and synthesis of shouted vowels9
Exploring neural models for predicting dementia from language9
Replay anti-spoofing countermeasure based on data augmentation with post selection8
Cross-lingual detection of mild cognitive impairment based on temporal parameters of spontaneous speech8
Trends and developments in automatic speech recognition research8
A classification benchmark for Arabic alphabet phonemes with diacritics in deep neural networks8
Natural language processing for under-resourced languages: Developing a Welsh natural language toolkit8
A comparison of data augmentation methods in voice pathology detection8
Code-switched automatic speech recognition in five South African languages8
Cross-lingual transfer learning for relation extraction using Universal Dependencies8
Prediction of speech intelligibility with DNN-based performance measures8
Overlapped Speech Detection and speaker counting using distant microphone arrays8
Multi-level embeddings for processing Arabic social media contents8
Discovering phonetic inventories with crosslingual automatic speech recognition7
English–Assamese neural machine translation using prior alignment and pre-trained language model7
Local and non-local dependency learning and emergence of rule-like representations in speech data by deep convolutional generative adversarial networks7
Determination of glottal closure instants from clean and telephone quality speech signals using single frequency filtering7
Dereverberation of autoregressive envelopes for far-field speech recognition7
A methodological approach to enable natural language interaction in an Intelligent Tutoring System7
Assessing the effect of visual servoing on the performance of linear microphone arrays in moving human-robot interaction scenarios7
Generating unambiguous and diverse referring expressions7
Tamil Handwritten Character Recognition System using Statistical Algorithmic Approaches7
End-to-End Spoken Language Understanding: Performance analyses of a voice command task in a low resource setting6
Improving low-resource machine transliteration by using 3-way transfer learning6
LIS-Net: An end-to-end light interior search network for speech command recognition6
Paralinguistic and linguistic fluency features for Alzheimer's disease detection6
A code-mixed task-oriented dialog dataset for medical domain6
DAE-NER: Dual-channel attention enhancement for Chinese named entity recognition6
On significance of constant-Q transform for pop noise detection6
Glottal features for classification of phonation type from speech and neck surface accelerometer signals6
Cross-Lingual Text Reuse Detection at sentence level for English–Urdu language pair6
Empirical Mode Decomposition articulation feature extraction on Parkinson’s Diadochokinesia6
Generation of Coherent Multi-Sentence Texts with a Coherence Mechanism6
Feature learning for efficient ASR-free keyword spotting in low-resource languages6
Cross-lingual multi-speaker speech synthesis with limited bilingual training data6
Effectiveness of energy separation-based instantaneous frequency estimation for cochlear cepstral features for synthetic and voice-converted spoofed speech detection6
An unsupervised approach to detect review spam using duplicates of images, videos and Chinese texts6
Maximal activation weighted memory for aspect based sentiment analysis6
Joint speaker diarization and speech recognition based on region proposal networks5
Effects of cross-cultural language differences on social cognition during human-agent interaction in cooperative game environments5
Changes in facial expressions in patients with Parkinson's disease during the phonation test and their correlation with disease severity5
Exploring accidental triggers of smart speakers5
Enhancing accuracy and privacy in speech-based depression detection through speaker disentanglement5
GTSO: Gradient tangent search optimization enabled voice transformer with speech intelligibility for aphasia5
Cross-corpora spoken language identification with domain diversification and generalization5
Sequential routing framework: Fully capsule network-based speech recognition5
Improving BERT with local context comprehension for multi-turn response selection in retrieval-based dialogue systems5
Multi-level context features extraction for named entity recognition5
Detection of speech playback attacks using robust harmonic trajectories5
Social media popularity prediction with multimodal hierarchical fusion model5
Generating identities with mixture models for speaker anonymization5
EEG-dependent automatic speech recognition using deep residual encoder based VGG net CNN4
Variational model for low-resource natural language generation in spoken dialogue systems4
Prosodic event detection in children’s read speech4
Towards better Chinese-centric neural machine translation for low-resource languages4
The phonetic footprint of Parkinson’s disease4
Supervised speech separation combined with adaptive beamforming4
Gated dynamic convolutions with deep layer fusion for abstractive document summarization4
Towards sound based testing of COVID-19—Summary of the first Diagnostics of COVID-19 using Acoustics (DiCOVA) Challenge4
An intention multiple-representation model with expanded information4
Self-conducted speech audiometry using automatic speech recognition: Simulation results for listeners with hearing loss4
Exploring the relationship between children's facial emotion processing characteristics and speech communication ability using deep learning on eye tracking and speech performance measures4
MS-Transformer: Introduce multiple structural priors into a unified transformer for encoding sentences4
Learning to extract from multiple perspectives for neural keyphrase extraction4
PQuAD: A Persian question answering dataset4
A three-stage neural model for Arabic Dialect Identification4
A neural network approach for speech activity detection for Apollo corpus4
Classification of stuttering – The ComParE challenge and beyond4
A local dynamic feature selection fusion method for voice diagnosis of Parkinson's disease4
Transfer learning for multimodal dialog4
Conversation Initiation of Mothers, Fathers, and Toddlers in their Natural Home Environment4
Speech enhancement approach for body-conducted unvoiced speech based on Taylor–Boltzmann machines trained DNN4
FinD: Fine-grained discrepancy-based fake news detection enhanced by event abstract generation4
Novel textual features for language modeling of intra-sentential code-switching data4
DORA: Towards policy optimization for task-oriented dialogue system with efficient context4
An automated quality evaluation framework of psychotherapy conversations with local quality estimates4
Distinctive acoustic changes in speech in Parkinson's disease4
Parameterisation of human speech after total laryngectomy surgery4
Preserving the beamforming effect for spatial cue-based pseudo-binaural dereverberation of a single source3
Learning to generate structured queries from natural language with indirect supervision3
Morphologically motivated word classes for very large vocabulary speech recognition of Finnish and Estonian3
An optimal approach for text feature selection3
Neural candidate-aware language models for speech recognition3
On the use of blind channel response estimation and a residual neural network to detect physical access attacks to speaker verification systems3
Hierarchical state recurrent neural network for social emotion ranking3
A deep neural network based correction scheme for improved air-tissue boundary prediction in real-time magnetic resonance imaging video3
Train from scratch: Single-stage joint training of speech separation and recognition3
Causal indicators for assessing the truthfulness of child speech in forensic interviews3
MASS: Multi-task anthropomorphic speech synthesis framework3
Automated grapheme-to-phoneme conversion for Central Kurdish based on optimality theory3
Complementary regional energy features for spoofed speech detection3
A novel approach to unsupervised pattern discovery in speech using Convolutional Neural Network3
Building a text retrieval system for the Sanskrit language: Exploring indexing, stemming, and searching issues3
PLDA inspired Siamese networks for speaker verification3
RepSum: A general abstractive summarization framework with dynamic word embedding representation correction3
Automatic screening of mild cognitive impairment and Alzheimer’s disease by means of posterior-thresholding hesitation representation3
An analysis of observation length requirements for machine understanding of human behaviors from spoken language3
Reference architecture design for computer-based speech therapy systems3
A combined syntactic-semantic embedding model based on lexicalized tree-adjoining grammar3
A study of bias mitigation strategies for speaker recognition3
Knowledge-grounded dialogue modelling with dialogue-state tracking, domain tracking, and entity extraction3
Voice privacy using CycleGAN and time-scale modification3
Multi-branch feature aggregation based on multiple weighting for speaker verification3
Self-segmentation of pass-phrase utterances for deep feature learning in text-dependent speaker verification3
An effective approach for identifying keywords as high-quality filters to get emergency-implicated Twitter Spanish data3
Automatic detection of pharyngeal fricatives in cleft palate speech using acoustic features based on the vocal tract area spectrum3
Stochastic models of glottal pulses from the Rosenberg and Liljencrants-Fant models with unified parameters3
Automatic classification of the severity level of Parkinson’s disease: A comparison of speaking tasks, features, and classifiers3
The predictive capabilities of mathematical models for the type-token relationship in English language corpora3
Refining a deep learning-based formant tracker using linear prediction methods3
Novel textual entailment technique for the Arabic language using genetic algorithm3
0.057297945022583