Computer Speech and Language

Papers
(The median citation count of Computer Speech and Language is 3. The table below lists those papers that are above that threshold based on CrossRef citation counts [max. 250 papers]. The publications cover those that have been published in the past four years, i.e., from 2020-04-01 to 2024-04-01.)
ArticleCitations
ASVspoof 2019: A large-scale public database of synthesized, converted and replayed speech134
A review of speaker diarization: Recent advances with deep learning93
Attention-based BiLSTM fused CNN with gating mechanism model for Chinese long text classification81
Turn-taking in Conversational Systems and Human-Robot Interaction: A Review67
Bayesian HMM clustering of x-vector sequences (VBx) in speaker diarization: Theory, implementation and analysis on standard tasks60
Deep reinforcement and transfer learning for abstractive text summarization: A review50
Transfer learning from adult to children for speech recognition: Evaluation, analysis and recommendations49
Hate speech detection on Twitter using transfer learning43
Human evaluation of automatically generated text: Current trends and best practice guidelines40
Spoken language interaction with robots: Recommendations for future research39
Combining context-relevant features with multi-stage attention network for short text classification38
Deep learning based multi-source localization with source splitting and its effectiveness in multi-talker speech recognition38
Non-negative matrix factorization-based time-frequency feature extraction of voice signal for Parkinson's disease prediction37
Generalized end-to-end detection of spoofing attacks to automatic speaker recognizers37
Enhancing Arabic aspect-based sentiment analysis using deep learning models35
Adversarial attack and defense strategies for deep speaker recognition systems35
Linguistic features and automatic classifiers for identifying mild cognitive impairment and dementia34
Multilingual stance detection in social media political debates34
Part-of-speech tagging for Arabic tweets using CRF and Bi-LSTM32
Generative adversarial networks for speech processing: A review31
Hate speech and offensive language detection in Dravidian languages using deep ensemble framework30
MuST-C: A multilingual corpus for end-to-end speech translation29
Emotion recognition in low-resource settings: An evaluation of automatic feature selection methods28
Advances in subword-based HMM-DNN speech recognition across languages26
BERT syntactic transfer: A computational experiment on Italian, French and English languages25
tax2vec: Constructing Interpretable Features from Taxonomies for Short Text Classification25
Trajectory-based recognition of dynamic Persian sign language using hidden Markov model25
An automatic Alzheimer’s disease classifier based on spontaneous spoken English25
The VoicePrivacy 2020 Challenge: Results and findings24
Automatic assessment of intelligibility in speakers with dysarthria from coded telephone speech using glottal features23
Investigations on speech recognition systems for low-resource dialectal Arabic–English code-switching speech23
Arabic speech recognition by end-to-end, modular systems and human21
Offensive language detection in Tamil YouTube comments by adapters and cross-domain knowledge transfer19
A survey on automatic speech recognition systems for Portuguese language and its variations18
Detection of replay spoof speech using teager energy feature cues18
TOP-Rank: A TopicalPostionRank for Extraction and Classification of Keyphrases in Text17
Optimization of the area under the ROC curve using neural network supervectors for text-dependent speaker verification17
Voice spoofing detection corpus for single and multi-order audio replays16
BERT-hLSTMs: BERT and hierarchical LSTMs for visual storytelling16
Analysis and classification of speech sounds of children with autism spectrum disorder using acoustic features16
A Korean named entity recognition method using Bi-LSTM-CRF and masked self-attention16
Transfer fine-tuning of BERT with phrasal paraphrases16
Replay spoofing countermeasure using autoencoder and siamese networks on ASVspoof 2019 challenge16
Verbal fluency in normal aging and cognitive decline: Results of a longitudinal study16
Analysis of gender and identity issues in depression detection on de-identified speech15
Comprehensive analysis of aspect term extraction methods using various text embeddings15
Representation transfer learning from deep end-to-end speech recognition networks for the classification of health states from speech14
Vocal tract shaping of emotional speech14
NEC-TT System for Mixed-Bandwidth and Multi-Domain Speaker Recognition14
Improving the potential of Enhanced Teager Energy Cepstral Coefficients (ETECC) for replay attack detection14
Evaluating voice-assistant commands for dementia detection14
Towards the first Maithili part of speech tagger: Resource creation and system development14
Named entity recognition using neural language model and CRF for Hindi language14
Sequence labeling to detect stuttering events in read speech14
Cluster-based beam search for pointer-generator chatbot grounded by knowledge14
Phase sensitive masking-based single channel speech enhancement using conditional generative adversarial network13
A Bayesian end-to-end model with estimated uncertainties for simple question answering over knowledge bases13
Assessing Parkinson's disease severity using speech analysis in non-native speakers13
Recurrent neural network language generation for spoken dialogue systems13
Overview of the seventh Dialog System Technology Challenge: DSTC713
On the effect of dropping layers of pre-trained transformer models13
The automatic detection of heart failure using speech signals13
X-vector anonymization using autoencoders and adversarial training for preserving speech privacy12
Deep generative variational autoencoding for replay spoof detection in automatic speaker verification11
Low resource end-to-end spoken language understanding with capsule networks11
A novel word sense disambiguation approach using WordNet knowledge graph11
A multi-label emoji classification method using balanced pointwise mutual information-based feature selection11
Discriminating speech traits of Alzheimer's disease assessed through a corpus of reading task for Spanish language10
Hybrid-task learning for robust automatic speech recognition10
Multilingual and unsupervised subword modeling for zero-resource languages10
Low-resource text classification using domain-adversarial learning10
Accentron: Foreign accent conversion to arbitrary non-native speakers using zero-shot learning10
A speaker verification backend with robust performance across conditions9
An analysis of machine learning models for sentiment analysis of Tamil code-mixed data9
13 years of speaker recognition research at BUT, with longitudinal analysis of NIST SRE9
End-to-end neural systems for automatic children speech recognition: An empirical study9
HOTTEST: Hate and Offensive content identification in Tamil using Transformers and Enhanced STemming9
Siamese networks for large-scale author identification9
Perceptions and reactions to conversational privacy initiated by a conversational user interface8
Natural language processing for under-resourced languages: Developing a Welsh natural language toolkit8
An online multi-source summarization algorithm for text readability in topic-based search8
Dialect Identification using Chroma-Spectral Shape Features with Ensemble Technique8
QBSUM: A large-scale query-based document summarization dataset from real-world applications8
Exploring neural models for predicting dementia from language8
Towards a unified assessment framework of speech pseudonymisation8
Investigation of learning abilities on linguistic features in sequence-to-sequence text-to-speech synthesis8
Joint emotion label space modeling for affect lexica8
A classification benchmark for Arabic alphabet phonemes with diacritics in deep neural networks8
Replay spoof detection using energy separation based instantaneous frequency estimation from quadrature and in-phase components8
Towards a speech therapy support system based on phonological processes early detection8
Leveraging Linguistic Context in Dyadic Interactions to Improve Automatic Speech Recognition for Children8
Sequential neural networks for noetic end-to-end response selection7
Local and non-local dependency learning and emergence of rule-like representations in speech data by deep convolutional generative adversarial networks7
Prediction of speech intelligibility with DNN-based performance measures7
State gradients for analyzing memory in LSTM language models7
Automatic speaker independent dysarthric speech intelligibility assessment system7
Speaker anonymization by modifying fundamental frequency and x-vector singular value7
Deep ad-hoc beamforming7
Determination of glottal closure instants from clean and telephone quality speech signals using single frequency filtering7
Cross-lingual transfer learning for relation extraction using Universal Dependencies7
Language-independent extractive automatic text summarization based on automatic keyword extraction7
Replay anti-spoofing countermeasure based on data augmentation with post selection7
Generating unambiguous and diverse referring expressions6
Glottal features for classification of phonation type from speech and neck surface accelerometer signals6
Cross-lingual detection of mild cognitive impairment based on temporal parameters of spontaneous speech6
An unsupervised approach to detect review spam using duplicates of images, videos and Chinese texts6
Detecting dementia from speech and transcripts using transformers6
Multi-level embeddings for processing Arabic social media contents6
Knowledge-Grounded Response Generation with Deep Attentional Latent-Variable Model6
Replay attack detection using variable-frequency resolution phase and magnitude features6
Dereverberation of autoregressive envelopes for far-field speech recognition6
End-to-End Spoken Language Understanding: Performance analyses of a voice command task in a low resource setting6
LIS-Net: An end-to-end light interior search network for speech command recognition6
Discovering phonetic inventories with crosslingual automatic speech recognition6
Acoustic and articulatory analysis and synthesis of shouted vowels6
NUVA: A Naming Utterance Verifier for Aphasia Treatment6
Code-switched automatic speech recognition in five South African languages5
English–Assamese neural machine translation using prior alignment and pre-trained language model5
Two decades into Speaker Recognition Evaluation - are we there yet?5
Improving low-resource machine transliteration by using 3-way transfer learning5
Cross-Lingual Text Reuse Detection at sentence level for English–Urdu language pair5
Assessing the effect of visual servoing on the performance of linear microphone arrays in moving human-robot interaction scenarios5
Empirical Mode Decomposition articulation feature extraction on Parkinson’s Diadochokinesia5
Feature learning for efficient ASR-free keyword spotting in low-resource languages5
Effectiveness of energy separation-based instantaneous frequency estimation for cochlear cepstral features for synthetic and voice-converted spoofed speech detection5
Overlapped Speech Detection and speaker counting using distant microphone arrays5
Changes in facial expressions in patients with Parkinson's disease during the phonation test and their correlation with disease severity5
Paralinguistic and linguistic fluency features for Alzheimer's disease detection5
Cross-lingual multi-speaker speech synthesis with limited bilingual training data5
Detection of speech playback attacks using robust harmonic trajectories4
Context and knowledge aware conversational model and system combination for grounded response generation4
Transfer learning for multimodal dialog4
Conversation Initiation of Mothers, Fathers, and Toddlers in their Natural Home Environment4
Towards sound based testing of COVID-19—Summary of the first Diagnostics of COVID-19 using Acoustics (DiCOVA) Challenge4
Distinctive acoustic changes in speech in Parkinson's disease4
Cross-corpora spoken language identification with domain diversification and generalization4
Generation of Coherent Multi-Sentence Texts with a Coherence Mechanism4
Variational model for low-resource natural language generation in spoken dialogue systems4
A neural network approach for speech activity detection for Apollo corpus4
An automated quality evaluation framework of psychotherapy conversations with local quality estimates4
Sequential routing framework: Fully capsule network-based speech recognition4
Exploring accidental triggers of smart speakers4
On significance of constant-Q transform for pop noise detection4
Joint speaker diarization and speech recognition based on region proposal networks4
Effects of cross-cultural language differences on social cognition during human-agent interaction in cooperative game environments4
Novel textual entailment technique for the Arabic language using genetic algorithm3
The phonetic footprint of Parkinson’s disease3
A study of bias mitigation strategies for speaker recognition3
Learning to generate structured queries from natural language with indirect supervision3
Maximal activation weighted memory for aspect based sentiment analysis3
MS-Transformer: Introduce multiple structural priors into a unified transformer for encoding sentences3
A novel approach to unsupervised pattern discovery in speech using Convolutional Neural Network3
Self-segmentation of pass-phrase utterances for deep feature learning in text-dependent speaker verification3
Learning to extract from multiple perspectives for neural keyphrase extraction3
On the use of blind channel response estimation and a residual neural network to detect physical access attacks to speaker verification systems3
Stochastic models of glottal pulses from the Rosenberg and Liljencrants-Fant models with unified parameters3
Automatic screening of mild cognitive impairment and Alzheimer’s disease by means of posterior-thresholding hesitation representation3
Multi-level context features extraction for named entity recognition3
DORA: Towards policy optimization for task-oriented dialogue system with efficient context3
Gated dynamic convolutions with deep layer fusion for abstractive document summarization3
An intention multiple-representation model with expanded information3
Voice privacy using CycleGAN and time-scale modification3
Learning Multi-Level Information for Dialogue Response Selection by Highway Recurrent Transformer3
Prosodic event detection in children’s read speech3
Generating identities with mixture models for speaker anonymization3
Towards inclusive automatic speech recognition3
A methodological approach to enable natural language interaction in an Intelligent Tutoring System3
Neural candidate-aware language models for speech recognition3
Supervised speech separation combined with adaptive beamforming3
A combined syntactic-semantic embedding model based on lexicalized tree-adjoining grammar3
Novel textual features for language modeling of intra-sentential code-switching data3
Train from scratch: Single-stage joint training of speech separation and recognition3
Knowledge-grounded dialogue modelling with dialogue-state tracking, domain tracking, and entity extraction3
A mobile application using automatic speech analysis for classifying Alzheimer's disease and mild cognitive impairment3
A local dynamic feature selection fusion method for voice diagnosis of Parkinson's disease3
Morphologically motivated word classes for very large vocabulary speech recognition of Finnish and Estonian3
Exploring the relationship between children's facial emotion processing characteristics and speech communication ability using deep learning on eye tracking and speech performance measures3
Improving BERT with local context comprehension for multi-turn response selection in retrieval-based dialogue systems3
PLDA inspired Siamese networks for speaker verification3
Tamil Handwritten Character Recognition System using Statistical Algorithmic Approaches3
The predictive capabilities of mathematical models for the type-token relationship in English language corpora3
0.027297973632812