Speech Communication

Papers
(The H4-Index of Speech Communication is 18. The table below lists those papers that are above that threshold based on CrossRef citation counts [max. 250 papers]. The publications cover those that have been published in the past four years, i.e., from 2020-11-01 to 2024-11-01.)
ArticleCitations
CN-Celeb: Multi-genre speaker recognition61
Emotional voice conversion: Theory, databases and ESD57
Learning deep multimodal affective features for spontaneous speech emotion recognition57
Masked multi-head self-attention for causal speech enhancement54
Survey on bimodal speech emotion recognition from acoustic and linguistic information fusion46
The Hearing-Aid Speech Perception Index (HASPI) Version 236
Unsupervised Automatic Speech Recognition: A review33
Two-stage dimensional emotion recognition by fusing predictions of acoustic and text networks using SVM31
Fusion of deep learning features with mixture of brain emotional learning for audio-visual emotion recognition30
Modulation spectral features for speech emotion recognition using deep neural networks27
Multi-modal speech emotion recognition using self-attention mechanism and multi-scale fusion framework25
GM-TCNet: Gated Multi-scale Temporal Convolutional Network using Emotion Causality for Speech Emotion Recognition24
CyTex: Transforming speech to textured images for speech emotion recognition22
Uneven success: automatic speech recognition and ethnicity-related dialects20
A formant modification method for improved ASR of children’s speech19
A study on data augmentation in voice anti-spoofing19
Computer-assisted pronunciation training—Speech synthesis is almost all you need19
A time–frequency smoothing neural network for speech enhancement18
Speech enhancement using a DNN-augmented colored-noise Kalman filter18
Learning affective representations based on magnitude and dynamic relative phase information for speech emotion recognition18
0.034221887588501