EURASIP Journal on Audio Speech and Music Processing

Papers
(The TQCC of EURASIP Journal on Audio Speech and Music Processing is 4. The table below lists those papers that are above that threshold based on CrossRef citation counts [max. 250 papers]. The publications cover those that have been published in the past four years, i.e., from 2020-04-01 to 2024-04-01.)
ArticleCitations
A review of infant cry analysis and classification42
Ensemble of convolutional neural networks to improve animal audio classification38
End-to-end speech emotion recognition using a novel context-stacking dilated convolution neural network17
Progressive loss functions for speech enhancement with deep neural networks15
Accent modification for speech recognition of non-native speakers using neural style transfer14
Performance vs. hardware requirements in state-of-the-art automatic speech recognition14
Dynamically localizing multiple speakers based on the time-frequency domain14
A depthwise separable convolutional neural network for keyword spotting on an embedded system13
Auxiliary function-based algorithm for blind extraction of a moving speaker11
Towards cross-modal pre-training and learning tempo-spatial characteristics for audio recognition with convolutional and recurrent neural networks11
An overview of machine learning and other data-based methods for spatial audio capture, processing, and reproduction10
dEchorate: a calibrated room impulse response dataset for echo-aware signal processing9
Geometry calibration in wireless acoustic sensor networks utilizing DoA and distance information9
Adversarial joint training with self-attention mechanism for robust end-to-end speech recognition8
Steerable differential beamformers with planar microphone arrays8
Automated audio captioning: an overview of recent progress and new challenges8
Components loss for neural networks in mask-based speech enhancement8
Transformer-based ensemble method for multiple predominant instruments recognition in polyphonic music7
Text-to-speech system for low-resource language using cross-lingual transfer learning and data augmentation7
Acoustic DOA estimation using space alternating sparse Bayesian learning7
Deep multiple instance learning for foreground speech localization in ambient audio from wearable devices7
Improving low-resource Tibetan end-to-end ASR by multilingual and multilevel unit modeling6
DOANet: a deep dilated convolutional neural network approach for search and rescue with drone-embedded sound source localization6
Joint speaker localization and array calibration using expectation-maximization6
A simulation study on optimal scores for speaker recognition6
Comparison of semi-supervised deep learning algorithms for audio classification6
Time–frequency scattering accurately models auditory similarities between instrumental playing techniques6
MetaMGC: a music generation framework for concerts in metaverse6
Low-complexity artificial noise suppression methods for deep learning-based speech enhancement algorithms6
Deep neural networks for automatic speech processing: a survey from large corpora to limited data6
Anchor voiceprint recognition in live streaming via RawNet-SA and gated recurrent unit5
Estimation of acoustic echoes using expectation-maximization methods5
Depression-level assessment from multi-lingual conversational speech data using acoustic and text features5
RPCA-DRNN technique for monaural singing voice separation5
Comparative evaluation of interpolation methods for the directivity of musical instruments4
Audio source separation by activity probability detection with maximum correlation and simplex geometry4
MYRiAD: a multi-array room acoustic database4
Benefits of pre-trained mono- and cross-lingual speech representations for spoken language understanding of Dutch dysarthric speech4
Timestamp-aligning and keyword-biasing end-to-end ASR front-end for a KWS system4
Paralinguistic singing attribute recognition using supervised machine learning for describing the classical tenor solo singing voice in vocal pedagogy4
A CNN-based approach to identification of degradations in speech signals4
Single-channel speech enhancement based on joint constrained dictionary learning4
Discriminative features based on modified log magnitude spectrum for playback speech detection4
AUC optimization for deep learning-based voice activity detection4
Noise power spectral density scaled SNR response estimation with restricted range search for sound source localisation using unmanned aerial vehicles4
Estimation of playable piano fingering by pitch-difference fingering match model4
0.018548011779785