EURASIP Journal on Audio Speech and Music Processing

Papers
(The TQCC of EURASIP Journal on Audio Speech and Music Processing is 3. The table below lists those papers that are above that threshold based on CrossRef citation counts [max. 250 papers]. The publications cover those that have been published in the past four years, i.e., from 2021-02-01 to 2025-02-01.)
ArticleCitations
Paralinguistic singing attribute recognition using supervised machine learning for describing the classical tenor solo singing voice in vocal pedagogy50
Automated audio captioning: an overview of recent progress and new challenges21
Analysis of transition cost and model parameters in speaker diarization for meetings20
Improving speech recognition systems for the morphologically complex Malayalam language using subword tokens for language modeling18
Data-based spatial audio processing18
Timestamp-aligning and keyword-biasing end-to-end ASR front-end for a KWS system18
Multi-source localization by using offset residual weight16
PlugSonic: a web- and mobile-based platform for dynamic and navigable binaural audio15
Gated recurrent unit predictor model-based adaptive differential pulse code modulation speech decoder13
Residual feedback suppression with extended model-based postfilters12
Learning-based robust speaker counting and separation with the aid of spatial coherence11
Learning domain-heterogeneous speaker recognition systems with personalized continual federated learning11
Efficient bandwidth extension of musical signals using a differentiable harmonic plus noise model10
The power of humorous audio: exploring emotion regulation in traffic congestion through EEG-based study10
UTran-DSR: a novel transformer-based model using feature enhancement for dysarthric speech recognition10
Auxiliary function-based algorithm for blind extraction of a moving speaker9
Neural network-based non-intrusive speech quality assessment using attention pooling function9
Automatic detection of attachment style in married couples through conversation analysis9
MIRACLE—a microphone array impulse response dataset for acoustic learning8
Heterogeneous separation consistency training for adaptation of unsupervised speech separation8
Stripe-Transformer: deep stripe feature learning for music source separation8
NMF-weighted SRP for multi-speaker direction of arrival estimation: robustness to spatial aliasing while exploiting sparsity in the atom-time domain7
dEchorate: a calibrated room impulse response dataset for echo-aware signal processing7
Dynamically localizing multiple speakers based on the time-frequency domain6
On the selection of the number of beamformers in beamforming-based binaural reproduction6
Correction: N-dimensional N-microphone sound source localization5
Deep neural networks for automatic speech processing: a survey from large corpora to limited data5
Continuous lipreading based on acoustic temporal alignments5
Correction to: An integrated MVDR beamformer for speech enhancement using a local microphone array and external microphones5
An overview of machine learning and other data-based methods for spatial audio capture, processing, and reproduction5
A lightweight approach to real-time speaker diarization: from audio toward audio-visual data streams5
Modelling note’s pitch and duration in trained professional singers5
AUC optimization for deep learning-based voice activity detection5
Robustness of ad hoc microphone clustering using speaker embeddings: evaluation under realistic and challenging scenarios4
Correction: Robustness of ad hoc microphone clustering using speaker embeddings: evaluation under realistic and challenging scenarios4
Generating chord progression from melody with flexible harmonic rhythm and controllable harmonic density4
Correction: DeepDet: YAMNet with BottleNeck Attention Module (BAM) for TTS synthesis detection4
A latent rhythm complexity model for attribute-controlled drum pattern generation4
Optimizing feature fusion for improved zero-shot adaptation in text-to-speech synthesis4
Improving multi-talker binaural DOA estimation by combining periodicity and spatial features in convolutional neural networks4
Microphone utility estimation in acoustic sensor networks using single-channel signal features4
AAM: a dataset of Artificial Audio Multitracks for diverse music information retrieval tasks3
Frequency-dependent auto-pooling function for weakly supervised sound event detection3
Masked multi-center angular margin loss for language recognition3
A multichannel learning-based approach for sound source separation in reverberant environments3
A noise PSD estimation algorithm using derivative-based high-pass filter in non-stationary noise conditions3
Musical note onset detection based on a spectral sparsity measure3
Can all variations within the unified mask-based beamformer framework achieve identical peak extraction performance?3
Supervised Attention Multi-Scale Temporal Convolutional Network for monaural speech enhancement3
Quantifying headphone listening experience in virtual sound environments using distraction3
Synthesis of soundfields through irregular loudspeaker arrays based on convolutional neural networks3
0.051681995391846