Multimedia Systems

Papers
(The H4-Index of Multimedia Systems is 26. The table below lists those papers that are above that threshold based on CrossRef citation counts [max. 250 papers]. The publications cover those that have been published in the past four years, i.e., from 2021-09-01 to 2025-09-01.)
ArticleCitations
Unsupervised deep metric learning algorithm for crop disease images based on knowledge distillation networks109
Pseudo-global strategy-based visual comfort assessment considering attention mechanism88
A research for sound event localization and detection based on local–global adaptive fusion and temporal importance network79
A visual question answering model based on image captioning72
SS-CMT: a label independent cross-modal transferable adversarial video attack with sparse strategy67
Correction: STASiamRPN: visual tracking based on spatiotemporal and attention66
360° video quality assessment based on saliency-guided viewport extraction58
Model-based portrait video compression with spatial constraint and adaptive pose processing58
CAPNet: tomato leaf disease detection network based on adaptive feature fusion and convolutional enhancement49
Automatic lymph node segmentation using deep parallel squeeze & excitation and attention Unet46
SFRA: spatial fusion regression augmentation network for facial landmark detection45
Segmentation-aware image super-resolution with generative adversarial networks44
Multi-level sentiment-aware clustering for denoising in multimodal sentiment analysis with ASR errors40
User authentication method based on keystroke dynamics and mouse dynamics using HDA39
SEMNet: a simple and efficient MLP-based network for 3D Face point clouds landmarks localization39
Feature fusion and optimization integrated refined deep residual network for diabetic retinopathy severity classification using fundus image38
Improving text-image cross-modal retrieval with contrastive loss38
The segmented UEC Food-100 dataset with benchmark experiment on food detection37
Face and voice cross-modal association with learning convex feature embedding34
GVA: guided visual attention approach for automatic image caption generation31
Deep Learning-based forgery detection and localization for compressed images using a hybrid optimization model29
Towards domain adaptation underwater image enhancement and restoration28
BENet: bi-directional enhanced network for image captioning27
Dual convolutional neural network with attention for image blind denoising26
Real emotion seeker: recalibrating annotation for facial expression recognition26
Point cloud inpainting with normal-based feature matching26
0.066019058227539