Multimedia Systems

Papers
(The H4-Index of Multimedia Systems is 26. The table below lists those papers that are above that threshold based on CrossRef citation counts [max. 250 papers]. The publications cover those that have been published in the past four years, i.e., from 2021-10-01 to 2025-10-01.)
ArticleCitations
Unsupervised deep metric learning algorithm for crop disease images based on knowledge distillation networks113
Pseudo-global strategy-based visual comfort assessment considering attention mechanism90
A visual question answering model based on image captioning81
SS-CMT: a label independent cross-modal transferable adversarial video attack with sparse strategy75
A research for sound event localization and detection based on local–global adaptive fusion and temporal importance network71
Generalizing sentence-level lipreading to unseen speakers: a two-stream end-to-end approach69
On-line monitoring of structural performance of scraper conveyor driven by digital twin65
SS-YOLOv8: small-size object detection algorithm based on improved YOLOv8 for UAV imagery58
360° video quality assessment based on saliency-guided viewport extraction55
Model-based portrait video compression with spatial constraint and adaptive pose processing47
Automatic lymph node segmentation using deep parallel squeeze & excitation and attention Unet45
Segmentation-aware image super-resolution with generative adversarial networks44
SFRA: spatial fusion regression augmentation network for facial landmark detection44
Improving text-image cross-modal retrieval with contrastive loss39
SEMNet: a simple and efficient MLP-based network for 3D Face point clouds landmarks localization39
Multi-level sentiment-aware clustering for denoising in multimodal sentiment analysis with ASR errors39
The segmented UEC Food-100 dataset with benchmark experiment on food detection38
Deep Learning-based forgery detection and localization for compressed images using a hybrid optimization model38
Real emotion seeker: recalibrating annotation for facial expression recognition38
Point cloud inpainting with normal-based feature matching32
User authentication method based on keystroke dynamics and mouse dynamics using HDA31
CHCoT-MSLU: a coupled hierarchical chain-of-thought prompt learning model for multi-intent spoken language understanding29
Multi-view Isolated sign language recognition based on cross-view and multi-level transformer29
DiffRA: universal restorative adversarial attack based on diffusion model28
CAPNet: tomato leaf disease detection network based on adaptive feature fusion and convolutional enhancement28
SFFN-YOLO for small object detection in aerial images28
ConASD: Contrastive Few Shot Learning for Detecting Autism Spectrum Disorder via Eye Tracking Scanpath26
LMFE-RDD: a road damage detector with a lightweight multi-feature extraction network26
0.18747997283936