OOIR: Observatory of International Research

Papers

(The TQCC of International Journal of Multimedia Information Retrieval is 6. The table below lists those papers that are above that threshold based on CrossRef citation counts [max. 250 papers]. The publications cover those that have been published in the past four years, i.e., from 2021-04-01 to 2025-04-01.)

Article	Citations
A voting-based novel spatio-temporal fusion framework for video saliency using transfer learning mechanism	329
DAABNet: depth-wise asymmetric attention bottleneck for real-time semantic segmentation	87
Editorial: web of science and scopus impact in IJMIR	44
How can users’ comments posted on social media videos be a source of effective tags?	44
Detecting abnormal behavior in megastore for crime prevention using a deep neural architecture	37
Multimodal music datasets? Challenges and future goals in music processing	25
VERITE: a Robust benchmark for multimodal misinformation detection accounting for unimodal bias	24
Style-aware adversarial pairwise ranking for image recommendation systems	21
Enhancing the performance of 3D auto-correlation gradient features in depth action classification	21
Stratified Graph Indexing for efficient search in deep descriptor databases	17
Mual: enhancing multimodal sentiment analysis with cross-modal attention and difference loss	15
End-to-end residual learning-based deep neural network model deployment for human activity recognition	14
Visual and semantic ensemble for scene text recognition with gated dual mutual attention	13
Similar interior coordination image retrieval with multi-view features	12
Reinforcement learning applied to machine vision: state of the art	11
Towards a high robust neural network via feature matching	11
Your heart rate betrays you: multimodal learning with spatio-temporal fusion networks for micro-expression recognition	10
Correction to: Different techniques for Alzheimer’s disease classification using brain images: a study	10
An interactive attribute-preserving fashion recommendation with 3D image-based virtual try-on	9
LG-MLFormer: local and global MLP for image captioning	9
How does a kernel based on gradients of infinite-width neural networks come to be widely used: a review of the neural tangent kernel	9
Gender classification from face images using central difference convolutional networks	9
CAMIR: fine-tuning CLIP and multi-head cross-attention mechanism for multimodal image retrieval with sketch and text features	8
RGBD deep multi-scale network for background subtraction	8
Optimized RT-DETR for accurate and efficient video object detection via decoupled feature aggregation	8

Neural style transfer generative adversarial network (NST-GAN) for facial expression recognition	8
Generative adversarial networks and its applications in the biomedical image segmentation: a comprehensive survey	8
Improving skeleton-based action recognition with interactive object information	8
Recent trends in recommender systems: a survey	8
Advancements in machine learning techniques for threat item detection in X-ray images: a comprehensive survey	8
Ornament image retrieval using few-shot learning	7
Caption TLSTMs: combining transformer with LSTMs for image captioning	7
Maximizing mutual information inside intra- and inter-modality for audio-visual event retrieval	7
Multi-modal emotion recognition using tensor decomposition fusion and self-supervised multi-tasking	7
Video anomaly detection with memory-guided multilevel embedding	7
A review on deep learning in medical image analysis	7
State of art and emerging trends on group recommender system: a comprehensive review	6
Dual-matrix guided reconstruction hashing for unsupervised cross-modal retrieval	6
Multiple feedback based adversarial collaborative filtering with aesthetics	6
3D skeleton-based human motion prediction using spatial–temporal graph convolutional network	6