OOIR: Observatory of International Research

Papers

(The TQCC of International Journal of Multimedia Information Retrieval is 8. The table below lists those papers that are above that threshold based on CrossRef citation counts [max. 250 papers]. The publications cover those that have been published in the past four years, i.e., from 2021-08-01 to 2025-08-01.)

Article	Citations
Editorial: web of science and scopus impact in IJMIR	397
Towards a high robust neural network via feature matching	104
Recent trends in recommender systems: a survey	55
Video anomaly detection with memory-guided multilevel embedding	47
VERITE: a Robust benchmark for multimodal misinformation detection accounting for unimodal bias	46
Generative adversarial networks and its applications in the biomedical image segmentation: a comprehensive survey	36
Strengthening attention: knowledge distillation via cross-layer feature fusion for image classification	29
DELIGHT-Net: DEep and LIGHTweight network to segment Indian text at word level from wild scenic images	26
Enhanced YOLOv10 for small object detection with context-aware and adaptive modules	23
VPC-VoxelNet: multi-modal fusion 3D object detection networks based on virtual point clouds	23
A local representation-enhanced recurrent convolutional network for image captioning	20
Prototype local–global alignment network for image–text retrieval	19
Early-stopped learning for action prediction in videos	18
MMDL: a multi-modal deep learning for video highlight detection in sports	16
Similarity-based face image retrieval using sparsely embedded deep features and binary code learning	15
DC-GNN: drop channel graph neural network for object classification and part segmentation in the point cloud	15
CAMIR: fine-tuning CLIP and multi-head cross-attention mechanism for multimodal image retrieval with sketch and text features	15
Visual and semantic ensemble for scene text recognition with gated dual mutual attention	14
FiCo-ITR: bridging fine-grained and coarse-grained image-text retrieval for comparative performance analysis	13
How can users’ comments posted on social media videos be a source of effective tags?	13
Human behavior recognition based on DualBiNet model	13
Multimodal music datasets? Challenges and future goals in music processing	12
An emotion-driven, transformer-based network for multimodal fake news detection	11
Semantic-enhanced discriminative embedding learning for cross-modal retrieval	11
Few2Decide: towards a robust model via using few neuron connections to decide	11

State of art and emerging trends on group recommender system: a comprehensive review	10
MFAFD: a few-shot learning method for cascading models with parameter free attention and finite discrete space	10
Image enhancement with bi-directional normalization and color attention-guided generative adversarial networks	10
Cross-domain image retrieval: methods and applications	10
Organ segmentation from computed tomography images using the 3D convolutional neural network: a systematic review	10
Multi-view learning for camouflaged object detection with PVTv2	10
DAF-Net: dense attention feature pyramid network for multiscale object detection	10
Generative adversarial networks for 2D-based CNN pose-invariant face recognition	10
Human action recognition using an optical flow-gated recurrent neural network	10
Concept-based and embedding-based models in lifelog retrieval: an empirical comparison of performance	9
InceptionDepth-wiseYOLOv2: improved implementation of YOLO framework for pedestrian detection	9
Multi-sensor human activity recognition using CNN and GRU	9
Study of Alzheimer’s disease brain impairment and methods for its early diagnosis: a comprehensive survey	8
A voting-based novel spatio-temporal fusion framework for video saliency using transfer learning mechanism	8
FOF: a fine-grained object detection and feature extraction end-to-end network	8
Optical music recognition for homophonic scores with neural networks and synthetic music generation	8