International Journal of Multimedia Information Retrieval

Papers
(The median citation count of International Journal of Multimedia Information Retrieval is 1. The table below lists those papers that are above that threshold based on CrossRef citation counts [max. 250 papers]. The publications cover those that have been published in the past four years, i.e., from 2020-11-01 to 2024-11-01.)
ArticleCitations
A review on deep learning in medical image analysis240
Anomaly detection using edge computing in video surveillance system: review65
Design ensemble deep learning model for pneumonia disease classification44
Interactive video retrieval evaluation at a distance: comparing sixteen interactive video search systems in a remote setting at the 10th Video Browser Showdown39
Generative adversarial networks and its applications in the biomedical image segmentation: a comprehensive survey36
Music similarity measurement and recommendation system using convolutional neural networks33
Contrastive self-supervised learning: review, progress, challenges and future research directions26
A study of classification and feature extraction techniques for brain tumor detection23
Optimized MobileNet + SSD: a real-time pedestrian detection on a low-end edge device19
Multi-sensor human activity recognition using CNN and GRU17
Text detection, recognition, and script identification in natural scene images: a Review17
Multimodal Quasi-AutoRegression: forecasting the visual popularity of new fashion products13
A literature review and perspectives in deepfakes: generation, detection, and applications13
AMS-CNN: Attentive multi-stream CNN for video-based crowd counting11
Music emotion recognition based on segment-level two-stage learning11
An automatic approach of audio feature engineering for the extraction, analysis and selection of descriptors10
Multimodal news analytics using measures of cross-modal entity and context consistency9
Cluster-based quotas for fairness improvements in music recommendation systems9
Cross-domain image retrieval: methods and applications9
Siamese coding network and pair similarity prediction for near-duplicate image detection8
Content-based image retrieval using Group Normalized-Inception-Darknet-538
InceptionDepth-wiseYOLOv2: improved implementation of YOLO framework for pedestrian detection8
A novel method for video shot boundary detection using CNN-LSTM approach8
Human pose estimation using deep learning: review, methodologies, progress and future research directions8
PDS-Net: A novel point and depth-wise separable convolution for real-time object detection8
Reinforcement learning applied to machine vision: state of the art8
Different techniques for Alzheimer’s disease classification using brain images: a study7
Gender classification from face images using central difference convolutional networks7
Organ segmentation from computed tomography images using the 3D convolutional neural network: a systematic review7
Caption TLSTMs: combining transformer with LSTMs for image captioning7
FCT: fusing CNN and transformer for scene classification7
A comprehensive survey of multimodal fake news detection techniques: advances, challenges, and opportunities7
CLIP-based fusion-modal reconstructing hashing for large-scale unsupervised cross-modal retrieval6
Few-shot and meta-learning methods for image understanding: a survey6
Deep learning for video-text retrieval: a review5
An improved customized CNN model for adaptive recognition of cerebral palsy people’s handwritten digits in assessment5
Alleviating the cold-start playlist continuation in music recommendation using latent semantic indexing5
Multi-class imbalanced image classification using conditioned GANs5
Sentiment analysis using deep learning techniques: a comprehensive review5
Optical music recognition for homophonic scores with neural networks and synthetic music generation5
Your heart rate betrays you: multimodal learning with spatio-temporal fusion networks for micro-expression recognition5
A unified approach of detecting misleading images via tracing its instances on web and analyzing its past context for the verification of multimedia content4
Medical image watermarking: a survey on applications, approach and performance requirement compliance4
Counterfactual attribute-based visual explanations for classification4
Study of Alzheimer’s disease brain impairment and methods for its early diagnosis: a comprehensive survey4
Multimodal image and audio music transcription4
Content-based image retrieval using handcraft feature fusion in semantic pyramid4
DC-GNN: drop channel graph neural network for object classification and part segmentation in the point cloud3
Recognition of student engagement in classroom from affective states3
An interactive attribute-preserving fashion recommendation with 3D image-based virtual try-on3
Neural style transfer generative adversarial network (NST-GAN) for facial expression recognition3
FDAM: full-dimension attention module for deep convolutional neural networks3
TCKGE: Transformers with contrastive learning for knowledge graph embedding3
End-to-end residual learning-based deep neural network model deployment for human activity recognition3
Who is gambling? Finding cryptocurrency gamblers using multi-modal retrieval methods3
Generative adversarial networks for 2D-based CNN pose-invariant face recognition3
Enhancing the performance of 3D auto-correlation gradient features in depth action classification3
Semantic-aware visual scene representation2
SPSD: Similarity-preserving self-distillation for video–text retrieval2
LG-MLFormer: local and global MLP for image captioning2
MHA-WoML: Multi-head attention and Wasserstein-OT for few-shot learning2
Prototype local–global alignment network for image–text retrieval2
MemeTector: enforcing deep focus for meme detection2
RGBD deep multi-scale network for background subtraction2
Early-stopped learning for action prediction in videos2
A local representation-enhanced recurrent convolutional network for image captioning2
A fast and robust affine-invariant method for shape registration under partial occlusion2
Ornament image retrieval using few-shot learning2
How does a kernel based on gradients of infinite-width neural networks come to be widely used: a review of the neural tangent kernel1
VERITE: a Robust benchmark for multimodal misinformation detection accounting for unimodal bias1
A comprehensive survey on chest diseases analysis: technique, challenges and future research directions1
Nested-Net: a deep nested network for background subtraction1
PSNet: position-shift alignment network for image caption1
Modal interaction-enhanced prompt learning by transformer decoder for vision-language models1
Deep multiple aggregation networks for action recognition1
Special issue on cross-modal retrieval and analysis1
State of art and emerging trends on group recommender system: a comprehensive review1
Visual and semantic ensemble for scene text recognition with gated dual mutual attention1
Detecting abnormal behavior in megastore for crime prevention using a deep neural architecture1
Text-assisted attention-based cross-modal hashing1
Multi-knowledge-driven enhanced module for visible-infrared cross-modal person Re-identification1
Emotion-aware music tower blocks (EmoMTB ): an intelligent audiovisual interface for music discovery and recommendation1
Joint multi-scale information and long-range dependence for video captioning1
Dual-feature collaborative relation-attention networks for visual question answering1
Tri-RAT: optimizing the attention scores for image captioning1
Semantic-enhanced discriminative embedding learning for cross-modal retrieval1
Mual: enhancing multimodal sentiment analysis with cross-modal attention and difference loss1
Maximizing mutual information inside intra- and inter-modality for audio-visual event retrieval1
FOF: a fine-grained object detection and feature extraction end-to-end network1
Attribute-wise reasoning reinforcement learning for pedestrian attribute retrieval1
A deep image retrieval network using Max-m-Min pooling and morphological feature generating residual blocks1
ConvST-LSTM-Net: convolutional spatiotemporal LSTM networks for skeleton-based human action recognition1
A lightweight small object detection algorithm based on improved YOLOv5 for driving scenarios1
Few2Decide: towards a robust model via using few neuron connections to decide1
0.049641847610474