IEEE Transactions on Multimedia

Papers
(The H4-Index of IEEE Transactions on Multimedia is 59. The table below lists those papers that are above that threshold based on CrossRef citation counts [max. 250 papers]. The publications cover those that have been published in the past four years, i.e., from 2020-03-01 to 2024-03-01.)
ArticleCitations
Scalable Discrete and Asymmetric Unequal Length Hashing Learning for Cross-Modal Retrieval976500490
A Strong Baseline and Batch Normalization Neck for Deep Person Re-Identification321
Human Memory Update Strategy: A Multi-Layer Template Update Mechanism for Remote Visual Monitoring213
Low-Light Image Enhancement With Semi-Decoupled Decomposition171
Extended Feature Pyramid Network for Small Object Detection145
AttentionFGAN: Infrared and Visible Image Fusion Using Attention-Based Generative Adversarial Networks144
Spatiotemporal Recurrent Convolutional Networks for Recognizing Spontaneous Micro-Expressions144
Reversible Data Hiding in Encrypted Images Based on Multi-MSB Prediction and Huffman Coding138
MFDNet: Collaborative Poses Perception and Matrix Fisher Distribution for Head Pose Estimation136
3D Room Layout Estimation From a Single RGB Image133
StrongSORT: Make DeepSORT Great Again132
Coarse-to-Fine CNN for Image Super-Resolution122
Automated Colorization of a Grayscale Image With Seed Points Propagation115
DSLR: Deep Stacked Laplacian Restorer for Low-Light Image Enhancement112
Beyond Triplet Loss: Person Re-Identification With Fine-Grained Difference-Aware Pairwise Loss108
Image-to-Image Translation: Methods and Applications102
Learning Deep Multi-Level Similarity for Thermal Infrared Object Tracking98
SPA-GAN: Spatial Attention GAN for Image-to-Image Translation98
Parameter Sharing Exploration and Hetero-Center Triplet Loss for Visible-Thermal Person Re-Identification97
Spatio-Temporal Attention Networks for Action Recognition and Detection95
Geometric Back-Projection Network for Point Cloud Classification93
A Dilated Inception Network for Visual Saliency Prediction91
PDR-Net: Perception-Inspired Single Image Dehazing Network With Refinement90
Spatial-Temporal Cascade Autoencoder for Video Anomaly Detection in Crowded Scenes86
Adaptive Graph Completion Based Incomplete Multi-View Clustering85
Consensus Graph Learning for Multi-View Clustering83
TBEFN: A Two-Branch Exposure-Fusion Network for Low-Light Image Enhancement79
CCAFNet: Crossflow and Cross-Scale Adaptive Fusion Network for Detecting Salient Objects in RGB-D Images79
VehicleNet: Learning Robust Visual Representation for Vehicle Re-Identification79
Jointly Learning Kernel Representation Tensor and Affinity Matrix for Multi-View Clustering76
Kernelized Multiview Subspace Analysis By Self-Weighted Learning74
STNReID: Deep Convolutional Networks With Pairwise Spatial Transformer Networks for Partial Person Re-Identification73
Real-Time and Accurate UAV Pedestrian Detection for Social Distancing Monitoring in COVID-19 Pandemic73
Food Recommendation: Framework, Existing Solutions, and Challenges73
Image-Text Multimodal Emotion Classification via Multi-View Attentional Network72
An Improved Reversible Data Hiding in Encrypted Images Using Parametric Binary Tree Labeling72
Stacked U-Shape Network With Channel-Wise Attention for Salient Object Detection71
ATMFN: Adaptive-Threshold-Based Multi-Model Fusion Network for Compressed Face Hallucination71
Multi-Level Policy and Reward-Based Deep Reinforcement Learning Framework for Image Captioning70
PTB-TIR: A Thermal Infrared Pedestrian Tracking Benchmark70
EHPE: Skeleton Cues-based Gaussian Coordinate Encoding for Efficient Human Pose Estimation68
Low-Rank Pairwise Alignment Bilinear Network For Few-Shot Fine-Grained Image Classification68
SiamCorners: Siamese Corner Networks for Visual Tracking68
Multi-View Multi-Label Learning With Sparse Feature Selection for Image Annotation68
Deep Multi-View Subspace Clustering With Unified and Discriminative Learning68
A Flexible Deep CNN Framework for Image Restoration65
Predicting the Perceptual Quality of Point Cloud: A 3D-to-2D Projection-Based Exploration64
Interact as You Intend: Intention-Driven Human-Object Interaction Detection64
PixelRL: Fully Convolutional Network With Reinforcement Learning for Image Processing64
Multi-Channel Deep Networks for Block-Based Image Compressive Sensing63
Multi-Level Correlation Adversarial Hashing for Cross-Modal Retrieval63
Deep Fusion Feature Representation Learning With Hard Mining Center-Triplet Loss for Person Re-Identification63
Exploiting Temporal Contexts With Strided Transformer for 3D Human Pose Estimation63
Deep-IRTarget: An Automatic Target Detector in Infrared Imagery Using Dual-Domain Feature Extraction and Allocation62
Anti-Forensics for Face Swapping Videos via Adversarial Training62
Luminance-Aware Pyramid Network for Low-Light Image Enhancement62
Illumination-Adaptive Person Re-Identification62
PointHop: An Explainable Machine Learning Method for Point Cloud Classification62
A Recursive Reversible Data Hiding in Encrypted Images Method With a Very High Payload62
An Automated and Robust Image Watermarking Scheme Based on Deep Neural Networks59
WSCNet: Weakly Supervised Coupled Networks for Visual Sentiment Classification and Detection59
YDTR: Infrared and Visible Image Fusion via Y-Shape Dynamic Transformer59
0.049591064453125