IEEE Transactions on Image Processing

Papers
(The H4-Index of IEEE Transactions on Image Processing is 73. The table below lists those papers that are above that threshold based on CrossRef citation counts [max. 250 papers]. The publications cover those that have been published in the past four years, i.e., from 2021-05-01 to 2025-05-01.)
ArticleCitations
Consensus Sparsity: Multi-Context Sparse Image Representation via L -Induced Matrix Variate470
Variational Structured Attention Networks for Deep Visual Representation Learning462
Multiframe Joint Enhancement for Early Interlaced Videos443
FF-LPD: A Real-Time Frame-by-Frame License Plate Detector With Knowledge Distillation and Feature Propagation386
SemiRS-COC: Semi-Supervised Classification for Complex Remote Sensing Scenes With Cross-Object Consistency332
HAda: Hyper-Adaptive Parameter-Efficient Learning for Multi-View ConvNets304
Canonical Correlation Analysis With Low-Rank Learning for Image Representation281
Learning Spectral Cues for Multispectral and Panchromatic Image Fusion264
One-Class Classification Using ℓp-Norm Multiple Kernel Fisher Null Approach214
Automatic Quaternion-Domain Color Image Stitching212
Multi-Granularity Contrastive Cross-Modal Collaborative Generation for End-to-End Long-Term Video Question Answering210
Toward Projected Clustering With Aggregated Mapping199
A Low-Rank Tensor Decomposition Model With Factors Prior and Total Variation for Impulsive Noise Removal188
Dual Alternating Direction Method of Multipliers for Inverse Imaging163
Multimodal Unrolled Robust PCA for Background Foreground Separation159
Density-Guided Incremental Dominant Instance Exploration for Two-View Geometric Model Fitting159
A Fast and Efficient Shape Blending by Stable and Analytically Invertible Finite Descriptors155
Self-Supervised Matting-Specific Portrait Enhancement and Generation150
Graph Convolutional Dictionary Selection With L, Norm for Video Summarization150
Differentiable SAR Renderer and Image-Based Target Reconstruction144
Contrast-Reconstruction Representation Learning for Self-Supervised Skeleton-Based Action Recognition138
MaCon: A Generic Self-Supervised Framework for Unsupervised Multimodal Change Detection138
GMLight: Lighting Estimation via Geometric Distribution Approximation134
Harnessing Multi-modal Large Language Models for Measuring and Interpreting Color Differences134
Cross-Modal Retrieval With Noisy Correspondence via Consistency Refining and Mining130
Equivariant Local Reference Frames With Optimization for Robust Non-Rigid Point Cloud Correspondence129
Discrete Metric Learning for Fast Image Set Classification128
Pro2Diff: Proposal Propagation for Multi-Object Tracking via the Diffusion Model124
TTVFI: Learning Trajectory-Aware Transformer for Video Frame Interpolation120
Pose-Appearance Relational Modeling for Video Action Recognition118
Cross-Modality Pyramid Alignment for Visual Intention Understanding115
Uncertainty-Guided Refinement for Fine-Grained Salient Object Detection115
Vision-Based UAV Self-Positioning in Low-Altitude Urban Environments114
Multi-Constraint Adversarial Networks for Unsupervised Image-to-Image Translation110
Attention-Guided Neural Networks for Full-Reference and No-Reference Audio-Visual Quality Assessment108
Bi-Nuclear Tensor Schatten-p Norm Minimization for Multi-View Subspace Clustering105
Graph Embedding Contrastive Multi-Modal Representation Learning for Clustering103
Real Image Denoising With a Locally-Adaptive Bitonic Filter103
Fine-Grained Recognition With Learnable Semantic Data Augmentation102
Attentive WaveBlock: Complementarity-Enhanced Mutual Networks for Unsupervised Domain Adaptation in Person Re-Identification and Beyond101
Few-Shot Learning With Class-Covariance Metric for Hyperspectral Image Classification100
Bidirectional Mapping Coupled GAN for Generalized Zero-Shot Learning99
Cross-Attentional Spatio-Temporal Semantic Graph Networks for Video Question Answering98
Inverse Image Frequency for Long-Tailed Image Recognition96
SharpFormer: Learning Local Feature Preserving Global Representations for Image Deblurring95
Unsupervised Modality-Transferable Video Highlight Detection With Representation Activation Sequence Learning95
Multi-Condition Latent Diffusion Network for Scene-Aware Neural Human Motion Prediction94
Generalization Beyond Feature Alignment: Concept Activation-Guided Contrastive Learning93
Boundary-Aware Prototype in Semi-Supervised Medical Image Segmentation92
Addressing Challenges of Incorporating Appearance Cues Into Heuristic Multi-Object Tracker via a Novel Feature Paradigm90
SegHSI: Semantic Segmentation of Hyperspectral Images With Limited Labeled Pixels90
Non-Cascaded and Crosstalk-Free Multi-Image Encryption Based on Optical Scanning Holography Using 2D Orthogonal Compressive Sensing90
Grammar-Induced Wavelet Network for Human Parsing88
Efficient Semi-Supervised Multimodal Hashing With Importance Differentiation Regression88
Decoupled Cross-Modal Phrase-Attention Network for Image-Sentence Matching88
Optimization-Inspired Learning With Architecture Augmentations and Control Mechanisms for Low-Level Vision88
Point-Based Learnable Query Generator for Human–Object Interaction Detection88
Distractor-Aware Event-Based Tracking86
Stacked Deconvolutional Network for Semantic Segmentation85
Unsupervised Person Re-Identification With Stochastic Training Strategy85
Weakly Supervised Semantic Segmentation via Alternate Self-Dual Teaching84
Hyperspectral Meets Optical Flow: Spectral Flow Extraction for Hyperspectral Image Classification83
Rethinking Sampling Strategies for Unsupervised Person Re-Identification83
Coarse-to-Fine Contrastive Self-Supervised Feature Learning for Land-Cover Classification in SAR Images With Limited Labeled Data81
Fuzzy Sparse Subspace Clustering for Infrared Image Segmentation81
DUT: Learning Video Stabilization by Simply Watching Unstable Videos80
IMU-Assisted Online Video Background Identification79
Toward Video Anomaly Retrieval From Video Anomaly Detection: New Benchmarks and Model76
NR-MVSNet: Learning Multi-View Stereo Based on Normal Consistency and Depth Refinement75
Cross-Layer Contrastive Learning of Latent Semantics for Facial Expression Recognition75
Commonality Feature Representation Learning for Unsupervised Multimodal Change Detection74
Cyclic Self-Training With Proposal Weight Modulation for Cross-Supervised Object Detection74
Multi-Exposure Image Fusion via Deformable Self-Attention73
0.13045120239258