IEEE Transactions on Image Processing

Papers
(The H4-Index of IEEE Transactions on Image Processing is 86. The table below lists those papers that are above that threshold based on CrossRef citation counts [max. 250 papers]. The publications cover those that have been published in the past four years, i.e., from 2021-12-01 to 2025-12-01.)
ArticleCitations
Consensus Sparsity: Multi-Context Sparse Image Representation via L -Induced Matrix Variate717
Pro2Diff: Proposal Propagation for Multi-Object Tracking via the Diffusion Model700
Harnessing Multi-modal Large Language Models for Measuring and Interpreting Color Differences619
Dual Alternating Direction Method of Multipliers for Inverse Imaging615
An Explanation Method Based on Interpretable Linear Model With Four Key Characteristics502
Multiframe Joint Enhancement for Early Interlaced Videos456
An Adaptive Multi-Granularity Graph Representation of Image via Granular-ball Computing443
Graph Convolutional Dictionary Selection With L, Norm for Video Summarization409
Toward Efficient Test Time Adaptation With Hierarchical Distribution Alignment402
Cross-Domain Few-Shot Medical Image Segmentation via Dynamic Semantic Matching376
A Fast and Efficient Shape Blending by Stable and Analytically Invertible Finite Descriptors301
Variational Structured Attention Networks for Deep Visual Representation Learning293
Equivariant Local Reference Frames With Optimization for Robust Non-Rigid Point Cloud Correspondence255
Self-Supervised Matting-Specific Portrait Enhancement and Generation246
Color Spike Camera Reconstruction via Long Short-Term Temporal Aggregation of Spike Signals244
Canonical Correlation Analysis With Low-Rank Learning for Image Representation229
Learning Spectral Cues for Multispectral and Panchromatic Image Fusion210
Multi-Constraint Adversarial Networks for Unsupervised Image-to-Image Translation209
A Low-Rank Tensor Decomposition Model With Factors Prior and Total Variation for Impulsive Noise Removal209
SemiRS-COC: Semi-Supervised Classification for Complex Remote Sensing Scenes With Cross-Object Consistency205
Spatial Frequency Modulation Network for Efficient Image Dehazing200
Pose-Appearance Relational Modeling for Video Action Recognition195
MaCon: A Generic Self-Supervised Framework for Unsupervised Multimodal Change Detection194
AdaAugment: A Tuning-Free and Adaptive Approach to Enhance Data Augmentation188
HAda: Hyper-Adaptive Parameter-Efficient Learning for Multi-View ConvNets188
Discrete Metric Learning for Fast Image Set Classification187
One-Class Classification Using ℓp-Norm Multiple Kernel Fisher Null Approach186
TTVFI: Learning Trajectory-Aware Transformer for Video Frame Interpolation175
FF-LPD: A Real-Time Frame-by-Frame License Plate Detector With Knowledge Distillation and Feature Propagation174
OccNeRF: Advancing 3D Occupancy Prediction in LiDAR-Free Environments173
Automatic Quaternion-Domain Color Image Stitching169
Fine-Grained Recognition With Learnable Semantic Data Augmentation168
Cross-Modality Pyramid Alignment for Visual Intention Understanding163
STPNet: Scale-Aware Text Prompt Network for Medical Image Segmentation155
Multimodal Unrolled Robust PCA for Background Foreground Separation154
Differentiable SAR Renderer and Image-Based Target Reconstruction148
Attentive WaveBlock: Complementarity-Enhanced Mutual Networks for Unsupervised Domain Adaptation in Person Re-Identification and Beyond147
Vision-Based UAV Self-Positioning in Low-Altitude Urban Environments145
Real Image Denoising With a Locally-Adaptive Bitonic Filter138
Contrast-Reconstruction Representation Learning for Self-Supervised Skeleton-Based Action Recognition137
Uncertainty-Guided Refinement for Fine-Grained Salient Object Detection136
Multi-Granularity Contrastive Cross-Modal Collaborative Generation for End-to-End Long-Term Video Question Answering134
LoRA-Composer: Leveraging Low-Rank Adaptation for Multi-Concept Customization in Training-Free Diffusion Models132
Cross-Modal Retrieval With Noisy Correspondence via Consistency Refining and Mining132
Density-Guided Incremental Dominant Instance Exploration for Two-View Geometric Model Fitting131
Few-Shot Learning With Class-Covariance Metric for Hyperspectral Image Classification131
Graph Embedding Contrastive Multi-Modal Representation Learning for Clustering129
GMLight: Lighting Estimation via Geometric Distribution Approximation129
Toward Projected Clustering With Aggregated Mapping128
Attention-Guided Neural Networks for Full-Reference and No-Reference Audio-Visual Quality Assessment127
Bi-Nuclear Tensor Schatten-p Norm Minimization for Multi-View Subspace Clustering127
Advances in Predictive RAHT for Geometric Point Cloud Compression125
Interactive Face Video Coding: A Generative Compression Framework124
NeuralDiffuser: Neuroscience-Inspired Diffusion Guidance for fMRI Visual Reconstruction124
Generalization Beyond Feature Alignment: Concept Activation-Guided Contrastive Learning124
Multi-Condition Latent Diffusion Network for Scene-Aware Neural Human Motion Prediction123
Unsupervised Modality-Transferable Video Highlight Detection With Representation Activation Sequence Learning122
IMU-Assisted Online Video Background Identification120
Distractor-Aware Event-Based Tracking118
Bidirectional Mapping Coupled GAN for Generalized Zero-Shot Learning117
Motion and Appearance Decoupling Representation for Event Cameras117
SRS: Siamese Reconstruction-Segmentation Network Based on Dynamic-Parameter Convolution112
Cyclic Self-Training With Proposal Weight Modulation for Cross-Supervised Object Detection110
Learning Dynamic Prompts for All-in-One Image Restoration109
ScaleNet: Scaling up Pretrained Neural Networks With Incremental Parameters108
Coarse-to-Fine Contrastive Self-Supervised Feature Learning for Land-Cover Classification in SAR Images With Limited Labeled Data108
Addressing Challenges of Incorporating Appearance Cues Into Heuristic Multi-Object Tracker via a Novel Feature Paradigm108
Unsupervised Person Re-Identification With Stochastic Training Strategy107
Multi-Source Unsupervised Domain Adaptation via Pseudo Target Domain103
Unsupervised Foggy Scene Understanding via Self Spatial-Temporal Label Diffusion103
Commonality Feature Representation Learning for Unsupervised Multimodal Change Detection102
Hyperspectral Meets Optical Flow: Spectral Flow Extraction for Hyperspectral Image Classification101
Rethinking Sampling Strategies for Unsupervised Person Re-Identification100
Perceptually Weighted Rate Distortion Optimization for Video-Based Point Cloud Compression99
Grammar-Induced Wavelet Network for Human Parsing98
DUT: Learning Video Stabilization by Simply Watching Unstable Videos95
Boundary-Aware Prototype in Semi-Supervised Medical Image Segmentation94
SharpFormer: Learning Local Feature Preserving Global Representations for Image Deblurring92
Decoupled Cross-Modal Phrase-Attention Network for Image-Sentence Matching92
FsaNet: Frequency Self-Attention for Semantic Segmentation91
Multi-Exposure Image Fusion via Deformable Self-Attention90
Stacked Deconvolutional Network for Semantic Segmentation90
Cross-Attentional Spatio-Temporal Semantic Graph Networks for Video Question Answering89
Point-Based Learnable Query Generator for Human–Object Interaction Detection89
SegHSI: Semantic Segmentation of Hyperspectral Images With Limited Labeled Pixels89
Fast 3D Room Layout Estimation Based on Compact High-Level Representation88
Fuzzy Sparse Subspace Clustering for Infrared Image Segmentation86
0.25111603736877