IEEE Transactions on Image Processing

Papers
(The H4-Index of IEEE Transactions on Image Processing is 79. The table below lists those papers that are above that threshold based on CrossRef citation counts [max. 250 papers]. The publications cover those that have been published in the past four years, i.e., from 2021-08-01 to 2025-08-01.)
ArticleCitations
Consensus Sparsity: Multi-Context Sparse Image Representation via L -Induced Matrix Variate580
HAda: Hyper-Adaptive Parameter-Efficient Learning for Multi-View ConvNets553
Canonical Correlation Analysis With Low-Rank Learning for Image Representation526
One-Class Classification Using ℓp-Norm Multiple Kernel Fisher Null Approach471
Multi-Granularity Contrastive Cross-Modal Collaborative Generation for End-to-End Long-Term Video Question Answering386
A Fast and Efficient Shape Blending by Stable and Analytically Invertible Finite Descriptors376
Pro2Diff: Proposal Propagation for Multi-Object Tracking via the Diffusion Model360
Automatic Quaternion-Domain Color Image Stitching317
Spatial Frequency Modulation Network for Efficient Image Dehazing276
Toward Projected Clustering With Aggregated Mapping274
Self-Supervised Matting-Specific Portrait Enhancement and Generation248
MaCon: A Generic Self-Supervised Framework for Unsupervised Multimodal Change Detection240
Graph Embedding Contrastive Multi-Modal Representation Learning for Clustering233
Real Image Denoising With a Locally-Adaptive Bitonic Filter206
Graph Convolutional Dictionary Selection With L, Norm for Video Summarization202
Equivariant Local Reference Frames With Optimization for Robust Non-Rigid Point Cloud Correspondence179
Pose-Appearance Relational Modeling for Video Action Recognition178
Uncertainty-Guided Refinement for Fine-Grained Salient Object Detection173
Multi-Constraint Adversarial Networks for Unsupervised Image-to-Image Translation172
Learning Spectral Cues for Multispectral and Panchromatic Image Fusion171
Bi-Nuclear Tensor Schatten-p Norm Minimization for Multi-View Subspace Clustering164
Harnessing Multi-modal Large Language Models for Measuring and Interpreting Color Differences160
Density-Guided Incremental Dominant Instance Exploration for Two-View Geometric Model Fitting159
STPNet: Scale-Aware Text Prompt Network for Medical Image Segmentation159
FF-LPD: A Real-Time Frame-by-Frame License Plate Detector With Knowledge Distillation and Feature Propagation153
OccNeRF: Advancing 3D Occupancy Prediction in LiDAR-Free Environments153
Multiframe Joint Enhancement for Early Interlaced Videos151
Differentiable SAR Renderer and Image-Based Target Reconstruction147
TTVFI: Learning Trajectory-Aware Transformer for Video Frame Interpolation143
An Adaptive Multi-Granularity Graph Representation of Image via Granular-ball Computing141
Cross-Modality Pyramid Alignment for Visual Intention Understanding141
Variational Structured Attention Networks for Deep Visual Representation Learning137
Dual Alternating Direction Method of Multipliers for Inverse Imaging130
Multimodal Unrolled Robust PCA for Background Foreground Separation121
GMLight: Lighting Estimation via Geometric Distribution Approximation121
Cross-Modal Retrieval With Noisy Correspondence via Consistency Refining and Mining121
Discrete Metric Learning for Fast Image Set Classification119
Attentive WaveBlock: Complementarity-Enhanced Mutual Networks for Unsupervised Domain Adaptation in Person Re-Identification and Beyond119
A Low-Rank Tensor Decomposition Model With Factors Prior and Total Variation for Impulsive Noise Removal118
Fine-Grained Recognition With Learnable Semantic Data Augmentation116
Contrast-Reconstruction Representation Learning for Self-Supervised Skeleton-Based Action Recognition115
Vision-Based UAV Self-Positioning in Low-Altitude Urban Environments112
Attention-Guided Neural Networks for Full-Reference and No-Reference Audio-Visual Quality Assessment112
Few-Shot Learning With Class-Covariance Metric for Hyperspectral Image Classification112
AdaAugment: A Tuning-Free and Adaptive Approach to Enhance Data Augmentation111
SemiRS-COC: Semi-Supervised Classification for Complex Remote Sensing Scenes With Cross-Object Consistency110
Multi-Condition Latent Diffusion Network for Scene-Aware Neural Human Motion Prediction110
Generalization Beyond Feature Alignment: Concept Activation-Guided Contrastive Learning109
Addressing Challenges of Incorporating Appearance Cues Into Heuristic Multi-Object Tracker via a Novel Feature Paradigm108
Optimization-Inspired Learning With Architecture Augmentations and Control Mechanisms for Low-Level Vision107
Grammar-Induced Wavelet Network for Human Parsing107
Stacked Deconvolutional Network for Semantic Segmentation106
Distractor-Aware Event-Based Tracking106
Efficient Semi-Supervised Multimodal Hashing With Importance Differentiation Regression104
Weakly Supervised Semantic Segmentation via Alternate Self-Dual Teaching101
Unsupervised Person Re-Identification With Stochastic Training Strategy100
Precise Facial Landmark Detection by Reference Heatmap Transformer99
SegHSI: Semantic Segmentation of Hyperspectral Images With Limited Labeled Pixels99
Cyclic Self-Training With Proposal Weight Modulation for Cross-Supervised Object Detection98
IMU-Assisted Online Video Background Identification96
NeuralDiffuser: Neuroscience-Inspired Diffusion Guidance for fMRI Visual Reconstruction96
Perceptually Weighted Rate Distortion Optimization for Video-Based Point Cloud Compression94
Variational Bayes Image Restoration With Compressive Autoencoders94
Advances in Predictive RAHT for Geometric Point Cloud Compression91
Learning Dynamic Prompts for All-in-One Image Restoration91
Interactive Face Video Coding: A Generative Compression Framework91
Bidirectional Mapping Coupled GAN for Generalized Zero-Shot Learning91
Unsupervised Foggy Scene Understanding via Self Spatial-Temporal Label Diffusion91
Decoupled Cross-Modal Phrase-Attention Network for Image-Sentence Matching90
Toward Video Anomaly Retrieval From Video Anomaly Detection: New Benchmarks and Model89
Fast 3D Room Layout Estimation Based on Compact High-Level Representation88
Fuzzy Sparse Subspace Clustering for Infrared Image Segmentation87
Multi-Exposure Image Fusion via Deformable Self-Attention87
Cross-Attentional Spatio-Temporal Semantic Graph Networks for Video Question Answering83
Commonality Feature Representation Learning for Unsupervised Multimodal Change Detection82
FsaNet: Frequency Self-Attention for Semantic Segmentation81
Cross-Layer Contrastive Learning of Latent Semantics for Facial Expression Recognition81
Boundary-Aware Prototype in Semi-Supervised Medical Image Segmentation81
Rethinking Sampling Strategies for Unsupervised Person Re-Identification80
0.49743008613586