IEEE Transactions on Image Processing

Papers
(The TQCC of IEEE Transactions on Image Processing is 20. The table below lists those papers that are above that threshold based on CrossRef citation counts [max. 250 papers]. The publications cover those that have been published in the past four years, i.e., from 2021-10-01 to 2025-10-01.)
ArticleCitations
Consensus Sparsity: Multi-Context Sparse Image Representation via L -Induced Matrix Variate641
HAda: Hyper-Adaptive Parameter-Efficient Learning for Multi-View ConvNets619
Pro2Diff: Proposal Propagation for Multi-Object Tracking via the Diffusion Model563
Real Image Denoising With a Locally-Adaptive Bitonic Filter534
Differentiable SAR Renderer and Image-Based Target Reconstruction438
SemiRS-COC: Semi-Supervised Classification for Complex Remote Sensing Scenes With Cross-Object Consistency417
AdaAugment: A Tuning-Free and Adaptive Approach to Enhance Data Augmentation389
Few-Shot Learning With Class-Covariance Metric for Hyperspectral Image Classification342
Attentive WaveBlock: Complementarity-Enhanced Mutual Networks for Unsupervised Domain Adaptation in Person Re-Identification and Beyond341
Fine-Grained Recognition With Learnable Semantic Data Augmentation338
Pose-Appearance Relational Modeling for Video Action Recognition272
Dual Alternating Direction Method of Multipliers for Inverse Imaging270
Learning Spectral Cues for Multispectral and Panchromatic Image Fusion246
Multiframe Joint Enhancement for Early Interlaced Videos222
Color Spike Camera Reconstruction via Long Short-Term Temporal Aggregation of Spike Signals220
GMLight: Lighting Estimation via Geometric Distribution Approximation202
Toward Projected Clustering With Aggregated Mapping197
FF-LPD: A Real-Time Frame-by-Frame License Plate Detector With Knowledge Distillation and Feature Propagation194
Bi-Nuclear Tensor Schatten-p Norm Minimization for Multi-View Subspace Clustering186
STPNet: Scale-Aware Text Prompt Network for Medical Image Segmentation186
Equivariant Local Reference Frames With Optimization for Robust Non-Rigid Point Cloud Correspondence185
Self-Supervised Matting-Specific Portrait Enhancement and Generation182
Automatic Quaternion-Domain Color Image Stitching179
An Adaptive Multi-Granularity Graph Representation of Image via Granular-ball Computing172
Multimodal Unrolled Robust PCA for Background Foreground Separation170
Variational Structured Attention Networks for Deep Visual Representation Learning170
Graph Convolutional Dictionary Selection With L, Norm for Video Summarization163
A Fast and Efficient Shape Blending by Stable and Analytically Invertible Finite Descriptors162
Harnessing Multi-modal Large Language Models for Measuring and Interpreting Color Differences158
Graph Embedding Contrastive Multi-Modal Representation Learning for Clustering154
Multi-Constraint Adversarial Networks for Unsupervised Image-to-Image Translation154
Canonical Correlation Analysis With Low-Rank Learning for Image Representation152
OccNeRF: Advancing 3D Occupancy Prediction in LiDAR-Free Environments147
Contrast-Reconstruction Representation Learning for Self-Supervised Skeleton-Based Action Recognition137
Spatial Frequency Modulation Network for Efficient Image Dehazing131
A Low-Rank Tensor Decomposition Model With Factors Prior and Total Variation for Impulsive Noise Removal130
Density-Guided Incremental Dominant Instance Exploration for Two-View Geometric Model Fitting130
An Explanation Method Based on Interpretable Linear Model with Four Key Characteristics128
Vision-Based UAV Self-Positioning in Low-Altitude Urban Environments127
Multi-Granularity Contrastive Cross-Modal Collaborative Generation for End-to-End Long-Term Video Question Answering127
MaCon: A Generic Self-Supervised Framework for Unsupervised Multimodal Change Detection125
One-Class Classification Using ℓp-Norm Multiple Kernel Fisher Null Approach124
Cross-Modality Pyramid Alignment for Visual Intention Understanding123
Discrete Metric Learning for Fast Image Set Classification122
Cross-Modal Retrieval With Noisy Correspondence via Consistency Refining and Mining122
TTVFI: Learning Trajectory-Aware Transformer for Video Frame Interpolation120
Attention-Guided Neural Networks for Full-Reference and No-Reference Audio-Visual Quality Assessment119
Uncertainty-Guided Refinement for Fine-Grained Salient Object Detection119
Multi-Condition Latent Diffusion Network for Scene-Aware Neural Human Motion Prediction117
Generalization Beyond Feature Alignment: Concept Activation-Guided Contrastive Learning116
Cross-Domain Diffusion With Progressive Alignment for Efficient Adaptive Retrieval114
Grammar-Induced Wavelet Network for Human Parsing114
Optimization-Inspired Learning With Architecture Augmentations and Control Mechanisms for Low-Level Vision113
Interactive Face Video Coding: A Generative Compression Framework113
Fast 3D Room Layout Estimation Based on Compact High-Level Representation113
Cross-Attentional Spatio-Temporal Semantic Graph Networks for Video Question Answering113
Weakly Supervised Semantic Segmentation via Alternate Self-Dual Teaching110
SRS: Siamese Reconstruction-Segmentation Network based on Dynamic-Parameter Convolution110
Inverse Image Frequency for Long-Tailed Image Recognition110
Unsupervised Person Re-Identification With Stochastic Training Strategy107
Efficient Semi-Supervised Multimodal Hashing With Importance Differentiation Regression104
Coarse-to-Fine Contrastive Self-Supervised Feature Learning for Land-Cover Classification in SAR Images With Limited Labeled Data104
Bidirectional Mapping Coupled GAN for Generalized Zero-Shot Learning104
Non-Cascaded and Crosstalk-Free Multi-Image Encryption Based on Optical Scanning Holography Using 2D Orthogonal Compressive Sensing102
Unsupervised Modality-Transferable Video Highlight Detection With Representation Activation Sequence Learning102
NR-MVSNet: Learning Multi-View Stereo Based on Normal Consistency and Depth Refinement101
Unsupervised Foggy Scene Understanding via Self Spatial-Temporal Label Diffusion101
Transition Is a Process: Pair-to-Video Change Detection Networks for Very High Resolution Remote Sensing Images100
NeuralDiffuser: Neuroscience-Inspired Diffusion Guidance for fMRI Visual Reconstruction99
Cyclic Self-Training With Proposal Weight Modulation for Cross-Supervised Object Detection99
IMU-Assisted Online Video Background Identification97
Hyperspectral Meets Optical Flow: Spectral Flow Extraction for Hyperspectral Image Classification93
Addressing Challenges of Incorporating Appearance Cues Into Heuristic Multi-Object Tracker via a Novel Feature Paradigm92
Variational Bayes Image Restoration With Compressive Autoencoders92
Perceptually Weighted Rate Distortion Optimization for Video-Based Point Cloud Compression91
Advances in Predictive RAHT for Geometric Point Cloud Compression90
Decoupled Cross-Modal Phrase-Attention Network for Image-Sentence Matching89
SharpFormer: Learning Local Feature Preserving Global Representations for Image Deblurring88
Multi-Exposure Image Fusion via Deformable Self-Attention88
Multi-Source Unsupervised Domain Adaptation via Pseudo Target Domain87
Learning Dynamic Prompts for All-in-One Image Restoration86
Point-Based Learnable Query Generator for Human–Object Interaction Detection85
Rethinking Sampling Strategies for Unsupervised Person Re-Identification85
Cross-Layer Contrastive Learning of Latent Semantics for Facial Expression Recognition85
Stacked Deconvolutional Network for Semantic Segmentation85
Fuzzy Sparse Subspace Clustering for Infrared Image Segmentation82
Precise Facial Landmark Detection by Reference Heatmap Transformer82
Video Moment Retrieval With Cross-Modal Neural Architecture Search81
FsaNet: Frequency Self-Attention for Semantic Segmentation78
KSS-ICP: Point Cloud Registration Based on Kendall Shape Space77
Commonality Feature Representation Learning for Unsupervised Multimodal Change Detection76
DUT: Learning Video Stabilization by Simply Watching Unstable Videos76
Motion and Appearance Decoupling Representation for Event Cameras76
Boundary-Aware Prototype in Semi-Supervised Medical Image Segmentation74
Toward Video Anomaly Retrieval From Video Anomaly Detection: New Benchmarks and Model74
SegHSI: Semantic Segmentation of Hyperspectral Images With Limited Labeled Pixels74
Distractor-Aware Event-Based Tracking74
BinsFormer: Revisiting Adaptive Bins for Monocular Depth Estimation74
Weighted Feature Fusion of Convolutional Neural Network and Graph Attention Network for Hyperspectral Image Classification74
RSSFormer: Foreground Saliency Enhancement for Remote Sensing Land-Cover Segmentation73
Shared Manifold Regularized Joint Feature Selection for Joint Classification and Regression in Alzheimer’s Disease Diagnosis73
One Sketch for All: One-Shot Personalized Sketch Segmentation72
Multispectral Snapshot Image Registration Using Learned Cross Spectral Disparity Estimation and a Deep Guided Occlusion Reconstruction Network72
MaskFaceGAN: High-Resolution Face Editing With Masked GAN Latent Code Optimization72
PolarPose: Single-Stage Multi-Person Pose Estimation in Polar Coordinates72
Joint Local and Nonlocal Progressive Prediction for Versatile Video Coding72
Generalizing to Out-of-Sample Degradations via Model Reprogramming71
Energy-Based Domain Adaptation Without Intermediate Domain Dataset for Foggy Scene Segmentation70
RobustMat: Neural Diffusion for Street Landmark Patch Matching Under Challenging Environments70
HQ2CL: A High-Quality Class Center Learning System for Deep Face Recognition70
Learned Spherical Image Compression With Spherical Convolution-Self-Attention and Transformer Context Model69
Fine-Grained Spatio-Temporal Parsing Network for Action Quality Assessment69
Fast Learning Radiance Fields by Shooting Much Fewer Rays69
Rich Action-Semantic Consistent Knowledge for Early Action Prediction68
Hierarchical Random Walker Segmentation for Large Volumetric Biomedical Images68
Spatially Consistent Transformer for Colorization in Monochrome-Color Dual-Lens System66
Graph-Based Depth Denoising & Dequantization for Point Cloud Enhancement66
Mutually Reinforcing Learning of Decoupled Degradation and Diffusion Enhancement for Unpaired Low-Light Image Lightening65
Causal Inference Hashing for Long-Tailed Image Retrieval65
A Discrete-Mapping-Based Cross-Component Prediction Paradigm for Screen Content Coding65
Exploring the Potential of Pooling Techniques for Universal Image Restoration65
High-Quality and Diverse Few-Shot Image Generation via Masked Discrimination64
Enhancing Few-Shot Out-of-Distribution Detection With Pre-Trained Model Features64
Continual Referring Expression Comprehension via Dual Modular Memorization63
Double Oracle Neural Architecture Search for Game Theoretic Deep Learning Models63
Implicit-Explicit Integrated Representations for Multi-View Video Compression62
Image Reconstruction for Accelerated MR Scan With Faster Fourier Convolutional Neural Networks62
DMRA: Depth-Induced Multi-Scale Recurrent Attention Network for RGB-D Saliency Detection62
Compact Representation and Reliable Classification Learning for Point-Level Weakly-Supervised Action Localization62
UVaT: Uncertainty Incorporated View-Aware Transformer for Robust Multi-View Classification61
Semantic Representation and Attention Alignment for Graph Information Bottleneck in Video Summarization61
Semi-Supervised Domain Adaptive Structure Learning61
Reduced Biquaternion Dual-Branch Deraining U-Network via Multi-Attention Mechanism61
Characteristic Mapping for Ellipse Detection Acceleration61
Rethinking Object Saliency Ranking: A Novel Whole-Flow Processing Paradigm60
Multi-Scale Fusion and Decomposition Network for Single Image Deraining60
NesTD-Net: Deep NESTA-Inspired Unfolding Network With Dual-Path Deblocking Structure for Image Compressive Sensing59
A Real-Time Memory Updating Strategy for Unsupervised Person Re-Identification59
Toward Robust and Unconstrained Full Range of Rotation Head Pose Estimation58
AAP-MIT: Attentive Atrous Pyramid Network and Memory Incorporated Transformer for Multisentence Video Description58
HOPE: Enhanced Position Image Priors via High-Order Implicit Representations58
Robust Ellipse Fitting Based on Maximum Correntropy Criterion With Variable Center58
Fuzzy Sparse Deviation Regularized Robust Principal Component Analysis58
Noise Prior Knowledge Informed Bayesian Inference Network for Hyperspectral Super-Resolution57
Joint Denoising-Demosaicking Network for Long-Wave Infrared Division-of-Focal-Plane Polarization Images With Mixed Noise Level Estimation57
Partition Map Prediction for Fast Block Partitioning in VVC Intra-Frame Coding57
Cross-Modal Causal Representation Learning for Radiology Report Generation56
RoMo: Robust Unsupervised Multimodal Learning With Noisy Pseudo Labels56
Reviewer Summary for Transactions on Image Processing56
Hierarchical Superpixel Segmentation by Parallel CRTrees Labeling56
Dynamic Atomic Column Detection in Transmission Electron Microscopy Videos via Ridge Estimation56
Image-Level Adaptive Adversarial Ranking for Person Re-Identification55
Arbitrary-Scale Texture Generation From Coarse-Grained Control55
MA-ST3D: Motion Associated Self-Training for Unsupervised Domain Adaptation on 3D Object Detection55
Data Augmentation Using Bitplane Information Recombination Model54
Multi-Person Pose Tracking With Sparse Key-Point Flow Estimation and Hierarchical Graph Distance Minimization54
View-Wise Versus Cluster-Wise Weight: Which Is Better for Multi-View Clustering?54
Toward Scalable and Unified Example-Based Explanation and Outlier Detection54
Sampling Agnostic Feature Representation for Long-Term Person Re-Identification53
Boosting Monocular 3D Human Pose Estimation With Part Aware Attention53
SSL++: Improving Self-Supervised Learning by Mitigating the Proxy Task-Specificity Problem53
BPMTrack: Multi-Object Tracking With Detection Box Application Pattern Mining53
Source-Guided Target Feature Reconstruction for Cross-Domain Classification and Detection53
Bayesian Nonnegative Tensor Completion With Automatic Rank Determination53
Advancing Video Anomaly Detection: A Bi-Directional Hybrid Framework for Enhanced Single- and Multi-Task Approaches53
Learning Transferable Conceptual Prototypes for Interpretable Unsupervised Domain Adaptation53
CartoonLossGAN: Learning Surface and Coloring of Images for Cartoonization53
Multi-Label Auroral Image Classification Based on CNN and Transformer52
FOVQA: Blind Foveated Video Quality Assessment52
Cross-Modal Contrastive Learning Network for Few-Shot Action Recognition52
Restoration of Images Taken Through a Dirty Window Using Optics-Guided Transformer52
MetaAge: Meta-Learning Personalized Age Estimators51
Bi-Directional Pseudo-Three-Dimensional Network for Video Frame Interpolation51
PointFormer: Keypoint-Guided Transformer for Simultaneous Nuclei Segmentation and Classification in Multi-Tissue Histology Images51
U-N2C: A Dual Memory-Guided Disentanglement Framework for Unsupervised System Matrix Denoising in Magnetic Particle Imaging51
UniEmoX: Cross-Modal Semantic-Guided Large-Scale Pretraining for Universal Scene Emotion Perception50
Learning Domain Invariant Representations for Generalizable Person Re-Identification50
Deep Ranking Exemplar-Based Dynamic Scene Deblurring50
Zero-Shot Camouflaged Object Detection50
Attribute and State Guided Structural Embedding Network for Vehicle Re-Identification50
Perception-Guided Quality Metric of 3D Point Clouds Using Hybrid Strategy49
Contrastive Conditional Latent Diffusion for Audio-Visual Segmentation49
FABNet: Frequency-Aware Binarized Network for Single Image Super-Resolution49
Enhancing Text-Based Person Retrieval by Combining Fused Representation and Reciprocal Learning With Adaptive Loss Refinement49
PFONet: A Progressive Feedback Optimization Network for Lightweight Single Image Dehazing49
DVMark: A Deep Multiscale Framework for Video Watermarking48
Model-Induced Generalization Error Bound for Information-Theoretic Representation Learning in Source-Data-Free Unsupervised Domain Adaptation48
Degraded Reference Image Quality Assessment48
Dynamic Slimmable Denoising Network48
SIR: Self-Supervised Image Rectification via Seeing the Same Scene From Multiple Different Lenses48
Rotational Convolution: Rethinking Convolution for Downside Fisheye Images48
A New Non-Linear Hyperbolic-Parabolic Coupled PDE Model for Image Despeckling47
DO-Conv: Depthwise Over-Parameterized Convolutional Layer47
CKD: Contrastive Knowledge Distillation From a Sample-Wise Perspective47
Multistage Spatio-Temporal Networks for Robust Sketch Recognition47
TTST: A Top-k Token Selective Transformer for Remote Sensing Image Super-Resolution47
PVPUFormer: Probabilistic Visual Prompt Unified Transformer for Interactive Image Segmentation47
Sensitivity Decouple Learning for Image Compression Artifacts Reduction47
Ingredient-Guided Region Discovery and Relationship Modeling for Food Category-Ingredient Prediction47
Hierarchical Hashing Learning for Image Set Classification46
Deep Underwater Image Quality Assessment With Explicit Degradation Awareness Embedding46
MBFQuant: A Multiplier-Bitwidth-Fixed, Mixed-Precision Quantization Method for Mobile CNN-Based Applications46
Image Compression Using Stochastic-AFD Based Multisignal Sparse Representation46
Few-Shot Domain Adaptation via Mixup Optimal Transport46
Learned Image Compression With Gaussian-Laplacian-Logistic Mixture Model and Concatenated Residual Modules45
Underwater Image Enhancement With Hyper-Laplacian Reflectance Priors45
Hyperspectral Image Classification via Cascaded Spatial Cross-Attention Network45
PCE-GAN: A Generative Adversarial Network for Point Cloud Attribute Quality Enhancement Based on Optimal Transport45
Interpretable Neural Networks for Video Separation: Deep Unfolding RPCA With Foreground Masking45
Improving Transferability of Universal Adversarial Perturbation With Feature Disruption44
Underwater Image Enhancement via Minimal Color Loss and Locally Adaptive Contrast Enhancement44
Explicitly-Decoupled Text Transfer With Minimized Background Reconstruction for Scene Text Editing44
IEEE Transactions on Image Processing publication information44
SDSFusion: A Semantic-Aware Infrared and Visible Image Fusion Network for Degraded Scenes44
Lightweight Deep Neural Networks for Ship Target Detection in SAR Imagery44
U-Shape Transformer for Underwater Image Enhancement44
HyperE2VID: Improving Event-Based Video Reconstruction via Hypernetworks44
Linearly Transformed Color Guide for Low-Bitrate Diffusion-Based Image Compression44
Coupled Splines for Sparse Curve Fitting43
Siamese-DETR for Generic Multi-Object Tracking43
Leveraging Frequency Analysis for Image Denoising Network Pruning43
BVI-VFI: A Video Quality Database for Video Frame Interpolation43
Enhancing Multimodal Learning via Hierarchical Fusion Architecture Search With Inconsistency Mitigation43
Spatio-Temporal Correlation Guided Geometric Partitioning for Versatile Video Coding43
Magi-Net: Meta Negative Network for Early Activity Prediction43
Interactive Learning of Intrinsic and Extrinsic Properties for All-Day Semantic Segmentation42
Decoupling Discriminative Attributes for Few-Shot Fine-Grained Recognition42
Learning to Compare Relation: Semantic Alignment for Few-Shot Learning42
DisAVR: Disentangled Adaptive Visual Reasoning Network for Diagram Question Answering42
Hyperpixels: Flexible 4D Over-Segmentation for Dense and Sparse Light Fields42
Toward Transparent Deep Image Aesthetics Assessment With Tag-Based Content Descriptors42
Image Copy-Move Forgery Detection via Deep PatchMatch and Pairwise Ranking Learning41
Learning Structure Aware Deep Spectral Embedding41
Diverse Target and Contribution Scheduling for Domain Generalization41
Scale-Aware Crowd Counting Network With Annotation Error Modeling41
YOLOH: You Only Look One Hourglass for Real-Time Object Detection40
Hierarchical Prior-Based Super Resolution for Point Cloud Geometry Compression40
Robust Palmprint Recognition via Multi-Stage Noisy Label Selection and Correction40
ECEA: Extensible Co-Existing Attention for Few-Shot Object Detection40
Accurate 3D Measurement of Complex Texture Objects by Height Compensation Using a Dual-Projector Structure40
Dual Mixture Model Based CNN for Image Denoising40
Conditional Feature Learning Based Transformer for Text-Based Person Search40
Multimodal Composition Example Mining for Composed Query Image Retrieval40
Designing an Illumination-Aware Network for Deep Image Relighting40
Zero-Shot Skeleton-Based Action Recognition With Prototype-Guided Feature Alignment40
Deepfake Forensics via an Adversarial Game40
StreakNet-Arch: An Anti-Scattering Network-Based Architecture for Underwater Carrier LiDAR-Radar Imaging40
Multi-Branch and Progressive Network for Low-Light Image Enhancement39
State-Aware Compositional Learning Toward Unbiased Training for Scene Graph Generation39
Multi-Modal Remote Sensing Image Matching Considering Co-Occurrence Filter39
0.25175213813782