IEEE Transactions on Image Processing

Papers
(The TQCC of IEEE Transactions on Image Processing is 17. The table below lists those papers that are above that threshold based on CrossRef citation counts [max. 250 papers]. The publications cover those that have been published in the past four years, i.e., from 2021-05-01 to 2025-05-01.)
ArticleCitations
Consensus Sparsity: Multi-Context Sparse Image Representation via L -Induced Matrix Variate470
Variational Structured Attention Networks for Deep Visual Representation Learning462
Multiframe Joint Enhancement for Early Interlaced Videos443
FF-LPD: A Real-Time Frame-by-Frame License Plate Detector With Knowledge Distillation and Feature Propagation386
SemiRS-COC: Semi-Supervised Classification for Complex Remote Sensing Scenes With Cross-Object Consistency332
HAda: Hyper-Adaptive Parameter-Efficient Learning for Multi-View ConvNets304
Canonical Correlation Analysis With Low-Rank Learning for Image Representation281
Learning Spectral Cues for Multispectral and Panchromatic Image Fusion264
One-Class Classification Using ℓp-Norm Multiple Kernel Fisher Null Approach214
Automatic Quaternion-Domain Color Image Stitching212
Multi-Granularity Contrastive Cross-Modal Collaborative Generation for End-to-End Long-Term Video Question Answering210
Toward Projected Clustering With Aggregated Mapping199
A Low-Rank Tensor Decomposition Model With Factors Prior and Total Variation for Impulsive Noise Removal188
Dual Alternating Direction Method of Multipliers for Inverse Imaging163
Multimodal Unrolled Robust PCA for Background Foreground Separation159
Density-Guided Incremental Dominant Instance Exploration for Two-View Geometric Model Fitting159
A Fast and Efficient Shape Blending by Stable and Analytically Invertible Finite Descriptors155
Self-Supervised Matting-Specific Portrait Enhancement and Generation150
Graph Convolutional Dictionary Selection With L, Norm for Video Summarization150
Differentiable SAR Renderer and Image-Based Target Reconstruction144
Contrast-Reconstruction Representation Learning for Self-Supervised Skeleton-Based Action Recognition138
MaCon: A Generic Self-Supervised Framework for Unsupervised Multimodal Change Detection138
GMLight: Lighting Estimation via Geometric Distribution Approximation134
Harnessing Multi-modal Large Language Models for Measuring and Interpreting Color Differences134
Cross-Modal Retrieval With Noisy Correspondence via Consistency Refining and Mining130
Equivariant Local Reference Frames With Optimization for Robust Non-Rigid Point Cloud Correspondence129
Discrete Metric Learning for Fast Image Set Classification128
Pro2Diff: Proposal Propagation for Multi-Object Tracking via the Diffusion Model124
TTVFI: Learning Trajectory-Aware Transformer for Video Frame Interpolation120
Pose-Appearance Relational Modeling for Video Action Recognition118
Cross-Modality Pyramid Alignment for Visual Intention Understanding115
Uncertainty-Guided Refinement for Fine-Grained Salient Object Detection115
Vision-Based UAV Self-Positioning in Low-Altitude Urban Environments114
Multi-Constraint Adversarial Networks for Unsupervised Image-to-Image Translation110
Attention-Guided Neural Networks for Full-Reference and No-Reference Audio-Visual Quality Assessment108
Bi-Nuclear Tensor Schatten-p Norm Minimization for Multi-View Subspace Clustering105
Graph Embedding Contrastive Multi-Modal Representation Learning for Clustering103
Real Image Denoising With a Locally-Adaptive Bitonic Filter103
Fine-Grained Recognition With Learnable Semantic Data Augmentation102
Attentive WaveBlock: Complementarity-Enhanced Mutual Networks for Unsupervised Domain Adaptation in Person Re-Identification and Beyond101
Few-Shot Learning With Class-Covariance Metric for Hyperspectral Image Classification100
Bidirectional Mapping Coupled GAN for Generalized Zero-Shot Learning99
Cross-Attentional Spatio-Temporal Semantic Graph Networks for Video Question Answering98
Inverse Image Frequency for Long-Tailed Image Recognition96
SharpFormer: Learning Local Feature Preserving Global Representations for Image Deblurring95
Unsupervised Modality-Transferable Video Highlight Detection With Representation Activation Sequence Learning95
Multi-Condition Latent Diffusion Network for Scene-Aware Neural Human Motion Prediction94
Generalization Beyond Feature Alignment: Concept Activation-Guided Contrastive Learning93
Boundary-Aware Prototype in Semi-Supervised Medical Image Segmentation92
Addressing Challenges of Incorporating Appearance Cues Into Heuristic Multi-Object Tracker via a Novel Feature Paradigm90
SegHSI: Semantic Segmentation of Hyperspectral Images With Limited Labeled Pixels90
Non-Cascaded and Crosstalk-Free Multi-Image Encryption Based on Optical Scanning Holography Using 2D Orthogonal Compressive Sensing90
Grammar-Induced Wavelet Network for Human Parsing88
Efficient Semi-Supervised Multimodal Hashing With Importance Differentiation Regression88
Decoupled Cross-Modal Phrase-Attention Network for Image-Sentence Matching88
Optimization-Inspired Learning With Architecture Augmentations and Control Mechanisms for Low-Level Vision88
Point-Based Learnable Query Generator for Human–Object Interaction Detection88
Distractor-Aware Event-Based Tracking86
Stacked Deconvolutional Network for Semantic Segmentation85
Unsupervised Person Re-Identification With Stochastic Training Strategy85
Weakly Supervised Semantic Segmentation via Alternate Self-Dual Teaching84
Hyperspectral Meets Optical Flow: Spectral Flow Extraction for Hyperspectral Image Classification83
Rethinking Sampling Strategies for Unsupervised Person Re-Identification83
Coarse-to-Fine Contrastive Self-Supervised Feature Learning for Land-Cover Classification in SAR Images With Limited Labeled Data81
Fuzzy Sparse Subspace Clustering for Infrared Image Segmentation81
DUT: Learning Video Stabilization by Simply Watching Unstable Videos80
IMU-Assisted Online Video Background Identification79
Toward Video Anomaly Retrieval From Video Anomaly Detection: New Benchmarks and Model76
NR-MVSNet: Learning Multi-View Stereo Based on Normal Consistency and Depth Refinement75
Cross-Layer Contrastive Learning of Latent Semantics for Facial Expression Recognition75
Commonality Feature Representation Learning for Unsupervised Multimodal Change Detection74
Cyclic Self-Training With Proposal Weight Modulation for Cross-Supervised Object Detection74
Multi-Exposure Image Fusion via Deformable Self-Attention73
Perceptually Weighted Rate Distortion Optimization for Video-Based Point Cloud Compression72
Cross-Domain Diffusion With Progressive Alignment for Efficient Adaptive Retrieval72
NeuralDiffuser: Neuroscience-Inspired Diffusion Guidance for fMRI Visual Reconstruction72
Transition Is a Process: Pair-to-Video Change Detection Networks for Very High Resolution Remote Sensing Images71
Video Moment Retrieval With Cross-Modal Neural Architecture Search70
Precise Facial Landmark Detection by Reference Heatmap Transformer70
Unsupervised Foggy Scene Understanding via Self Spatial-Temporal Label Diffusion70
Multi-Source Unsupervised Domain Adaptation via Pseudo Target Domain69
KSS-ICP: Point Cloud Registration Based on Kendall Shape Space68
RSSFormer: Foreground Saliency Enhancement for Remote Sensing Land-Cover Segmentation68
FsaNet: Frequency Self-Attention for Semantic Segmentation67
BinsFormer: Revisiting Adaptive Bins for Monocular Depth Estimation67
Weighted Feature Fusion of Convolutional Neural Network and Graph Attention Network for Hyperspectral Image Classification67
RobustMat: Neural Diffusion for Street Landmark Patch Matching Under Challenging Environments65
Fast Learning Radiance Fields by Shooting Much Fewer Rays63
Noise Prior Knowledge Informed Bayesian Inference Network for Hyperspectral Super-Resolution63
Mutually Reinforcing Learning of Decoupled Degradation and Diffusion Enhancement for Unpaired Low-Light Image Lightening62
Joint Denoising-Demosaicking Network for Long-Wave Infrared Division-of-Focal-Plane Polarization Images With Mixed Noise Level Estimation62
One Sketch for All: One-Shot Personalized Sketch Segmentation62
Robust Ellipse Fitting Based on Maximum Correntropy Criterion With Variable Center62
Partition Map Prediction for Fast Block Partitioning in VVC Intra-Frame Coding62
Toward Robust and Unconstrained Full Range of Rotation Head Pose Estimation61
NesTD-Net: Deep NESTA-Inspired Unfolding Network With Dual-Path Deblocking Structure for Image Compressive Sensing60
Semi-Supervised Domain Adaptive Structure Learning59
Shared Manifold Regularized Joint Feature Selection for Joint Classification and Regression in Alzheimer’s Disease Diagnosis58
A Real-Time Memory Updating Strategy for Unsupervised Person Re-Identification57
Compact Representation and Reliable Classification Learning for Point-Level Weakly-Supervised Action Localization55
AAP-MIT: Attentive Atrous Pyramid Network and Memory Incorporated Transformer for Multisentence Video Description54
Multispectral Snapshot Image Registration Using Learned Cross Spectral Disparity Estimation and a Deep Guided Occlusion Reconstruction Network54
High-Quality and Diverse Few-Shot Image Generation via Masked Discrimination54
Characteristic Mapping for Ellipse Detection Acceleration54
Generalizing to Out-of-Sample Degradations via Model Reprogramming53
Rich Action-Semantic Consistent Knowledge for Early Action Prediction53
Image Reconstruction for Accelerated MR Scan With Faster Fourier Convolutional Neural Networks53
Enhancing Few-Shot Out-of-Distribution Detection With Pre-Trained Model Features52
Joint Local and Nonlocal Progressive Prediction for Versatile Video Coding52
Energy-Based Domain Adaptation Without Intermediate Domain Dataset for Foggy Scene Segmentation52
Semantic Representation and Attention Alignment for Graph Information Bottleneck in Video Summarization52
UVaT: Uncertainty Incorporated View-Aware Transformer for Robust Multi-View Classification52
HQ2CL: A High-Quality Class Center Learning System for Deep Face Recognition52
Continual Referring Expression Comprehension via Dual Modular Memorization51
Hierarchical Superpixel Segmentation by Parallel CRTrees Labeling51
Hierarchical Random Walker Segmentation for Large Volumetric Biomedical Images51
Fuzzy Sparse Deviation Regularized Robust Principal Component Analysis51
A Discrete-Mapping-Based Cross-Component Prediction Paradigm for Screen Content Coding51
Implicit-Explicit Integrated Representations for Multi-View Video Compression51
Graph-Based Depth Denoising & Dequantization for Point Cloud Enhancement50
DMRA: Depth-Induced Multi-Scale Recurrent Attention Network for RGB-D Saliency Detection50
MaskFaceGAN: High-Resolution Face Editing With Masked GAN Latent Code Optimization50
Spatially Consistent Transformer for Colorization in Monochrome-Color Dual-Lens System50
Multi-Scale Fusion and Decomposition Network for Single Image Deraining50
PolarPose: Single-Stage Multi-Person Pose Estimation in Polar Coordinates50
Rethinking Object Saliency Ranking: A Novel Whole-Flow Processing Paradigm50
Fine-Grained Spatio-Temporal Parsing Network for Action Quality Assessment50
RoMo: Robust Unsupervised Multimodal Learning With Noisy Pseudo Labels50
Double Oracle Neural Architecture Search for Game Theoretic Deep Learning Models49
Reviewer Summary for Transactions on Image Processing49
PVPUFormer: Probabilistic Visual Prompt Unified Transformer for Interactive Image Segmentation48
Image-Level Adaptive Adversarial Ranking for Person Re-Identification48
MA-ST3D: Motion Associated Self-Training for Unsupervised Domain Adaptation on 3D Object Detection48
Dynamic Atomic Column Detection in Transmission Electron Microscopy Videos via Ridge Estimation48
Advancing Video Anomaly Detection: A Bi-Directional Hybrid Framework for Enhanced Single- and Multi-Task Approaches48
Perception-Guided Quality Metric of 3D Point Clouds Using Hybrid Strategy48
Arbitrary-Scale Texture Generation From Coarse-Grained Control47
Image Compression Using Stochastic-AFD Based Multisignal Sparse Representation47
Multi-Person Pose Tracking With Sparse Key-Point Flow Estimation and Hierarchical Graph Distance Minimization46
Attribute and State Guided Structural Embedding Network for Vehicle Re-Identification46
Source-Guided Target Feature Reconstruction for Cross-Domain Classification and Detection46
Rotational Convolution: Rethinking Convolution for Downside Fisheye Images46
Cross-Modal Contrastive Learning Network for Few-Shot Action Recognition45
Data Augmentation Using Bitplane Information Recombination Model45
Model-Induced Generalization Error Bound for Information-Theoretic Representation Learning in Source-Data-Free Unsupervised Domain Adaptation45
Sampling Agnostic Feature Representation for Long-Term Person Re-Identification45
A New Non-Linear Hyperbolic-Parabolic Coupled PDE Model for Image Despeckling45
Toward Scalable and Unified Example-Based Explanation and Outlier Detection45
Degraded Reference Image Quality Assessment45
SSL++: Improving Self-Supervised Learning by Mitigating the Proxy Task-Specificity Problem45
MBFQuant: A Multiplier-Bitwidth-Fixed, Mixed-Precision Quantization Method for Mobile CNN-Based Applications44
Deep Underwater Image Quality Assessment With Explicit Degradation Awareness Embedding43
View-Wise Versus Cluster-Wise Weight: Which Is Better for Multi-View Clustering?43
Multi-Label Auroral Image Classification Based on CNN and Transformer43
Interpretable Neural Networks for Video Separation: Deep Unfolding RPCA With Foreground Masking43
Bayesian Nonnegative Tensor Completion With Automatic Rank Determination43
Few-Shot Domain Adaptation via Mixup Optimal Transport42
FOVQA: Blind Foveated Video Quality Assessment42
Dynamic Slimmable Denoising Network42
HyperE2VID: Improving Event-Based Video Reconstruction via Hypernetworks42
Zero-Shot Camouflaged Object Detection42
Learning Domain Invariant Representations for Generalizable Person Re-Identification42
MetaAge: Meta-Learning Personalized Age Estimators41
PFONet: A Progressive Feedback Optimization Network for Lightweight Single Image Dehazing41
Sensitivity Decouple Learning for Image Compression Artifacts Reduction41
Hyperspectral Image Classification via Cascaded Spatial Cross-Attention Network41
Learning Transferable Conceptual Prototypes for Interpretable Unsupervised Domain Adaptation41
Bi-Directional Pseudo-Three-Dimensional Network for Video Frame Interpolation41
Ingredient-Guided Region Discovery and Relationship Modeling for Food Category-Ingredient Prediction41
CartoonLossGAN: Learning Surface and Coloring of Images for Cartoonization41
Deep Ranking Exemplar-Based Dynamic Scene Deblurring41
Learned Image Compression With Gaussian-Laplacian-Logistic Mixture Model and Concatenated Residual Modules41
DVMark: A Deep Multiscale Framework for Video Watermarking40
BPMTrack: Multi-Object Tracking With Detection Box Application Pattern Mining40
SIR: Self-Supervised Image Rectification via Seeing the Same Scene From Multiple Different Lenses40
Underwater Image Enhancement via Minimal Color Loss and Locally Adaptive Contrast Enhancement40
Hierarchical Hashing Learning for Image Set Classification39
Underwater Image Enhancement With Hyper-Laplacian Reflectance Priors39
Multistage Spatio-Temporal Networks for Robust Sketch Recognition39
U-Shape Transformer for Underwater Image Enhancement39
DO-Conv: Depthwise Over-Parameterized Convolutional Layer39
FABNet: Frequency-Aware Binarized Network for Single Image Super-Resolution39
Boosting Monocular 3D Human Pose Estimation With Part Aware Attention39
TTST: A Top-k Token Selective Transformer for Remote Sensing Image Super-Resolution39
Coupled Splines for Sparse Curve Fitting38
YOLOH: You Only Look One Hourglass for Real-Time Object Detection38
Image Copy-Move Forgery Detection via Deep PatchMatch and Pairwise Ranking Learning38
Deepfake Forensics via an Adversarial Game38
Spatio-Temporal Correlation Guided Geometric Partitioning for Versatile Video Coding38
State-Aware Compositional Learning Toward Unbiased Training for Scene Graph Generation38
Explicitly-Decoupled Text Transfer With Minimized Background Reconstruction for Scene Text Editing38
IEEE Transactions on Image Processing publication information38
Hierarchical Prior-Based Super Resolution for Point Cloud Geometry Compression38
Linearly Transformed Color Guide for Low-Bitrate Diffusion-Based Image Compression38
Siamese-DETR for Generic Multi-Object Tracking38
Versatile Denoising-Based Approximate Message Passing for Compressive Sensing37
Hyperpixels: Flexible 4D Over-Segmentation for Dense and Sparse Light Fields37
Leveraging Frequency Analysis for Image Denoising Network Pruning37
An Efficient Transformer Based on Global and Local Self-Attention for Face Photo-Sketch Synthesis37
Multimodal Composition Example Mining for Composed Query Image Retrieval36
Conditional Feature Learning Based Transformer for Text-Based Person Search36
Cluster-Guided Asymmetric Contrastive Learning for Unsupervised Person Re-Identification36
Towards Transparent Deep Image Aesthetics Assessment with Tag-based Content Descriptors36
Multi-Branch and Progressive Network for Low-Light Image Enhancement36
DisAVR: Disentangled Adaptive Visual Reasoning Network for Diagram Question Answering36
Accurate 3D Measurement of Complex Texture Objects by Height Compensation Using a Dual-Projector Structure35
Scale-Aware Crowd Counting Network with Annotation Error Modeling35
Improving Transferability of Universal Adversarial Perturbation With Feature Disruption35
BVI-VFI: A Video Quality Database for Video Frame Interpolation35
Lightweight Deep Neural Networks for Ship Target Detection in SAR Imagery35
View-Consistency Learning for Incomplete Multiview Clustering35
Dual Mixture Model Based CNN for Image Denoising35
Pseudo-Labeling Based Practical Semi-Supervised Meta-Training for Few-Shot Learning35
Designing an Illumination-Aware Network for Deep Image Relighting34
Learning to Compare Relation: Semantic Alignment for Few-Shot Learning34
Interactive Learning of Intrinsic and Extrinsic Properties for All-Day Semantic Segmentation34
Magi-Net: Meta Negative Network for Early Activity Prediction34
Learning Structure Aware Deep Spectral Embedding34
Anomaly Detection for Medical Images Using Heterogeneous Auto-Encoder34
Fast Generation of Superpixels With Lattice Topology33
Reformulating Graph Kernels for Self-Supervised Space-Time Correspondence Learning33
Enhancing Face Recognition With Detachable Self-Supervised Bypass Networks33
Alignment Relation is What You Need for Diagram Parsing33
Multi-Modal Remote Sensing Image Matching Considering Co-Occurrence Filter33
DREAM-PCD: Deep Reconstruction and Enhancement of mmWave Radar Pointcloud33
Disparity-Aware Reference Frame Generation Network for Multiview Video Coding33
Neuromorphic Synergy for Video Binarization33
Adaptive Betweenness Clustering for Semi-Supervised Domain Adaptation33
ECEA: Extensible Co-Existing Attention for Few-Shot Object Detection33
Video Instance Shadow Detection Under the Sun and Sky33
Grouping Boundary Proposals for Fast Interactive Image Segmentation33
Self-Supervised Learning of Perceptually Optimized Block Motion Estimates for Video Compression33
HDR or SDR? A Subjective and Objective Study of Scaled and Compressed Videos32
Learning Common Semantics via Optimal Transport for Contrastive Multi-View Clustering32
Event-Based Optical Flow via Transforming Into Motion-Dependent View32
Joint Under-Sampling Pattern and Dual-Domain Reconstruction for Accelerating Multi-Contrast MRI32
INformer: Inertial-Based Fusion Transformer for Camera Shake Deblurring31
Diffusion Models as Strong Adversaries31
Study of Spatio-Temporal Modeling in Video Quality Assessment31
See360: Novel Panoramic View Interpolation31
Error Model and Concise Temporal Network for Indirect Illumination in 3D Reconstruction31
X-View: Non-Egocentric Multi-View 3D Object Detector31
Nowhere to Disguise: Spot Camouflaged Objects via Saliency Attribute Transfer31
Taylor Neural Network for Real-World Image Super-Resolution31
Exploration of Learned Lifting-Based Transform Structures for Fully Scalable and Accessible Wavelet-Like Image Compression31
Latitude-Redundancy-Aware All-Zero Block Detection for Fast 360-Degree Video Coding31
Improving Inconspicuous Attributes Modeling for Person Search by Language31
Complementary Calibration: Boosting General Continual Learning With Collaborative Distillation and Self-Supervision30
Multi-Modal Convolutional Dictionary Learning30
Task-to-Instance Prompt Learning for Vision-Language Models at Test Time30
0.10535192489624