IEEE Transactions on Image Processing

Papers
(The median citation count of IEEE Transactions on Image Processing is 4. The table below lists those papers that are above that threshold based on CrossRef citation counts [max. 250 papers]. The publications cover those that have been published in the past four years, i.e., from 2021-02-01 to 2025-02-01.)
ArticleCitations
Helping Visually Impaired People Take Better Quality Pictures344
One-Class Classification Using ℓp-Norm Multiple Kernel Fisher Null Approach299
BooDet: Gradient Boosting Object Detection With Additive Learning-Based Prediction Aggregation298
Self-Paced Multi-Grained Cross-Modal Interaction Modeling for Referring Expression Comprehension245
Video Reenactment as Inductive Bias for Content-Motion Disentanglement234
Robust Object Detection via Adversarial Novel Style Exploration221
An Embeddable Implicit IUVD Representation for Part-Based 3D Human Surface Reconstruction207
To Boost Zero-Shot Generalization for Embodied Reasoning With Vision-Language Pre-Training168
Restructuring the Teacher and Student in Self-Distillation166
Text Prior Guided Scene Text Image Super-Resolution153
Toward Adversarial Robustness in Unlabeled Target Domains153
Temporal Phase Unwrapping Based on Unequal Phase-Shifting Code137
Dynamical Hyperspectral Unmixing With Variational Recurrent Neural Networks136
FlexHDR: Modeling Alignment and Exposure Uncertainties for Flexible HDR Imaging124
Content-Aware Scalable Deep Compressed Sensing113
Towards Low Light Enhancement With RAW Images109
Local Intensity Order Transformation for Robust Curvilinear Object Segmentation109
Multisubject Task-Related fMRI Data Processing via a Two-Stage Generalized Canonical Correlation Analysis106
Exploring the Robustness of Human Parsers Toward Common Corruptions105
Interactive Regression and Classification for Dense Object Detector101
Graph Embedding Contrastive Multi-Modal Representation Learning for Clustering100
Unsupervised Synthetic Acoustic Image Generation for Audio-Visual Scene Understanding99
VGSG: Vision-Guided Semantic-Group Network for Text-Based Person Search97
Cross-Modal Graph With Meta Concepts for Video Captioning94
Toward Top-Down Just Noticeable Difference Estimation of Natural Images91
Differentiable SAR Renderer and Image-Based Target Reconstruction90
Point Cloud Video Super-Resolution via Partial Point Coupling and Graph Smoothness88
Bi-Fusion of Structure and Deformation at Multi-Scale for Joint Segmentation and Registration87
SemiRS-COC: Semi-Supervised Classification for Complex Remote Sensing Scenes With Cross-Object Consistency87
Broad Spectrum Image Deblurring via an Adaptive Super-Network83
FF-LPD: A Real-Time Frame-by-Frame License Plate Detector With Knowledge Distillation and Feature Propagation83
Multiframe Joint Enhancement for Early Interlaced Videos82
Multi-Constraint Adversarial Networks for Unsupervised Image-to-Image Translation82
Surface-SOS: Self-Supervised Object Segmentation via Neural Surface Representation79
Coarse Mask Guided Interactive Object Segmentation78
Real Image Denoising With a Locally-Adaptive Bitonic Filter78
Tree-Structured Data Clustering-Driven Neural Network for Intra Prediction in Video Coding74
A Novel Hybrid Level Set Model for Non-Rigid Object Contour Tracking72
Dynamic Frame Interpolation in Wavelet Domain71
Fast Multi-Grid Methods for Minimizing Curvature Energies71
Learnable Feature Augmentation Framework for Temporal Action Localization69
Feature Preserving Non-Rigid Iterative Weighted Closest Point and Semi-Curvature Registration68
A Low-Rank Tensor Decomposition Model With Factors Prior and Total Variation for Impulsive Noise Removal67
Data Acquisition and Preparation for Dual-Reference Deep Learning of Image Super-Resolution66
Multi-Label Adversarial Attack With New Measures and Self-Paced Constraint Weighting66
Tolerating Annotation Displacement in Dense Object Counting via Point Annotation Probability Map66
Distance-Aware Occlusion Detection With Focused Attention66
Efficient 3D Scene Semantic Segmentation via Active Learning on Rendered 2D Images66
Fast and High-Performance Learned Image Compression With Improved Checkerboard Context Model, Deformable Residual Module, and Knowledge Distillation65
Fast Scalable Image Restoration Using Total Variation Priors and Expectation Propagation64
UMCGL: Universal Multi-View Consensus Graph Learning With Consistency and Diversity63
Self-Supervised Matting-Specific Portrait Enhancement and Generation63
NCSiam: Reliable Matching via Neighborhood Consensus for Siamese-Based Object Tracking60
A no-Reference Stereoscopic Image Quality Assessment Network Based on Binocular Interaction and Fusion Mechanisms60
Beyond Appearance: Multi-Frame Spatio-Temporal Context Memory Networks for Efficient and Robust Video Object Segmentation60
From Global to Local: Multi-Patch and Multi-Scale Contrastive Similarity Learning for Unsupervised Defocus Blur Detection58
Discrete Metric Learning for Fast Image Set Classification58
Tensor Cascaded-Rank Minimization in Subspace: A Unified Regime for Hyperspectral Image Low-Level Vision57
Exploring Sparse Spatial Relation in Graph Inference for Text-Based VQA57
Bi-Nuclear Tensor Schatten-p Norm Minimization for Multi-View Subspace Clustering56
Learning Weak Semantics by Feature Graph for Attribute-Based Person Search55
Efficient Robust Watermarking Based on Structure-Preserving Quaternion Singular Value Decomposition55
Few-Shot Learning With Class-Covariance Metric for Hyperspectral Image Classification55
Pre-Demosaic Graph-Based Light Field Image Compression55
INSURE: An Information Theory iNspired diSentanglement and pURification modEl for Domain Generalization55
Exploiting Intra-Slice and Inter-Slice Redundancy for Learning-Based Lossless Volumetric Image Compression54
EDDMF: An Efficient Deep Discrepancy Measuring Framework for Full-Reference Light Field Image Quality Assessment54
Two-Stage Copy-Move Forgery Detection With Self Deep Matching and Proposal SuperGlue54
Delving Into Crispness: Guided Label Refinement for Crisp Edge Detection53
CalibNet: Dual-Branch Cross-Modal Calibration for RGB-D Salient Instance Segmentation52
Fine-Grained Essential Tensor Learning for Robust Multi-View Spectral Clustering52
Change Representation and Extraction in Stripes: Rethinking Unsupervised Hyperspectral Image Change Detection With an Untrained Network51
CWSCNet: Channel-Weighted Skip Connection Network for Underwater Object Detection51
TransVQA: Transferable Vector Quantization Alignment for Unsupervised Domain Adaptation49
Deep-Based Film Grain Removal and Synthesis49
Pattern-Based Reconstruction of K-Level Images From Cutsets48
Geometry-Aware Deep Video Deblurring via Recurrent Feature Refinement48
Contrastive Proposal Extension With LSTM Network for Weakly Supervised Object Detection47
Discriminative Style Learning for Cross-Domain Image Captioning47
Learning Virtual View Selection for 3D Scene Semantic Segmentation46
Graph Convolutional Dictionary Selection With L, Norm for Video Summarization46
DO-SA&R: Distant Object Augmented Set Abstraction and Regression for Point-Based 3D Object Detection46
Segmentation-Free Velocity Field Super-Resolution on 4D Flow MRI45
Learning to Generate Parameters of ConvNets for Unseen Image Data45
Adapting Vision-Language Models via Learning to Inject Knowledge44
Analysis and Benchmarking of Extending Blind Face Image Restoration to Videos44
CPI-Parser: Integrating Causal Properties Into Multiple Human Parsing43
Fine-Grained Recognition With Learnable Semantic Data Augmentation43
MFNet: A Novel GNN-Based Multi-Level Feature Network With Superpixel Priors42
Automatic Quaternion-Domain Color Image Stitching42
Composition and Style Attributes Guided Image Aesthetic Assessment42
Exploring Intrinsic Discrimination and Consistency for Weakly Supervised Object Localization42
Uncertainty Modeling for Gaze Estimation41
Comprehensive Attribute Prediction Learning for Person Search by Language41
Enhancing Person Re-Identification Performance Through In Vivo Learning41
Multi-Granularity Part Sampling Attention for Fine-Grained Visual Classification41
Exploring Spatial Correlation for Light Field Saliency Detection: Expansion From a Single View41
Fast Projected Fuzzy Clustering With Anchor Guidance for Multimodal Remote Sensing Imagery41
DCT2net: An Interpretable Shallow CNN for Image Denoising40
A Fast and Efficient Shape Blending by Stable and Analytically Invertible Finite Descriptors40
Neural Reference Synthesis for Inter Frame Coding40
MetaLabelNet: Learning to Generate Soft-Labels From Noisy-Labels39
IEEE Transactions on Image Processing Publication Information39
Graph Embedding Interclass Relation-Aware Adaptive Network for Cross-Scene Classification of Multisource Remote Sensing Data39
EGRC-Net: Embedding-Induced Graph Refinement Clustering Network39
Learning a Deep Demosaicing Network for Spike Camera With Color Filter Array39
Learning a Locally Unified 3D Point Cloud for View Synthesis39
Angular Isotonic Loss Guided Multi-Layer Integration for Few-Shot Fine-Grained Image Classification39
Exploring Multi-Modal Spatial–Temporal Contexts for High-Performance RGB-T Tracking37
Deep Hypersphere Feature Regularization for Weakly Supervised RGB-D Salient Object Detection37
Weakly Supervised Visual Saliency Prediction37
Learning a Prototype Discriminator With RBF for Multimodal Image Synthesis37
Locality-Aware Channel-Wise Dropout for Occluded Face Recognition37
Unfolded Proximal Neural Networks for Robust Image Gaussian Denoising37
Learning Feature Channel Weighting for Real-Time Visual Tracking36
Additivity Constrained Linearisation of Camera Calibration Data36
Consensus Sparsity: Multi-Context Sparse Image Representation via L -Induced Matrix Variate35
A Robust Movement Quantification Algorithm of Hyperactivity Detection for ADHD Children Based on 3D Depth Images35
Semantic-Aware Modular Capsule Routing for Visual Question Answering35
Saliency Guided Deep Neural Network for Color Transfer With Light Optimization34
Toward Projected Clustering With Aggregated Mapping34
Inference-Domain Network Evolution: A New Perspective for One-Shot Multi-Object Tracking34
Correntropy-Induced Wasserstein GCN: Learning Graph Embedding via Domain Adaptation33
Locality-Guided Global-Preserving Optimization for Robust Feature Matching33
Weakly-Supervised RGBD Video Object Segmentation33
Motion-Compensated Predictive RAHT for Dynamic Point Clouds33
Anycost Network Quantization for Image Super-Resolution33
Multi-Granularity Contrastive Cross-Modal Collaborative Generation for End-to-End Long-Term Video Question Answering33
Progressive Transfer Learning32
Relationship-Guided Knowledge Transfer for Class-Incremental Facial Expression Recognition32
Scale-Consistent Fusion: From Heterogeneous Local Sampling to Global Immersive Rendering32
DRNet: Double Recalibration Network for Few-Shot Semantic Segmentation32
Ray-Space Motion Compensation for Lenslet Plenoptic Video Coding32
VPU: A Video-Based Point Cloud Upsampling Framework32
Learning to Discover Knowledge: A Weakly-Supervised Partial Domain Adaptation Approach32
Exploring Long- and Short-Range Temporal Information for Learned Video Compression32
JNMR: Joint Non-Linear Motion Regression for Video Frame Interpolation32
Cross-Modal Retrieval With Noisy Correspondence via Consistency Refining and Mining31
AFT: Adaptive Fusion Transformer for Visible and Infrared Images31
Vision-Based UAV Self-Positioning in Low-Altitude Urban Environments31
Learning-Based Rate Control for Video-Based Point Cloud Compression31
Triple-Level Model Inferred Collaborative Network Architecture for Video Deraining31
Perspectively Equivariant Keypoint Learning for Omnidirectional Images31
Deep Unrolled Low-Rank Tensor Completion for High Dynamic Range Imaging31
Dynamic Facial Expression Recognition Under Partial Occlusion With Optical Flow Reconstruction31
A Machine Learning Approach to Design of Aperiodic, Clustered-Dot Halftone Screens via Direct Binary Search31
Fine-Grained Video Retrieval With Scene Sketches31
Revisiting Domain-Adaptive Semantic Segmentation via Knowledge Distillation30
MERF: A Practical HDR-Like Image Generator via Mutual-Guided Learning Between Multi-Exposure Registration and Fusion30
MagConv: Mask-Guided Convolution for Image Inpainting30
RGB-Guided Depth Map Recovery by Two-Stage Coarse-to-Fine Dense CRF Models30
Multiview Spectral Clustering With Bipartite Graph30
Wavelet-Guided Promotion-Suppression Transformer for Surface-Defect Detection30
Gloss Prior Guided Visual Feature Learning for Continuous Sign Language Recognition30
Latent Space Semantic Supervision Based on Knowledge Distillation for Cross-Modal Retrieval30
Cross-Modality Pyramid Alignment for Visual Intention Understanding30
Scalable Face Image Coding via StyleGAN Prior: Toward Compression for Human-Machine Collaborative Vision30
Subjective and Objective Audio-Visual Quality Assessment for User Generated Content29
Semi-Supervised Learning With Heterogeneous Distribution Consistency for Visible Infrared Person Re-Identification28
Multimodal Unrolled Robust PCA for Background Foreground Separation28
Learning Shadow Removal From Unpaired Samples via Reciprocal Learning28
User-Guided Deep Human Image Matting Using Arbitrary Trimaps28
A Deep Stochastic Adaptive Fourier Decomposition Network for Hyperspectral Image Classification28
Frequency Information Disentanglement Network for Video-Based Person Re-Identification28
A Geodesic Translation Model for Spherical Video Compression28
MuTrans: Multiple Transformers for Fusing Feature Pyramid on 2D and 3D Object Detection28
Learning Spectral Cues for Multispectral and Panchromatic Image Fusion28
R-PointHop: A Green, Accurate, and Unsupervised Point Cloud Registration Method28
Single-Image-Based Deep Learning for Segmentation of Early Esophageal Cancer Lesions27
Dual Alternating Direction Method of Multipliers for Inverse Imaging27
Multi-Biometric Unified Network for Cloth-Changing Person Re-Identification27
Spectral Clustering Super-Resolution Imaging Based on Multispectral Camera Array27
Attentive WaveBlock: Complementarity-Enhanced Mutual Networks for Unsupervised Domain Adaptation in Person Re-Identification and Beyond27
Defending Against Multiple and Unforeseen Adversarial Videos27
Facial Prior Guided Micro-Expression Generation26
ShaTure: Shape and Texture Deformation for Human Pose and Attribute Transfer26
Deep Multi-Exposure Image Fusion for Dynamic Scenes26
Pose-Appearance Relational Modeling for Video Action Recognition26
Effective Multimodal Encoding for Image Paragraph Captioning26
Variational Structured Attention Networks for Deep Visual Representation Learning26
Density-Guided Incremental Dominant Instance Exploration for Two-View Geometric Model Fitting26
Super-Resolution Phase Retrieval Network for Single-Pattern Structured Light 3D Imaging26
Exploiting Non-Local Priors via Self-Convolution for Highly-Efficient Image Restoration26
Guided Filter Network for Semantic Image Segmentation26
Local Orthogonal Moments for Local Features25
A Closer Look at the Joint Training of Object Detection and Re-Identification in Multi-Object Tracking25
Rebalanced Zero-Shot Learning25
6D-ViT: Category-Level 6D Object Pose Estimation via Transformer-Based Instance Representation Learning25
Multi-Level Content-Aware Boundary Detection for Temporal Action Proposal Generation25
Co-Learning Meets Stitch-Up for Noisy Multi-Label Visual Recognition25
Physics-Informed Compressed Sensing for PC-MRI: An Inverse Navier-Stokes Problem25
Self-Parameter Distillation Dehazing25
SWFormer: Stochastic Windows Convolutional Transformer for Hybrid Modality Hyperspectral Classification24
CLIP4STR: A Simple Baseline for Scene Text Recognition With Pre-Trained Vision-Language Model24
Harnessing Multi-modal Large Language Models for Measuring and Interpreting Color Differences24
Unsupervised Meta Learning With Multiview Constraints for Hyperspectral Image Small Sample set Classification24
Pro-UIGAN: Progressive Face Hallucination From Occluded Thumbnails24
Concept-Aware Video Captioning: Describing Videos With Effective Prior Information24
A General Gaussian Heatmap Label Assignment for Arbitrary-Oriented Object Detection24
Rotation-Invariant Attention Network for Hyperspectral Image Classification24
Learning Frame-Event Fusion for Motion Deblurring24
CBNet: A Composite Backbone Network Architecture for Object Detection23
DTCM: Joint Optimization of Dark Enhancement and Action Recognition in Videos23
A General Dynamic Knowledge Distillation Method for Visual Analytics23
MWFormer: Multi-Weather Image Restoration Using Degradation-Aware Transformers23
Ingredient Prediction via Context Learning Network With Class-Adaptive Asymmetric Loss23
PFDN: Pyramid Feature Decoupling Network for Single Image Deraining23
Weakly-Supervised Salient Object Detection on Light Fields23
Revisiting the Regularizers in Blind Image Deblurring With a New One23
A Gating Model for Bias Calibration in Generalized Zero-shot Learning23
FineAction: A Fine-Grained Video Dataset for Temporal Action Localization22
GMLight: Lighting Estimation via Geometric Distribution Approximation22
BANet: A Blur-Aware Attention Network for Dynamic Scene Deblurring22
Feature Aggregation and Propagation Network for Camouflaged Object Detection22
TTVFI: Learning Trajectory-Aware Transformer for Video Frame Interpolation22
PUFA-GAN: A Frequency-Aware Generative Adversarial Network for 3D Point Cloud Upsampling22
Prototype Adaption and Projection for Few- and Zero-Shot 3D Point Cloud Semantic Segmentation22
JigsawGAN: Auxiliary Learning for Solving Jigsaw Puzzles With Generative Adversarial Networks22
Wavelet-Based Texture Reformation Network for Image Super-Resolution22
Spatial-Temporal Pyramid Graph Reasoning for Action Recognition22
Feedback Graph Convolutional Network for Skeleton-Based Action Recognition22
EDN: Salient Object Detection via Extremely-Downsampled Network22
Attention-Guided Progressive Neural Texture Fusion for High Dynamic Range Image Restoration22
Dynamic Neural Network for Lossy-to-Lossless Image Coding21
Improving Embedding Generalization in Few-Shot Learning With Instance Neighbor Constraints21
HGR-Net: Hierarchical Graph Reasoning Network for Arbitrary Shape Scene Text Detection21
Dual-View Curricular Optimal Transport for Cross-Lingual Cross-Modal Retrieval21
Contrast-Reconstruction Representation Learning for Self-Supervised Skeleton-Based Action Recognition21
Plug-and-Play Regulators for Image-Text Matching21
Event-Aware Video Deraining via Multi-Patch Progressive Learning21
Polarization Guided HDR Reconstruction via Pixel-Wise Depolarization21
From Global to Local: Multi-Scale Out-of-Distribution Detection21
Regular Splitting Graph Network for 3D Human Pose Estimation21
Single Image Super-Resolution Quality Assessment: A Real-World Dataset, Subjective Studies, and an Objective Metric21
Sampling Equivariant Self-Attention Networks for Object Detection in Aerial Images21
A Dual-Branch Self-Boosting Framework for Self-Supervised 3D Hand Pose Estimation21
BLPSeg: Balance the Label Preference in Scribble-Supervised Semantic Segmentation21
NIM-Nets: Noise-Aware Incomplete Multi-View Learning Networks21
A New Language-Independent Deep CNN for Scene Text Detection and Style Transfer in Social Media Images21
ECSU-Net: An Embedded Clustering Sliced U-Net Coupled With Fusing Strategy for Efficient Intervertebral Disc Segmentation and Classification21
Canonical Correlation Analysis With Low-Rank Learning for Image Representation21
Human Co-Parsing Guided Alignment for Occluded Person Re-Identification20
MM-Net: A MixFormer-Based Multi-Scale Network for Anatomical and Functional Image Fusion20
Dif-Fusion: Toward High Color Fidelity in Infrared and Visible Image Fusion With Diffusion Models20
Confusing Image Quality Assessment: Toward Better Augmented Reality Experience20
No-Reference Image Quality Assessment by Hallucinating Pristine Features20
SR-GNN: Spatial Relation-Aware Graph Neural Network for Fine-Grained Image Categorization20
Color Image Recovery Using Low-Rank Quaternion Matrix Completion Algorithm20
GaitMPL: Gait Recognition With Memory-Augmented Progressive Learning20
Discrepancy-Aware Meta-Learning for Zero-Shot Face Manipulation Detection20
0.046406984329224