IEEE Transactions on Image Processing

Papers
(The TQCC of IEEE Transactions on Image Processing is 16. The table below lists those papers that are above that threshold based on CrossRef citation counts [max. 250 papers]. The publications cover those that have been published in the past four years, i.e., from 2022-05-01 to 2026-05-01.)
ArticleCitations
Variational Structured Attention Networks for Deep Visual Representation Learning952
One-Class Classification Using ℓp-Norm Multiple Kernel Fisher Null Approach872
TSFormer: Efficient Ultra-High-Definition Image Restoration via Trusted Min- p794
An Explanation Method Based on Interpretable Linear Model With Four Key Characteristics693
FF-LPD: A Real-Time Frame-by-Frame License Plate Detector With Knowledge Distillation and Feature Propagation689
Density-Guided Incremental Dominant Instance Exploration for Two-View Geometric Model Fitting551
Color Spike Camera Reconstruction via Long Short-Term Temporal Aggregation of Spike Signals317
Focus on Finding Deepfakes: A Robust Proactive Detection Method Based on Orthogonal Moment Watermarking294
Cross-Domain Few-Shot Medical Image Segmentation via Dynamic Semantic Matching291
An Adaptive Multi-Granularity Graph Representation of Image via Granular-ball Computing279
Harnessing Multi-Modal Large Language Models for Measuring and Interpreting Color Differences257
Toward Efficient Test Time Adaptation With Hierarchical Distribution Alignment244
Pose-Appearance Relational Modeling for Video Action Recognition235
Equivariant Local Reference Frames With Optimization for Robust Non-Rigid Point Cloud Correspondence195
Global Modeling Matters: A Fast, Lightweight, and Effective Baseline for Efficient Image Restoration191
Toward Projected Clustering With Aggregated Mapping186
LearnMat: Semantic-Aware Self-Supervision Fine-Grained Visual Recognition179
COME: A Collaborative Optimization Framework With Low-Rank MoE for Indoor 3D Object Detection179
Information-Maximized Soft Variable Discretization for Self-Supervised Image Representation Learning172
Consensus Sparsity: Multi-Context Sparse Image Representation via L -Induced Matrix Variate171
HAda: Hyper-Adaptive Parameter-Efficient Learning for Multi-View ConvNets171
Revisiting Fine-Grained Image Analysis by Semantic-Part Alignment167
Graph Embedding Contrastive Multi-Modal Representation Learning for Clustering164
High-Fidelity Seismic Super-Resolution Using Prior-Informed Deep Learning With 3D Awareness154
Uncertainty-Guided Refinement for Fine-Grained Salient Object Detection153
Zero-Pose-Prior NeRF: Recursive Radiance Field Reconstruction From Unposed and Unordered Images148
SemiRS-COC: Semi-Supervised Classification for Complex Remote Sensing Scenes With Cross-Object Consistency143
TTVFI: Learning Trajectory-Aware Transformer for Video Frame Interpolation143
Bi-Nuclear Tensor Schatten-p Norm Minimization for Multi-View Subspace Clustering139
Advancing Pre-trained Teacher: Towards Robust Feature Discrepancy for Anomaly Detection126
Multi-Granularity Contrastive Cross-Modal Collaborative Generation for End-to-End Long-Term Video Question Answering121
Vision-Based UAV Self-Positioning in Low-Altitude Urban Environments114
Attention-Guided Neural Networks for Full-Reference and No-Reference Audio-Visual Quality Assessment114
Fine-Grained Recognition With Learnable Semantic Data Augmentation106
Star-Shaped Multi-Person Interaction Graph Model for Group Skeleton-Based Action Recognition104
Automatic Quaternion-Domain Color Image Stitching101
Pro2Diff: Proposal Propagation for Multi-Object Tracking via the Diffusion Model101
MaCon: A Generic Self-Supervised Framework for Unsupervised Multimodal Change Detection100
LoRA-Composer: Leveraging Low-Rank Adaptation for Multi-Concept Customization in Training-Free Diffusion Models100
STPNet: Scale-Aware Text Prompt Network for Medical Image Segmentation100
AdaAugment: A Tuning-Free and Adaptive Approach to Enhance Data Augmentation98
Spatial Frequency Modulation Network for Efficient Image Dehazing95
Cross-Modal Retrieval With Noisy Correspondence via Consistency Refining and Mining94
Cross-Modality Pyramid Alignment for Visual Intention Understanding93
OccNeRF: Advancing 3D Occupancy Prediction in LiDAR-Free Environments91
Inverse Image Frequency for Long-Tailed Image Recognition87
Advances in Predictive RAHT for Geometric Point Cloud Compression87
Fast 3D Room Layout Estimation Based on Compact High-Level Representation87
ASDTracker: Adaptively Sparse Detection With Attention-Guided Refinement for Efficient Multi-Object Tracking86
Generalization Beyond Feature Alignment: Concept Activation-Guided Contrastive Learning83
Decoupled Cross-Modal Phrase-Attention Network for Image-Sentence Matching82
Unsupervised Modality-Transferable Video Highlight Detection With Representation Activation Sequence Learning81
Precise Facial Landmark Detection by Reference Heatmap Transformer79
Perceptually Weighted Rate Distortion Optimization for Video-Based Point Cloud Compression78
Fuzzy Sparse Subspace Clustering for Infrared Image Segmentation78
Spatial-Temporal Scene Graph Generation for Open-Vocabulary Multiple Object Tracking78
Weakly Supervised Semantic Segmentation via Alternate Self-Dual Teaching77
Toward Generalizable Forgery Detection and Reasoning77
Rethinking Sampling Strategies for Unsupervised Person Re-Identification76
FD-SCU: Frequency Decomposition-Based Spectrum Collaborative Upsampling for Point Cloud Color Attribute75
Addressing Challenges of Incorporating Appearance Cues Into Heuristic Multi-Object Tracker via a Novel Feature Paradigm74
Multi-Condition Latent Diffusion Network for Scene-Aware Neural Human Motion Prediction74
ScaleNet: Scaling up Pretrained Neural Networks With Incremental Parameters73
Vision Enhancing LLMs: Empowering Multimodal Knowledge Storage and Sharing in LLMs73
Boundary-Aware Prototype in Semi-Supervised Medical Image Segmentation73
TSCCD: Temporal Self-Construction Cross-Domain Learning for Unsupervised Hyperspectral Change Detection73
SRS: Siamese Reconstruction-Segmentation Network Based on Dynamic-Parameter Convolution71
Optimization-Inspired Learning With Architecture Augmentations and Control Mechanisms for Low-Level Vision71
Point-Based Learnable Query Generator for Human–Object Interaction Detection70
Distractor-Aware Event-Based Tracking69
Stacked Deconvolutional Network for Semantic Segmentation69
Hyperspectral Meets Optical Flow: Spectral Flow Extraction for Hyperspectral Image Classification69
MCIB: Multi-Modal Complementary Information Bottleneck for Hyperspectral and LiDAR Classification68
Learning Dynamic Prompts for All-in-One Image Restoration67
Cross-Layer Contrastive Learning of Latent Semantics for Facial Expression Recognition67
Interactive Face Video Coding: A Generative Compression Framework66
Variational Bayes Image Restoration With Compressive Autoencoders66
Non-Cascaded and Crosstalk-Free Multi-Image Encryption Based on Optical Scanning Holography Using 2D Orthogonal Compressive Sensing66
NeuralDiffuser: Neuroscience-Inspired Diffusion Guidance for fMRI Visual Reconstruction65
NR-MVSNet: Learning Multi-View Stereo Based on Normal Consistency and Depth Refinement65
Cross-Domain Diffusion With Progressive Alignment for Efficient Adaptive Retrieval64
MicroSDF: Microfacet-Driven Hybrid Neural SDFs for Mixed-Reflectance Surface Reconstruction64
Motion and Appearance Decoupling Representation for Event Cameras64
KSS-ICP: Point Cloud Registration Based on Kendall Shape Space64
Commonality Feature Representation Learning for Unsupervised Multimodal Change Detection63
RSSFormer: Foreground Saliency Enhancement for Remote Sensing Land-Cover Segmentation63
SegHSI: Semantic Segmentation of Hyperspectral Images With Limited Labeled Pixels62
Toward Video Anomaly Retrieval From Video Anomaly Detection: New Benchmarks and Model61
Multi-Exposure Image Fusion via Deformable Self-Attention61
DCD-UIE: Decoupled Chromatic Diffusion Model for Underwater Image Enhancement61
Transition Is a Process: Pair-to-Video Change Detection Networks for Very High Resolution Remote Sensing Images61
BinsFormer: Revisiting Adaptive Bins for Monocular Depth Estimation61
SharpFormer: Learning Local Feature Preserving Global Representations for Image Deblurring61
Cyclic Self-Training With Proposal Weight Modulation for Cross-Supervised Object Detection60
FsaNet: Frequency Self-Attention for Semantic Segmentation60
Learned Spherical Image Compression With Spherical Convolution-Self-Attention and Transformer Context Model59
CGMNet: A Center-Pixel and Gated Mechanism-Based Attention Network for Hyperspectral Change Detection58
RobustMat: Neural Diffusion for Street Landmark Patch Matching Under Challenging Environments58
UVaT: Uncertainty Incorporated View-Aware Transformer for Robust Multi-View Classification58
Bayesian Multifractal Image Segmentation58
Energy-Based Domain Adaptation Without Intermediate Domain Dataset for Foggy Scene Segmentation57
Rich Action-Semantic Consistent Knowledge for Early Action Prediction57
HOPE: Enhanced Position Image Priors via High-Order Implicit Representations57
Attack-Augmented Mixing-Contrastive Skeletal Representation Learning57
CycleDiff: Cycle Diffusion Models for Unpaired Image-to-image Translation56
PolarPose: Single-Stage Multi-Person Pose Estimation in Polar Coordinates56
Generalizing to Out-of-Sample Degradations via Model Reprogramming56
Rethinking Object Saliency Ranking: A Novel Whole-Flow Processing Paradigm55
Characteristic Mapping for Ellipse Detection Acceleration55
High-Quality and Diverse Few-Shot Image Generation via Masked Discrimination54
Shared Manifold Regularized Joint Feature Selection for Joint Classification and Regression in Alzheimer’s Disease Diagnosis54
Positional Encoding Image Prior53
IAP: Improving Continual Learning of Vision-Language Models via Instance-Aware Prompting53
Semi-Negative Contrastive Subclass Discriminative Network for Compositional Zero-Shot Learning52
Image Reconstruction for Accelerated MR Scan With Faster Fourier Convolutional Neural Networks52
Mutually Reinforcing Learning of Decoupled Degradation and Diffusion Enhancement for Unpaired Low-Light Image Lightening52
Implicit-Explicit Integrated Representations for Multi-View Video Compression51
LNet: Lightweight Network for Driver Attention Estimation via Scene and Gaze Consistency51
Quality-Aware Spatio-Temporal Transformer Network for RGBT Tracking51
A Discrete-Mapping-Based Cross-Component Prediction Paradigm for Screen Content Coding51
RoMo: Robust Unsupervised Multimodal Learning With Noisy Pseudo Labels51
Perception-Inspired Network for Stereo Image Quality Assessment51
Partition Map Prediction for Fast Block Partitioning in VVC Intra-Frame Coding50
Semantic Representation and Attention Alignment for Graph Information Bottleneck in Video Summarization49
Noise Prior Knowledge Informed Bayesian Inference Network for Hyperspectral Super-Resolution49
Self-Anchored Progressive Framework With Noise Mitigation for Unsupervised Camouflaged Object Detection49
Unsupervised Domain Adaptation in Biomedical Images Segmentation With Guided Diffusion Generative Prior49
Joint Denoising-Demosaicking Network for Long-Wave Infrared Division-of-Focal-Plane Polarization Images With Mixed Noise Level Estimation48
Multispectral Snapshot Image Registration Using Learned Cross Spectral Disparity Estimation and a Deep Guided Occlusion Reconstruction Network48
CAS-ViT: Convolutional Additive Self-Attention Vision Transformers for Efficient Mobile Applications48
Fine-Grained Spatio-Temporal Parsing Network for Action Quality Assessment48
Fast Learning Radiance Fields by Shooting Much Fewer Rays48
Prompt to Restore, Restore to Prompt: Cyclic Prompting for Universal Adverse Weather Removal47
Double Oracle Neural Architecture Search for Game Theoretic Deep Learning Models47
Exploring the Potential of Pooling Techniques for Universal Image Restoration46
Uncertainty Quantification for Semi-Supervised Object Detection in Remote Sensing Images46
Multi-Scale Fusion and Decomposition Network for Single Image Deraining45
Enhancing Few-Shot Out-of-Distribution Detection With Pre-Trained Model Features45
Individual and Common Attack: Enhancing Transferability in VLP Models Through Modal Feature Exploitation45
Robust Ellipse Fitting Based on Maximum Correntropy Criterion With Variable Center45
LB-PTQ: Effective Low-Bit Post-Training Quantization for Vision Transformers44
NesTD-Net: Deep NESTA-Inspired Unfolding Network With Dual-Path Deblocking Structure for Image Compressive Sensing44
MaskFaceGAN: High-Resolution Face Editing With Masked GAN Latent Code Optimization44
Causal Inference Hashing for Long-Tailed Image Retrieval44
Cross-Modal Causal Representation Learning for Radiology Report Generation44
Counterfactual Risk Minimization for Out-of-Distribution Generalization43
IMPRESS: Incomplete Human Motion Prediction via Motion Recovery and Structural-Semantic Fusion43
Reviewer Summary for Transactions on Image Processing42
Toward Robust and Unconstrained Full Range of Rotation Head Pose Estimation42
TransDiff: Unsupervised Non-Line-of-Sight Imaging With Aperture-Limited Relay Surfaces42
A Real-Time Memory Updating Strategy for Unsupervised Person Re-Identification42
Neural Scene Designer: Self-Styled Semantic Image Manipulation42
Reduced Biquaternion Dual-Branch Deraining U-Network via Multi-Attention Mechanism42
Image-Level Adaptive Adversarial Ranking for Person Re-Identification41
MA-ST3D: Motion Associated Self-Training for Unsupervised Domain Adaptation on 3D Object Detection41
LPATR-Net: Learnable Piecewise Affine Transformation Regression Assisted Data-Driven Dehazing Framework41
Unsupervised Domain Adaptive Object Detection via Semantic Consistency and Compactness Learning41
Advancing Video Anomaly Detection: A Bi-Directional Hybrid Framework for Enhanced Single- and Multi-Task Approaches41
Perception-Guided Quality Metric of 3D Point Clouds Using Hybrid Strategy41
IEEE Transactions on Image Processing publication information41
DACESR: Degradation-Aware Conditional Embedding for Real-World Image Super-Resolution41
Dynamic Slimmable Denoising Network40
Multi-Label Auroral Image Classification Based on CNN and Transformer40
Rethinking the Low-Light Video Enhancement: Benchmark Datasets and Methods40
Multi-Person Pose Tracking With Sparse Key-Point Flow Estimation and Hierarchical Graph Distance Minimization40
BVSR-EvD: Blurry Video Space-Time Super-Resolution With Events via Diffusion Models39
Improving Unsupervised Ultrasonic Image Anomaly Detection via Frequency-Spatial Feature Filtering and Gaussian Mixture Modeling39
Bayesian Nonnegative Tensor Completion With Automatic Rank Determination39
MBFQuant: A Multiplier-Bitwidth-Fixed, Mixed-Precision Quantization Method for Mobile CNN-Based Applications39
Degraded Reference Image Quality Assessment39
Enhancing Text-Based Person Retrieval by Combining Fused Representation and Reciprocal Learning With Adaptive Loss Refinement39
PCE-GAN: A Generative Adversarial Network for Point Cloud Attribute Quality Enhancement Based on Optimal Transport39
Bidirectional Cross-Modal Collaborative Alignment via Semantic-Guided Visual Embeddings for Partially Relevant Video Retrieval39
BP-NeRF: End-to-End Neural Radiance Fields for Sparse Images Without Camera Pose in Complex Scenes38
Restoration of Images Taken Through a Dirty Window Using Optics-Guided Transformer38
Interpretable Neural Networks for Video Separation: Deep Unfolding RPCA With Foreground Masking38
Hierarchical Hashing Learning for Image Set Classification38
U-N2C: A Dual Memory-Guided Disentanglement Framework for Unsupervised System Matrix Denoising in Magnetic Particle Imaging37
SIR: Self-Supervised Image Rectification via Seeing the Same Scene From Multiple Different Lenses37
MotionPrior: Exploring Efficient Learning of Motion Concepts for Few-Shot Video Generation37
UniEmoX: Cross-Modal Semantic-Guided Large-Scale Pretraining for Universal Scene Emotion Perception37
HyperE2VID: Improving Event-Based Video Reconstruction via Hypernetworks37
PVPUFormer: Probabilistic Visual Prompt Unified Transformer for Interactive Image Segmentation36
Learning Transferable Conceptual Prototypes for Interpretable Unsupervised Domain Adaptation36
Sensitivity Decouple Learning for Image Compression Artifacts Reduction36
Source-Guided Target Feature Reconstruction for Cross-Domain Classification and Detection36
Long-Tailed and Inter-Class Homogeneity Matters in Multi-Class Weakly Supervised Tissue Segmentation of Histopathology Images36
PFONet: A Progressive Feedback Optimization Network for Lightweight Single Image Dehazing36
FABNet: Frequency-Aware Binarized Network for Single Image Super-Resolution36
BPMTrack: Multi-Object Tracking With Detection Box Application Pattern Mining36
Dynamic Atomic Column Detection in Transmission Electron Microscopy Videos via Ridge Estimation36
CKD: Contrastive Knowledge Distillation From a Sample-Wise Perspective36
Enhancing Target Recognition Performance in SSVEP-Based Brain–Computer Interfaces via Deep Neural Networks with Pyramid Squeeze Attention35
Learning Domain Invariant Representations for Generalizable Person Re-Identification35
Hyperspectral Image Classification via Cascaded Spatial Cross-Attention Network35
DVMark: A Deep Multiscale Framework for Video Watermarking35
Rotational Convolution: Rethinking Convolution for Downside Fisheye Images35
Deep Underwater Image Quality Assessment With Explicit Degradation Awareness Embedding35
SDSFusion: A Semantic-Aware Infrared and Visible Image Fusion Network for Degraded Scenes34
Contrastive Conditional Latent Diffusion for Audio-Visual Segmentation34
One Step Diffusion-Based Super-Resolution With Time-Aware Distillation34
Padé Neurons for Efficient Neural Models34
Diverse Target and Contribution Scheduling for Domain Generalization34
Magi-Net: Meta Negative Network for Early Activity Prediction34
Zero-Shot Camouflaged Object Detection34
Cross-Modal Contrastive Learning Network for Few-Shot Action Recognition34
U-Shape Transformer for Underwater Image Enhancement34
Learn From Examples: In-Context Learning for Camouflaged Object Detection34
Learning Structure Aware Deep Spectral Embedding34
TTST: A Top-k Token Selective Transformer for Remote Sensing Image Super-Resolution34
Learned Image Compression With Gaussian-Laplacian-Logistic Mixture Model and Concatenated Residual Modules34
PointFormer: Keypoint-Guided Transformer for Simultaneous Nuclei Segmentation and Classification in Multi-Tissue Histology Images34
Image Copy-Move Forgery Detection via Deep PatchMatch and Pairwise Ranking Learning34
Zero-Shot Skeleton-Based Action Recognition With Prototype-Guided Feature Alignment34
SketchAging: Face Photo-Sketch Synthesis and Aging With Multi-Scale Feature Extraction33
Versatile Denoising-Based Approximate Message Passing for Compressive Sensing33
Hyperpixels: Flexible 4D Over-Segmentation for Dense and Sparse Light Fields33
DisAVR: Disentangled Adaptive Visual Reasoning Network for Diagram Question Answering33
C-NeRF: Representing Scene Changes as Directional Consistency Difference-Based NeRF33
Rethinking Generalized Zero-Shot Learning: A Synthesized Per-Instance Attribute Perspective33
Active Style-Content Dual-Branch Domain Adaptation for Semi-Supervised SAR Object Detection33
Completing Missing Entities: Exploring Consistency Reasoning for Remote Sensing Object Detection33
StreakNet-Arch: An Anti-Scattering Network-Based Architecture for Underwater Carrier LiDAR-Radar Imaging33
Advancing Weakly-Supervised Change Detection in Satellite Images via Adversarial Class Prompting33
An Efficient Transformer Based on Global and Local Self-Attention for Face Photo-Sketch Synthesis33
Anomaly Detection for Medical Images Using Heterogeneous Auto-Encoder33
Interactive Learning of Intrinsic and Extrinsic Properties for All-Day Semantic Segmentation33
Coupled Diffusion Posterior Sampling for Unsupervised Hyperspectral and Multispectral Images Fusion32
Interpretable Few-Shot Image Classification via Prototypical Concept-Guided Mixture of LoRA Experts32
BVI-VFI: A Video Quality Database for Video Frame Interpolation32
Hierarchical Prior-Based Super Resolution for Point Cloud Geometry Compression32
ROOT: Region-Word Alignment With Partial Optimal Transport for Open-Vocabulary Object Detection32
Broadcast-Gated Attention With Identity Adaptive Integration for Efficient Image Super-Resolution32
Scale-Aware Crowd Counting Network With Annotation Error Modeling32
DiffLLFace: Learning Alternate Illumination-Diffusion Adaptation for Low-Light Face Super-Resolution and Beyond32
Robust Palmprint Recognition via Multi-Stage Noisy Label Selection and Correction31
Leveraging Frequency Analysis for Image Denoising Network Pruning31
Explicitly-Decoupled Text Transfer With Minimized Background Reconstruction for Scene Text Editing31
Lightweight Deep Neural Networks for Ship Target Detection in SAR Imagery31
Multi-Branch and Progressive Network for Low-Light Image Enhancement31
Compression-Oriented Video Super-Resolution31
Linearly Transformed Color Guide for Low-Bitrate Diffusion-Based Image Compression31
YOLOH: You Only Look One Hourglass for Real-Time Object Detection31
Action Quality Assessment via Hierarchical Pose-Guided Multi-Stage Contrastive Regression31
Enhancing Multimodal Learning via Hierarchical Fusion Architecture Search With Inconsistency Mitigation30
Underdetermined Blind Source Separation via Weighted Simplex Shrinkage Regularization and Quantum Deep Image Prior30
Toward Transparent Deep Image Aesthetics Assessment With Tag-Based Content Descriptors30
Pseudo-Labeling Based Practical Semi-Supervised Meta-Training for Few-Shot Learning30
Tensor Wheel Decomposition: Theory and Application to Tensor Completion30
Improving Transferability of Universal Adversarial Perturbation With Feature Disruption30
0.069322109222412