OOIR: Observatory of International Research

Papers

(The median citation count of IEEE Transactions on Image Processing is 8. The table below lists those papers that are above that threshold based on CrossRef citation counts [max. 250 papers]. The publications cover those that have been published in the past four years, i.e., from 2022-01-01 to 2026-01-01.)

Article	Citations
Consensus Sparsity: Multi-Context Sparse Image Representation via L _∞-Induced Matrix Variate	755
Pro2Diff: Proposal Propagation for Multi-Object Tracking via the Diffusion Model	731
Dual Alternating Direction Method of Multipliers for Inverse Imaging	651
An Explanation Method Based on Interpretable Linear Model With Four Key Characteristics	640
Multiframe Joint Enhancement for Early Interlaced Videos	529
Toward Efficient Test Time Adaptation With Hierarchical Distribution Alignment	483
Cross-Domain Few-Shot Medical Image Segmentation via Dynamic Semantic Matching	471
Variational Structured Attention Networks for Deep Visual Representation Learning	421
Attentive WaveBlock: Complementarity-Enhanced Mutual Networks for Unsupervised Domain Adaptation in Person Re-Identification and Beyond	417
An Adaptive Multi-Granularity Graph Representation of Image via Granular-ball Computing	381
GMLight: Lighting Estimation via Geometric Distribution Approximation	315
Equivariant Local Reference Frames With Optimization for Robust Non-Rigid Point Cloud Correspondence	301
Self-Supervised Matting-Specific Portrait Enhancement and Generation	268
Color Spike Camera Reconstruction via Long Short-Term Temporal Aggregation of Spike Signals	262
Canonical Correlation Analysis With Low-Rank Learning for Image Representation	249
SemiRS-COC: Semi-Supervised Classification for Complex Remote Sensing Scenes With Cross-Object Consistency	249
MaCon: A Generic Self-Supervised Framework for Unsupervised Multimodal Change Detection	223
HAda: Hyper-Adaptive Parameter-Efficient Learning for Multi-View ConvNets	219
AdaAugment: A Tuning-Free and Adaptive Approach to Enhance Data Augmentation	214
Discrete Metric Learning for Fast Image Set Classification	212
FF-LPD: A Real-Time Frame-by-Frame License Plate Detector With Knowledge Distillation and Feature Propagation	211
Automatic Quaternion-Domain Color Image Stitching	201
Multimodal Unrolled Robust PCA for Background Foreground Separation	200
Graph Embedding Contrastive Multi-Modal Representation Learning for Clustering	197
Density-Guided Incremental Dominant Instance Exploration for Two-View Geometric Model Fitting	194

Multi-Constraint Adversarial Networks for Unsupervised Image-to-Image Translation	193
Graph Convolutional Dictionary Selection With L₂_, ₚ Norm for Video Summarization	191
Multi-Granularity Contrastive Cross-Modal Collaborative Generation for End-to-End Long-Term Video Question Answering	185
Bi-Nuclear Tensor Schatten-p Norm Minimization for Multi-View Subspace Clustering	180
Toward Projected Clustering With Aggregated Mapping	178
Learning Spectral Cues for Multispectral and Panchromatic Image Fusion	175
Contrast-Reconstruction Representation Learning for Self-Supervised Skeleton-Based Action Recognition	175
Fine-Grained Recognition With Learnable Semantic Data Augmentation	169
Cross-Modal Retrieval With Noisy Correspondence via Consistency Refining and Mining	161
A Fast and Efficient Shape Blending by Stable and Analytically Invertible Finite Descriptors	160
OccNeRF: Advancing 3D Occupancy Prediction in LiDAR-Free Environments	156
A Low-Rank Tensor Decomposition Model With Factors Prior and Total Variation for Impulsive Noise Removal	151
LoRA-Composer: Leveraging Low-Rank Adaptation for Multi-Concept Customization in Training-Free Diffusion Models	150
Differentiable SAR Renderer and Image-Based Target Reconstruction	141
TTVFI: Learning Trajectory-Aware Transformer for Video Frame Interpolation	141
Cross-Modality Pyramid Alignment for Visual Intention Understanding	140
TSFormer: Efficient Ultra-High-Definition Image Restoration via Trusted Min- p	138
Attention-Guided Neural Networks for Full-Reference and No-Reference Audio-Visual Quality Assessment	137
Few-Shot Learning With Class-Covariance Metric for Hyperspectral Image Classification	137
STPNet: Scale-Aware Text Prompt Network for Medical Image Segmentation	137
Pose-Appearance Relational Modeling for Video Action Recognition	136
Harnessing Multi-modal Large Language Models for Measuring and Interpreting Color Differences	135
Spatial Frequency Modulation Network for Efficient Image Dehazing	134
Real Image Denoising With a Locally-Adaptive Bitonic Filter	132
Vision-Based UAV Self-Positioning in Low-Altitude Urban Environments	131
Uncertainty-Guided Refinement for Fine-Grained Salient Object Detection	131
One-Class Classification Using ℓp-Norm Multiple Kernel Fisher Null Approach	131
Advances in Predictive RAHT for Geometric Point Cloud Compression	129
Cross-Domain Diffusion With Progressive Alignment for Efficient Adaptive Retrieval	129
Interactive Face Video Coding: A Generative Compression Framework	128
Variational Bayes Image Restoration With Compressive Autoencoders	127
Unsupervised Person Re-Identification With Stochastic Training Strategy	126
Grammar-Induced Wavelet Network for Human Parsing	125
SharpFormer: Learning Local Feature Preserving Global Representations for Image Deblurring	120
Distractor-Aware Event-Based Tracking	120
Fuzzy Sparse Subspace Clustering for Infrared Image Segmentation	118
NeuralDiffuser: Neuroscience-Inspired Diffusion Guidance for fMRI Visual Reconstruction	117
RSSFormer: Foreground Saliency Enhancement for Remote Sensing Land-Cover Segmentation	115
Multi-Condition Latent Diffusion Network for Scene-Aware Neural Human Motion Prediction	112
Unsupervised Modality-Transferable Video Highlight Detection With Representation Activation Sequence Learning	112
Video Moment Retrieval With Cross-Modal Neural Architecture Search	111
Generalization Beyond Feature Alignment: Concept Activation-Guided Contrastive Learning	109
ScaleNet: Scaling up Pretrained Neural Networks With Incremental Parameters	108
IMU-Assisted Online Video Background Identification	106
Learning Dynamic Prompts for All-in-One Image Restoration	106
Multi-Exposure Image Fusion via Deformable Self-Attention	105
SRS: Siamese Reconstruction-Segmentation Network Based on Dynamic-Parameter Convolution	105
Stacked Deconvolutional Network for Semantic Segmentation	102
Cross-Attentional Spatio-Temporal Semantic Graph Networks for Video Question Answering	100
Fast 3D Room Layout Estimation Based on Compact High-Level Representation	100

Cyclic Self-Training With Proposal Weight Modulation for Cross-Supervised Object Detection	99
Bidirectional Mapping Coupled GAN for Generalized Zero-Shot Learning	97
Unsupervised Foggy Scene Understanding via Self Spatial-Temporal Label Diffusion	97
KSS-ICP: Point Cloud Registration Based on Kendall Shape Space	96
FsaNet: Frequency Self-Attention for Semantic Segmentation	95
Boundary-Aware Prototype in Semi-Supervised Medical Image Segmentation	93
Optimization-Inspired Learning With Architecture Augmentations and Control Mechanisms for Low-Level Vision	93
Inverse Image Frequency for Long-Tailed Image Recognition	92
Perceptually Weighted Rate Distortion Optimization for Video-Based Point Cloud Compression	92
Coarse-to-Fine Contrastive Self-Supervised Feature Learning for Land-Cover Classification in SAR Images With Limited Labeled Data	92
Commonality Feature Representation Learning for Unsupervised Multimodal Change Detection	92
Decoupled Cross-Modal Phrase-Attention Network for Image-Sentence Matching	91
Precise Facial Landmark Detection by Reference Heatmap Transformer	89
Efficient Semi-Supervised Multimodal Hashing With Importance Differentiation Regression	88
Rethinking Sampling Strategies for Unsupervised Person Re-Identification	88
Weakly Supervised Semantic Segmentation via Alternate Self-Dual Teaching	88
Transition Is a Process: Pair-to-Video Change Detection Networks for Very High Resolution Remote Sensing Images	87
Addressing Challenges of Incorporating Appearance Cues Into Heuristic Multi-Object Tracker via a Novel Feature Paradigm	87
Non-Cascaded and Crosstalk-Free Multi-Image Encryption Based on Optical Scanning Holography Using 2D Orthogonal Compressive Sensing	85
Cross-Layer Contrastive Learning of Latent Semantics for Facial Expression Recognition	84
NR-MVSNet: Learning Multi-View Stereo Based on Normal Consistency and Depth Refinement	83
Hyperspectral Meets Optical Flow: Spectral Flow Extraction for Hyperspectral Image Classification	83
SegHSI: Semantic Segmentation of Hyperspectral Images With Limited Labeled Pixels	82
Motion and Appearance Decoupling Representation for Event Cameras	82
Multi-Source Unsupervised Domain Adaptation via Pseudo Target Domain	82
Point-Based Learnable Query Generator for Human–Object Interaction Detection	81
DUT: Learning Video Stabilization by Simply Watching Unstable Videos	81
Toward Video Anomaly Retrieval From Video Anomaly Detection: New Benchmarks and Model	81
Weighted Feature Fusion of Convolutional Neural Network and Graph Attention Network for Hyperspectral Image Classification	81
Joint Local and Nonlocal Progressive Prediction for Versatile Video Coding	80
HQ2CL: A High-Quality Class Center Learning System for Deep Face Recognition	80
BinsFormer: Revisiting Adaptive Bins for Monocular Depth Estimation	80
Multispectral Snapshot Image Registration Using Learned Cross Spectral Disparity Estimation and a Deep Guided Occlusion Reconstruction Network	80
Learned Spherical Image Compression With Spherical Convolution-Self-Attention and Transformer Context Model	79
A Discrete-Mapping-Based Cross-Component Prediction Paradigm for Screen Content Coding	79
RobustMat: Neural Diffusion for Street Landmark Patch Matching Under Challenging Environments	79
Rethinking Object Saliency Ranking: A Novel Whole-Flow Processing Paradigm	78
HOPE: Enhanced Position Image Priors via High-Order Implicit Representations	78
UVaT: Uncertainty Incorporated View-Aware Transformer for Robust Multi-View Classification	78
Double Oracle Neural Architecture Search for Game Theoretic Deep Learning Models	76
Continual Referring Expression Comprehension via Dual Modular Memorization	75
Causal Inference Hashing for Long-Tailed Image Retrieval	75
Implicit-Explicit Integrated Representations for Multi-View Video Compression	74
Fuzzy Sparse Deviation Regularized Robust Principal Component Analysis	73
Noise Prior Knowledge Informed Bayesian Inference Network for Hyperspectral Super-Resolution	73
Shared Manifold Regularized Joint Feature Selection for Joint Classification and Regression in Alzheimer’s Disease Diagnosis	72
A Real-Time Memory Updating Strategy for Unsupervised Person Re-Identification	72
Fine-Grained Spatio-Temporal Parsing Network for Action Quality Assessment	71
Partition Map Prediction for Fast Block Partitioning in VVC Intra-Frame Coding	71
Enhancing Few-Shot Out-of-Distribution Detection With Pre-Trained Model Features	70
Image Reconstruction for Accelerated MR Scan With Faster Fourier Convolutional Neural Networks	70
Hierarchical Random Walker Segmentation for Large Volumetric Biomedical Images	70
One Sketch for All: One-Shot Personalized Sketch Segmentation	69
Energy-Based Domain Adaptation Without Intermediate Domain Dataset for Foggy Scene Segmentation	69
Uncertainty Quantification for Semi-Supervised Object Detection in Remote Sensing Images	68
Mutually Reinforcing Learning of Decoupled Degradation and Diffusion Enhancement for Unpaired Low-Light Image Lightening	68
Quality-Aware Spatio-Temporal Transformer Network for RGBT Tracking	68
Generalizing to Out-of-Sample Degradations via Model Reprogramming	67
Graph-Based Depth Denoising & Dequantization for Point Cloud Enhancement	67
Spatially Consistent Transformer for Colorization in Monochrome-Color Dual-Lens System	67
Hierarchical Superpixel Segmentation by Parallel CRTrees Labeling	66
Compact Representation and Reliable Classification Learning for Point-Level Weakly-Supervised Action Localization	66
Semantic Representation and Attention Alignment for Graph Information Bottleneck in Video Summarization	66
Semi-Supervised Domain Adaptive Structure Learning	64
Reduced Biquaternion Dual-Branch Deraining U-Network via Multi-Attention Mechanism	64
Prompt to Restore, Restore to Prompt: Cyclic Prompting for Universal Adverse Weather Removal	64
Robust Ellipse Fitting Based on Maximum Correntropy Criterion With Variable Center	63
MaskFaceGAN: High-Resolution Face Editing With Masked GAN Latent Code Optimization	63
Exploring the Potential of Pooling Techniques for Universal Image Restoration	62
Rich Action-Semantic Consistent Knowledge for Early Action Prediction	62
Fast Learning Radiance Fields by Shooting Much Fewer Rays	62
Multi-Scale Fusion and Decomposition Network for Single Image Deraining	62
Joint Denoising-Demosaicking Network for Long-Wave Infrared Division-of-Focal-Plane Polarization Images With Mixed Noise Level Estimation	62
Toward Robust and Unconstrained Full Range of Rotation Head Pose Estimation	61
DMRA: Depth-Induced Multi-Scale Recurrent Attention Network for RGB-D Saliency Detection	61
Bayesian Multifractal Image Segmentation	61
AAP-MIT: Attentive Atrous Pyramid Network and Memory Incorporated Transformer for Multisentence Video Description	61
High-Quality and Diverse Few-Shot Image Generation via Masked Discrimination	60
Cross-Modal Causal Representation Learning for Radiology Report Generation	60
LNet: Lightweight Network for Driver Attention Estimation via Scene and Gaze Consistency	60

Characteristic Mapping for Ellipse Detection Acceleration	60
PolarPose: Single-Stage Multi-Person Pose Estimation in Polar Coordinates	60
NesTD-Net: Deep NESTA-Inspired Unfolding Network With Dual-Path Deblocking Structure for Image Compressive Sensing	60
FOVQA: Blind Foveated Video Quality Assessment	59
Dynamic Atomic Column Detection in Transmission Electron Microscopy Videos via Ridge Estimation	59
Arbitrary-Scale Texture Generation From Coarse-Grained Control	59
Reviewer Summary for Transactions on Image Processing	59
Data Augmentation Using Bitplane Information Recombination Model	59
RoMo: Robust Unsupervised Multimodal Learning With Noisy Pseudo Labels	59
UniEmoX: Cross-Modal Semantic-Guided Large-Scale Pretraining for Universal Scene Emotion Perception	58
Model-Induced Generalization Error Bound for Information-Theoretic Representation Learning in Source-Data-Free Unsupervised Domain Adaptation	58
PVPUFormer: Probabilistic Visual Prompt Unified Transformer for Interactive Image Segmentation	58
SIR: Self-Supervised Image Rectification via Seeing the Same Scene From Multiple Different Lenses	58
Restoration of Images Taken Through a Dirty Window Using Optics-Guided Transformer	58
Image Compression Using Stochastic-AFD Based Multisignal Sparse Representation	58
Neural Scene Designer: Self-Styled Semantic Image Manipulation	58
TransDiff: Unsupervised Non-Line-of-Sight Imaging With Aperture-Limited Relay Surfaces	57
Rotational Convolution: Rethinking Convolution for Downside Fisheye Images	57
Bi-Directional Pseudo-Three-Dimensional Network for Video Frame Interpolation	57
Enhancing Text-Based Person Retrieval by Combining Fused Representation and Reciprocal Learning With Adaptive Loss Refinement	57
Deep Ranking Exemplar-Based Dynamic Scene Deblurring	57
Bayesian Nonnegative Tensor Completion With Automatic Rank Determination	56
Perception-Guided Quality Metric of 3D Point Clouds Using Hybrid Strategy	56
Image-Level Adaptive Adversarial Ranking for Person Re-Identification	56
Cross-Modal Contrastive Learning Network for Few-Shot Action Recognition	56
Sensitivity Decouple Learning for Image Compression Artifacts Reduction	56
PFONet: A Progressive Feedback Optimization Network for Lightweight Single Image Dehazing	56
LPATR-Net: Learnable Piecewise Affine Transformation Regression Assisted Data-Driven Dehazing Framework	56
Hierarchical Hashing Learning for Image Set Classification	56
Deep Underwater Image Quality Assessment With Explicit Degradation Awareness Embedding	56
Learned Image Compression With Gaussian-Laplacian-Logistic Mixture Model and Concatenated Residual Modules	55
PointFormer: Keypoint-Guided Transformer for Simultaneous Nuclei Segmentation and Classification in Multi-Tissue Histology Images	55
Contrastive Conditional Latent Diffusion for Audio-Visual Segmentation	55
BPMTrack: Multi-Object Tracking With Detection Box Application Pattern Mining	55
SSL++: Improving Self-Supervised Learning by Mitigating the Proxy Task-Specificity Problem	55
Degraded Reference Image Quality Assessment	55
DVMark: A Deep Multiscale Framework for Video Watermarking	55
Rethinking the Low-Light Video Enhancement: Benchmark Datasets and Methods	55
FABNet: Frequency-Aware Binarized Network for Single Image Super-Resolution	54
Zero-Shot Camouflaged Object Detection	54
Toward Scalable and Unified Example-Based Explanation and Outlier Detection	54
HyperE2VID: Improving Event-Based Video Reconstruction via Hypernetworks	53
MetaAge: Meta-Learning Personalized Age Estimators	53
Ingredient-Guided Region Discovery and Relationship Modeling for Food Category-Ingredient Prediction	53
Learning Transferable Conceptual Prototypes for Interpretable Unsupervised Domain Adaptation	53
CKD: Contrastive Knowledge Distillation From a Sample-Wise Perspective	52
Multi-Person Pose Tracking With Sparse Key-Point Flow Estimation and Hierarchical Graph Distance Minimization	52
Dynamic Slimmable Denoising Network	52
Attribute and State Guided Structural Embedding Network for Vehicle Re-Identification	52
Few-Shot Domain Adaptation via Mixup Optimal Transport	52
MA-ST3D: Motion Associated Self-Training for Unsupervised Domain Adaptation on 3D Object Detection	52
Sampling Agnostic Feature Representation for Long-Term Person Re-Identification	51
Source-Guided Target Feature Reconstruction for Cross-Domain Classification and Detection	51
Hyperspectral Image Classification via Cascaded Spatial Cross-Attention Network	51
A New Non-Linear Hyperbolic-Parabolic Coupled PDE Model for Image Despeckling	51
U-N2C: A Dual Memory-Guided Disentanglement Framework for Unsupervised System Matrix Denoising in Magnetic Particle Imaging	50
Interpretable Neural Networks for Video Separation: Deep Unfolding RPCA With Foreground Masking	50
Multi-Label Auroral Image Classification Based on CNN and Transformer	50
PCE-GAN: A Generative Adversarial Network for Point Cloud Attribute Quality Enhancement Based on Optimal Transport	50
View-Wise Versus Cluster-Wise Weight: Which Is Better for Multi-View Clustering?	49
Boosting Monocular 3D Human Pose Estimation With Part Aware Attention	49
U-Shape Transformer for Underwater Image Enhancement	49
DO-Conv: Depthwise Over-Parameterized Convolutional Layer	49
Multistage Spatio-Temporal Networks for Robust Sketch Recognition	49
Advancing Video Anomaly Detection: A Bi-Directional Hybrid Framework for Enhanced Single- and Multi-Task Approaches	49
Underwater Image Enhancement via Minimal Color Loss and Locally Adaptive Contrast Enhancement	49
BVSR-EvD: Blurry Video Space-Time Super-Resolution With Events via Diffusion Models	49
Learning Domain Invariant Representations for Generalizable Person Re-Identification	49
Underwater Image Enhancement With Hyper-Laplacian Reflectance Priors	49
IEEE Transactions on Image Processing publication information	48
CartoonLossGAN: Learning Surface and Coloring of Images for Cartoonization	48
Leveraging Frequency Analysis for Image Denoising Network Pruning	48
TTST: A Top-k Token Selective Transformer for Remote Sensing Image Super-Resolution	48
MBFQuant: A Multiplier-Bitwidth-Fixed, Mixed-Precision Quantization Method for Mobile CNN-Based Applications	48
Spatio-Temporal Correlation Guided Geometric Partitioning for Versatile Video Coding	48
SDSFusion: A Semantic-Aware Infrared and Visible Image Fusion Network for Degraded Scenes	48
Coupled Splines for Sparse Curve Fitting	47
Explicitly-Decoupled Text Transfer With Minimized Background Reconstruction for Scene Text Editing	47
ROOT: Region-word Alignment with Partial Optimal Transport for Open-vocabulary Object Detection	47
Rethinking Generalized Zero-Shot Learning: A Synthesized Per-Instance Attribute Perspective	47
DisAVR: Disentangled Adaptive Visual Reasoning Network for Diagram Question Answering	47
Linearly Transformed Color Guide for Low-Bitrate Diffusion-Based Image Compression	47
Learning to Compare Relation: Semantic Alignment for Few-Shot Learning	46
Magi-Net: Meta Negative Network for Early Activity Prediction	46
SketchAging: Face Photo-Sketch Synthesis and Aging With Multi-Scale Feature Extraction	46
Multimodal Composition Example Mining for Composed Query Image Retrieval	46
Hyperpixels: Flexible 4D Over-Segmentation for Dense and Sparse Light Fields	45
Zero-Shot Skeleton-Based Action Recognition With Prototype-Guided Feature Alignment	45
Toward Transparent Deep Image Aesthetics Assessment With Tag-Based Content Descriptors	45
Enhancing Multimodal Learning via Hierarchical Fusion Architecture Search With Inconsistency Mitigation	45
Siamese-DETR for Generic Multi-Object Tracking	45
Scale-Aware Crowd Counting Network With Annotation Error Modeling	45
SUIT: Spatial-Spectral Union-Intersection Interaction Network for Hyperspectral Object Tracking	45
Diverse Target and Contribution Scheduling for Domain Generalization	44
Improving Transferability of Universal Adversarial Perturbation With Feature Disruption	44