OOIR: Observatory of International Research

Papers

(The TQCC of IEEE Transactions on Image Processing is 16. The table below lists those papers that are above that threshold based on CrossRef citation counts [max. 250 papers]. The publications cover those that have been published in the past four years, i.e., from 2022-06-01 to 2026-06-01.)

Article	Citations
Variational Structured Attention Networks for Deep Visual Representation Learning	989
One-Class Classification Using ℓp-Norm Multiple Kernel Fisher Null Approach	907
TSFormer: Efficient Ultra-High-Definition Image Restoration via Trusted Min- p	838
An Explanation Method Based on Interpretable Linear Model With Four Key Characteristics	729
FF-LPD: A Real-Time Frame-by-Frame License Plate Detector With Knowledge Distillation and Feature Propagation	724
Density-Guided Incremental Dominant Instance Exploration for Two-View Geometric Model Fitting	587
Color Spike Camera Reconstruction via Long Short-Term Temporal Aggregation of Spike Signals	336
An Adaptive Multi-Granularity Graph Representation of Image via Granular-ball Computing	299
Toward Efficient Test Time Adaptation With Hierarchical Distribution Alignment	296
Equivariant Local Reference Frames With Optimization for Robust Non-Rigid Point Cloud Correspondence	289
Global Modeling Matters: A Fast, Lightweight, and Effective Baseline for Efficient Image Restoration	266
Toward Projected Clustering With Aggregated Mapping	254
LearnMat: Semantic-Aware Self-Supervision Fine-Grained Visual Recognition	242
COME: A Collaborative Optimization Framework With Low-Rank MoE for Indoor 3D Object Detection	204
Information-Maximized Soft Variable Discretization for Self-Supervised Image Representation Learning	201
High-Fidelity Seismic Super-Resolution Using Prior-Informed Deep Learning With 3D Awareness	195
Zero-Pose-Prior NeRF: Recursive Radiance Field Reconstruction From Unposed and Unordered Images	190
TTVFI: Learning Trajectory-Aware Transformer for Video Frame Interpolation	188
Advancing Pre-Trained Teacher: Towards Robust Feature Discrepancy for Anomaly Detection	178
MaCon: A Generic Self-Supervised Framework for Unsupervised Multimodal Change Detection	177
LoRA-Composer: Leveraging Low-Rank Adaptation for Multi-Concept Customization in Training-Free Diffusion Models	174
HAda: Hyper-Adaptive Parameter-Efficient Learning for Multi-View ConvNets	174
OccNeRF: Advancing 3D Occupancy Prediction in LiDAR-Free Environments	172
Consensus Sparsity: Multi-Context Sparse Image Representation via L _∞-Induced Matrix Variate	161
Revisiting Fine-Grained Image Analysis by Semantic-Part Alignment	159

Pro2Diff: Proposal Propagation for Multi-Object Tracking via the Diffusion Model	150
Graph Embedding Contrastive Multi-Modal Representation Learning for Clustering	147
Cross-Modal Retrieval With Noisy Correspondence via Consistency Refining and Mining	146
SemiRS-COC: Semi-Supervised Classification for Complex Remote Sensing Scenes With Cross-Object Consistency	141
Star-Shaped Multi-Person Interaction Graph Model for Group Skeleton-Based Action Recognition	129
Multi-Granularity Contrastive Cross-Modal Collaborative Generation for End-to-End Long-Term Video Question Answering	126
Bi-Nuclear Tensor Schatten-p Norm Minimization for Multi-View Subspace Clustering	118
Automatic Quaternion-Domain Color Image Stitching	116
AdaAugment: A Tuning-Free and Adaptive Approach to Enhance Data Augmentation	110
Cross-Modality Pyramid Alignment for Visual Intention Understanding	109
Fine-Grained Recognition With Learnable Semantic Data Augmentation	108
Vision-Based UAV Self-Positioning in Low-Altitude Urban Environments	105
Leveraging Feature Alignment in Grassmannian Manifold for Multi-output Regression Tasks	104
Spatial Frequency Modulation Network for Efficient Image Dehazing	103
Pose-Appearance Relational Modeling for Video Action Recognition	102
Harnessing Multi-Modal Large Language Models for Measuring and Interpreting Color Differences	101
Cross-Domain Few-Shot Medical Image Segmentation via Dynamic Semantic Matching	98
STPNet: Scale-Aware Text Prompt Network for Medical Image Segmentation	97
Focus on Finding Deepfakes: A Robust Proactive Detection Method Based on Orthogonal Moment Watermarking	96
Uncertainty-Guided Refinement for Fine-Grained Salient Object Detection	93
Attention-Guided Neural Networks for Full-Reference and No-Reference Audio-Visual Quality Assessment	92
Advances in Predictive RAHT for Geometric Point Cloud Compression	91
Inverse Image Frequency for Long-Tailed Image Recognition	90
Fast 3D Room Layout Estimation Based on Compact High-Level Representation	89
ASDTracker: Adaptively Sparse Detection With Attention-Guided Refinement for Efficient Multi-Object Tracking	87
Generalization Beyond Feature Alignment: Concept Activation-Guided Contrastive Learning	84
Perceptually Weighted Rate Distortion Optimization for Video-Based Point Cloud Compression	83
Spatial-Temporal Scene Graph Generation for Open-Vocabulary Multiple Object Tracking	83
Weakly Supervised Semantic Segmentation via Alternate Self-Dual Teaching	81
Toward Generalizable Forgery Detection and Reasoning	81
Addressing Challenges of Incorporating Appearance Cues Into Heuristic Multi-Object Tracker via a Novel Feature Paradigm	80
FD-SCU: Frequency Decomposition-Based Spectrum Collaborative Upsampling for Point Cloud Color Attribute	80
Multi-Condition Latent Diffusion Network for Scene-Aware Neural Human Motion Prediction	79
TSCCD: Temporal Self-Construction Cross-Domain Learning for Unsupervised Hyperspectral Change Detection	79
ScaleNet: Scaling up Pretrained Neural Networks With Incremental Parameters	79
Stacked Deconvolutional Network for Semantic Segmentation	77
Vision Enhancing LLMs: Empowering Multimodal Knowledge Storage and Sharing in LLMs	77
Unsupervised Modality-Transferable Video Highlight Detection With Representation Activation Sequence Learning	77
Optimization-Inspired Learning With Architecture Augmentations and Control Mechanisms for Low-Level Vision	75
Decoupled Cross-Modal Phrase-Attention Network for Image-Sentence Matching	75
MicroSDF: Microfacet-Driven Hybrid Neural SDFs for Mixed-Reflectance Surface Reconstruction	73
NR-MVSNet: Learning Multi-View Stereo Based on Normal Consistency and Depth Refinement	73
Cross-Domain Diffusion With Progressive Alignment for Efficient Adaptive Retrieval	73
SRS: Siamese Reconstruction-Segmentation Network Based on Dynamic-Parameter Convolution	72
SharpFormer: Learning Local Feature Preserving Global Representations for Image Deblurring	72
Precise Facial Landmark Detection by Reference Heatmap Transformer	70
Soft Supervision Guided Spatial-Temporal Refinement Network For Video-based Visible-Infrared Person Re-Identification	70
Commonality Feature Representation Learning for Unsupervised Multimodal Change Detection	70
Fuzzy Sparse Subspace Clustering for Infrared Image Segmentation	70
Cyclic Self-Training With Proposal Weight Modulation for Cross-Supervised Object Detection	69

RSSFormer: Foreground Saliency Enhancement for Remote Sensing Land-Cover Segmentation	69
Rethinking Sampling Strategies for Unsupervised Person Re-Identification	69
Interactive Face Video Coding: A Generative Compression Framework	69
Variational Bayes Image Restoration With Compressive Autoencoders	68
Cross-Layer Contrastive Learning of Latent Semantics for Facial Expression Recognition	68
NeuralDiffuser: Neuroscience-Inspired Diffusion Guidance for fMRI Visual Reconstruction	68
MCIB: Multi-Modal Complementary Information Bottleneck for Hyperspectral and LiDAR Classification	68
Hyperspectral Meets Optical Flow: Spectral Flow Extraction for Hyperspectral Image Classification	67
Point-Based Learnable Query Generator for Human–Object Interaction Detection	67
Motion and Appearance Decoupling Representation for Event Cameras	66
Toward Robust Alignment for Video Dehazing With Temporal Lookup Table	66
Distractor-Aware Event-Based Tracking	66
Non-Cascaded and Crosstalk-Free Multi-Image Encryption Based on Optical Scanning Holography Using 2D Orthogonal Compressive Sensing	65
Toward Video Anomaly Retrieval From Video Anomaly Detection: New Benchmarks and Model	65
BinsFormer: Revisiting Adaptive Bins for Monocular Depth Estimation	65
Transition Is a Process: Pair-to-Video Change Detection Networks for Very High Resolution Remote Sensing Images	64
DCD-UIE: Decoupled Chromatic Diffusion Model for Underwater Image Enhancement	64
Learning Dynamic Prompts for All-in-One Image Restoration	64
FsaNet: Frequency Self-Attention for Semantic Segmentation	63
Multi-Exposure Image Fusion via Deformable Self-Attention	62
Boundary-Aware Prototype in Semi-Supervised Medical Image Segmentation	61
SegHSI: Semantic Segmentation of Hyperspectral Images With Limited Labeled Pixels	61
UVaT: Uncertainty Incorporated View-Aware Transformer for Robust Multi-View Classification	59
KSS-ICP: Point Cloud Registration Based on Kendall Shape Space	59
Bayesian Multifractal Image Segmentation	59
CGMNet: A Center-Pixel and Gated Mechanism-Based Attention Network for Hyperspectral Change Detection	59
RobustMat: Neural Diffusion for Street Landmark Patch Matching Under Challenging Environments	59
Fast Learning Radiance Fields by Shooting Much Fewer Rays	58
IMPRESS: Incomplete Human Motion Prediction via Motion Recovery and Structural-Semantic Fusion	58
Rich Action-Semantic Consistent Knowledge for Early Action Prediction	58
Counterfactual Risk Minimization for Out-of-Distribution Generalization	58
Learned Spherical Image Compression With Spherical Convolution-Self-Attention and Transformer Context Model	57
Exploring the Potential of Pooling Techniques for Universal Image Restoration	57
LB-PTQ: Effective Low-Bit Post-Training Quantization for Vision Transformers	57
PolarPose: Single-Stage Multi-Person Pose Estimation in Polar Coordinates	56
HOPE: Enhanced Position Image Priors via High-Order Implicit Representations	56
Mutually Reinforcing Learning of Decoupled Degradation and Diffusion Enhancement for Unpaired Low-Light Image Lightening	56
Generalizing to Out-of-Sample Degradations via Model Reprogramming	56
Semi-Negative Contrastive Subclass Discriminative Network for Compositional Zero-Shot Learning	56
Energy-Based Domain Adaptation Without Intermediate Domain Dataset for Foggy Scene Segmentation	56
A Discrete-Mapping-Based Cross-Component Prediction Paradigm for Screen Content Coding	55
LNet: Lightweight Network for Driver Attention Estimation via Scene and Gaze Consistency	55
Reduced Biquaternion Dual-Branch Deraining U-Network via Multi-Attention Mechanism	54
Characteristic Mapping for Ellipse Detection Acceleration	53
High-Quality and Diverse Few-Shot Image Generation via Masked Discrimination	53
Rethinking Object Saliency Ranking: A Novel Whole-Flow Processing Paradigm	53
Perception-Inspired Network for Stereo Image Quality Assessment	52
IAP: Improving Continual Learning of Vision-Language Models via Instance-Aware Prompting	52
Positional Encoding Image Prior	52
Noise Prior Knowledge Informed Bayesian Inference Network for Hyperspectral Super-Resolution	52
Semantic Representation and Attention Alignment for Graph Information Bottleneck in Video Summarization	51
Multispectral Snapshot Image Registration Using Learned Cross Spectral Disparity Estimation and a Deep Guided Occlusion Reconstruction Network	51
Double Oracle Neural Architecture Search for Game Theoretic Deep Learning Models	51
Unsupervised Domain Adaptation in Biomedical Images Segmentation With Guided Diffusion Generative Prior	51
Prompt to Restore, Restore to Prompt: Cyclic Prompting for Universal Adverse Weather Removal	51
Robust Ellipse Fitting Based on Maximum Correntropy Criterion With Variable Center	49
Uncertainty Quantification for Semi-Supervised Object Detection in Remote Sensing Images	49
Enhancing Few-Shot Out-of-Distribution Detection With Pre-Trained Model Features	48
Causal Inference Hashing for Long-Tailed Image Retrieval	48
Individual and Common Attack: Enhancing Transferability in VLP Models Through Modal Feature Exploitation	48
CycleDiff: Cycle Diffusion Models for Unpaired Image-to-image Translation	48
Cross-Modal Causal Representation Learning for Radiology Report Generation	47
Quality-Aware Spatio-Temporal Transformer Network for RGBT Tracking	47
Implicit-Explicit Integrated Representations for Multi-View Video Compression	47
NesTD-Net: Deep NESTA-Inspired Unfolding Network With Dual-Path Deblocking Structure for Image Compressive Sensing	46
Self-Anchored Progressive Framework With Noise Mitigation for Unsupervised Camouflaged Object Detection	46
Attack-Augmented Mixing-Contrastive Skeletal Representation Learning	46
Partition Map Prediction for Fast Block Partitioning in VVC Intra-Frame Coding	46
Fine-Grained Spatio-Temporal Parsing Network for Action Quality Assessment	45
Toward Robust and Unconstrained Full Range of Rotation Head Pose Estimation	45
Joint Denoising-Demosaicking Network for Long-Wave Infrared Division-of-Focal-Plane Polarization Images With Mixed Noise Level Estimation	45
MaskFaceGAN: High-Resolution Face Editing With Masked GAN Latent Code Optimization	45
ReCoTR: Reducing Semantic Cognitive Shift via Dual-Consensus Token Compression for Remote Sensing Image-Text Retrieval	44
A Real-Time Memory Updating Strategy for Unsupervised Person Re-Identification	44
RoMo: Robust Unsupervised Multimodal Learning With Noisy Pseudo Labels	44
Shared Manifold Regularized Joint Feature Selection for Joint Classification and Regression in Alzheimer’s Disease Diagnosis	44
Reviewer Summary for Transactions on Image Processing	43
CAS-ViT: Convolutional Additive Self-Attention Vision Transformers for Efficient Mobile Applications	43
HANeRV: Hierarchically Adaptive Neural Representation for Video Compression	43
Image Reconstruction for Accelerated MR Scan With Faster Fourier Convolutional Neural Networks	43

Multi-Scale Fusion and Decomposition Network for Single Image Deraining	43
Dynamic Slimmable Denoising Network	42
TransDiff: Unsupervised Non-Line-of-Sight Imaging With Aperture-Limited Relay Surfaces	42
Neural Scene Designer: Self-Styled Semantic Image Manipulation	42
LPATR-Net: Learnable Piecewise Affine Transformation Regression Assisted Data-Driven Dehazing Framework	42
Perception-Guided Quality Metric of 3D Point Clouds Using Hybrid Strategy	42
Rethinking the Low-Light Video Enhancement: Benchmark Datasets and Methods	42
MotionPrior: Exploring Efficient Learning of Motion Concepts for Few-Shot Video Generation	41
Learning Transferable Conceptual Prototypes for Interpretable Unsupervised Domain Adaptation	41
UniEmoX: Cross-Modal Semantic-Guided Large-Scale Pretraining for Universal Scene Emotion Perception	41
DACESR: Degradation-Aware Conditional Embedding for Real-World Image Super-Resolution	41
BVSR-EvD: Blurry Video Space-Time Super-Resolution With Events via Diffusion Models	41
Unsupervised Domain Adaptive Object Detection via Semantic Consistency and Compactness Learning	41
SIR: Self-Supervised Image Rectification via Seeing the Same Scene From Multiple Different Lenses	41
Multi-Label Auroral Image Classification Based on CNN and Transformer	40
PCE-GAN: A Generative Adversarial Network for Point Cloud Attribute Quality Enhancement Based on Optimal Transport	40
Multi-Person Pose Tracking With Sparse Key-Point Flow Estimation and Hierarchical Graph Distance Minimization	40
Degraded Reference Image Quality Assessment	40
Long-Tailed and Inter-Class Homogeneity Matters in Multi-Class Weakly Supervised Tissue Segmentation of Histopathology Images	39
IEEE Transactions on Image Processing publication information	39
BP-NeRF: End-to-End Neural Radiance Fields for Sparse Images Without Camera Pose in Complex Scenes	39
MA-ST3D: Motion Associated Self-Training for Unsupervised Domain Adaptation on 3D Object Detection	39
Source-Guided Target Feature Reconstruction for Cross-Domain Classification and Detection	39
Bidirectional Cross-Modal Collaborative Alignment via Semantic-Guided Visual Embeddings for Partially Relevant Video Retrieval	39
Image-Level Adaptive Adversarial Ranking for Person Re-Identification	39
Learning Domain Invariant Representations for Generalizable Person Re-Identification	39
PointFormer: Keypoint-Guided Transformer for Simultaneous Nuclei Segmentation and Classification in Multi-Tissue Histology Images	39
Bayesian Nonnegative Tensor Completion With Automatic Rank Determination	39
Improving Unsupervised Ultrasonic Image Anomaly Detection via Frequency-Spatial Feature Filtering and Gaussian Mixture Modeling	39
Restoration of Images Taken Through a Dirty Window Using Optics-Guided Transformer	39
PVPUFormer: Probabilistic Visual Prompt Unified Transformer for Interactive Image Segmentation	38
FABNet: Frequency-Aware Binarized Network for Single Image Super-Resolution	38
Contrastive Conditional Latent Diffusion for Audio-Visual Segmentation	38
Dynamic Atomic Column Detection in Transmission Electron Microscopy Videos via Ridge Estimation	37
Hierarchical Hashing Learning for Image Set Classification	37
Enhancing Target Recognition Performance in SSVEP-Based Brain–Computer Interfaces via Deep Neural Networks With Pyramid Squeeze Attention	37
CKD: Contrastive Knowledge Distillation From a Sample-Wise Perspective	37
HyperE2VID: Improving Event-Based Video Reconstruction via Hypernetworks	37
Cross-Modal Contrastive Learning Network for Few-Shot Action Recognition	37
Rotational Convolution: Rethinking Convolution for Downside Fisheye Images	37
Interpretable Neural Networks for Video Separation: Deep Unfolding RPCA With Foreground Masking	37
BPMTrack: Multi-Object Tracking With Detection Box Application Pattern Mining	36
SDSFusion: A Semantic-Aware Infrared and Visible Image Fusion Network for Degraded Scenes	36
Exploiting Cross-Task Synergy via Frequency-Driven Hierarchical Learning for Multi-Task Dense Prediction	36
Hyperspectral Image Classification via Cascaded Spatial Cross-Attention Network	36
Enhancing Text-Based Person Retrieval by Combining Fused Representation and Reciprocal Learning With Adaptive Loss Refinement	36
Single Image Reflection Removal via Iterative Prompt Learning of Reflection Level	36
Advancing Video Anomaly Detection: A Bi-Directional Hybrid Framework for Enhanced Single- and Multi-Task Approaches	36
SD-ReID: View-aware Stable Diffusion for Aerial-Ground Person Re-Identification	36
PFONet: A Progressive Feedback Optimization Network for Lightweight Single Image Dehazing	35
Zero-Shot Camouflaged Object Detection	35
Deep Underwater Image Quality Assessment With Explicit Degradation Awareness Embedding	35
Learned Image Compression With Gaussian-Laplacian-Logistic Mixture Model and Concatenated Residual Modules	35
MBFQuant: A Multiplier-Bitwidth-Fixed, Mixed-Precision Quantization Method for Mobile CNN-Based Applications	35
DVMark: A Deep Multiscale Framework for Video Watermarking	35
COMBINER: Composed Image Retrieval Guided by Attribute-Based Neighbor Relations	35
U-Shape Transformer for Underwater Image Enhancement	35
One Step Diffusion-Based Super-Resolution With Time-Aware Distillation	35
U-N2C: A Dual Memory-Guided Disentanglement Framework for Unsupervised System Matrix Denoising in Magnetic Particle Imaging	35
Sensitivity Decouple Learning for Image Compression Artifacts Reduction	35
TTST: A Top-k Token Selective Transformer for Remote Sensing Image Super-Resolution	34
Diverse Target and Contribution Scheduling for Domain Generalization	34
Versatile Denoising-Based Approximate Message Passing for Compressive Sensing	34
Active Style-Content Dual-Branch Domain Adaptation for Semi-Supervised SAR Object Detection	34
Learning Structure Aware Deep Spectral Embedding	34
C-NeRF: Representing Scene Changes as Directional Consistency Difference-Based NeRF	34
Rethinking Generalized Zero-Shot Learning: A Synthesized Per-Instance Attribute Perspective	34
Hyperpixels: Flexible 4D Over-Segmentation for Dense and Sparse Light Fields	34
Broadcast-Gated Attention With Identity Adaptive Integration for Efficient Image Super-Resolution	34
Padé Neurons for Efficient Neural Models	34
Coupled Diffusion Posterior Sampling for Unsupervised Hyperspectral and Multispectral Images Fusion	34
An Efficient Transformer Based on Global and Local Self-Attention for Face Photo-Sketch Synthesis	34
Learn From Examples: In-Context Learning for Camouflaged Object Detection	34
SketchAging: Face Photo-Sketch Synthesis and Aging With Multi-Scale Feature Extraction	33
Linearly Transformed Color Guide for Low-Bitrate Diffusion-Based Image Compression	33
Interactive Learning of Intrinsic and Extrinsic Properties for All-Day Semantic Segmentation	33
Scale-Aware Crowd Counting Network With Annotation Error Modeling	33
ROOT: Region-Word Alignment With Partial Optimal Transport for Open-Vocabulary Object Detection	33
Interpretable Few-Shot Image Classification via Prototypical Concept-Guided Mixture of LoRA Experts	33
DisAVR: Disentangled Adaptive Visual Reasoning Network for Diagram Question Answering	33
BVI-VFI: A Video Quality Database for Video Frame Interpolation	33
DiffLLFace: Learning Alternate Illumination-Diffusion Adaptation for Low-Light Face Super-Resolution and Beyond	33
StreakNet-Arch: An Anti-Scattering Network-Based Architecture for Underwater Carrier LiDAR-Radar Imaging	33
Leveraging Frequency Analysis for Image Denoising Network Pruning	32
Pseudo-Labeling Based Practical Semi-Supervised Meta-Training for Few-Shot Learning	32
Tensor Wheel Decomposition: Theory and Application to Tensor Completion	32
Robust Palmprint Recognition via Multi-Stage Noisy Label Selection and Correction	32
Compression-Oriented Video Super-Resolution	32
Explicitly-Decoupled Text Transfer With Minimized Background Reconstruction for Scene Text Editing	32
Accurate 3D Measurement of Complex Texture Objects by Height Compensation Using a Dual-Projector Structure	32
Multimodal Composition Example Mining for Composed Query Image Retrieval	32
Siamese-DETR for Generic Multi-Object Tracking	32
Advancing Weakly-Supervised Change Detection in Satellite Images via Adversarial Class Prompting	32
Lightweight Deep Neural Networks for Ship Target Detection in SAR Imagery	31
Magi-Net: Meta Negative Network for Early Activity Prediction	31