OOIR: Observatory of International Research

Papers

(The TQCC of IEEE Transactions on Pattern Analysis and Machine Intelligence is 22. The table below lists those papers that are above that threshold based on CrossRef citation counts [max. 250 papers]. The publications cover those that have been published in the past four years, i.e., from 2021-08-01 to 2025-08-01.)

Article	Citations
Learn to Predict Sets Using Feed-Forward Neural Networks	2865
DAQE: Enhancing the Quality of Compressed Images by Exploiting the Inherent Characteristic of Defocus	2478
Task-Oriented Channel Attention for Fine-Grained Few-Shot Classification	1382
Modeling Noisy Annotations for Point-Wise Supervision	1361
Cover	1310
Editorial Board	1285
[Back cover - Table of contents, continued]	1241
Front Cover	1211
Adaptive Surface Normal Constraint for Geometric Estimation From Monocular Images	1202
Vertical Layering of Quantized Neural Networks for Heterogeneous Inference	904
Motion-Aware Dynamic Graph Neural Network for Video Compressive Sensing	851
Principal Uncertainty Quantification With Spatial Correlation for Image Restoration Problems	783
One-for-All: Towards Universal Domain Translation With a Single StyleGAN	662
Deep Non-Rigid Structure From Motion With Missing Data	659
Probing Synergistic High-Order Interaction for Multi-Modal Image Fusion	577
Enhancing Representations Through Heterogeneous Self-Supervised Learning	577
Omni-Training: Bridging Pre-Training and Meta-Training for Few-Shot Learning	541
Symbolic Visual Reinforcement Learning: A Scalable Framework With Object-Level Abstraction and Differentiable Expression Search	494
Quadratic Matrix Factorization With Applications to Manifold Learning	472
A Hybrid Stochastic-Deterministic Minibatch Proximal Gradient Method for Efficient Optimization and Generalization	469
Interactive NeRF Geometry Editing With Shape Priors	460
Towards Expressive Spectral-Temporal Graph Neural Networks for Time Series Forecasting	429
S-NeRF++: Autonomous Driving Simulation via Neural Reconstruction and Generation	429
Learning Graph Convolutional Networks for Multi-Label Recognition and Applications	418
A Clustering Validity Index with Multi-Granularity Fusion for Multiple Fuzzy Clustering Algorithms	417

Instance Shadow Detection with A Single-Stage Detector	399
ResNet-LDDMM: Advancing the LDDMM Framework using Deep Residual Networks	393
Interaction-Based Inductive Bias in Graph Neural Networks: Enhancing Protein-Ligand Binding Affinity Predictions From 3D Structures	383
Graph Convolutional Module for Temporal Action Localization in Videos	378
Separable Spatial-Temporal Residual Graph for Cloth-Changing Group Re-Identification	368
Event-based Photometric Bundle Adjustment	367
Detection-Friendly Dehazing: Object Detection in Real-World Hazy Scenes	367
Prior Image Guided Snapshot Compressive Spectral Imaging	365
Weakly Supervised Semantic Segmentation via Box-Driven Masking and Filling Rate Shifting	356
Towards Accurate and Compact Architectures via Neural Architecture Transformer	355
Invariant Policy Learning: A Causal Perspective	352
A Generative Model for Generic Light Field Reconstruction	345
Structure-Preserving Image Super-Resolution	342
ABINet++: Autonomous, Bidirectional and Iterative Language Modeling for Scene Text Spotting	323
Affective Image Content Analysis: Two Decades Review and New Perspectives	313
Towards Unified Deep Image Deraining: A Survey and a New Benchmark	305
Analysis of Video Quality Datasets via Design of Minimalistic Video Quality Models	305
Centerless Clustering	300
Physics-Informed Guided Disentanglement in Generative Networks	297
A Comprehensive and Modularized Statistical Framework for Gradient Norm Equality in Deep Neural Networks	295
VATr++: Choose Your Words Wisely for Handwritten Text Generation	294
Locating and Counting Heads in Crowds With a Depth Prior	289
Learning With Style: Continual Semantic Segmentation Across Tasks and Domains	283
Point-to-Pixel Prompting for Point Cloud Analysis With Pre-Trained Image Models	282
Face Forgery Detection by 3D Decomposition and Composition Search	279
Implicit Annealing in Kernel Spaces: A Strongly Consistent Clustering Approach	277
Active Supervised Cross-Modal Retrieval	270
OPAL: Occlusion Pattern Aware Loss for Unsupervised Light Field Disparity Estimation	259
On the Trade-off between Flatness and Optimization in Distributed Learning	259
Multi-Dataset, Multitask Learning of Egocentric Vision Tasks	258
Ensemble-Enhanced Semi-Supervised Learning With Optimized Graph Construction for High-Dimensional Data	257
DVIS++: Improved Decoupled Framework for Universal Video Segmentation	243
Asymmetric Convolution: An Efficient and Generalized Method to Fuse Feature Maps in Multiple Vision Tasks	241
Sparse-to-Dense Matching Network for Large-Scale LiDAR Point Cloud Registration	236
Multi-Task Head Pose Estimation in-the-Wild	231
Learning Signed Hyper Surfaces for Oriented Point Cloud Normal Estimation	227
Transformer-Based Visual Segmentation: A Survey	227
Deep Long-Tailed Learning: A Survey	223
Guaranteed Tensor Recovery Fused Low-rankness and Smoothness	218
Face Generation and Editing With StyleGAN: A Survey	214
Digging Into Uncertainty-Based Pseudo-Label for Robust Stereo Matching	214
Optimization-Based Post-Training Quantization With Bit-Split and Stitching	214
Are Graph Convolutional Networks With Random Weights Feasible?	211
Learning to Guide a Saturation-Based Theorem Prover	210
Multiple Video Frame Interpolation via Enhanced Deformable Separable Convolution	210
Multiview Clustering: A Scalable and Parameter-Free Bipartite Graph Fusion Method	206
Unsupervised Domain Adaptation via Discriminative Manifold Propagation	198
Inferring Point Cloud Quality via Graph Similarity	197
A Survey on Deep Learning Techniques for Stereo-Based Depth Estimation	197
Metrics for Dataset Demographic Bias: A Case Study on Facial Expression Recognition	195

Simplicial Complex Neural Networks	192
MADAv2: Advanced Multi-Anchor Based Active Domain Adaptation Segmentation	188
Fast Component Tree Computation for Images of Limited Levels	187
Human-Centric Transformer for Domain Adaptive Action Recognition	186
Cover	185
Cover 2	185
Cover	183
Cover	179
Table of Contents	179
IEEE Computer Society Has You Covered!	177
TPAMI Information for Authors	176
Towards Deviation-Robust Agent Navigation via Perturbation-Aware Contrastive Learning	176
Point Set Registration for 3D Range Scans Using Fuzzy Cluster-Based Metric and Efficient Global Optimization	176
Evaluation for Weakly Supervised Object Localization: Protocol, Metrics, and Datasets	175
A Variational EM Acceleration for Efficient Clustering at Very Large Scales	174
Image Lens Flare Removal Using Adversarial Curve Learning	173
MRA-Net: Improving VQA Via Multi-Modal Relation Attention Network	173
Discriminant Feature Extraction by Generalized Difference Subspace	171
MoIL: Momentum Imitation Learning for Efficient Vision-Language Adaptation	169
Towards Pointsets Representation Learning via Self-Supervised Learning and Set Augmentation	167
SVGDreamer++: Advancing Editability and Diversity in Text-Guided SVG Generation	167
LMP-GAN: Out-of-Distribution Detection for Non-Control Data Malware Attacks	166
Universal Image Segmentation with Efficiency	165
Image-to-Image Translation With Disentangled Latent Vectors for Face Editing	164
Influence Function Based Second-Order Channel Pruning: Evaluating True Loss Changes for Pruning is Possible Without Retraining	163
Accurate and Efficient Stereo Matching via Attention Concatenation Volume	160
On Positive-Unlabeled Classification From Corrupted Data in GANs	158
Scalable Optimal Transport Methods in Machine Learning: A Contemporary Survey	157
Hypergraph-Based Multi-View Action Recognition Using Event Cameras	154
Out-of-Domain Generalization From a Single Source: An Uncertainty Quantification Approach	153
RF-Next: Efficient Receptive Field Search for Convolutional Neural Networks	152
Weakly Supervised Tracklet Association Learning With Video Labels for Person Re-Identification	151
Temporal Feature Matters: A Framework for Diffusion Model Quantization	150
Power Normalizations in Fine-Grained Image, Few-Shot Image and Graph Classification	149
Dynamic Self-Supervised Teacher-Student Network Learning	144
Bridging Actions: Generate 3D Poses and Shapes In-Between Photos	143
Learning Graph Attentions via Replicator Dynamics	142
Reduced-Rank Tensor-on-Tensor Regression and Tensor-Variate Analysis of Variance	140
Adaptive Transfer Kernel Learning for Transfer Gaussian Process Regression	136
New Dataset and Methods for Fine-Grained Compositional Referring Expression Comprehension via Specialist-MLLM Collaboration	136
Curriculum-Based Asymmetric Multi-Task Reinforcement Learning	135
Differentially Private Graph Neural Networks for Whole-Graph Classification	133
BNET: Batch Normalization With Enhanced Linear Transformation	132
Inter-Intra Hypergraph Computation for Survival Prediction on Whole Slide Images	131
VNVC: A Versatile Neural Video Coding Framework for Efficient Human-Machine Vision	129
Revisiting Nonlocal Self-Similarity from Continuous Representation	129
HiGCIN: Hierarchical Graph-Based Cross Inference Network for Group Activity Recognition	128
Ensemble Multi-Quantiles: Adaptively Flexible Distribution Prediction for Uncertainty Quantification	127
On the Robustness of Average Losses for Partial-Label Learning	127
Correcting Optical Aberration via Depth-Aware Point Spread Functions	126
Asymmetric Loss Functions for Noise-Tolerant Learning: Theory and Applications	125
GCoNet+: A Stronger Group Collaborative Co-Salient Object Detector	123
MetaDrive: Composing Diverse Driving Scenarios for Generalizable Reinforcement Learning	123
GradMDM: Adversarial Attack on Dynamic Networks	122
Matrix Completion via Non-Convex Relaxation and Adaptive Correlation Learning	122
Domain Generalization: A Survey	121
From Simple to Complex Scenes: Learning Robust Feature Representations for Accurate Human Parsing	119
Variational Data-Free Knowledge Distillation for Continual Learning	119
Enhancing Photorealism Enhancement	119
A Fully Automated Method for 3D Individual Tooth Identification and Segmentation in Dental CBCT	119
Meta Invariance Defense Towards Generalizable Robustness to Unknown Adversarial Attacks	119
Interpretable Optimization-Inspired Unfolding Network for Low-Light Image Enhancement	118
Learning to See Through With Events	118
Unbiased Scene Graph Generation via Two-Stage Causal Modeling	118
Rate-Distortion Theory in Coding for Machines and Its Applications	118
Differential Viewpoints for Ground Terrain Material Recognition	116
Random Permutation Set Reasoning	114
Flare7K++: Mixing Synthetic and Real Datasets for Nighttime Flare Removal and Beyond	114
Human Interaction Understanding With Consistency-Aware Learning	111
QDTrack: Quasi-Dense Similarity Learning for Appearance-Only Multiple Object Tracking	110
Reconstruction Guided Meta-Learning for Few Shot Open Set Recognition	110
Robust Multimodal Learning With Missing Modalities via Parameter-Efficient Adaptation	109
Spatial-Temporal Transformer for Video Snapshot Compressive Imaging	109
SPLiT: Single Portrait Lighting Estimation via a Tetrad of Face Intrinsics	108
Knowledge-Based Embodied Question Answering	108
Deep Learning for Face Anti-Spoofing: A Survey	108
MHF-Net: An Interpretable Deep Network for Multispectral and Hyperspectral Image Fusion	108
Compositional Scene Representation Learning via Reconstruction: A Survey	108
Learning From Partially Labeled Data for Multi-Organ and Tumor Segmentation	107
Orientational Distribution Learning with Hierarchical Spatial Attention for Open Set Recognition	106

Self-Scalable Tanh (Stan): Multi-Scale Solutions for Physics-Informed Neural Networks	106
Self-Supervised Multimodal Learning: A Survey	103
Deep Learning-Based Point Cloud Compression: An In-Depth Survey and Benchmark	103
Advances and Challenges in Meta-Learning: A Technical Review	103
Deep Gait Recognition: A Survey	103
Fear-Neuro-Inspired Reinforcement Learning for Safe Autonomous Driving	103
GenPoly: Learning Generalized and Tessellated Shape Priors via 3D Polymorphic Evolving	102
AutoNovel: Automatically Discovering and Learning Novel Visual Categories	102
A New Brain Network Construction Paradigm for Brain Disorder via Diffusion-Based Graph Contrastive Learning	101
Dawn of the Transformer Era in Speech Emotion Recognition: Closing the Valence Gap	101
A Style-Based Generator Architecture for Generative Adversarial Networks	100
ComputingEdge ad	100
Dataset Security for Machine Learning: Data Poisoning, Backdoor Attacks, and Defenses	100
PathNet: Path-Selective Point Cloud Denoising	100
P2T: Pyramid Pooling Transformer for Scene Understanding	100
ONNXPruner: ONNX-Based General Model Pruning Adapter	99
JointFormer: A Unified Framework With Joint Modeling for Video Object Segmentation	98
luvHarris: A Practical Corner Detector for Event-Cameras	98
ModeRNN: Harnessing Spatiotemporal Mode Collapse in Unsupervised Predictive Learning	97
Dynamic Differential Image Circle Diameter Measurement Precision Assessment: Application to Burning Droplets	97
Multi-Modality Multi-Attribute Contrastive Pre-Training for Image Aesthetics Computing	97
Human as Points: Explicit Point-Based 3D Human Reconstruction From Single-View RGB Images	97
Learning With Constraint Learning: New Perspective, Solution Strategy and Various Applications	97
Low-Shot Video Object Segmentation	97
Semi-Supervised Learning for FGVC With Out-of-Category Data	96
Improving Machine Vision Using Human Perceptual Representations: The Case of Planar Reflection Symmetry for Object Classification	96
Homeomorphism Prior for False Positive and Negative Problem in Medical Image Dense Contrastive Representation Learning	96
TN-ZSTAD: Transferable Network for Zero-Shot Temporal Activity Detection	94
Distributionally Location-Aware Transferable Adversarial Patches for Facial Images	94
Cover 3	94
Video DataFlywheel: Resolving the Impossible Data Trinity in Video-Language Understanding	93
The Cluster Structure Function	93
Editorial: Special Section on Egocentric Perception	92
An Energy-Based Prior for Generative Saliency	92
Heterogeneous Feature Re-Sampling for Balanced Pedestrian Attribute Recognition	91
Scale Propagation Network for Generalizable Depth Completion	91
Reusable Architecture Growth for Continual Stereo Matching	91
Generalized Task-Driven Medical Image Quality Enhancement With Gradient Promotion	91
Test-time Training for Hyperspectral Image Super-resolution	90
The Bayesian Cut	89
Relationship Quantification of Image Degradations	89
Unified Adversarial Patch for Visible-Infrared Cross-Modal Attacks in the Physical World	89
Deciphering the Feature Representation of Deep Neural Networks for High-Performance AI	89
STAR-FC: Structure-Aware Face Clustering on Ultra-Large-Scale Graphs	89
Stimulative Training++: Go Beyond the Performance Limits of Residual Networks	89
Continual Unsupervised Generative Modeling	88
Pair Then Relation: Pair-Net for Panoptic Scene Graph Generation	88
LCBM: A Multi-View Probabilistic Model for Multi-Label Classification	88
Modeling the Label Distributions for Weakly-Supervised Semantic Segmentation	87
Supervision by Denoising	86
Adaptive Perspective Distillation for Semantic Segmentation	86
Conformal Prediction for Time Series	86
Analysis of the Hands in Egocentric Vision: A Survey	86
RGB-T Tracking With Template-Bridged Search Interaction and Target-Preserved Template Updating	86
On the Universal Approximation Properties of Deep Neural Networks Using MAM Neurons	85
TE141K: Artistic Text Benchmark for Text Effect Transfer	85
Deep Learning on Object-Centric 3D Neural Fields	84
FreeFusion: Infrared and Visible Image Fusion via Cross Reconstruction Learning	84
MoBluRF: Motion Deblurring Neural Radiance Fields for Blurry Monocular Video	83
GhostingNet: A Novel Approach for Glass Surface Detection With Ghosting Cues	83
Rolling Shutter Homography and its Applications	82
Reinforcing Generated Images via Meta-Learning for One-Shot Fine-Grained Visual Recognition	81
Learning to Super-Resolve Blurry Images With Events	81
AutoEval: Are Labels Always Necessary for Classifier Accuracy Evaluation?	81
Equivalent Classification Mapping for Weakly Supervised Temporal Action Localization	81
Compositional Physical Reasoning of Objects and Events from Videos	81
Interpolated Joint Space Adversarial Training for Robust and Generalizable Defenses	80
Cascaded Dynamic Memory Refinement and Semantic Alignment for Exo-to-Ego Cross-View Video Generation	80
Adversarially Robust Neural Architectures	80
Any Fashion Attribute Editing: Dataset and Pretrained Models	79
CC4S: Encouraging Certainty and Consistency in Scribble-Supervised Semantic Segmentation	79
Pixel Distillation: Cost-Flexible Distillation Across Image Sizes and Heterogeneous Networks	79
RED++ : Data-Free Pruning of Deep Neural Networks via Input Splitting and Output Merging	79
S $^{2}$ O: Enhancing Adversarial Training with Second-Order Statistics of Weights	79
FNA++: Fast Network Adaptation via Parameter Remapping and Architecture Search	78
Progressive Instance-Aware Feature Learning for Compositional Action Recognition	78
DeepMesh: Differentiable Iso-Surface Extraction	78
Single Image Deraining: From Model-Based to Data-Driven and Beyond	78
Noisy Label Learning With Provable Consistency for a Wider Family of Losses	78
Reframing Neural Networks: Deep Structure in Overcomplete Representations	77
$\mathcal {X}$-Metric: An N-Dimensional Information-Theoretic Framework for Groupwise Registration and Deep Combined Computing	77
Towards Reliable and Faithful Explanations: A Disentanglement-Augmented Approach for Selective Rationalization	77
SS-TBN: A Semi-Supervised Tri-Branch Network for COVID-19 Screening and Lesion Segmentation	77
Optimizing Regularized Cholesky Score for Order-Based Learning of Bayesian Networks	76
Privacy-Preserving Deep Action Recognition: An Adversarial Learning Framework and A New Dataset	76
Joint Framework for Single Image Reconstruction and Super-Resolution With an Event Camera	76
3D Visual Saliency: An Independent Perceptual Measure or a Derivative of 2D Image Saliency?	76
CycMuNet+: Cycle-Projected Mutual Learning for Spatial-Temporal Video Super-Resolution	75
Semantic Object Accuracy for Generative Text-to-Image Synthesis	75
WildVideo: Benchmarking LMMs for Understanding Video-Language Interaction	75
Probabilistic Directed Distance Fields for Ray-Based Shape Representations	75
Support Vector Machine Classifier via Soft-Margin Loss	75
Revealing the Dark Side of Non-Local Attention in Single Image Super-Resolution	75
Map-Guided Curriculum Domain Adaptation and Uncertainty-Aware Evaluation for Semantic Nighttime Image Segmentation	75
Point Spatio-Temporal Transformer Networks for Point Cloud Video Modeling	74