IEEE Transactions on Pattern Analysis and Machine Intelligence

Papers
(The median citation count of IEEE Transactions on Pattern Analysis and Machine Intelligence is 8. The table below lists those papers that are above that threshold based on CrossRef citation counts [max. 250 papers]. The publications cover those that have been published in the past four years, i.e., from 2022-05-01 to 2026-05-01.)
ArticleCitations
Front Cover3719
One-for-All: Towards Universal Domain Translation With a Single StyleGAN1923
On the Trade-Off Between Flatness and Optimization in Distributed Learning1676
Learning to Guide a Saturation-Based Theorem Prover1504
Omni-Training: Bridging Pre-Training and Meta-Training for Few-Shot Learning935
Symbolic Visual Reinforcement Learning: A Scalable Framework With Object-Level Abstraction and Differentiable Expression Search889
Interactive NeRF Geometry Editing With Shape Priors757
Self-Supervised Skeleton Representation Learning Via Actionlet Contrast and Reconstruct695
Editorial: Introduction to the Special Section on Best of CVPR'2022695
BiBBDM: Bidirectional Image Translation With Brownian Bridge Diffusion Models667
Point-to-Pixel Prompting for Point Cloud Analysis With Pre-Trained Image Models661
A Clustering Validity Index With Multi-Granularity Fusion for Multiple Fuzzy Clustering Algorithms655
Implicit Annealing in Kernel Spaces: A Strongly Consistent Clustering Approach652
Video Demoireing Using Focused-Defocused Dual-Camera System615
Invariant Policy Learning: A Causal Perspective596
A Hybrid Stochastic-Deterministic Minibatch Proximal Gradient Method for Efficient Optimization and Generalization588
Towards Accurate and Compact Architectures via Neural Architecture Transformer581
Seeing Through Satellite Images at Street Views564
Next Bit Prediction: A Unified Lossless and Lossy Point Cloud Geometry Compression Framework539
LRANet++: Low-Rank Approximation Network for Accurate and Efficient Text Spotting536
Modeling Noisy Annotations for Point-Wise Supervision526
Adaptive Surface Normal Constraint for Geometric Estimation From Monocular Images523
Test-Time Correction: An Online 3D Detection System via Visual Prompting516
Simplicial Complex Neural Networks488
Like Human Rethinking: Contour Transformer AutoRegression for Referring Remote Sensing Interpretation486
Interaction-Based Inductive Bias in Graph Neural Networks: Enhancing Protein-Ligand Binding Affinity Predictions From 3D Structures472
Rethinking Rotation-Invariant Recognition of Fine-Grained Shapes From the Perspective of Contour Points453
SNI-SLAM++: Tightly-Coupled Semantic Neural Implicit SLAM443
Revisiting Transformation Invariant Geometric Deep Learning: An Initial Representation Perspective436
MECD+: Unlocking Event-Level Causal Graph Discovery for Video Reasoning435
A Personalized and Privacy-Preserving Federated Transformer Framework for Multilingual Sentiment Analysis431
Quadratic Matrix Factorization With Applications to Manifold Learning428
Graph Convolutional Module for Temporal Action Localization in Videos421
Weakly Supervised Semantic Segmentation via Box-Driven Masking and Filling Rate Shifting405
Motion-Aware Dynamic Graph Neural Network for Video Compressive Sensing402
Learn to Predict Sets Using Feed-Forward Neural Networks402
Physics-Informed Guided Disentanglement in Generative Networks396
Principal Uncertainty Quantification With Spatial Correlation for Image Restoration Problems375
Prior Image Guided Snapshot Compressive Spectral Imaging371
Inferring Point Cloud Quality via Graph Similarity370
OPAL: Occlusion Pattern Aware Loss for Unsupervised Light Field Disparity Estimation370
Optimization-Based Post-Training Quantization With Bit-Split and Stitching360
Vertical Layering of Quantized Neural Networks for Heterogeneous Inference353
AIRPNet: Adaptive Image Restoration With Privacy Protection in Steganographic Domain353
Separable Spatial-Temporal Residual Graph for Cloth-Changing Group Re-Identification352
Label Hierarchy Transition: Delving into Class Hierarchies to Enhance Deep Classifiers351
Ensemble-Enhanced Semi-Supervised Learning With Optimized Graph Construction for High-Dimensional Data338
Face Forgery Detection by 3D Decomposition and Composition Search324
SCGT: Towards Scalable and Comprehensive Graph Transformer304
Sparse-to-Dense Matching Network for Large-Scale LiDAR Point Cloud Registration303
Face Generation and Editing With StyleGAN: A Survey299
Detection-Friendly Dehazing: Object Detection in Real-World Hazy Scenes295
VATr++: Choose Your Words Wisely for Handwritten Text Generation293
Are Graph Convolutional Networks With Random Weights Feasible?293
Guaranteed Tensor Recovery Fused Low-rankness and Smoothness292
Analysis of Video Quality Datasets via Design of Minimalistic Video Quality Models292
Multiple Video Frame Interpolation via Enhanced Deformable Separable Convolution283
Task-Oriented Channel Attention for Fine-Grained Few-Shot Classification281
Towards Unified Deep Image Deraining: A Survey and a New Benchmark280
Jailbreak and Guard Aligned Language Models with Only Few In-Context Demonstrations276
Reliable and Compact Graph Fine-Tuning Via Graph Sparse Prompting275
Transformer-Based Visual Segmentation: A Survey269
Structure-Preserving Image Super-Resolution268
ABINet++: Autonomous, Bidirectional and Iterative Language Modeling for Scene Text Spotting264
Asymmetric Convolution: An Efficient and Generalized Method to Fuse Feature Maps in Multiple Vision Tasks259
Enhancing Representations Through Heterogeneous Self-Supervised Learning259
Learning Graph Convolutional Networks for Multi-Label Recognition and Applications247
Centerless Clustering247
Affective Image Content Analysis: Two Decades Review and New Perspectives238
Active Supervised Cross-Modal Retrieval234
Locating and Counting Heads in Crowds With a Depth Prior231
Metrics for Dataset Demographic Bias: A Case Study on Facial Expression Recognition229
MADAv2: Advanced Multi-Anchor Based Active Domain Adaptation Segmentation227
Probing Synergistic High-Order Interaction for Multi-Modal Image Fusion227
DVIS++: Improved Decoupled Framework for Universal Video Segmentation227
Digging Into Uncertainty-Based Pseudo-Label for Robust Stereo Matching226
DAQE: Enhancing the Quality of Compressed Images by Exploiting the Inherent Characteristic of Defocus224
Learning Signed Hyper Surfaces for Oriented Point Cloud Normal Estimation224
Deep Long-Tailed Learning: A Survey220
S-NeRF++: Autonomous Driving Simulation via Neural Reconstruction and Generation220
Event-Based Photometric Bundle Adjustment219
Towards Expressive Spectral-Temporal Graph Neural Networks for Time Series Forecasting217
Multi-Dataset, Multitask Learning of Egocentric Vision Tasks213
Rein++: Efficient Generalization and Adaptation for Semantic Segmentation with Vision Foundation Models212
Learning With Style: Continual Semantic Segmentation Across Tasks and Domains212
Cover 2209
BNET: Batch Normalization With Enhanced Linear Transformation208
Temporal Feature Matters: A Framework for Diffusion Model Quantization206
Unbiased Scene Graph Generation via Two-Stage Causal Modeling203
GCoNet+: A Stronger Group Collaborative Co-Salient Object Detector200
Rate-Distortion Theory in Coding for Machines and Its Applications199
On the Robustness of Average Losses for Partial-Label Learning199
A Unified Experience Replay Framework for Spiking Deep Reinforcement Learning198
MESA: Effective Matching Redundancy Reduction by Semantic Area Segmentation197
Universal Image Segmentation With Efficiency188
Supervised Small-baseline and Large-baseline Homography Learning with Diffusion-based Data Generation188
Correcting Optical Aberration via Depth-Aware Point Spread Functions184
To Fold or Not to Fold: Graph Regularized Tensor Train for Visual Data Completion184
Image-to-Image Translation With Disentangled Latent Vectors for Face Editing184
Adaptive Transfer Kernel Learning for Transfer Gaussian Process Regression180
Self-Scalable Tanh (Stan): Multi-Scale Solutions for Physics-Informed Neural Networks176
Towards Pointsets Representation Learning via Self-Supervised Learning and Set Augmentation176
SVGDreamer++: Advancing Editability and Diversity in Text-Guided SVG Generation175
Human Interaction Understanding With Consistency-Aware Learning173
GradMDM: Adversarial Attack on Dynamic Networks173
Deep Orientational Representation Learning for Ordinal Regression172
Towards Deviation-Robust Agent Navigation via Perturbation-Aware Contrastive Learning170
Graph-Oriented Instruction Tuning of Large Language Models for Generic Graph Mining170
MoIL: Momentum Imitation Learning for Efficient Vision-Language Adaptation169
Ensemble Multi-Quantiles: Adaptively Flexible Distribution Prediction for Uncertainty Quantification168
Understanding the Effects of Projectors in Knowledge Distillation167
Compositional Scene Representation Learning via Reconstruction: A Survey166
Hypergraph-Based Multi-View Action Recognition Using Event Cameras166
Learning Graph Attentions via Replicator Dynamics166
Inter-Intra Hypergraph Computation for Survival Prediction on Whole Slide Images166
Discriminant Feature Extraction by Generalized Difference Subspace165
SPLiT: Single Portrait Lighting Estimation via a Tetrad of Face Intrinsics165
Learning to See Through With Events163
Thermal3D-GS: Physics-induced 3D Gaussians for Thermal Infrared Novel-view Synthesis with a Large-Scale Dataset163
LMP-GAN: Out-of-Distribution Detection for Non-Control Data Malware Attacks161
Matrix Completion via Non-Convex Relaxation and Adaptive Correlation Learning160
Meta Invariance Defense Towards Generalizable Robustness to Unknown Adversarial Attacks160
Controllable Generation With Text-to-Image Diffusion Models: A Survey159
VNVC: A Versatile Neural Video Coding Framework for Efficient Human-Machine Vision158
Image Lens Flare Removal Using Adversarial Curve Learning158
Continuous Review and Timely Correction: Enhancing the Resistance to Noisy Labels via Self-Not-True and Class-Wise Distillation158
Revisiting Nonlocal Self-Similarity from Continuous Representation157
AutoNovel: Automatically Discovering and Learning Novel Visual Categories157
Interpretable Optimization-Inspired Unfolding Network for Low-Light Image Enhancement156
A New Brain Network Construction Paradigm for Brain Disorder via Diffusion-Based Graph Contrastive Learning156
Bridging Actions: Generate 3D Poses and Shapes In-Between Photos153
Variational Data-Free Knowledge Distillation for Continual Learning151
Winsor-CAM: Human-Tunable Visual Explanations from Deep Networks via Layer-Wise Winsorization150
SPARE: Symmetrized Point-to-Plane Distance for Robust Non-Rigid 3D Registration149
Curriculum-Based Asymmetric Multi-Task Reinforcement Learning148
Reconstruction Guided Meta-Learning for Few Shot Open Set Recognition148
EvolveNav: Empowering LLM-Based Vision-Language Navigation via Self-Improving Embodied Reasoning147
Deep Learning-Based Point Cloud Compression: An In-Depth Survey and Benchmark146
A Unified Decision Rule for Generalized Out-of-Distribution Detection145
Learning From Partially Labeled Data for Multi-Organ and Tumor Segmentation145
Physics-Informed Matrix Factorization Operator143
Revisiting Transferable Adversarial Images: Systemization, Evaluation, and New Insights142
New Dataset and Methods for Fine-Grained Compositional Referring Expression Comprehension via Specialist-MLLM Collaboration142
Evaluation for Weakly Supervised Object Localization: Protocol, Metrics, and Datasets138
Learning Efficient Meshflow and Optical Flow From Event Cameras137
On Positive-Unlabeled Classification From Corrupted Data in GANs137
A Variational EM Acceleration for Efficient Clustering at Very Large Scales137
Privacy Preserving Decentralized Learning with Positive-Incentive Noise135
InstructLayout: Instruction-Driven 2D and 3D Layout Synthesis With Semantic Graph Prior135
M$^{3}$3D: A Multimodal, Multilingual and Multitask Dataset for Grounded Document-Level Information Extraction134
Asymmetric Loss Functions for Noise-Tolerant Learning: Theory and Applications134
From Simple to Complex Scenes: Learning Robust Feature Representations for Accurate Human Parsing133
Dawn of the Transformer Era in Speech Emotion Recognition: Closing the Valence Gap132
Human-Centric Transformer for Domain Adaptive Action Recognition132
Robust Multimodal Learning With Missing Modalities via Parameter-Efficient Adaptation132
Out-of-Domain Generalization From a Single Source: An Uncertainty Quantification Approach132
A Fully Automated Method for 3D Individual Tooth Identification and Segmentation in Dental CBCT132
Enhancing Photorealism Enhancement129
Influence Function Based Second-Order Channel Pruning: Evaluating True Loss Changes for Pruning is Possible Without Retraining128
Reduced-Rank Tensor-on-Tensor Regression and Tensor-Variate Analysis of Variance128
HiGCIN: Hierarchical Graph-Based Cross Inference Network for Group Activity Recognition128
Weakly Supervised Tracklet Association Learning With Video Labels for Person Re-Identification127
Differentially Private Graph Neural Networks for Whole-Graph Classification126
Dataset Security for Machine Learning: Data Poisoning, Backdoor Attacks, and Defenses126
Fear-Neuro-Inspired Reinforcement Learning for Safe Autonomous Driving126
PathNet: Path-Selective Point Cloud Denoising125
Random Permutation Set Reasoning125
Accurate and Efficient Stereo Matching via Attention Concatenation Volume124
QDTrack: Quasi-Dense Similarity Learning for Appearance-Only Multiple Object Tracking123
Flare7K++: Mixing Synthetic and Real Datasets for Nighttime Flare Removal and Beyond122
Deep Gait Recognition: A Survey122
Mining Association Patterns From Neighborhood Insight122
Advances and Challenges in Meta-Learning: A Technical Review120
GenPoly: Learning Generalized and Tessellated Shape Priors via 3D Polymorphic Evolving120
Knowledge-Based Embodied Question Answering119
Self-Supervised Multimodal Learning: A Survey119
Scalable Optimal Transport Methods in Machine Learning: A Contemporary Survey118
P2T: Pyramid Pooling Transformer for Scene Understanding118
Low-Shot Video Object Segmentation117
JointFormer: A Unified Framework With Joint Modeling for Video Object Segmentation117
ComputingEdge ad117
WildVideo: Benchmarking LMMs for Understanding Video-Language Interaction116
Cover 3115
Supervision by Denoising114
Towards Reliable and Faithful Explanations: A Disentanglement-Augmented Approach for Selective Rationalization114
Reframing Neural Networks: Deep Structure in Overcomplete Representations114
Noisy Label Learning With Provable Consistency for a Wider Family of Losses114
On the Universal Approximation Properties of Deep Neural Networks Using MAM Neurons114
PRANCE: Joint Token-Optimization and Structural Channel-Pruning for Adaptive ViT Inference113
PMGT-VR: A Decentralized Proximal-Gradient Algorithmic Framework With Variance Reduction113
SS-NeRF: Physically Based Sparse Spectral Rendering With Neural Radiance Field113
AutoEval: Are Labels Always Necessary for Classifier Accuracy Evaluation?113
Cascaded Dynamic Memory Refinement and Semantic Alignment for Exo-to-Ego Cross-View Video Generation111
iSeg: An Iterative Refinement-based Framework for Training-free Segmentation111
Distributionally Location-Aware Transferable Adversarial Patches for Facial Images111
Deep Learning on Object-Centric 3D Neural Fields111
RGB-T Tracking With Template-Bridged Search Interaction and Target-Preserved Template Updating108
Pair Then Relation: Pair-Net for Panoptic Scene Graph Generation108
Adversarially Robust Neural Architectures108
MoBluRF: Motion Deblurring Neural Radiance Fields for Blurry Monocular Video106
GLC++: Source-Free Universal Domain Adaptation Through Global-Local Clustering and Contrastive Affinity Learning106
SHADOW: Secure Hidden Authenticating Digital Objects in the Wild106
Learn to Enhance Sparse Spike Streams105
NuwaDynamics+: A Causality-Aware Generative Framework for Spatio-Temporal Representation Learning104
Pixel Distillation: Cost-Flexible Distillation Across Image Sizes and Heterogeneous Networks103
ModeRNN: Harnessing Spatiotemporal Mode Collapse in Unsupervised Predictive Learning103
An Energy-Based Prior for Generative Saliency102
Video DataFlywheel: Resolving the Impossible Data Trinity in Video-Language Understanding102
The Cluster Structure Function102
Semi-Supervised Learning for FGVC With Out-of-Category Data102
Heterogeneous Feature Re-Sampling for Balanced Pedestrian Attribute Recognition101
Editorial: Special Section on Egocentric Perception101
Reusable Architecture Growth for Continual Stereo Matching100
Stimulative Training++: Go Beyond the Performance Limits of Residual Networks100
Generalized Task-Driven Medical Image Quality Enhancement With Gradient Promotion99
Learning With Constraint Learning: New Perspective, Solution Strategy and Various Applications99
Continual Unsupervised Generative Modeling99
Learning to Super-Resolve Blurry Images With Events99
Compositional Physical Reasoning of Objects and Events From Videos99
DeepMesh: Differentiable Iso-Surface Extraction99
STAR-FC: Structure-Aware Face Clustering on Ultra-Large-Scale Graphs98
Joint Framework for Single Image Reconstruction and Super-Resolution With an Event Camera98
Any Fashion Attribute Editing: Dataset and Pretrained Models98
Reinforcing Generated Images via Meta-Learning for One-Shot Fine-Grained Visual Recognition98
An Algebraic Geometry Approach to Viewing Graph Solvability97
Test-Time Training for Hyperspectral Image Super-Resolution97
Orthogonal Decoupling Contrastive Regularization: Toward Uncorrelated Feature Decoupling for Unpaired Image Restoration96
A Thorough Benchmark and a New Model for Light Field Saliency Detection96
Temporal Stereo Matching From Event Cameras via Joint Learning With Stereoscopic Flow95
GhostingNet: A Novel Approach for Glass Surface Detection With Ghosting Cues94
Dynamic Differential Image Circle Diameter Measurement Precision Assessment: Application to Burning Droplets94
Confidence-Aware Pseudo-Label Self-Correction for Weakly Supervised Visual Grounding94
Relationship Quantification of Image Degradations94
Adaptive Sparse Self-Attention for Efficient Image Super-resolution and beyond94
Human as Points: Explicit Point-Based 3D Human Reconstruction From Single-View RGB Images94
3D Visual Saliency: An Independent Perceptual Measure or a Derivative of 2D Image Saliency?93
RED++ : Data-Free Pruning of Deep Neural Networks via Input Splitting and Output Merging93
Interpolated Joint Space Adversarial Training for Robust and Generalizable Defenses93
SKDF: A Simple Knowledge Distillation Framework for Distilling Open-Vocabulary Knowledge to Open-World Object Detector92
CC4S: Encouraging Certainty and Consistency in Scribble-Supervised Semantic Segmentation91
CycMuNet+: Cycle-Projected Mutual Learning for Spatial-Temporal Video Super-Resolution91
Compositional Generative Model of Unbounded 4D Cities91
Scale Propagation Network for Generalizable Depth Completion90
Multi-Modality Multi-Attribute Contrastive Pre-Training for Image Aesthetics Computing90
Conformal Prediction for Time Series90
S$^{2}$ 2O: Enhancing Adversarial Training With Second-Order Statistics of Weights89
SS-TBN: A Semi-Supervised Tri-Branch Network for COVID-19 Screening and Lesion Segmentation88
Adaptive Perspective Distillation for Semantic Segmentation88
$\mathcal {X}$-Metric: An N-Dimensional Information-Theoretic Framework for Groupwise Registration and Deep Combined Computing86
Unified Adversarial Patch for Visible-Infrared Cross-Modal Attacks in the Physical World86
0.098690032958984