IEEE Transactions on Pattern Analysis and Machine Intelligence

Papers
(The TQCC of IEEE Transactions on Pattern Analysis and Machine Intelligence is 25. The table below lists those papers that are above that threshold based on CrossRef citation counts [max. 250 papers]. The publications cover those that have been published in the past four years, i.e., from 2021-12-01 to 2025-12-01.)
ArticleCitations
[Back cover - Table of contents, continued]3052
Front Cover1641
Vertical Layering of Quantized Neural Networks for Heterogeneous Inference1575
Learn to Predict Sets Using Feed-Forward Neural Networks1545
One-for-All: Towards Universal Domain Translation With a Single StyleGAN1497
Editorial: Introduction to the Special Section on Best of CVPR'20221483
Self-Supervised Skeleton Representation Learning Via Actionlet Contrast and Reconstruct1160
BiBBDM: Bidirectional Image Translation With Brownian Bridge Diffusion Models1135
Principal Uncertainty Quantification With Spatial Correlation for Image Restoration Problems955
S-NeRF++: Autonomous Driving Simulation via Neural Reconstruction and Generation772
Multi-Dataset, Multitask Learning of Egocentric Vision Tasks727
MECD+ : Unlocking Event-Level Causal Graph Discovery for Video Reasoning625
Face Forgery Detection by 3D Decomposition and Composition Search583
ResNet-LDDMM: Advancing the LDDMM Framework using Deep Residual Networks575
Active Supervised Cross-Modal Retrieval563
DAQE: Enhancing the Quality of Compressed Images by Exploiting the Inherent Characteristic of Defocus555
Event-Based Photometric Bundle Adjustment534
On the Trade-Off Between Flatness and Optimization in Distributed Learning509
VATr++: Choose Your Words Wisely for Handwritten Text Generation504
Revisiting Transformation Invariant Geometric Deep Learning: An Initial Representation Perspective496
Sparse-to-Dense Matching Network for Large-Scale LiDAR Point Cloud Registration475
Learning to Guide a Saturation-Based Theorem Prover472
Weakly Supervised Semantic Segmentation via Box-Driven Masking and Filling Rate Shifting461
Towards Accurate and Compact Architectures via Neural Architecture Transformer460
MADAv2: Advanced Multi-Anchor Based Active Domain Adaptation Segmentation456
Prior Image Guided Snapshot Compressive Spectral Imaging447
A Hybrid Stochastic-Deterministic Minibatch Proximal Gradient Method for Efficient Optimization and Generalization445
Enhancing Representations Through Heterogeneous Self-Supervised Learning441
Quadratic Matrix Factorization With Applications to Manifold Learning426
Invariant Policy Learning: A Causal Perspective411
Learning With Style: Continual Semantic Segmentation Across Tasks and Domains405
Adaptive Surface Normal Constraint for Geometric Estimation From Monocular Images403
A Generative Model for Generic Light Field Reconstruction403
Task-Oriented Channel Attention for Fine-Grained Few-Shot Classification401
Towards Unified Deep Image Deraining: A Survey and a New Benchmark390
SNI-SLAM++: Tightly-Coupled Semantic Neural Implicit SLAM378
Unsupervised Domain Adaptation via Discriminative Manifold Propagation362
A Clustering Validity Index With Multi-Granularity Fusion for Multiple Fuzzy Clustering Algorithms361
Metrics for Dataset Demographic Bias: A Case Study on Facial Expression Recognition359
Omni-Training: Bridging Pre-Training and Meta-Training for Few-Shot Learning358
Multiple Video Frame Interpolation via Enhanced Deformable Separable Convolution354
Point-to-Pixel Prompting for Point Cloud Analysis With Pre-Trained Image Models352
OPAL: Occlusion Pattern Aware Loss for Unsupervised Light Field Disparity Estimation349
Ensemble-Enhanced Semi-Supervised Learning With Optimized Graph Construction for High-Dimensional Data339
Implicit Annealing in Kernel Spaces: A Strongly Consistent Clustering Approach338
Video Demoireing Using Focused-Defocused Dual-Camera System334
Physics-Informed Guided Disentanglement in Generative Networks333
Interaction-Based Inductive Bias in Graph Neural Networks: Enhancing Protein-Ligand Binding Affinity Predictions From 3D Structures321
Interactive NeRF Geometry Editing With Shape Priors318
Instance Shadow Detection with A Single-Stage Detector318
Asymmetric Convolution: An Efficient and Generalized Method to Fuse Feature Maps in Multiple Vision Tasks315
Probing Synergistic High-Order Interaction for Multi-Modal Image Fusion313
Modeling Noisy Annotations for Point-Wise Supervision310
Centerless Clustering285
Towards Expressive Spectral-Temporal Graph Neural Networks for Time Series Forecasting283
Rethinking Rotation-Invariant Recognition of Fine-grained Shapes from the Perspective of Contour Points269
Learning Signed Hyper Surfaces for Oriented Point Cloud Normal Estimation266
Motion-Aware Dynamic Graph Neural Network for Video Compressive Sensing261
Inferring Point Cloud Quality via Graph Similarity258
Digging Into Uncertainty-Based Pseudo-Label for Robust Stereo Matching257
Detection-Friendly Dehazing: Object Detection in Real-World Hazy Scenes257
Locating and Counting Heads in Crowds With a Depth Prior252
ABINet++: Autonomous, Bidirectional and Iterative Language Modeling for Scene Text Spotting247
A Survey on Deep Learning Techniques for Stereo-Based Depth Estimation245
Symbolic Visual Reinforcement Learning: A Scalable Framework With Object-Level Abstraction and Differentiable Expression Search243
A Comprehensive and Modularized Statistical Framework for Gradient Norm Equality in Deep Neural Networks242
Simplicial Complex Neural Networks242
Guaranteed Tensor Recovery Fused Low-rankness and Smoothness236
Are Graph Convolutional Networks With Random Weights Feasible?234
Deep Non-Rigid Structure From Motion With Missing Data230
Learning Graph Convolutional Networks for Multi-Label Recognition and Applications229
Multiview Clustering: A Scalable and Parameter-Free Bipartite Graph Fusion Method229
Transformer-Based Visual Segmentation: A Survey225
Face Generation and Editing With StyleGAN: A Survey224
Analysis of Video Quality Datasets via Design of Minimalistic Video Quality Models223
Affective Image Content Analysis: Two Decades Review and New Perspectives222
Graph Convolutional Module for Temporal Action Localization in Videos216
DVIS++: Improved Decoupled Framework for Universal Video Segmentation215
Optimization-Based Post-Training Quantization With Bit-Split and Stitching215
Structure-Preserving Image Super-Resolution213
Deep Long-Tailed Learning: A Survey209
IEEE Computer Society Has You Covered!209
Separable Spatial-Temporal Residual Graph for Cloth-Changing Group Re-Identification209
Cover 2208
Fast Component Tree Computation for Images of Limited Levels207
BNET: Batch Normalization With Enhanced Linear Transformation205
Universal Image Segmentation With Efficiency204
New Dataset and Methods for Fine-Grained Compositional Referring Expression Comprehension via Specialist-MLLM Collaboration201
Temporal Feature Matters: A Framework for Diffusion Model Quantization201
M3D: a Multimodal, Multilingual and Multitask Dataset for Grounded Document-level Information Extraction200
Spatial-Temporal Transformer for Video Snapshot Compressive Imaging197
Learning to See Through With Events196
Revisiting Transferable Adversarial Images: Systemization, Evaluation, and New Insights195
Towards Deviation-Robust Agent Navigation via Perturbation-Aware Contrastive Learning191
MoIL: Momentum Imitation Learning for Efficient Vision-Language Adaptation187
Robust Multimodal Learning With Missing Modalities via Parameter-Efficient Adaptation183
On Positive-Unlabeled Classification From Corrupted Data in GANs183
Learning Graph Attentions via Replicator Dynamics175
Correcting Optical Aberration via Depth-Aware Point Spread Functions175
Image Lens Flare Removal Using Adversarial Curve Learning174
Bridging Actions: Generate 3D Poses and Shapes In-Between Photos174
Evaluation for Weakly Supervised Object Localization: Protocol, Metrics, and Datasets173
GenPoly: Learning Generalized and Tessellated Shape Priors via 3D Polymorphic Evolving170
To Fold or Not to Fold: Graph Regularized Tensor Train for Visual Data Completion170
Ensemble Multi-Quantiles: Adaptively Flexible Distribution Prediction for Uncertainty Quantification167
SPLiT: Single Portrait Lighting Estimation via a Tetrad of Face Intrinsics164
Learning Efficient Meshflow and Optical Flow from Event Cameras164
Adaptive Transfer Kernel Learning for Transfer Gaussian Process Regression164
Differential Viewpoints for Ground Terrain Material Recognition164
Deep Learning-Based Point Cloud Compression: An In-Depth Survey and Benchmark163
Reduced-Rank Tensor-on-Tensor Regression and Tensor-Variate Analysis of Variance162
Matrix Completion via Non-Convex Relaxation and Adaptive Correlation Learning161
A Variational EM Acceleration for Efficient Clustering at Very Large Scales160
Human-Centric Transformer for Domain Adaptive Action Recognition158
Inter-Intra Hypergraph Computation for Survival Prediction on Whole Slide Images157
On the Robustness of Average Losses for Partial-Label Learning152
SPARE: Symmetrized Point-to-Plane Distance for Robust Non-Rigid 3D Registration149
InstructLayout: Instruction-Driven 2D and 3D Layout Synthesis With Semantic Graph Prior149
Power Normalizations in Fine-Grained Image, Few-Shot Image and Graph Classification148
Out-of-Domain Generalization From a Single Source: An Uncertainty Quantification Approach148
GradMDM: Adversarial Attack on Dynamic Networks147
Human Interaction Understanding With Consistency-Aware Learning147
Curriculum-Based Asymmetric Multi-Task Reinforcement Learning147
Reconstruction Guided Meta-Learning for Few Shot Open Set Recognition146
Dynamic Self-Supervised Teacher-Student Network Learning146
Variational Data-Free Knowledge Distillation for Continual Learning146
SVGDreamer++: Advancing Editability and Diversity in Text-Guided SVG Generation144
RF-Next: Efficient Receptive Field Search for Convolutional Neural Networks143
LMP-GAN: Out-of-Distribution Detection for Non-Control Data Malware Attacks141
Compositional Scene Representation Learning via Reconstruction: A Survey141
Meta Invariance Defense Towards Generalizable Robustness to Unknown Adversarial Attacks139
Enhancing Photorealism Enhancement138
Learning From Partially Labeled Data for Multi-Organ and Tumor Segmentation138
Unbiased Scene Graph Generation via Two-Stage Causal Modeling135
Rate-Distortion Theory in Coding for Machines and Its Applications134
Discriminant Feature Extraction by Generalized Difference Subspace134
MRA-Net: Improving VQA Via Multi-Modal Relation Attention Network130
Graph-oriented Instruction Tuning of Large Language Models for Generic Graph Mining128
Self-Supervised Multimodal Learning: A Survey127
Orientational Distribution Learning with Hierarchical Spatial Attention for Open Set Recognition126
Dataset Security for Machine Learning: Data Poisoning, Backdoor Attacks, and Defenses126
Deep Gait Recognition: A Survey126
A New Brain Network Construction Paradigm for Brain Disorder via Diffusion-Based Graph Contrastive Learning126
Self-Scalable Tanh (Stan): Multi-Scale Solutions for Physics-Informed Neural Networks126
PathNet: Path-Selective Point Cloud Denoising125
Random Permutation Set Reasoning125
Differentially Private Graph Neural Networks for Whole-Graph Classification123
Influence Function Based Second-Order Channel Pruning: Evaluating True Loss Changes for Pruning is Possible Without Retraining122
Revisiting Nonlocal Self-Similarity from Continuous Representation121
Image-to-Image Translation With Disentangled Latent Vectors for Face Editing121
Fear-Neuro-Inspired Reinforcement Learning for Safe Autonomous Driving121
A Fully Automated Method for 3D Individual Tooth Identification and Segmentation in Dental CBCT120
Knowledge-Based Embodied Question Answering120
GCoNet+: A Stronger Group Collaborative Co-Salient Object Detector120
MetaDrive: Composing Diverse Driving Scenarios for Generalizable Reinforcement Learning119
P2T: Pyramid Pooling Transformer for Scene Understanding119
Towards Pointsets Representation Learning via Self-Supervised Learning and Set Augmentation118
Advances and Challenges in Meta-Learning: A Technical Review117
VNVC: A Versatile Neural Video Coding Framework for Efficient Human-Machine Vision116
From Simple to Complex Scenes: Learning Robust Feature Representations for Accurate Human Parsing116
Domain Generalization: A Survey116
Interpretable Optimization-Inspired Unfolding Network for Low-Light Image Enhancement116
AutoNovel: Automatically Discovering and Learning Novel Visual Categories116
Flare7K++: Mixing Synthetic and Real Datasets for Nighttime Flare Removal and Beyond115
HiGCIN: Hierarchical Graph-Based Cross Inference Network for Group Activity Recognition115
MHF-Net: An Interpretable Deep Network for Multispectral and Hyperspectral Image Fusion114
Accurate and Efficient Stereo Matching via Attention Concatenation Volume114
A Style-Based Generator Architecture for Generative Adversarial Networks114
QDTrack: Quasi-Dense Similarity Learning for Appearance-Only Multiple Object Tracking114
Deep Learning for Face Anti-Spoofing: A Survey114
Asymmetric Loss Functions for Noise-Tolerant Learning: Theory and Applications113
Dawn of the Transformer Era in Speech Emotion Recognition: Closing the Valence Gap113
Weakly Supervised Tracklet Association Learning With Video Labels for Person Re-Identification113
Scalable Optimal Transport Methods in Machine Learning: A Contemporary Survey112
Hypergraph-Based Multi-View Action Recognition Using Event Cameras112
Semi-Supervised Learning for FGVC With Out-of-Category Data111
Any Fashion Attribute Editing: Dataset and Pretrained Models110
JointFormer: A Unified Framework With Joint Modeling for Video Object Segmentation110
ComputingEdge ad110
WildVideo: Benchmarking LMMs for Understanding Video-Language Interaction110
Low-Shot Video Object Segmentation110
Hypergraph-Based High-Order Correlation Analysis for Large-Scale Long-Tailed Data Classification109
MoBluRF: Motion Deblurring Neural Radiance Fields for Blurry Monocular Video109
Cover 3109
Test-Time Training for Hyperspectral Image Super-Resolution108
On the Universal Approximation Properties of Deep Neural Networks Using MAM Neurons108
3D Visual Saliency: An Independent Perceptual Measure or a Derivative of 2D Image Saliency?107
Reframing Neural Networks: Deep Structure in Overcomplete Representations107
SS-NeRF: Physically Based Sparse Spectral Rendering with Neural Radiance Field106
S$^{2}$ 2O: Enhancing Adversarial Training With Second-Order Statistics of Weights105
Self-Guidance: Boosting Flow and Diffusion Generation on Their Own105
DeepMesh: Differentiable Iso-Surface Extraction105
Interpolated Joint Space Adversarial Training for Robust and Generalizable Defenses104
STAR-FC: Structure-Aware Face Clustering on Ultra-Large-Scale Graphs104
Supervision by Denoising103
PMGT-VR: a Decentralized Proximal-gradient Algorithmic Framework with Variance Reduction103
Towards Reliable and Faithful Explanations: A Disentanglement-Augmented Approach for Selective Rationalization102
Noisy Label Learning With Provable Consistency for a Wider Family of Losses102
Probabilistic Directed Distance Fields for Ray-Based Shape Representations102
Unified Modality Separation: A Vision-Language Framework for Unsupervised Domain Adaptation102
CycMuNet+: Cycle-Projected Mutual Learning for Spatial-Temporal Video Super-Resolution101
AutoEval: Are Labels Always Necessary for Classifier Accuracy Evaluation?101
Heterogeneous Feature Re-Sampling for Balanced Pedestrian Attribute Recognition101
Scale Propagation Network for Generalizable Depth Completion100
An Energy-Based Prior for Generative Saliency100
The Cluster Structure Function100
Video DataFlywheel: Resolving the Impossible Data Trinity in Video-Language Understanding100
Editorial: Special Section on Egocentric Perception99
Adversarially Robust Neural Architectures99
Continual Unsupervised Generative Modeling99
PRANCE: Joint Token-Optimization and Structural Channel-Pruning for Adaptive ViT Inference99
Compositional Generative Model of Unbounded 4D Cities98
TN-ZSTAD: Transferable Network for Zero-Shot Temporal Activity Detection98
Compositional Physical Reasoning of Objects and Events From Videos98
Joint Framework for Single Image Reconstruction and Super-Resolution With an Event Camera97
GLC++: Source-Free Universal Domain Adaptation Through Global-Local Clustering and Contrastive Affinity Learning96
luvHarris: A Practical Corner Detector for Event-Cameras96
Orthogonal Decoupling Contrastive Regularization: Towards Uncorrelated Feature Decoupling for Unpaired Image Restoration96
Stimulative Training++: Go Beyond the Performance Limits of Residual Networks95
RGB-T Tracking With Template-Bridged Search Interaction and Target-Preserved Template Updating95
RED++ : Data-Free Pruning of Deep Neural Networks via Input Splitting and Output Merging94
Multi-Modality Multi-Attribute Contrastive Pre-Training for Image Aesthetics Computing94
SKDF: A Simple Knowledge Distillation Framework for Distilling Open-Vocabulary Knowledge to Open-World Object Detector94
Relationship Quantification of Image Degradations94
Deciphering the Feature Representation of Deep Neural Networks for High-Performance AI94
Reusable Architecture Growth for Continual Stereo Matching93
Unified Adversarial Patch for Visible-Infrared Cross-Modal Attacks in the Physical World93
Equivalent Classification Mapping for Weakly Supervised Temporal Action Localization93
Distributionally Location-Aware Transferable Adversarial Patches for Facial Images92
Dynamic Differential Image Circle Diameter Measurement Precision Assessment: Application to Burning Droplets91
MB-TaylorFormer V2: Improved Multi-Branch Linear Transformer Expanded by Taylor Formula for Image Restoration91
Progressive Instance-Aware Feature Learning for Compositional Action Recognition90
ONNXPruner: ONNX-Based General Model Pruning Adapter90
$\mathcal {X}$-Metric: An N-Dimensional Information-Theoretic Framework for Groupwise Registration and Deep Combined Computing90
Support Vector Machine Classifier via Soft-Margin Loss90
Conformal Prediction for Time Series90
Adaptive Perspective Distillation for Semantic Segmentation90
Disentangled Representation Learning89
Learning to Super-Resolve Blurry Images With Events89
Recurrent Neural Networks for Snapshot Compressive Imaging89
Deep Learning on Object-Centric 3D Neural Fields89
A Thorough Benchmark and a New Model for Light Field Saliency Detection89
SS-TBN: A Semi-Supervised Tri-Branch Network for COVID-19 Screening and Lesion Segmentation88
Improving Machine Vision Using Human Perceptual Representations: The Case of Planar Reflection Symmetry for Object Classification88
Revealing the Dark Side of Non-Local Attention in Single Image Super-Resolution88
Homeomorphism Prior for False Positive and Negative Problem in Medical Image Dense Contrastive Representation Learning88
Map-Guided Curriculum Domain Adaptation and Uncertainty-Aware Evaluation for Semantic Nighttime Image Segmentation88
Privacy-Preserving Deep Action Recognition: An Adversarial Learning Framework and A New Dataset88
FreeFusion: Infrared and Visible Image Fusion via Cross Reconstruction Learning87
Analysis of the Hands in Egocentric Vision: A Survey87
0.29142594337463