IEEE Transactions on Pattern Analysis and Machine Intelligence

Papers
(The TQCC of IEEE Transactions on Pattern Analysis and Machine Intelligence is 15. The table below lists those papers that are above that threshold based on CrossRef citation counts [max. 250 papers]. The publications cover those that have been published in the past four years, i.e., from 2021-02-01 to 2025-02-01.)
ArticleCitations
[Front inside cover]2260
IEEE Transactions on Pattern Analysis and Machine Intelligence Publication Information1892
Depth Restoration in Under-Display Time-of-Flight Imaging1537
Cover 21074
Cover 4 [Table of Contents, back cover]1026
Front Cover1019
Towards Robust Probabilistic Modeling on SO(3) via Rotation Laplace Distribution994
Semi-supervised Counting via Pixel-by-pixel Density Distribution Modelling987
IEEE Computer Society Information925
Front Cover908
Cover 3868
Cover794
Table of Contents637
[Back inside cover]585
Cover575
Bilinear Image Translation for Temporal Analysis of Photo Collections556
Editorial Board486
Fully Sparse Fusion for 3D Object Detection464
What Makes Deviant Places?463
A Closed-Form, Pairwise Solution to Local Non-Rigid Structure-From-Motion453
Asymmetric Convolution: An Efficient and Generalized Method to Fuse Feature Maps in Multiple Vision Tasks396
Learning With Style: Continual Semantic Segmentation Across Tasks and Domains377
Analysis of Video Quality Datasets via Design of Minimalistic Video Quality Models371
Deep Learning Methods for Calibrated Photometric Stereo and Beyond359
Semi-Supervised Multi-View Deep Discriminant Representation Learning359
Physics-Informed Guided Disentanglement in Generative Networks328
Learning to Augment Poses for 3D Human Pose Estimation in Images and Videos314
AlphaPose: Whole-Body Regional Multi-Person Pose Estimation and Tracking in Real-Time311
Learning by Seeing More Classes298
RayMVSNet++: Learning Ray-Based 1D Implicit Fields for Accurate Multi-View Stereo296
A Unified Visual Information Preservation Framework for Self-supervised Pre-training in Medical Image Analysis296
Domain Adaptive and Generalizable Network Architectures and Training Strategies for Semantic Image Segmentation290
Generating Hypergraph-Based High-Order Representations of Whole-Slide Histopathological Images for Survival Prediction282
EM-Driven Unsupervised Learning for Efficient Motion Segmentation279
Deep Order-Preserving Learning With Adaptive Optimal Transport Distance276
Structure-Preserving Image Super-Resolution275
Holistic Prototype Activation for Few-Shot Segmentation273
Geometry-Aware Generation of Adversarial Point Clouds273
Fourier-Based and Rational Graph Filters for Spectral Processing258
FeatAug-DETR: Enriching One-to-Many Matching for DETRs With Feature Augmentation254
Towards a Deeper Understanding of Global Covariance Pooling in Deep Learning: An Optimization Perspective249
Vicinity Vision Transformer245
Deterministic Approximate Methods for Maximum Consensus Robust Fitting236
Dual Instance-Consistent Network for Cross-Domain Object Detection233
ZeroNAS: Differentiable Generative Adversarial Networks Search for Zero-Shot Learning229
Centerless Clustering220
SibNet: Sibling Convolutional Encoder for Video Captioning219
Tensor Low-Rank Representation for Data Recovery and Clustering215
Communication-Efficient Randomized Algorithm for Multi-Kernel Online Federated Learning211
Logarithmic Schatten-p Norm Minimization for Tensorial Multi-view Subspace Clustering209
A Generative Model for Generic Light Field Reconstruction205
Create Your World: Lifelong Text-to-Image Diffusion202
Object Affinity Learning: Towards Annotation-Free Instance Segmentation202
Stereo Confidence Estimation via Locally Adaptive Fusion and Knowledge Distillation201
On the Optimality of Sufficient Statistics-based Quantizers199
IEEE Computer Society Information189
Semi-Infinitely Constrained Markov Decision Processes and Provably Efficient Reinforcement Learning186
Learning to Follow and Generate Instructions for Language-Capable Navigation183
From NeRFLiX to NeRFLiX++: A General NeRF-Agnostic Restorer Paradigm181
Point Cloud Attacks in Graph Spectral Domain: When 3D Geometry Meets Graph Signal Processing179
Heterogeneous Multi-Party Learning With Data-Driven Network Sampling178
Principal Uncertainty Quantification With Spatial Correlation for Image Restoration Problems178
Importance Weighted Structure Learning for Scene Graph Generation169
Erratum to “Deep Back-Projection Networks for Single Image Super-Resolution”167
A Stream Algebra for Performance Optimization of Large Scale Computer Vision Pipelines167
Searching a High Performance Feature Extractor for Text Recognition Network166
Graph Transformer GANs With Graph Masked Modeling for Architectural Layout Generation164
B-Cos Alignment for Inherently Interpretable CNNs and Vision Transformers161
Incomplete Gamma Kernels: Generalizing Locally Optimal Projection Operators159
Semantic Probability Distribution Modeling for Diverse Semantic Image Synthesis158
End-to-End Full Projector Compensation157
Text-Driven Video Acceleration: A Weakly-Supervised Reinforcement Learning Method156
GAN Compression: Efficient Architectures for Interactive Conditional GANs154
An Efficient Fisher Matrix Approximation Method for Large-Scale Neural Network Optimization149
Self-Supervised Arbitrary-Scale Implicit Point Clouds Upsampling149
PWLU: Learning Specialized Activation Functions With the Piecewise Linear Unit148
Temporal Perceiver: A General Architecture for Arbitrary Boundary Detection148
IEEE Transactions on Pattern Analysis and Machine Intelligence Publication Information145
Semi-Dense Feature Matching With Transformers and its Applications in Multiple-View Geometry141
How to Query an Oracle? Efficient Strategies to Label Data140
Invariant Policy Learning: A Causal Perspective140
State of the Journal Editorial139
Publications Seek 2023 Editors in Chief139
[Back cover - Table of contents, continued]137
Unambiguous Text Localization, Retrieval, and Recognition for Cluttered Scenes136
Understanding and Accelerating Neural Architecture Search With Training-Free and Theory-Grounded Metrics135
Point Cloud Instance Segmentation With Semi-Supervised Bounding-Box Mining134
Content-Aware Warping for View Synthesis134
MNET++: Music-Driven Pluralistic Dancing Toward Multiple Dance Genre Synthesis133
Unsupervised Learning of Graph Matching With Mixture of Modes via Discrepancy Minimization131
Optimization-Based Post-Training Quantization With Bit-Split and Stitching130
Face Forgery Detection by 3D Decomposition and Composition Search130
NAAQA: A Neural Architecture for Acoustic Question Answering129
Debiased Scene Graph Generation for Dual Imbalance Learning127
MPS-NeRF: Generalizable 3D Human Rendering From Multiview Images126
Deep Constraint-Based Propagation in Graph Neural Networks126
Unsupervised Face Detection in the Dark123
Global Instance Tracking: Locating Target More Like Humans122
DeepPhaseCut: Deep Relaxation in Phase for Unsupervised Fourier Phase Retrieval121
Context-Aware Graph Inference With Knowledge Distillation for Visual Dialog121
Neural Prompt Search119
PFENet++: Boosting Few-Shot Semantic Segmentation With the Noise-Filtered Context-Aware Prior Mask116
Adaptive Surface Normal Constraint for Geometric Estimation From Monocular Images114
Learning Bilateral Cost Volume for Rolling Shutter Temporal Super-Resolution114
SODFormer: Streaming Object Detection With Transformer Using Events and Frames112
Hunter: Exploring High-Order Consistency for Point Cloud Registration With Severe Outliers112
Neighbourhood Representative Sampling for Efficient End-to-End Video Quality Assessment110
Relationship-Embedded Representation Learning for Grounding Referring Expressions110
Detecting and Grounding Multi-Modal Media Manipulation and Beyond110
Z-Splat: Z-Axis Gaussian Splatting for Camera-Sonar Fusion109
An Asynchronous Linear Filter Architecture for Hybrid Event-Frame Cameras109
PLMP – Point-Line Minimal Problems in Complete Multi-View Visibility109
Non-Graph Data Clustering via $\mathcal {O}(n)$ Bipartite Graph Convolution106
AnyFace++: A Unified Framework for Free-style Text-to-Face Synthesis and Manipulation106
Booster: A Benchmark for Depth From Images of Specular and Transparent Surfaces105
Contextualizing Meta-Learning via Learning to Decompose105
An Effective Motion-Centric Paradigm for 3D Single Object Tracking in Point Clouds104
Deep Image Matting With Sparse User Interactions103
SimSwap++: Towards Faster and High-Quality Identity Swapping103
Cover102
Cover101
[Back cover]101
Cover100
Table of Contents99
Generative Text Convolutional Neural Network for Hierarchical Document Representation Learning98
Front Cover98
Learning Dynamic Scene-Conditioned 3D Object Detectors96
CrossHomo: Cross-Modality and Cross-Resolution Homography Estimation95
Adaptive Perturbation for Adversarial Attack95
Attention in Reasoning: Dataset, Analysis, and Modeling93
Separable Spatial-Temporal Residual Graph for Cloth-Changing Group Re-Identification93
On Symbiosis of Attribute Prediction and Semantic Segmentation93
Modeling the Background for Incremental and Weakly-Supervised Semantic Segmentation92
Learning With Asymmetric Kernels: Least Squares and Feature Interpretation91
On the Power of Gradual Network Alignment Using Dual-Perception Similarities91
Learning to Guide a Saturation-Based Theorem Prover89
IEEE Quantum Week89
Scale-Aware Automatic Augmentations for Object Detection With Dynamic Training89
Omni-Training: Bridging Pre-Training and Meta-Training for Few-Shot Learning89
Light Field Neural Rendering88
ZJUT-EIFD: A Synchronously Collected External and Internal Fingerprint Database88
Cover 4 [Table of Contents]88
Global Model Selection via Solution Paths for Robust Support Vector Machine88
End-to-End One-Shot Human Parsing87
Cover 287
TAKDE: Temporal Adaptive Kernel Density Estimator for Real-Time Dynamic Density Estimation85
Multi-Channel Attention Selection GANs for Guided Image-to-Image Translation85
Temporal Action Segmentation: An Analysis of Modern Techniques85
Revisiting Computer-Aided Tuberculosis Diagnosis85
Simple Primitives With Feasibility- and Contextuality-Dependence for Open-World Compositional Zero-Shot Learning85
Generative Variational-Contrastive Learning for Self-Supervised Point Cloud Representation85
Towards Understanding Convergence and Generalization of AdamW84
Quadratic Matrix Factorization With Applications to Manifold Learning83
A Semantic and Motion-Aware Spatiotemporal Transformer Network for Action Detection82
Deformable Part Region Learning and Feature Aggregation Tree Representation for Object Detection81
Superadditivity and Convex Optimization for Globally Optimal Cell Segmentation Using Deformable Shape Models81
Cross-Lingual Universal Dependency Parsing Only From One Monolingual Treebank81
SIFT Matching by Context Exposed80
Saliency as Pseudo-Pixel Supervision for Weakly and Semi-Supervised Semantic Segmentation80
A Comprehensive and Modularized Statistical Framework for Gradient Norm Equality in Deep Neural Networks79
Optical Flow in the Dark79
Minimizing Negative Transfer of Knowledge in Multivariate Gaussian Processes: A Scalable and Regularized Approach78
Metrics for Dataset Demographic Bias: A Case Study on Facial Expression Recognition78
Fast Graph Generation via Spectral Diffusion78
From Human Pose Similarity Metric to 3D Human Pose Estimator: Temporal Propagating LSTM Networks78
Diagnosing and Preventing Instabilities in Recurrent Video Processing77
Seed the Views: Hierarchical Semantic Alignment for Contrastive Representation Learning77
Meta-Transfer Learning Through Hard Tasks77
Adjacency Constraint for Efficient Hierarchical Reinforcement Learning77
On the Decision Boundaries of Neural Networks: A Tropical Geometry Perspective76
CaCo: Both Positive and Negative Samples are Directly Learnable via Cooperative-Adversarial Contrastive Learning76
AD-VAT+: An Asymmetric Dueling Mechanism for Learning and Understanding Visual Active Tracking76
Realize Generative Yet Complete Latent Representation for Incomplete Multi-View Learning76
MIGO-NAS: Towards Fast and Generalizable Neural Architecture Search75
Hierarchical Prototype Networks for Continual Graph Representation Learning75
Evidential Multi-Source-Free Unsupervised Domain Adaptation74
Structure From Motion on XSlit Cameras73
Tractable Maximum Likelihood Estimation for Latent Structure Influence Models With Applications to EEG & ECoG Processing72
The Proxy Step-Size Technique for Regularized Optimization on the Sphere Manifold72
Exploiting Optical Flow Guidance for Transformer-Based Video Inpainting72
A Generic Graph-Based Neural Architecture Encoding Scheme With Multifaceted Information71
Conditional Wasserstein Generator71
Appearance and Pose-Conditioned Human Image Generation Using Deformable GANs71
Cycle Registration in Persistent Homology With Applications in Topological Bootstrap71
Vertical Layering of Quantized Neural Networks for Heterogeneous Inference70
Liquid Warping GAN With Attention: A Unified Framework for Human Image Synthesis70
Compositional Semantic Mix for Domain Adaptation in Point Cloud Segmentation69
Second-Order Unsupervised Feature Selection via Knowledge Contrastive Distillation69
Not All Samples are Trustworthy: Towards Deep Robust SVP Prediction69
PointGLR: Unsupervised Structural Representation Learning of 3D Point Clouds68
ASP: Learn a Universal Neural Solver!67
A Self-Consistent-Field Iteration for Orthogonal Canonical Correlation Analysis67
OPAL: Occlusion Pattern Aware Loss for Unsupervised Light Field Disparity Estimation67
Video Coding for Machines: Compact Visual Representation Compression for Intelligent Collaborative Analytics67
Self-Training Boosted Multi-Factor Matching Network for Composed Image Retrieval66
A Hybrid Neural Coding Approach for Pattern Recognition With Spiking Neural Networks65
Effective Training of Convolutional Neural Networks With Low-Bitwidth Weights and Activations65
Detecting Line Segments in Motion-Blurred Images With Events65
Learning Rates for Stochastic Gradient Descent With Nonconvex Objectives64
Modeling Noisy Annotations for Point-Wise Supervision64
An Integrated Fast Hough Transform for Multidimensional Data64
Emotional Attention: From Eye Tracking to Computational Modeling64
Model Study of Transient Imaging With Multi-Frequency Time-of-Flight Sensors64
Learning Invariance From Generated Variance for Unsupervised Person Re-Identification63
Discriminative Video Representation Learning Using Support Vector Classifiers63
Real-Time Globally Consistent Dense 3D Reconstruction With Online Texturing63
Earning Extra Performance From Restrictive Feedbacks63
Robust Point Cloud Segmentation With Noisy Annotations63
Learning Optical Flow and Scene Flow With Bidirectional Camera-LiDAR Fusion62
Additive Tree-Structured Conditional Parameter Spaces in Bayesian Optimization: A Novel Covariance Function and a Fast Implementation62
Deep Scene Flow Learning: From 2D Images to 3D Point Clouds62
Learning Energy-Based Spatial-Temporal Generative ConvNets for Dynamic Patterns62
Wavelet Approximation-Aware Residual Network for Single Image Deraining61
Weakly Supervised Semantic Segmentation via Box-Driven Masking and Filling Rate Shifting61
Representational Gradient Boosting: Backpropagation in the Space of Functions60
MCTS with Refinement for Proposals Selection Games in Scene Understanding60
Multi-View Supervision for Single-View Reconstruction via Differentiable Ray Consistency60
Sequential Point Clouds: A Survey58
Sheared Epipolar Focus Spectrum for Dense Light Field Reconstruction58
Rank-One Network: An Effective Framework for Image Restoration58
Point-to-Pixel Prompting for Point Cloud Analysis With Pre-Trained Image Models58
Geodesic Multi-Class SVM with Stiefel Manifold Embedding58
Towards High Performance Low Complexity Calibration in Appearance Based Gaze Estimation58
Learning Symbolic Model-Agnostic Loss Functions via Meta-Learning58
The Group Loss++: A Deeper Look Into Group Loss for Deep Metric Learning57
Guest Editorial: Non-Euclidean Machine Learning57
Multi-Label Classification via Adaptive Resonance Theory-Based Clustering57
Toyota Smarthome Untrimmed: Real-World Untrimmed Videos for Activity Detection57
Adaptive Cross-Modal Transferable Adversarial Attacks From Images to Videos57
A Deterministic Approximation to Neural SDEs56
Consistency-Aware Anchor Pyramid Network for Crowd Localization56
Affinity Attention Graph Neural Network for Weakly Supervised Semantic Segmentation56
Learn to Predict Sets Using Feed-Forward Neural Networks56
Eigendecomposition-Free Training of Deep Networks for Linear Least-Square Problems56
Semantic Scene Completion Using Local Deep Implicit Functions on LiDAR Data55
Affective Image Content Analysis: Two Decades Review and New Perspectives55
Perceptual Texture Similarity Estimation: An Evaluation of Computational Features54
MADAv2: Advanced Multi-Anchor Based Active Domain Adaptation Segmentation54
Salient Object Detection in the Deep Learning Era: An In-Depth Survey54
GCP: Graph Encoder With Content-Planning for Sentence Generation From Knowledge Bases54
Few-Shot Partial Multi-View Learning53
Variational Nested Dropout53
Temporal Pixel-Level Semantic Understanding Through the VSPW Dataset53
Revisiting 2D Convolutional Neural Networks for Graph-Based Applications53
Few-Shot Multi-Agent Perception With Ranking-Based Feature Learning53
Bayesian Joint Matrix Decomposition for Data Integration with Heterogeneous Noise53
On Exploring Multiplicity of Primitives and Attributes for Texture Recognition in the Wild52
Bridging the Gap Between Computational Photography and Visual Recognition52
Learning on Attribute-Missing Graphs52
Towards Accurate and Compact Architectures via Neural Architecture Transformer52
0.46045708656311