IEEE Transactions on Pattern Analysis and Machine Intelligence

Papers
(The H4-Index of IEEE Transactions on Pattern Analysis and Machine Intelligence is 112. The table below lists those papers that are above that threshold based on CrossRef citation counts [max. 250 papers]. The publications cover those that have been published in the past four years, i.e., from 2020-11-01 to 2024-11-01.)
ArticleCitations
OpenPose: Realtime Multi-Person 2D Pose Estimation Using Part Affinity Fields2297
Deep High-Resolution Representation Learning for Visual Recognition2045
Res2Net: A New Multi-Scale Backbone Architecture1742
Image Segmentation Using Deep Learning: A Survey1304
A Survey on Vision Transformer1246
Deep Learning for 3D Point Clouds: A Survey1024
Self-Supervised Visual Feature Learning With Deep Neural Networks: A Survey965
Deep Learning for Person Re-Identification: A Survey and Outlook911
Deep Learning for Image Super-Resolution: A Survey908
Event-Based Vision: A Survey907
GOT-10k: A Large High-Diversity Benchmark for Generic Object Tracking in the Wild838
U2Fusion: A Unified Unsupervised Image Fusion Network829
Cascade R-CNN: High Quality Object Detection and Instance Segmentation779
ProtTrans: Toward Understanding the Language of Life Through Self-Supervised Learning682
Towards Robust Monocular Depth Estimation: Mixing Datasets for Zero-Shot Cross-Dataset Transfer585
Gliding Vertex on the Horizontal Bounding Box for Multi-Oriented Object Detection528
Residual Dense Network for Image Restoration528
Meta-Learning in Neural Networks: A Survey528
Image Super-Resolution Via Iterative Refinement507
A continual learning survey: Defying forgetting in classification tasks504
Normalizing Flows: An Introduction and Review of Current Methods465
Plug-and-Play Image Restoration With Deep Denoiser Prior419
Recent Advances in Open Set Recognition: A Survey410
Hierarchical Deep Click Feature Prediction for Fine-Grained Image Recognition374
A Style-Based Generator Architecture for Generative Adversarial Networks370
Salient Object Detection in the Deep Learning Era: An In-Depth Survey323
Diffusion Models in Vision: A Survey317
Detection and Tracking Meet Drones Challenge310
A Review of Domain Adaptation without Target Labels310
Knowledge Distillation and Student-Teacher Learning for Visual Intelligence: A Review and New Outlooks304
Deep Multi-View Enhancement Hashing for Image Retrieval303
Imbalance Problems in Object Detection: A Review296
Multi-Task Learning for Dense Prediction Tasks: A Survey294
Contextual Transformer Networks for Visual Recognition292
Prior Guided Feature Enrichment Network for Few-Shot Segmentation275
Domain Generalization: A Survey272
Convolutional Networks with Dense Connectivity263
Concealed Object Detection255
Deep Audio-Visual Speech Recognition252
NWPU-Crowd: A Large-Scale Benchmark for Crowd Counting and Localization252
High Speed and High Dynamic Range Video with an Event Camera250
Dynamic Neural Networks: A Survey249
YOLACT++ Better Real-Time Instance Segmentation235
Image-Based 3D Object Reconstruction: State-of-the-Art and Trends in the Deep Learning Era233
Semi-Supervised Semantic Segmentation With High- and Low-Level Consistency231
Low-Light Image and Video Enhancement Using Deep Learning: A Survey225
InterFaceGAN: Interpreting the Disentangled Face Representation Learned by GANs220
ResMLP: Feedforward Networks for Image Classification With Data-Efficient Training216
Deep Imbalanced Learning for Face Recognition and Attribute Prediction215
Beyond Self-Attention: External Attention Using Two Linear Layers for Visual Tasks214
Learning to Enhance Low-Light Image via Zero-Reference Deep Curve Estimation206
Object Detection in Aerial Images: A Large-Scale Benchmark and Challenges204
Deep Generative Modelling: A Comparative Review of VAEs, GANs, Normalizing Flows, Energy-Based and Autoregressive Models202
FakeCatcher: Detection of Synthetic Portrait Videos using Biological Signals201
MEMC-Net: Motion Estimation and Motion Compensation Driven Neural Network for Video Interpolation and Enhancement198
Revisiting Video Saliency Prediction in the Deep Learning Era191
CCNet: Criss-Cross Attention for Semantic Segmentation191
Human Action Recognition From Various Data Modalities: A Review190
ArcFace: Additive Angular Margin Loss for Deep Face Recognition186
Maximum Density Divergence for Domain Adaptation182
PredRNN: A Recurrent Neural Network for Spatiotemporal Predictive Learning178
Multiview Clustering: A Scalable and Parameter-Free Bipartite Graph Fusion Method178
Constructing Stronger and Faster Baselines for Skeleton-Based Action Recognition177
A Survey on Curriculum Learning174
Class-Incremental Learning: Survey and Performance Evaluation on Image Classification174
AbdomenCT-1K: Is Abdominal Organ Segmentation a Solved Problem?167
KITTI-360: A Novel Dataset and Benchmarks for Urban Scene Understanding in 2D and 3D166
Real-Time Scene Text Detection With Differentiable Binarization and Adaptive Scale Fusion163
AlphaPose: Whole-Body Regional Multi-Person Pose Estimation and Tracking in Real-Time161
GAN Inversion: A Survey158
SCRDet++: Detecting Small, Cluttered and Rotated Objects via Instance-Level Feature Denoising and Rotation Loss Smoothing157
Fine-Grained Image Analysis With Deep Learning: A Survey153
Effects of Image Degradation and Degradation Removal to CNN-Based Image Classification151
Spatiotemporal Co-Attention Recurrent Neural Networks for Human-Skeleton Motion Prediction150
Mask TextSpotter: An End-to-End Trainable Neural Network for Spotting Text with Arbitrary Shapes149
Video Anomaly Detection with Sparse Coding Inspired Deep Neural Networks148
A Survey on Deep Learning Techniques for Stereo-Based Depth Estimation146
PaMIR: Parametric Model-Conditioned Implicit Representation for Image-Based Human Reconstruction145
Transfer Learning in Deep Reinforcement Learning: A Survey141
The Emerging Trends of Multi-Label Learning141
Self-Correction for Human Parsing141
MFQE 2.0: A New Approach for Multi-Frame Quality Enhancement on Compressed Video140
Single Image Deraining: From Model-Based to Data-Driven and Beyond139
Learning Enriched Features for Fast Image Restoration and Enhancement138
Dynamical Hyperparameter Optimization via Deep Reinforcement Learning in Tracking137
Graph U-Nets137
Coherence Constrained Graph LSTM for Group Activity Recognition134
Weakly Supervised Object Localization and Detection: A Survey134
Robust Multi-View Clustering With Incomplete Information132
Direction-Aware Spatial Context Features for Shadow Detection and Removal131
Densely Residual Laplacian Super-Resolution131
MHF-Net: An Interpretable Deep Network for Multispectral and Hyperspectral Image Fusion130
Neural Image Compression for Gigapixel Histopathology Image Analysis127
Robust Low-Rank Tensor Recovery with Rectification and Alignment127
SpectralGPT: Spectral Remote Sensing Foundation Model127
Small Data Challenges in Big Data Era: A Survey of Recent Progress on Unsupervised and Semi-Supervised Methods127
MTFH: A Matrix Tri-Factorization Hashing Framework for Efficient Cross-Modal Retrieval126
NAS-FAS: Static-Dynamic Central Difference Network Search for Face Anti-Spoofing126
Multimodal Learning With Transformers: A Survey124
From Show to Tell: A Survey on Deep Learning-Based Image Captioning124
Negation of the Quantum Mass Function for Multisource Quantum Information Fusion With its Application to Pattern Classification123
A Comprehensive Survey of Scene Graphs: Generation and Application123
Graph Neural Networks with Convolutional ARMA Filters122
Deep Long-Tailed Learning: A Survey122
MS-TCN++: Multi-Stage Temporal Convolutional Network for Action Segmentation122
CTNet: Context-Based Tandem Network for Semantic Segmentation120
DeepFake Detection Based on Discrepancies Between Faces and Their Context119
Learning Generalisable Omni-Scale Representations for Person Re-Identification118
Recipe1M+: A Dataset for Learning Cross-Modal Embeddings for Cooking Recipes and Food Images117
Disentangling Light Fields for Super-Resolution and Disparity Estimation114
RGB-D SLAM in Dynamic Environments Using Point Correlations114
Deep Convolutional Neural Network for Multi-Modal Image Restoration and Fusion113
Person Re-Identification by Contour Sketch Under Moderate Clothing Change112
0.040610074996948