IEEE Transactions on Pattern Analysis and Machine Intelligence

Papers
(The H4-Index of IEEE Transactions on Pattern Analysis and Machine Intelligence is 112. The table below lists those papers that are above that threshold based on CrossRef citation counts [max. 250 papers]. The publications cover those that have been published in the past four years, i.e., from 2020-09-01 to 2024-09-01.)
ArticleCitations
OpenPose: Realtime Multi-Person 2D Pose Estimation Using Part Affinity Fields2220
Deep High-Resolution Representation Learning for Visual Recognition1931
Res2Net: A New Multi-Scale Backbone Architecture1674
Image Segmentation Using Deep Learning: A Survey1249
A Survey on Vision Transformer1130
Deep Learning for 3D Point Clouds: A Survey985
Self-Supervised Visual Feature Learning With Deep Neural Networks: A Survey924
Deep Learning for Image Super-Resolution: A Survey866
Deep Learning for Person Re-Identification: A Survey and Outlook853
Event-Based Vision: A Survey834
NTU RGB+D 120: A Large-Scale Benchmark for 3D Human Activity Understanding805
GOT-10k: A Large High-Diversity Benchmark for Generic Object Tracking in the Wild799
U2Fusion: A Unified Unsupervised Image Fusion Network769
Cascade R-CNN: High Quality Object Detection and Instance Segmentation742
ProtTrans: Toward Understanding the Language of Life Through Self-Supervised Learning634
Residual Dense Network for Image Restoration506
Gliding Vertex on the Horizontal Bounding Box for Multi-Oriented Object Detection502
Meta-Learning in Neural Networks: A Survey501
Towards Robust Monocular Depth Estimation: Mixing Datasets for Zero-Shot Cross-Dataset Transfer486
A continual learning survey: Defying forgetting in classification tasks455
Normalizing Flows: An Introduction and Review of Current Methods445
Image Super-Resolution Via Iterative Refinement397
Recent Advances in Open Set Recognition: A Survey394
Plug-and-Play Image Restoration With Deep Denoiser Prior382
Hierarchical Deep Click Feature Prediction for Fine-Grained Image Recognition373
A Style-Based Generator Architecture for Generative Adversarial Networks344
Salient Object Detection in the Deep Learning Era: An In-Depth Survey316
A Review of Domain Adaptation without Target Labels303
Deep Multi-View Enhancement Hashing for Image Retrieval298
Detection and Tracking Meet Drones Challenge294
Imbalance Problems in Object Detection: A Review293
The ApolloScape Open Dataset for Autonomous Driving and Its Application292
Knowledge Distillation and Student-Teacher Learning for Visual Intelligence: A Review and New Outlooks287
Multi-Task Learning for Dense Prediction Tasks: A Survey275
Contextual Transformer Networks for Visual Recognition269
Diffusion Models in Vision: A Survey259
Convolutional Networks with Dense Connectivity255
Prior Guided Feature Enrichment Network for Few-Shot Segmentation247
NWPU-Crowd: A Large-Scale Benchmark for Crowd Counting and Localization244
Dynamic Neural Networks: A Survey240
Deep Audio-Visual Speech Recognition234
Domain Generalization: A Survey231
High Speed and High Dynamic Range Video with an Event Camera230
Image-Based 3D Object Reconstruction: State-of-the-Art and Trends in the Deep Learning Era230
YOLACT++ Better Real-Time Instance Segmentation227
Concealed Object Detection226
Semi-Supervised Semantic Segmentation With High- and Low-Level Consistency215
Deep Imbalanced Learning for Face Recognition and Attribute Prediction213
Low-Light Image and Video Enhancement Using Deep Learning: A Survey209
InterFaceGAN: Interpreting the Disentangled Face Representation Learned by GANs209
Beyond Self-Attention: External Attention Using Two Linear Layers for Visual Tasks200
ResMLP: Feedforward Networks for Image Classification With Data-Efficient Training197
FakeCatcher: Detection of Synthetic Portrait Videos using Biological Signals193
Object Detection in Aerial Images: A Large-Scale Benchmark and Challenges192
MEMC-Net: Motion Estimation and Motion Compensation Driven Neural Network for Video Interpolation and Enhancement191
CCNet: Criss-Cross Attention for Semantic Segmentation187
Learning to Enhance Low-Light Image via Zero-Reference Deep Curve Estimation187
Revisiting Video Saliency Prediction in the Deep Learning Era186
Deep Generative Modelling: A Comparative Review of VAEs, GANs, Normalizing Flows, Energy-Based and Autoregressive Models186
Weakly Supervised Learning with Multi-Stream CNN-LSTM-HMMs to Discover Sequential Parallelism in Sign Language Videos179
Maximum Density Divergence for Domain Adaptation178
Human Action Recognition From Various Data Modalities: A Review176
Multiview Clustering: A Scalable and Parameter-Free Bipartite Graph Fusion Method173
ArcFace: Additive Angular Margin Loss for Deep Face Recognition170
Constructing Stronger and Faster Baselines for Skeleton-Based Action Recognition167
PredRNN: A Recurrent Neural Network for Spatiotemporal Predictive Learning163
Learning Depth with Convolutional Spatial Propagation Network159
A Survey on Curriculum Learning153
Every Pixel Counts ++: Joint Learning of Geometry and Motion with 3D Holistic Understanding152
A Comprehensive Analysis of Deep Regression151
SCRDet++: Detecting Small, Cluttered and Rotated Objects via Instance-Level Feature Denoising and Rotation Loss Smoothing151
AbdomenCT-1K: Is Abdominal Organ Segmentation a Solved Problem?148
Spatiotemporal Co-Attention Recurrent Neural Networks for Human-Skeleton Motion Prediction146
Video Anomaly Detection with Sparse Coding Inspired Deep Neural Networks146
Effects of Image Degradation and Degradation Removal to CNN-Based Image Classification145
Mask TextSpotter: An End-to-End Trainable Neural Network for Spotting Text with Arbitrary Shapes144
GAN Inversion: A Survey144
Class-Incremental Learning: Survey and Performance Evaluation on Image Classification142
Real-Time Scene Text Detection With Differentiable Binarization and Adaptive Scale Fusion141
AlphaPose: Whole-Body Regional Multi-Person Pose Estimation and Tracking in Real-Time139
KITTI-360: A Novel Dataset and Benchmarks for Urban Scene Understanding in 2D and 3D139
Confidence Propagation through CNNs for Guided Sparse Depth Regression139
Fine-Grained Image Analysis With Deep Learning: A Survey138
A Survey on Deep Learning Techniques for Stereo-Based Depth Estimation138
Single Image Deraining: From Model-Based to Data-Driven and Beyond137
Dynamical Hyperparameter Optimization via Deep Reinforcement Learning in Tracking136
MFQE 2.0: A New Approach for Multi-Frame Quality Enhancement on Compressed Video136
Graph U-Nets132
Coherence Constrained Graph LSTM for Group Activity Recognition131
The Emerging Trends of Multi-Label Learning130
Self-Correction for Human Parsing128
Robust Low-Rank Tensor Recovery with Rectification and Alignment127
Densely Residual Laplacian Super-Resolution126
Weakly Supervised Object Localization and Detection: A Survey125
MTFH: A Matrix Tri-Factorization Hashing Framework for Efficient Cross-Modal Retrieval125
Direction-Aware Spatial Context Features for Shadow Detection and Removal125
Neural Image Compression for Gigapixel Histopathology Image Analysis125
Robust Multi-View Clustering With Incomplete Information121
Negation of the Quantum Mass Function for Multisource Quantum Information Fusion With its Application to Pattern Classification121
NAS-FAS: Static-Dynamic Central Difference Network Search for Face Anti-Spoofing121
MHF-Net: An Interpretable Deep Network for Multispectral and Hyperspectral Image Fusion120
Learning Enriched Features for Fast Image Restoration and Enhancement120
PaMIR: Parametric Model-Conditioned Implicit Representation for Image-Based Human Reconstruction120
Transfer Learning in Deep Reinforcement Learning: A Survey118
A Comprehensive Survey of Scene Graphs: Generation and Application117
Small Data Challenges in Big Data Era: A Survey of Recent Progress on Unsupervised and Semi-Supervised Methods117
Graph Neural Networks with Convolutional ARMA Filters117
Recipe1M+: A Dataset for Learning Cross-Modal Embeddings for Cooking Recipes and Food Images116
Learning Generalisable Omni-Scale Representations for Person Re-Identification115
From Show to Tell: A Survey on Deep Learning-Based Image Captioning115
MS-TCN++: Multi-Stage Temporal Convolutional Network for Action Segmentation115
DeepFake Detection Based on Discrepancies Between Faces and Their Context114
0.045639991760254