IEEE Transactions on Pattern Analysis and Machine Intelligence

Papers
(The median citation count of IEEE Transactions on Pattern Analysis and Machine Intelligence is 4. The table below lists those papers that are above that threshold based on CrossRef citation counts [max. 250 papers]. The publications cover those that have been published in the past four years, i.e., from 2020-04-01 to 2024-04-01.)
ArticleCitations
Squeeze-and-Excitation Networks3141
OpenPose: Realtime Multi-Person 2D Pose Estimation Using Part Affinity Fields2006
Deep High-Resolution Representation Learning for Visual Recognition1571
Res2Net: A New Multi-Scale Backbone Architecture1399
Image Segmentation Using Deep Learning: A Survey1077
Deep Learning for 3D Point Clouds: A Survey826
Self-Supervised Visual Feature Learning With Deep Neural Networks: A Survey799
A Survey on Vision Transformer756
Deep Learning for Image Super-Resolution: A Survey753
NTU RGB+D 120: A Large-Scale Benchmark for 3D Human Activity Understanding698
Deep Learning for Person Re-Identification: A Survey and Outlook689
GOT-10k: A Large High-Diversity Benchmark for Generic Object Tracking in the Wild664
Event-Based Vision: A Survey663
Cascade R-CNN: High Quality Object Detection and Instance Segmentation622
U2Fusion: A Unified Unsupervised Image Fusion Network580
Tensor Robust Principal Component Analysis with a New Tensor Nuclear Norm498
Residual Dense Network for Image Restoration441
ProtTrans: Toward Understanding the Language of Life Through Self-Supervised Learning437
Gliding Vertex on the Horizontal Bounding Box for Multi-Oriented Object Detection418
Meta-Learning in Neural Networks: A Survey397
Efficient and Robust Approximate Nearest Neighbor Search Using Hierarchical Navigable Small World Graphs379
A continual learning survey: Defying forgetting in classification tasks378
Towards Robust Monocular Depth Estimation: Mixing Datasets for Zero-Shot Cross-Dataset Transfer373
Hierarchical Deep Click Feature Prediction for Fine-Grained Image Recognition342
Recent Advances in Open Set Recognition: A Survey338
Normalizing Flows: An Introduction and Review of Current Methods337
Plug-and-Play Image Restoration With Deep Denoiser Prior290
Hierarchical Fully Convolutional Network for Joint Atrophy Localization and Alzheimer's Disease Diagnosis Using Structural MRI285
A Style-Based Generator Architecture for Generative Adversarial Networks272
Deep Multi-View Enhancement Hashing for Image Retrieval270
A Review of Domain Adaptation without Target Labels266
Salient Object Detection in the Deep Learning Era: An In-Depth Survey266
The ApolloScape Open Dataset for Autonomous Driving and Its Application256
Imbalance Problems in Object Detection: A Review255
Image Super-Resolution Via Iterative Refinement254
Convolutional Networks with Dense Connectivity236
Multi-Task Learning for Dense Prediction Tasks: A Survey232
Joint Rain Detection and Removal from a Single Image with Contextualized Deep Networks228
Knowledge Distillation and Student-Teacher Learning for Visual Intelligence: A Review and New Outlooks216
NWPU-Crowd: A Large-Scale Benchmark for Crowd Counting and Localization207
Image-Based 3D Object Reconstruction: State-of-the-Art and Trends in the Deep Learning Era207
Detection and Tracking Meet Drones Challenge204
YOLACT++ Better Real-Time Instance Segmentation197
Prior Guided Feature Enrichment Network for Few-Shot Segmentation191
Contextual Transformer Networks for Visual Recognition190
Deep Audio-Visual Speech Recognition188
Deep Imbalanced Learning for Face Recognition and Attribute Prediction183
High Speed and High Dynamic Range Video with an Event Camera181
FakeCatcher: Detection of Synthetic Portrait Videos using Biological Signals179
MEMC-Net: Motion Estimation and Motion Compensation Driven Neural Network for Video Interpolation and Enhancement176
Semi-Supervised Semantic Segmentation With High- and Low-Level Consistency173
Dynamic Neural Networks: A Survey170
Revisiting Video Saliency Prediction in the Deep Learning Era165
Concealed Object Detection161
InterFaceGAN: Interpreting the Disentangled Face Representation Learned by GANs161
CCNet: Criss-Cross Attention for Semantic Segmentation160
Low-Light Image and Video Enhancement Using Deep Learning: A Survey159
Weakly Supervised Learning with Multi-Stream CNN-LSTM-HMMs to Discover Sequential Parallelism in Sign Language Videos156
Maximum Density Divergence for Domain Adaptation154
Beyond Self-Attention: External Attention Using Two Linear Layers for Visual Tasks152
Domain Generalization: A Survey149
Learning to Enhance Low-Light Image via Zero-Reference Deep Curve Estimation146
Object Detection in Aerial Images: A Large-Scale Benchmark and Challenges143
Every Pixel Counts ++: Joint Learning of Geometry and Motion with 3D Holistic Understanding141
ResMLP: Feedforward Networks for Image Classification With Data-Efficient Training141
Learning Depth with Convolutional Spatial Propagation Network139
A Comprehensive Analysis of Deep Regression139
Inferring Salient Objects from Human Fixations134
Spatiotemporal Co-Attention Recurrent Neural Networks for Human-Skeleton Motion Prediction133
Constructing Stronger and Faster Baselines for Skeleton-Based Action Recognition132
Multiview Clustering: A Scalable and Parameter-Free Bipartite Graph Fusion Method130
Confidence Propagation through CNNs for Guided Sparse Depth Regression128
Mask TextSpotter: An End-to-End Trainable Neural Network for Spotting Text with Arbitrary Shapes128
Diffusion Models in Vision: A Survey128
Dynamical Hyperparameter Optimization via Deep Reinforcement Learning in Tracking124
Effects of Image Degradation and Degradation Removal to CNN-Based Image Classification124
Unsupervised Tracklet Person Re-Identification124
Video Anomaly Detection with Sparse Coding Inspired Deep Neural Networks123
Single Image Deraining: From Model-Based to Data-Driven and Beyond122
Coherence Constrained Graph LSTM for Group Activity Recognition121
Deep Generative Modelling: A Comparative Review of VAEs, GANs, Normalizing Flows, Energy-Based and Autoregressive Models120
Robust Low-Rank Tensor Recovery with Rectification and Alignment119
MFQE 2.0: A New Approach for Multi-Frame Quality Enhancement on Compressed Video119
Human Action Recognition From Various Data Modalities: A Review118
Neural Image Compression for Gigapixel Histopathology Image Analysis116
Densely Residual Laplacian Super-Resolution115
GAN Inversion: A Survey115
ArcFace: Additive Angular Margin Loss for Deep Face Recognition115
Hiding Images within Images114
SCRDet++: Detecting Small, Cluttered and Rotated Objects via Instance-Level Feature Denoising and Rotation Loss Smoothing112
Direction-Aware Spatial Context Features for Shadow Detection and Removal111
Models Matter, So Does Training: An Empirical Study of CNNs for Optical Flow Estimation110
Weakly Supervised Object Localization and Detection: A Survey106
Recipe1M+: A Dataset for Learning Cross-Modal Embeddings for Cooking Recipes and Food Images104
MTFH: A Matrix Tri-Factorization Hashing Framework for Efficient Cross-Modal Retrieval104
Real-Time Scene Text Detection With Differentiable Binarization and Adaptive Scale Fusion104
PredRNN: A Recurrent Neural Network for Spatiotemporal Predictive Learning104
A Survey on Deep Learning Techniques for Stereo-Based Depth Estimation103
Skeleton-Based Online Action Prediction Using Scale Selection Network103
A Survey on Curriculum Learning103
Unsupervised Person Re-Identification by Deep Asymmetric Metric Embedding102
Negation of the Quantum Mass Function for Multisource Quantum Information Fusion With its Application to Pattern Classification99
Small Data Challenges in Big Data Era: A Survey of Recent Progress on Unsupervised and Semi-Supervised Methods98
NAS-FAS: Static-Dynamic Central Difference Network Search for Face Anti-Spoofing98
Self-Correction for Human Parsing98
MHF-Net: An Interpretable Deep Network for Multispectral and Hyperspectral Image Fusion98
AbdomenCT-1K: Is Abdominal Organ Segmentation a Solved Problem?98
Graph U-Nets97
Hierarchical Long Short-Term Concurrent Memory for Human Interaction Recognition97
The Emerging Trends of Multi-Label Learning95
Fine-Grained Image Analysis With Deep Learning: A Survey95
Graph Neural Networks with Convolutional ARMA Filters94
MS-TCN++: Multi-Stage Temporal Convolutional Network for Action Segmentation94
Deep Residual Correction Network for Partial Domain Adaptation94
Learning Generalisable Omni-Scale Representations for Person Re-Identification92
KITTI-360: A Novel Dataset and Benchmarks for Urban Scene Understanding in 2D and 3D92
PaMIR: Parametric Model-Conditioned Implicit Representation for Image-Based Human Reconstruction92
DeepFake Detection Based on Discrepancies Between Faces and Their Context92
Class-Incremental Learning: Survey and Performance Evaluation on Image Classification91
Unsupervised Learning of a Hierarchical Spiking Neural Network for Optical Flow Estimation: From Events to Global Motion Perception91
Spherical Kernel for Efficient Graph Convolution on 3D Point Clouds91
Adversarial Cross-Spectral Face Completion for NIR-VIS Face Recognition91
Learning Representations for Neural Network-Based Classification Using the Information Bottleneck Principle90
Siamese Network for RGB-D Salient Object Detection and Beyond88
A Curriculum Domain Adaptation Approach to the Semantic Segmentation of Urban Scenes87
Symbiotic Graph Neural Networks for 3D Skeleton-Based Human Action Recognition and Motion Prediction87
A Comprehensive Survey of Scene Graphs: Generation and Application86
Disentangling Light Fields for Super-Resolution and Disparity Estimation86
Person Re-Identification by Contour Sketch Under Moderate Clothing Change86
A Novel Approach to Large-Scale Dynamically Weighted Directed Network Representation86
RGB-D SLAM in Dynamic Environments Using Point Correlations85
CTNet: Context-Based Tandem Network for Semantic Segmentation84
A Review on Deep Learning Techniques for Video Prediction84
From Show to Tell: A Survey on Deep Learning-Based Image Captioning84
Saliency Prediction in the Deep Learning Era: Successes and Limitations84
A Lightweight Optical Flow CNN —Revisiting Data Fidelity and Regularization83
Video-based Facial Micro-Expression Analysis: A Survey of Datasets, Features and Algorithms82
Physics-Based Generative Adversarial Models for Image Restoration and Beyond82
Structured Knowledge Distillation for Dense Prediction82
Infinite Feature Selection: A Graph-based Feature Filtering Approach82
ZeroNAS: Differentiable Generative Adversarial Networks Search for Zero-Shot Learning81
Multiset Feature Learning for Highly Imbalanced Data Classification81
Dual Encoding for Video Retrieval by Text81
XSleepNet: Multi-View Sequential Model for Automatic Sleep Staging80
An End-to-End Learning Framework for Video Compression80
Learning to Match Anchors for Visual Object Detection80
Segmenting Objects From Relational Visual Data78
SEWA DB: A Rich Database for Audio-Visual Emotion and Sentiment Research in the Wild78
Synthesizing Supervision for Learning Deep Saliency Network without Human Annotation77
Where and How to Transfer: Knowledge Aggregation-Induced Transferability Perception for Unsupervised Domain Adaptation76
Knowledge-Guided Multi-Label Few-Shot Learning for General Image Recognition76
Deep Convolutional Neural Network for Multi-Modal Image Restoration and Fusion76
High-Dimensional Dense Residual Convolutional Neural Network for Light Field Reconstruction76
Exploiting Deep Generative Prior for Versatile Image Restoration and Manipulation76
PhlatCam: Designed Phase-Mask Based Thin Lensless Camera75
Robust Multi-View Clustering With Incomplete Information75
Paying Attention to Video Object Pattern Understanding75
Salient Object Detection via Integrity Learning73
Augmentation Invariant and Instance Spreading Feature for Softmax Embedding73
Tensor Low-Rank Representation for Data Recovery and Clustering73
End-to-End Active Object Tracking and Its Real-World Deployment via Reinforcement Learning73
AlphaPose: Whole-Body Regional Multi-Person Pose Estimation and Tracking in Real-Time72
Enhanced Tensor RPCA and its Application70
Long-Term Visual Localization Revisited69
SensitiveNets: Learning Agnostic Representations with Application to Face Images68
Distilled Siamese Networks for Visual Tracking68
Divergence-Agnostic Unsupervised Domain Adaptation by Adversarial Attacks68
Context-Aware Visual Policy Network for Fine-Grained Image Captioning68
A Bayesian Formulation of Coherent Point Drift68
Weakly Supervised Object Detection Using Proposal- and Semantic-Level Relationships68
Semi-Supervised Multi-View Deep Discriminant Representation Learning67
P-CNN: Part-Based Convolutional Neural Networks for Fine-Grained Visual Categorization67
Source Data-absent Unsupervised Domain Adaptation through Hypothesis Transfer and Labeling Transfer67
Nonlinear Regression via Deep Negative Correlation Learning67
Learning a Fixed-Length Fingerprint Representation67
Multi-Source Causal Feature Selection66
Explainability in Graph Neural Networks: A Taxonomic Survey66
Multiple Video Frame Interpolation via Enhanced Deformable Separable Convolution65
Adversarial Reciprocal Points Learning for Open Set Recognition65
Deep Hough Transform for Semantic Line Detection65
P2T: Pyramid Pooling Transformer for Scene Understanding65
Deep Long-Tailed Learning: A Survey65
VOLO: Vision Outlooker for Visual Recognition64
Vision Permutator: A Permutable MLP-Like Architecture for Visual Recognition64
A Review of Generalized Zero-Shot Learning Methods64
Deep ROC Analysis and AUC as Balanced Average Accuracy, for Improved Classifier Selection, Audit and Explanation64
Deep Clustering: On the Link Between Discriminative Models and K-Means63
Re-thinking Co-Salient Object Detection63
Neural Architecture Transfer63
Object Detection in Videos by High Quality Object Linking63
Parallax Attention for Unsupervised Stereo Correspondence Learning62
Uncertainty Inspired RGB-D Saliency Detection62
End-to-End Optimized Versatile Image Compression With Wavelet-Like Transform62
Auto-Pytorch: Multi-Fidelity MetaLearning for Efficient and Robust AutoDL62
Heterogeneous Graph Attention Network for Unsupervised Multiple-Target Domain Adaptation62
Self-Supervised Learning of Graph Neural Networks: A Unified Review62
Learning Enriched Features for Fast Image Restoration and Enhancement62
Bayesian Temporal Factorization for Multidimensional Time Series Prediction62
Line Graph Neural Networks for Link Prediction61
Stereo Matching Using Multi-Level Cost Volume and Multi-Scale Feature Constancy59
Fast and Robust Iterative Closest Point59
Learning Part-based Convolutional Features for Person Re-Identification58
Trusted Multi-View Classification With Dynamic Evidential Fusion58
Neural Sensors: Learning Pixel Exposures for HDR Imaging and Video Compressive Sensing With Programmable Sensors58
Graph Neural Networks in Network Neuroscience57
Kernel-Based Density Map Generation for Dense Object Counting57
Cascaded Parsing of Human-Object Interaction Recognition57
Transfer Learning in Deep Reinforcement Learning: A Survey57
GaitSet: Cross-view Gait Recognition through Utilizing Gait as a Deep Set57
On Learning Disentangled Representations for Gait Recognition56
Real-World Image Denoising with Deep Boosting56
Leveraging Instance-, Image- and Dataset-Level Information for Weakly Supervised Instance Segmentation56
The EPIC-KITCHENS Dataset: Collection, Challenges and Baselines55
Towards Robust Discriminative Projections Learning via Non-Greedy -Norm MinMax55
A Topological Loss Function for Deep-Learning Based Image Segmentation Using Persistent Homology55
BlockQNN: Efficient Block-Wise Neural Network Architecture Generation54
Learning to Compose and Reason with Language Tree Structures for Visual Grounding54
Intel® RealSense™ SR300 Coded Light Depth Camera54
Visual Camera Re-Localization from RGB and RGB-D Images Using DSAC54
CoRRN: Cooperative Reflection Removal Network53
Unsupervised Grouped Axial Data Modeling via Hierarchical Bayesian Nonparametric Models With Watson Distributions53
Towards A Weakly Supervised Framework for 3D Point Cloud Object Detection and Annotation52
Hypergraph Learning: Methods and Practices52
Deep Gait Recognition: A Survey52
Simultaneous Fidelity and Regularization Learning for Image Restoration52
DeepMIH: Deep Invertible Network for Multiple Image Hiding52
HiGCIN: Hierarchical Graph-Based Cross Inference Network for Group Activity Recognition52
MobileSal: Extremely Efficient RGB-D Salient Object Detection52
Learning Semantic Segmentation of Large-Scale Point Clouds with Random Sampling52
Attention-Based Dropout Layer for Weakly Supervised Single Object Localization and Semantic Segmentation51
The Gap of Semantic Parsing: A Survey on Automatic Math Word Problem Solvers51
What and How: Generalized Lifelong Spectral Clustering via Dual Memory51
A Hybrid RNN-HMM Approach for Weakly Supervised Temporal Action Segmentation51
Map-Guided Curriculum Domain Adaptation and Uncertainty-Aware Evaluation for Semantic Nighttime Image Segmentation51
Higher-Order Explanations of Graph Neural Networks via Relevant Walks51
Neural Granger Causality51
Unsupervised Domain Adaptation for Depth Prediction from Images51
Symbiotic Attention for Egocentric Action Recognition With Object-Centric Alignment51
Subspace Clustering via Good Neighbors50
On the Synergies between Machine Learning and Binocular Stereo for Depth Estimation from Images: a Survey50
On the Convergence of Learning-Based Iterative Methods for Nonconvex Inverse Problems50
On Multi-Layer Basis Pursuit, Efficient Algorithms and Convolutional Neural Networks50
A Background-Agnostic Framework with Adversarial Training for Abnormal Event Detection in Video50
Cross-Domain Facial Expression Recognition: A Unified Evaluation Benchmark and Adversarial Graph Learning50
MRA-Net: Improving VQA Via Multi-Modal Relation Attention Network50
Inferring Point Cloud Quality via Graph Similarity50
Hyperbolic Deep Neural Networks: A Survey49
Affinity Attention Graph Neural Network for Weakly Supervised Semantic Segmentation49
Defocus Blur Detection via Multi-Stream Bottom-Top-Bottom Network49
Learning End-to-End Lossy Image Compression: A Benchmark49
0.043262004852295