IEEE Transactions on Circuits and Systems for Video Technology

Papers
(The median citation count of IEEE Transactions on Circuits and Systems for Video Technology is 4. The table below lists those papers that are above that threshold based on CrossRef citation counts [max. 250 papers]. The publications cover those that have been published in the past four years, i.e., from 2020-11-01 to 2024-11-01.)
ArticleCitations
Overview of the Versatile Video Coding (VVC) Standard and its Applications798
Image De-Raining Using a Conditional Generative Adversarial Network584
Real-World Underwater Enhancement: Challenges, Benchmarks, and Solutions Under Natural Light407
Multimodal Transformer With Multi-View Visual Representation for Image Captioning285
Channel-Wise and Spatial Feature Modulation Network for Single Image Super-Resolution212
Feature Refinement and Filter Network for Person Re-Identification188
Task-Adaptive Attention for Image Captioning183
SwinNet: Swin Transformer Drives Edge-Aware RGB-D and RGB-T Salient Object Detection176
RetinexDIP: A Unified Deep Framework for Low-Light Image Enhancement172
Multi-Scale Metric Learning for Few-Shot Learning172
Learning a Deep Multi-Scale Feature Ensemble and an Edge-Attention Guidance for Image Fusion164
Monocular Depth Estimation Using Laplacian Pyramid-Based Depth Residuals151
ECFFNet: Effective and Consistent Feature Fusion Network for RGB-T Salient Object Detection150
Low-Light Image Enhancement via Progressive-Recursive Network137
A Decade Survey of Content Based Image Retrieval Using Deep Learning134
Expansion-Squeeze-Excitation Fusion Network for Elderly Activity Recognition131
Drone-Based RGB-Infrared Cross-Modality Vehicle Detection Via Uncertainty-Aware Learning130
Richly Activated Graph Convolutional Network for Robust Skeleton-Based Action Recognition129
Anatomy-Aware 3D Human Pose Estimation With Bone-Based Pose Decomposition123
Target Oriented Perceptual Adversarial Fusion Network for Underwater Image Enhancement123
Image Description With Polar Harmonic Fourier Moments121
Underwater Image Enhancement Quality Evaluation: Benchmark Dataset and Objective Metric111
Low-Rank Tensor Graph Learning for Multi-View Subspace Clustering110
Lossy Point Cloud Geometry Compression via End-to-End Learning109
Hierarchical Graph Neural Networks for Few-Shot Learning108
Detecting Compressed Deepfake Videos in Social Networks Using Frame-Temporality Two-Stream Convolutional Network108
Each Part Matters: Local Patterns Facilitate Cross-View Geo-Localization107
PSCC-Net: Progressive Spatio-Channel Correlation Network for Image Manipulation Detection and Localization107
Lightweight Image Super-Resolution With Expectation-Maximization Attention Mechanism102
Wireless Image Transmission Using Deep Source Channel Coding With Attention Modules102
A Simple Local Minimal Intensity Prior and an Improved Algorithm for Blind Image Deblurring100
Unified Information Fusion Network for Multi-Modal RGB-D and RGB-T Salient Object Detection98
Watermarking Neural Networks With Watermarked Images97
Perceptual Underwater Image Enhancement With Deep Learning and Physical Priors97
Efficient and Model-Based Infrared and Visible Image Fusion via Algorithm Unrolling97
CGFNet: Cross-Guided Fusion Network for RGB-T Salient Object Detection96
Revisiting Feature Fusion for RGB-T Salient Object Detection95
Real-Time Video Emotion Recognition Based on Reinforcement Learning and Domain Knowledge95
Edge-Guided Non-Local Fully Convolutional Network for Salient Object Detection92
Region-Aware Image Captioning via Interaction Learning92
Attention-Driven Loss for Anomaly Detection in Video Surveillance92
Temporal-Channel Transformer for 3D Lidar-Based Video Object Detection for Autonomous Driving90
Double Parameters Fractal Sorting Matrix and Its Application in Image Encryption90
Grayscale Enhancement Colorization Network for Visible-Infrared Person Re-Identification88
Multi-Level Fusion and Attention-Guided CNN for Image Dehazing88
Dual-Level Representation Enhancement on Characteristic and Context for Image-Text Retrieval87
UrbanLF: A Comprehensive Light Field Dataset for Semantic Segmentation of Urban Scenes86
A Variational Framework for Underwater Image Dehazing and Deblurring86
A Survey of Multiple Pedestrian Tracking Based on Tracking-by-Detection Framework82
Underwater Image Co-Enhancement With Correlation Feature Matching and Joint Learning81
Camouflaged Object Detection via Context-Aware Cross-Level Fusion81
Infrared and Visible Image Fusion via Texture Conditional Generative Adversarial Network79
DATFuse: Infrared and Visible Image Fusion via Dual Attention Transformer78
Fuzzy Integral-Based CNN Classifier Fusion for 3D Skeleton Action Recognition78
RefineDet++: Single-Shot Refinement Neural Network for Object Detection76
Blind Omnidirectional Image Quality Assessment With Viewport Oriented Graph Convolutional Networks75
Intra Prediction and Mode Coding in VVC75
Concealed Attack for Robust Watermarking Based on Generative Model and Perceptual Loss75
A Transformer-Based Feature Segmentation and Region Alignment Method for UAV-View Geo-Localization74
Multi-Purpose Oriented Single Nighttime Image Haze Removal Based on Unified Variational Retinex Model74
Efficient Context-Guided Stacked Refinement Network for RGB-T Salient Object Detection73
Block Partitioning Structure in the VVC Standard73
Deep Joint Depth Estimation and Color Correction From Monocular Underwater Images Based on Unsupervised Adaptation Networks72
Deep Spatial-Spectral Subspace Clustering for Hyperspectral Image72
PQA-Net: Deep No Reference Point Cloud Quality Assessment via Multi-View Projection71
Recursive Neural Network for Video Deblurring71
Color Cast Dependent Image Dehazing via Adaptive Airlight Refinement and Non-Linear Color Balancing71
Normality Learning in Multispace for Video Anomaly Detection71
A Hybrid Compression Framework for Color Attributes of Static 3D Point Clouds71
Multiscale Densely-Connected Fusion Networks for Hyperspectral Images Classification70
VVC In-Loop Filters70
A Robust GAN-Generated Face Detection Method Based on Dual-Color Spaces and an Improved Xception70
MDCN: Multi-Scale Dense Cross Network for Image Super-Resolution68
Decomposition Makes Better Rain Removal: An Improved Attention-Guided Deraining Network67
Coverless Image Steganography Based on Multi-Object Recognition67
No-Reference Quality Assessment for 3D Colored Point Cloud and Mesh Models67
UAV-Satellite View Synthesis for Cross-View Geo-Localization66
Causal Contextual Prediction for Learned Image Compression65
MSTA-Net: Forgery Detection by Generating Manipulation Trace Based on Multi-Scale Self-Texture Attention65
GUDCP: Generalization of Underwater Dark Channel Prior for Underwater Image Restoration65
No-Reference Light Field Image Quality Assessment Based on Spatial-Angular Measurement64
Encoder-Decoder With Cascaded CRFs for Semantic Segmentation64
Reversible Data Hiding With Hierarchical Embedding for Encrypted Images64
Learning Video Moment Retrieval Without a Single Annotated Video64
UNFusion: A Unified Multi-Scale Densely Connected Network for Infrared and Visible Image Fusion64
High-Order Interaction Learning for Image Captioning63
A Perception-Aware Decomposition and Fusion Framework for Underwater Image Enhancement63
Learning Dual Semantic Relations With Graph Attention for Image-Text Matching63
Transform Coding in the VVC Standard62
Attention-Guided Global-Local Adversarial Learning for Detail-Preserving Multi-Exposure Image Fusion62
Spatiotemporal Multimodal Learning With 3D CNNs for Video Action Recognition62
Implicit Dual-Domain Convolutional Network for Robust Color Image Compression Artifact Reduction62
Deep Template-Based Watermarking61
Robust High-Capacity Watermarking Over Online Social Network Shared Images61
Learning to Score Figure Skating Sport Videos60
Light-Guided and Cross-Fusion U-Net for Anti-Illumination Image Super-Resolution60
Multiscale Low-Light Image Enhancement Network With Illumination Constraint59
Cross-View Gait Recognition Using Pairwise Spatial Transformer Networks59
HF-TPE: High-Fidelity Thumbnail- Preserving Encryption59
SiamCDA: Complementarity- and Distractor-Aware RGB-T Tracking Based on Siamese Network58
Multi-View Spectral Clustering Tailored Tensor Low-Rank Representation58
E2I: Generative Inpainting From Edge to Image58
AO2-DETR: Arbitrary-Oriented Object Detection Transformer58
Adaptive Region Proposal With Channel Regularization for Robust Object Tracking57
IID-Net: Image Inpainting Detection Network via Neural Architecture Search and Attention57
Deep Multiscale Fusion Hashing for Cross-Modal Retrieval57
Stereoscopic Image Description With Trinion Fractional-Order Continuous Orthogonal Moments57
From Multi-View to Hollow-3D: Hallucinated Hollow-3D R-CNN for 3D Object Detection57
TrajectoryCNN: A New Spatio-Temporal Feature Learning Network for Human Motion Prediction57
Estimating Generalized Gaussian Blur Kernels for Out-of-Focus Image Deblurring56
Semantic-Aware Occlusion-Robust Network for Occluded Person Re-Identification56
AC-SUM-GAN: Connecting Actor-Critic and Generative Adversarial Networks for Unsupervised Video Summarization56
Deep Transfer Hashing for Image Retrieval56
Rethinking Triplet Loss for Domain Adaptation56
Meta-Learning-Based Incremental Few-Shot Object Detection56
Deep Texture-Aware Features for Camouflaged Object Detection56
Layer-Specific Optimization for Mixed Data Flow With Mixed Precision in FPGA Design for CNN-Based Object Detectors55
No-Reference Quality Assessment for 360-Degree Images by Analysis of Multifrequency Information and Local-Global Naturalness54
Color Transferred Convolutional Neural Networks for Image Dehazing54
SAC-Net: Spatial Attenuation Context for Salient Object Detection54
End-to-End Learning Deep CRF Models for Multi-Object Tracking Deep CRF Models53
Robust Reversible Watermarking in Encrypted Image With Secure Multi-Party Based on Lightweight Cryptography53
Exploring Stable Coefficients on Joint Sub-Bands for Robust Video Watermarking in DT CWT Domain52
SiamFPN: A Deep Learning Method for Accurate and Real-Time Maritime Ship Tracking52
Syntax-Guided Hierarchical Attention Network for Video Captioning52
Video Captioning Using Global-Local Representation52
Uncertainty-Guided Cross-Modal Learning for Robust Multispectral Pedestrian Detection52
Student Network Learning via Evolutionary Knowledge Distillation52
Blindly Assess Quality of In-the-Wild Videos via Quality-Aware Pre-Training and Motion Perception52
Quantization and Entropy Coding in the Versatile Video Coding (VVC) Standard52
RGBT Salient Object Detection: Benchmark and A Novel Cooperative Ranking Approach51
Learning Generalized Spatial-Temporal Deep Feature Representation for No-Reference Video Quality Assessment51
Local-Global Temporal Difference Learning for Satellite Video Super-Resolution51
RGBT Tracking by Trident Fusion Network51
Triple Adversarial Learning and Multi-View Imaginative Reasoning for Unsupervised Domain Adaptation Person Re-Identification51
Underwater Image Enhancement via Weighted Wavelet Visual Perception Fusion50
A Two-Stage Attentive Network for Single Image Super-Resolution50
DBCFace: Towards Pure Convolutional Neural Network Face Detection50
Robust Texture Description Using Local Grouped Order Pattern and Non-Local Binary Pattern50
SACF-Net: Skip-Attention Based Correspondence Filtering Network for Point Cloud Registration50
Event-Centric Hierarchical Representation for Dense Video Captioning49
Image Steganography With Symmetric Embedding Using Gaussian Markov Random Field Model49
Context-Aware Mixup for Domain Adaptive Semantic Segmentation49
Multi-MSB Compression Based Reversible Data Hiding Scheme in Encrypted Images49
Multi-Grained Attention Networks for Single Image Super-Resolution49
Occupancy-Map-Based Rate Distortion Optimization and Partition for Video-Based Point Cloud Compression49
Multi-Scale Neighborhood Feature Extraction and Aggregation for Point Cloud Segmentation49
Deep Cross-Modal Representation Learning and Distillation for Illumination-Invariant Pedestrian Detection49
Source-Free Open Compound Domain Adaptation in Semantic Segmentation49
VVC Complexity and Software Implementation Analysis49
SCGAN: Saliency Map-Guided Colorization With Generative Adversarial Network48
AM³Net: Adaptive Mutual-Learning-Based Multimodal Data Fusion Network48
Multi-Graph Fusion and Learning for RGBT Image Saliency Detection48
Depth-Aware Multi-Grid Deep Homography Estimation With Contextual Correlation47
Tensorial Multi-View Clustering via Low-Rank Constrained High-Order Graph Learning47
TSAN: Synthesized View Quality Enhancement via Two-Stream Attention Network for 3D-HEVC47
Efficient Reversible Data Hiding for JPEG Images With Multiple Histograms Modification47
Sequential and Patch Analyses for Object Removal Video Forgery Detection and Localization47
Transformer3D-Det: Improving 3D Object Detection by Vote Refinement46
Cross-SRN: Structure-Preserving Super-Resolution Network With Cross Convolution46
Hierarchical Feature Fusion With Mixed Convolution Attention for Single Image Dehazing46
Model Compression Using Progressive Channel Pruning46
Unsupervised Cross-Media Retrieval Using Domain Adaptation With Scene Graph46
Revisiting Modality-Specific Feature Compensation for Visible-Infrared Person Re-Identification46
Cross-Domain Complementary Learning Using Pose for Multi-Person Part Segmentation46
Secure Robust JPEG Steganography Based on AutoEncoder With Adaptive BCH Encoding46
Hyperspectral Image Super-Resolution via Deep Prior Regularization With Parameter Estimation45
Solve the Puzzle of Instance Segmentation in Videos: A Weakly Supervised Framework With Spatio-Temporal Collaboration45
Rethinking Camouflaged Object Detection: Models and Datasets45
High-Quality R-CNN Object Detection Using Multi-Path Detection Calibration Network45
Bi-Directional Progressive Guidance Network for RGB-D Salient Object Detection44
Joint Anchor-Feature Refinement for Real-Time Accurate Object Detection in Images and Videos44
Deep Adversarial Data Augmentation for Extremely Low Data Regimes44
An Iterative Threshold Algorithm of Log-Sum Regularization for Sparse Problem44
DesnowGAN: An Efficient Single Image Snow Removal Framework Using Cross-Resolution Lateral Connection and GANs44
Facial Expression Recognition With Two-Branch Disentangled Generative Adversarial Network44
Deep Adaptively-Enhanced Hashing With Discriminative Similarity Guidance for Unsupervised Cross-Modal Retrieval43
RGB-T Semantic Segmentation With Location, Activation, and Sharpening43
Human-Centric Spatio-Temporal Video Grounding With Visual Transformers43
Incomplete Descriptor Mining With Elastic Loss for Person Re-Identification43
Learning From Synthetic Shadows for Shadow Detection and Removal43
Influence-Aware Attention Networks for Anomaly Detection in Surveillance Videos42
Asynchronous Updating Boolean Network Encryption Algorithm42
Wavelet-Based Deep Auto Encoder-Decoder (WDAED)-Based Image Compression42
Exploring Dense Context for Salient Object Detection42
INENet: Inliers Estimation Network With Similarity Learning for Partial Overlapping Registration42
Multimodal Local-Global Attention Network for Affective Video Content Analysis42
Perceptual Quality Assessment for Screen Content Images by Spatial Continuity42
Boosting Few-Shot Fine-Grained Recognition With Background Suppression and Foreground Alignment42
Progressive Meta-Learning With Curriculum42
Overview of the Screen Content Support in VVC: Applications, Coding Tools, and Performance41
Noise Augmented Double-Stream Graph Convolutional Networks for Image Captioning41
VDM-DA: Virtual Domain Modeling for Source Data-Free Domain Adaptation41
Multi-View Spatial Attention Embedding for Vehicle Re-Identification41
A Multi-Task Collaborative Network for Light Field Salient Object Detection41
Joint Expression Synthesis and Representation Learning for Facial Expression Recognition41
A Camera Shooting Resilient Watermarking Scheme for Underpainting Documents41
Bridge-GAN: Interpretable Representation Learning for Text-to-Image Synthesis41
Local and Global Perception Generative Adversarial Network for Facial Expression Synthesis41
CGMDRNet: Cross-Guided Modality Difference Reduction Network for RGB-T Salient Object Detection40
TOAN: Target-Oriented Alignment Network for Fine-Grained Image Categorization With Few Labeled Samples40
RAPT360: Reinforcement Learning-Based Rate Adaptation for 360-Degree Video Streaming With Adaptive Prediction and Tiling40
Detail-Preserving Multi-Exposure Fusion With Edge-Preserving Structural Patch Decomposition40
Subjective and Objective De-Raining Quality Assessment Towards Authentic Rain Image40
Adversarial Decoupling and Modality-Invariant Representation Learning for Visible-Infrared Person Re-Identification40
HG-FCN: Hierarchical Grid Fully Convolutional Network for Fast VVC Intra Coding40
Pyramid Global Context Network for Image Dehazing39
Stage-Aware Feature Alignment Network for Real-Time Semantic Segmentation of Street Scenes39
Single Image Brightening via Multi-Scale Exposure Fusion With Hybrid Learning39
Deep Sub-Region Network for Salient Object Detection39
Predictive Generalized Graph Fourier Transform for Attribute Compression of Dynamic Point Clouds39
Lossless Coding of Point Cloud Geometry Using a Deep Generative Model39
Depth Estimation Using a Self-Supervised Network Based on Cross-Layer Feature Fusion and the Quadtree Constraint39
Occlusion-Sensitive Person Re-Identification via Attribute-Based Shift Attention39
A Weakly Supervised Learning Framework for Salient Object Detection via Hybrid Labels39
Perceptual Image Hashing for Content Authentication Based on Convolutional Neural Network With Multiple Constraints39
Configurable Fast Block Partitioning for VVC Intra Coding Using Light Gradient Boosting Machine39
Subjective Quality Database and Objective Study of Compressed Point Clouds With 6DoF Head-Mounted Display38
Food and Ingredient Joint Learning for Fine-Grained Recognition38
Adaptive Multilayer Perceptual Attention Network for Facial Expression Recognition38
Neural Video Coding Using Multiscale Motion Compensation and Spatiotemporal Context Model38
3D Face Anti-Spoofing With Factorized Bilinear Coding38
Learning Gated Non-Local Residual for Single-Image Rain Streak Removal38
Viewing Behavior Supported Visual Saliency Predictor for 360 Degree Videos38
A Common Method of Share Authentication in Image Secret Sharing38
Dynamic Attention Guided Multi-Trajectory Analysis for Single Object Tracking37
Complementary Discriminative Correlation Filters Based on Collaborative Representation for Visual Object Tracking37
Lightweight Modules for Efficient Deep Learning Based Image Restoration37
Multi-Person Hierarchical 3D Pose Estimation in Natural Videos37
Generalizable No-Reference Image Quality Assessment via Deep Meta-Learning37
Local Geometric Distortions Resilient Watermarking Scheme Based on Symmetry37
Feature Alignment and Aggregation Siamese Networks for Fast Visual Tracking37
Large-Scale Spatio-Temporal Person Re-Identification: Algorithms and Benchmark37
Video Compressed Sensing Using a Convolutional Neural Network37
Illumination Unification for Person Re-Identification37
Multi-Task SE-Network for Image Splicing Localization37
Photo-Realistic Image Super-Resolution via Variational Autoencoders36
Reversible Data Hiding in Encrypted Images Using Cipher-Feedback Secret Sharing36
Adaptive Path Selection for Dynamic Image Captioning36
Cryptanalysis of Image Ciphers With Permutation-Substitution Network and Chaos36
Long-Term Video Question Answering via Multimodal Hierarchical Memory Attentive Networks36
Adaptive Pairwise Prediction-Error Expansion and Multiple Histograms Modification for Reversible Data Hiding36
Incremental Learning of Multi-Domain Image-to-Image Translations36
MMMNet: An End-to-End Multi-Task Deep Convolution Neural Network With Multi-Scale and Multi-Hierarchy Fusion for Blind Image Quality Assessment36
MoADNet: Mobile Asymmetric Dual-Stream Networks for Real-Time and Lightweight RGB-D Salient Object Detection36
Motion Vector Coding and Block Merging in the Versatile Video Coding Standard36
Unsupervised Domain Adaptation via Importance Sampling36
Omnidirectional Image Quality Assessment by Distortion Discrimination Assisted Multi-Stream Network35
Reversible Data Hiding in Encrypted Image via Secret Sharing Based on GF(p) and GF(2⁸)35
Cross-View Recurrence-Based Self-Supervised Super-Resolution of Light Field35
0.17080807685852