IEEE Transactions on Circuits and Systems for Video Technology

Papers
(The median citation count of IEEE Transactions on Circuits and Systems for Video Technology is 4. The table below lists those papers that are above that threshold based on CrossRef citation counts [max. 250 papers]. The publications cover those that have been published in the past four years, i.e., from 2020-03-01 to 2024-03-01.)
ArticleCitations
Overview of the Versatile Video Coding (VVC) Standard and its Applications520
Image De-Raining Using a Conditional Generative Adversarial Network496
Real-World Underwater Enhancement: Challenges, Benchmarks, and Solutions Under Natural Light299
Multimodal Transformer With Multi-View Visual Representation for Image Captioning234
A New Payload Partition Strategy in Color Image Steganography217
Image and Video Compression With Neural Networks: A Review209
Data Augmentation Using Random Image Cropping and Patching for Deep CNNs201
Video Summarization With Attention-Based Encoder–Decoder Networks188
Channel-Wise and Spatial Feature Modulation Network for Single Image Super-Resolution184
Low-Complexity CTU Partition Structure Decision and Fast Intra Mode Decision for Versatile Video Coding178
Feature Refinement and Filter Network for Person Re-Identification161
A Survey of Open-World Person Re-Identification153
PCC Net: Perspective Crowd Counting via Spatial Convolutional Network148
Multi-Scale Metric Learning for Few-Shot Learning141
Task-Adaptive Attention for Image Captioning135
Saliency-Aware Convolution Neural Network for Ship Detection in Surveillance Video132
Small Object Detection in Unmanned Aerial Vehicle Images Using Feature Fusion and Scaling-Based Single Shot Detector With Spatial Context Analysis123
ECFFNet: Effective and Consistent Feature Fusion Network for RGB-T Salient Object Detection122
RetinexDIP: A Unified Deep Framework for Low-Light Image Enhancement119
SwinNet: Swin Transformer Drives Edge-Aware RGB-D and RGB-T Salient Object Detection112
Low-Light Image Enhancement via Progressive-Recursive Network111
Monocular Depth Estimation Using Laplacian Pyramid-Based Depth Residuals110
Image Description With Polar Harmonic Fourier Moments110
Deep Virtual Reality Image Quality Assessment With Human Perception Guider for Omnidirectional Image104
Learning a Deep Multi-Scale Feature Ensemble and an Edge-Attention Guidance for Image Fusion104
Richly Activated Graph Convolutional Network for Robust Skeleton-Based Action Recognition101
Expansion-Squeeze-Excitation Fusion Network for Elderly Activity Recognition97
A Decade Survey of Content Based Image Retrieval Using Deep Learning95
Multi-Temporal Ultra Dense Memory Network for Video Super-Resolution92
Hierarchical Graph Neural Networks for Few-Shot Learning84
Lossy Point Cloud Geometry Compression via End-to-End Learning83
Revisiting Feature Fusion for RGB-T Salient Object Detection82
Low-Rank Tensor Graph Learning for Multi-View Subspace Clustering81
Anatomy-Aware 3D Human Pose Estimation With Bone-Based Pose Decomposition81
A Simple Local Minimal Intensity Prior and an Improved Algorithm for Blind Image Deblurring81
Real-Time Video Emotion Recognition Based on Reinforcement Learning and Domain Knowledge80
Detecting Compressed Deepfake Videos in Social Networks Using Frame-Temporality Two-Stream Convolutional Network79
Region-Aware Image Captioning via Interaction Learning78
Lightweight Image Super-Resolution With Expectation-Maximization Attention Mechanism77
SCRATCH: A Scalable Discrete Matrix Factorization Hashing Framework for Cross-Modal Retrieval77
Dual-Level Representation Enhancement on Characteristic and Context for Image-Text Retrieval75
Low Rank Component Induced Spatial-Spectral Kernel Method for Hyperspectral Image Classification75
Edge-Guided Non-Local Fully Convolutional Network for Salient Object Detection74
Attention-Driven Loss for Anomaly Detection in Video Surveillance74
Drone-Based RGB-Infrared Cross-Modality Vehicle Detection Via Uncertainty-Aware Learning74
Multiple Histograms-Based Reversible Data Hiding: Framework and Realization74
Target Oriented Perceptual Adversarial Fusion Network for Underwater Image Enhancement74
Double Parameters Fractal Sorting Matrix and Its Application in Image Encryption73
Unified Information Fusion Network for Multi-Modal RGB-D and RGB-T Salient Object Detection73
Perceptual Underwater Image Enhancement With Deep Learning and Physical Priors72
CGFNet: Cross-Guided Fusion Network for RGB-T Salient Object Detection72
Occluded Face Recognition in the Wild by Identity-Diversity Inpainting71
Underwater Image Enhancement Quality Evaluation: Benchmark Dataset and Objective Metric70
Temporal-Channel Transformer for 3D Lidar-Based Video Object Detection for Autonomous Driving70
Unsupervised Blind Image Quality Evaluation via Statistical Measurements of Structure, Naturalness, and Perception69
Each Part Matters: Local Patterns Facilitate Cross-View Geo-Localization68
Watermarking Neural Networks With Watermarked Images68
UrbanLF: A Comprehensive Light Field Dataset for Semantic Segmentation of Urban Scenes67
Grayscale Enhancement Colorization Network for Visible-Infrared Person Re-Identification66
Intra Prediction and Mode Coding in VVC65
RefineDet++: Single-Shot Refinement Neural Network for Object Detection64
Deep Spatial-Spectral Subspace Clustering for Hyperspectral Image64
Wireless Image Transmission Using Deep Source Channel Coding With Attention Modules64
Fine-Grained Age Estimation in the Wild With Attention LSTM Networks64
Efficient and Model-Based Infrared and Visible Image Fusion via Algorithm Unrolling63
Recursive Neural Network for Video Deblurring61
Multi-Level Fusion and Attention-Guided CNN for Image Dehazing61
Concealed Attack for Robust Watermarking Based on Generative Model and Perceptual Loss61
Fuzzy Integral-Based CNN Classifier Fusion for 3D Skeleton Action Recognition61
Color Cast Dependent Image Dehazing via Adaptive Airlight Refinement and Non-Linear Color Balancing60
Blind Omnidirectional Image Quality Assessment With Viewport Oriented Graph Convolutional Networks60
A Survey of Multiple Pedestrian Tracking Based on Tracking-by-Detection Framework59
Block Partitioning Structure in the VVC Standard59
Underwater Image Co-Enhancement With Correlation Feature Matching and Joint Learning58
VVC In-Loop Filters58
Efficient Context-Guided Stacked Refinement Network for RGB-T Salient Object Detection58
Deep Joint Depth Estimation and Color Correction From Monocular Underwater Images Based on Unsupervised Adaptation Networks58
SDL: Spectrum-Disentangled Representation Learning for Visible-Infrared Person Re-Identification56
Detail-Enhanced Multi-Scale Exposure Fusion in YUV Color Space55
Beyond Scalar Neuron: Adopting Vector-Neuron Capsules for Long-Term Person Re-Identification54
High-Level Semantic Networks for Multi-Scale Object Detection54
MDCN: Multi-Scale Dense Cross Network for Image Super-Resolution54
PSCC-Net: Progressive Spatio-Channel Correlation Network for Image Manipulation Detection and Localization54
A Survey of Human Action Analysis in HRI Applications54
A Variational Framework for Underwater Image Dehazing and Deblurring54
Decomposition Makes Better Rain Removal: An Improved Attention-Guided Deraining Network53
Low CP Rank and Tucker Rank Tensor Completion for Estimating Missing Components in Image Data53
No-Reference Light Field Image Quality Assessment Based on Spatial-Angular Measurement53
A Robust GAN-Generated Face Detection Method Based on Dual-Color Spaces and an Improved Xception53
Infrared and Visible Image Fusion via Texture Conditional Generative Adversarial Network52
Multiscale Densely-Connected Fusion Networks for Hyperspectral Images Classification52
Action Recognition Scheme Based on Skeleton Representation With DS-LSTM Network52
Stereoscopic Image Description With Trinion Fractional-Order Continuous Orthogonal Moments52
Adaptive Region Proposal With Channel Regularization for Robust Object Tracking52
High-Order Interaction Learning for Image Captioning52
Streaming Video QoE Modeling and Prediction: A Long Short-Term Memory Approach50
Coverless Image Steganography Based on Multi-Object Recognition50
Color Transferred Convolutional Neural Networks for Image Dehazing50
The Joint Exploration Model (JEM) for Video Compression With Capability Beyond HEVC50
Learning Dual Semantic Relations With Graph Attention for Image-Text Matching50
Coupled Bilinear Discriminant Projection for Cross-View Gait Recognition50
A Hybrid Compression Framework for Color Attributes of Static 3D Point Clouds49
A View Synthesis-Based 360° VR Caching System Over MEC-Enabled C-RAN49
Cross-View Gait Recognition Using Pairwise Spatial Transformer Networks49
Multiple Robustness Enhancements for Image Adaptive Steganography in Lossy Channels49
MSTA-Net: Forgery Detection by Generating Manipulation Trace Based on Multi-Scale Self-Texture Attention49
Normality Learning in Multispace for Video Anomaly Detection49
A High-Capacity Reversible Data Hiding in Encrypted Images Employing Local Difference Predictor48
Centralized Large Margin Cosine Loss for Open-Set Deep Palmprint Recognition48
SAC-Net: Spatial Attenuation Context for Salient Object Detection48
Implicit Dual-Domain Convolutional Network for Robust Color Image Compression Artifact Reduction48
Deep Transfer Hashing for Image Retrieval47
Transform Coding in the VVC Standard47
E2I: Generative Inpainting From Edge to Image47
End-to-End Learning Deep CRF Models for Multi-Object Tracking Deep CRF Models47
Rethinking Triplet Loss for Domain Adaptation47
Causal Contextual Prediction for Learned Image Compression46
Robust High-Capacity Watermarking Over Online Social Network Shared Images46
Deep Multiscale Fusion Hashing for Cross-Modal Retrieval46
Fully Homomorphic Encryption Encapsulated Difference Expansion for Reversible Data Hiding in Encrypted Domain46
DR-GAN: Automatic Radial Distortion Rectification Using Conditional GAN in Real-Time46
RGBT Salient Object Detection: Benchmark and A Novel Cooperative Ranking Approach46
Estimating Generalized Gaussian Blur Kernels for Out-of-Focus Image Deblurring45
Zero Shot Detection45
Deep Template-Based Watermarking45
PQA-Net: Deep No Reference Point Cloud Quality Assessment via Multi-View Projection45
Encoder-Decoder With Cascaded CRFs for Semantic Segmentation45
AC-SUM-GAN: Connecting Actor-Critic and Generative Adversarial Networks for Unsupervised Video Summarization44
Reversible Data Hiding With Hierarchical Embedding for Encrypted Images44
From Multi-View to Hollow-3D: Hallucinated Hollow-3D R-CNN for 3D Object Detection44
Robust Reversible Watermarking in Encrypted Image With Secure Multi-Party Based on Lightweight Cryptography44
Aggregating Attentional Dilated Features for Salient Object Detection44
Sequential and Patch Analyses for Object Removal Video Forgery Detection and Localization43
Occupancy-Map-Based Rate Distortion Optimization and Partition for Video-Based Point Cloud Compression43
Learning to Score Figure Skating Sport Videos43
Towards Effective Deep Embedding for Zero-Shot Learning43
Robust Texture Description Using Local Grouped Order Pattern and Non-Local Binary Pattern43
GUDCP: Generalization of Underwater Dark Channel Prior for Underwater Image Restoration43
Camouflaged Object Detection via Context-Aware Cross-Level Fusion43
Multi-Purpose Oriented Single Nighttime Image Haze Removal Based on Unified Variational Retinex Model43
Sports Video Captioning via Attentive Motion Representation and Group Relationship Modeling42
Reversible Data Hiding in JPEG Images With Multi-Objective Optimization41
SiamFPN: A Deep Learning Method for Accurate and Real-Time Maritime Ship Tracking41
Optimal Reversible Data Hiding Scheme Based on Multiple Histograms Modification41
Multi-Grained Attention Networks for Single Image Super-Resolution40
Quantization and Entropy Coding in the Versatile Video Coding (VVC) Standard40
Temporal–Spatial Mapping for Action Recognition40
Multi-View Spectral Clustering Tailored Tensor Low-Rank Representation40
Cross-Domain Complementary Learning Using Pose for Multi-Person Part Segmentation40
Multi-MSB Compression Based Reversible Data Hiding Scheme in Encrypted Images40
Semantic-Aware Occlusion-Robust Network for Occluded Person Re-Identification40
Detecting Double JPEG Compressed Color Images With the Same Quantization Matrix in Spherical Coordinates39
Image Steganography With Symmetric Embedding Using Gaussian Markov Random Field Model39
Efficient Reversible Data Hiding for JPEG Images With Multiple Histograms Modification39
Visual Distortions in 360° Videos39
Deep Adversarial Data Augmentation for Extremely Low Data Regimes39
No-Reference Quality Assessment for 3D Colored Point Cloud and Mesh Models39
Layer-Specific Optimization for Mixed Data Flow With Mixed Precision in FPGA Design for CNN-Based Object Detectors39
TSAN: Synthesized View Quality Enhancement via Two-Stream Attention Network for 3D-HEVC38
Blindly Assess Quality of In-the-Wild Videos via Quality-Aware Pre-Training and Motion Perception38
Attribute-Identity Embedding and Self-Supervised Learning for Scalable Person Re-Identification38
No-Reference Quality Assessment for 360-Degree Images by Analysis of Multifrequency Information and Local-Global Naturalness38
HF-TPE: High-Fidelity Thumbnail- Preserving Encryption38
IID-Net: Image Inpainting Detection Network via Neural Architecture Search and Attention38
Unsupervised Cross-Media Retrieval Using Domain Adaptation With Scene Graph38
Multi-Scale Neighborhood Feature Extraction and Aggregation for Point Cloud Segmentation38
Uncertainty-Guided Cross-Modal Learning for Robust Multispectral Pedestrian Detection38
Triple Adversarial Learning and Multi-View Imaginative Reasoning for Unsupervised Domain Adaptation Person Re-Identification38
Secure Robust JPEG Steganography Based on AutoEncoder With Adaptive BCH Encoding37
Machine Learning-Based Fast Intra Mode Decision for HEVC Screen Content Coding via Decision Trees37
Zero-Shot Cross-Media Embedding Learning With Dual Adversarial Distribution Network37
Meta-Learning-Based Incremental Few-Shot Object Detection37
Spatiotemporal Multimodal Learning With 3D CNNs for Video Action Recognition37
DBCFace: Towards Pure Convolutional Neural Network Face Detection37
Joint Anchor-Feature Refinement for Real-Time Accurate Object Detection in Images and Videos37
A VVC Proposal With Quaternary Tree Plus Binary-Ternary Tree Coding Block Structure and Advanced Coding Techniques37
Exploring Stable Coefficients on Joint Sub-Bands for Robust Video Watermarking in DT CWT Domain37
Syntax-Guided Hierarchical Attention Network for Video Captioning37
Attention-Guided Global-Local Adversarial Learning for Detail-Preserving Multi-Exposure Image Fusion37
A Common Method of Share Authentication in Image Secret Sharing36
Hyperspectral Image Super-Resolution via Deep Prior Regularization With Parameter Estimation36
Transformer3D-Det: Improving 3D Object Detection by Vote Refinement36
High-Quality R-CNN Object Detection Using Multi-Path Detection Calibration Network36
TrajectoryCNN: A New Spatio-Temporal Feature Learning Network for Human Motion Prediction36
Bridge-GAN: Interpretable Representation Learning for Text-to-Image Synthesis36
High Capacity Reversible Data Hiding Based on Multiple Histograms Modification36
Exemplar-Based Denoising: A Unified Low-Rank Recovery Framework36
Multi-Graph Fusion and Learning for RGBT Image Saliency Detection36
Hierarchical Feature Fusion With Mixed Convolution Attention for Single Image Dehazing36
Model Compression Using Progressive Channel Pruning36
Wavelet-Based Deep Auto Encoder-Decoder (WDAED)-Based Image Compression36
Event-Centric Hierarchical Representation for Dense Video Captioning35
SCGAN: Saliency Map-Guided Colorization With Generative Adversarial Network35
Perceptual Quality Assessment for Screen Content Images by Spatial Continuity35
UNFusion: A Unified Multi-Scale Densely Connected Network for Infrared and Visible Image Fusion35
Multiscale Low-Light Image Enhancement Network With Illumination Constraint35
UAV-Satellite View Synthesis for Cross-View Geo-Localization35
Learning Low-Rank and Sparse Discriminative Correlation Filters for Coarse-to-Fine Visual Object Tracking35
Active Transfer Learning34
A Transformer-Based Feature Segmentation and Region Alignment Method for UAV-View Geo-Localization34
A Tunable Selective Encryption Scheme for H.265/HEVC Based on Chroma IPM and Coefficient Scrambling34
Facial Expression Recognition With Two-Branch Disentangled Generative Adversarial Network34
A Two-Stage Attentive Network for Single Image Super-Resolution34
Deep Sub-Region Network for Salient Object Detection34
Downscaling Factor Estimation on Pre-JPEG Compressed Images34
3D Mapping and 6D Pose Computation for Real Time Augmented Reality on Cylindrical Objects34
Learning Gated Non-Local Residual for Single-Image Rain Streak Removal34
Multimodal Local-Global Attention Network for Affective Video Content Analysis34
Dynamic Attention Guided Multi-Trajectory Analysis for Single Object Tracking34
Progressive Cross-Camera Soft-Label Learning for Semi-Supervised Person Re-Identification34
Independent Embedding Domain Based Two-Stage Robust Reversible Watermarking34
Single Image Brightening via Multi-Scale Exposure Fusion With Hybrid Learning34
Learning Generalized Spatial-Temporal Deep Feature Representation for No-Reference Video Quality Assessment34
Detecting Small Objects Using a Channel-Aware Deconvolutional Network33
Human-Centric Spatio-Temporal Video Grounding With Visual Transformers33
Predictive Generalized Graph Fourier Transform for Attribute Compression of Dynamic Point Clouds33
General Video Coding Technology in Responses to the Joint Call for Proposals on Video Compression With Capability Beyond HEVC33
Progressive Meta-Learning With Curriculum33
Depth Estimation Using a Self-Supervised Network Based on Cross-Layer Feature Fusion and the Quadtree Constraint33
Overview of the Screen Content Support in VVC: Applications, Coding Tools, and Performance32
A Multi-Task Collaborative Network for Light Field Salient Object Detection32
Food and Ingredient Joint Learning for Fine-Grained Recognition32
Student Network Learning via Evolutionary Knowledge Distillation32
RGBT Tracking by Trident Fusion Network32
Illumination Unification for Person Re-Identification32
ZoomCount: A Zooming Mechanism for Crowd Counting in Static Images32
Detail-Preserving Multi-Exposure Fusion With Edge-Preserving Structural Patch Decomposition32
Deep Cross-Modal Representation Learning and Distillation for Illumination-Invariant Pedestrian Detection32
VVC Complexity and Software Implementation Analysis32
Multi-Person Hierarchical 3D Pose Estimation in Natural Videos32
Task-Aware Attention Model for Clothing Attribute Prediction31
Fast 3D-HEVC Depth Map Encoding Using Machine Learning31
Tensorial Multi-View Clustering via Low-Rank Constrained High-Order Graph Learning31
BBC Net: Bounding-Box Critic Network for Occlusion-Robust Object Detection31
Perceptual Image Hashing for Content Authentication Based on Convolutional Neural Network With Multiple Constraints31
Cross-SRN: Structure-Preserving Super-Resolution Network With Cross Convolution31
Incomplete Descriptor Mining With Elastic Loss for Person Re-Identification31
Noise Augmented Double-Stream Graph Convolutional Networks for Image Captioning31
Lossless Coding of Point Cloud Geometry Using a Deep Generative Model30
Context-Aware Mixup for Domain Adaptive Semantic Segmentation30
3D Face Anti-Spoofing With Factorized Bilinear Coding30
DesnowGAN: An Efficient Single Image Snow Removal Framework Using Cross-Resolution Lateral Connection and GANs30
Learning From Synthetic Shadows for Shadow Detection and Removal30
Cryptanalysis of Image Ciphers With Permutation-Substitution Network and Chaos30
Part-based Tracking via Discriminative Correlation Filters30
Video Compressed Sensing Using a Convolutional Neural Network30
Feature Alignment and Aggregation Siamese Networks for Fast Visual Tracking30
CGMDRNet: Cross-Guided Modality Difference Reduction Network for RGB-T Salient Object Detection30
Complementary Discriminative Correlation Filters Based on Collaborative Representation for Visual Object Tracking30
Joint Expression Synthesis and Representation Learning for Facial Expression Recognition29
0.095479011535645