IEEE Transactions on Circuits and Systems for Video Technology

Papers
(The TQCC of IEEE Transactions on Circuits and Systems for Video Technology is 13. The table below lists those papers that are above that threshold based on CrossRef citation counts [max. 250 papers]. The publications cover those that have been published in the past four years, i.e., from 2020-04-01 to 2024-04-01.)
ArticleCitations
Overview of the Versatile Video Coding (VVC) Standard and its Applications545
Image De-Raining Using a Conditional Generative Adversarial Network510
Real-World Underwater Enhancement: Challenges, Benchmarks, and Solutions Under Natural Light314
Multimodal Transformer With Multi-View Visual Representation for Image Captioning243
Image and Video Compression With Neural Networks: A Review215
Data Augmentation Using Random Image Cropping and Patching for Deep CNNs205
Video Summarization With Attention-Based Encoder–Decoder Networks194
Channel-Wise and Spatial Feature Modulation Network for Single Image Super-Resolution187
Low-Complexity CTU Partition Structure Decision and Fast Intra Mode Decision for Versatile Video Coding181
Feature Refinement and Filter Network for Person Re-Identification161
A Survey of Open-World Person Re-Identification155
PCC Net: Perspective Crowd Counting via Spatial Convolutional Network154
Task-Adaptive Attention for Image Captioning146
Multi-Scale Metric Learning for Few-Shot Learning143
RetinexDIP: A Unified Deep Framework for Low-Light Image Enhancement126
Small Object Detection in Unmanned Aerial Vehicle Images Using Feature Fusion and Scaling-Based Single Shot Detector With Spatial Context Analysis124
ECFFNet: Effective and Consistent Feature Fusion Network for RGB-T Salient Object Detection124
SwinNet: Swin Transformer Drives Edge-Aware RGB-D and RGB-T Salient Object Detection120
Monocular Depth Estimation Using Laplacian Pyramid-Based Depth Residuals116
Low-Light Image Enhancement via Progressive-Recursive Network115
Learning a Deep Multi-Scale Feature Ensemble and an Edge-Attention Guidance for Image Fusion112
Image Description With Polar Harmonic Fourier Moments112
Deep Virtual Reality Image Quality Assessment With Human Perception Guider for Omnidirectional Image105
Richly Activated Graph Convolutional Network for Robust Skeleton-Based Action Recognition104
Expansion-Squeeze-Excitation Fusion Network for Elderly Activity Recognition102
A Decade Survey of Content Based Image Retrieval Using Deep Learning97
Multi-Temporal Ultra Dense Memory Network for Video Super-Resolution94
Low-Rank Tensor Graph Learning for Multi-View Subspace Clustering89
Hierarchical Graph Neural Networks for Few-Shot Learning88
Anatomy-Aware 3D Human Pose Estimation With Bone-Based Pose Decomposition85
Lossy Point Cloud Geometry Compression via End-to-End Learning85
Target Oriented Perceptual Adversarial Fusion Network for Underwater Image Enhancement84
Drone-Based RGB-Infrared Cross-Modality Vehicle Detection Via Uncertainty-Aware Learning83
Lightweight Image Super-Resolution With Expectation-Maximization Attention Mechanism83
Revisiting Feature Fusion for RGB-T Salient Object Detection83
Detecting Compressed Deepfake Videos in Social Networks Using Frame-Temporality Two-Stream Convolutional Network82
A Simple Local Minimal Intensity Prior and an Improved Algorithm for Blind Image Deblurring82
Region-Aware Image Captioning via Interaction Learning81
Dual-Level Representation Enhancement on Characteristic and Context for Image-Text Retrieval80
Real-Time Video Emotion Recognition Based on Reinforcement Learning and Domain Knowledge80
SCRATCH: A Scalable Discrete Matrix Factorization Hashing Framework for Cross-Modal Retrieval79
Each Part Matters: Local Patterns Facilitate Cross-View Geo-Localization77
Underwater Image Enhancement Quality Evaluation: Benchmark Dataset and Objective Metric77
Edge-Guided Non-Local Fully Convolutional Network for Salient Object Detection76
Double Parameters Fractal Sorting Matrix and Its Application in Image Encryption76
Perceptual Underwater Image Enhancement With Deep Learning and Physical Priors76
Unified Information Fusion Network for Multi-Modal RGB-D and RGB-T Salient Object Detection76
Occluded Face Recognition in the Wild by Identity-Diversity Inpainting75
Low Rank Component Induced Spatial-Spectral Kernel Method for Hyperspectral Image Classification75
Attention-Driven Loss for Anomaly Detection in Video Surveillance75
Temporal-Channel Transformer for 3D Lidar-Based Video Object Detection for Autonomous Driving75
Multiple Histograms-Based Reversible Data Hiding: Framework and Realization75
Unsupervised Blind Image Quality Evaluation via Statistical Measurements of Structure, Naturalness, and Perception72
CGFNet: Cross-Guided Fusion Network for RGB-T Salient Object Detection72
Watermarking Neural Networks With Watermarked Images71
UrbanLF: A Comprehensive Light Field Dataset for Semantic Segmentation of Urban Scenes69
Deep Spatial-Spectral Subspace Clustering for Hyperspectral Image68
Wireless Image Transmission Using Deep Source Channel Coding With Attention Modules68
Efficient and Model-Based Infrared and Visible Image Fusion via Algorithm Unrolling67
Grayscale Enhancement Colorization Network for Visible-Infrared Person Re-Identification67
Intra Prediction and Mode Coding in VVC66
RefineDet++: Single-Shot Refinement Neural Network for Object Detection65
A Variational Framework for Underwater Image Dehazing and Deblurring64
Concealed Attack for Robust Watermarking Based on Generative Model and Perceptual Loss64
Fine-Grained Age Estimation in the Wild With Attention LSTM Networks64
Blind Omnidirectional Image Quality Assessment With Viewport Oriented Graph Convolutional Networks64
Block Partitioning Structure in the VVC Standard63
A Survey of Multiple Pedestrian Tracking Based on Tracking-by-Detection Framework63
Multi-Level Fusion and Attention-Guided CNN for Image Dehazing62
Fuzzy Integral-Based CNN Classifier Fusion for 3D Skeleton Action Recognition62
Underwater Image Co-Enhancement With Correlation Feature Matching and Joint Learning62
Color Cast Dependent Image Dehazing via Adaptive Airlight Refinement and Non-Linear Color Balancing61
Recursive Neural Network for Video Deblurring61
Deep Joint Depth Estimation and Color Correction From Monocular Underwater Images Based on Unsupervised Adaptation Networks59
VVC In-Loop Filters59
PSCC-Net: Progressive Spatio-Channel Correlation Network for Image Manipulation Detection and Localization58
Efficient Context-Guided Stacked Refinement Network for RGB-T Salient Object Detection58
Beyond Scalar Neuron: Adopting Vector-Neuron Capsules for Long-Term Person Re-Identification58
Detail-Enhanced Multi-Scale Exposure Fusion in YUV Color Space57
No-Reference Light Field Image Quality Assessment Based on Spatial-Angular Measurement56
SDL: Spectrum-Disentangled Representation Learning for Visible-Infrared Person Re-Identification56
MDCN: Multi-Scale Dense Cross Network for Image Super-Resolution56
A Robust GAN-Generated Face Detection Method Based on Dual-Color Spaces and an Improved Xception56
Low CP Rank and Tucker Rank Tensor Completion for Estimating Missing Components in Image Data56
High-Order Interaction Learning for Image Captioning55
Decomposition Makes Better Rain Removal: An Improved Attention-Guided Deraining Network55
Infrared and Visible Image Fusion via Texture Conditional Generative Adversarial Network55
Action Recognition Scheme Based on Skeleton Representation With DS-LSTM Network55
High-Level Semantic Networks for Multi-Scale Object Detection55
A Survey of Human Action Analysis in HRI Applications54
Multiscale Densely-Connected Fusion Networks for Hyperspectral Images Classification54
Stereoscopic Image Description With Trinion Fractional-Order Continuous Orthogonal Moments53
Cross-View Gait Recognition Using Pairwise Spatial Transformer Networks52
A View Synthesis-Based 360° VR Caching System Over MEC-Enabled C-RAN52
Adaptive Region Proposal With Channel Regularization for Robust Object Tracking52
Normality Learning in Multispace for Video Anomaly Detection51
MSTA-Net: Forgery Detection by Generating Manipulation Trace Based on Multi-Scale Self-Texture Attention51
The Joint Exploration Model (JEM) for Video Compression With Capability Beyond HEVC51
A Hybrid Compression Framework for Color Attributes of Static 3D Point Clouds51
Learning Dual Semantic Relations With Graph Attention for Image-Text Matching50
Reversible Data Hiding With Hierarchical Embedding for Encrypted Images50
Coverless Image Steganography Based on Multi-Object Recognition50
A High-Capacity Reversible Data Hiding in Encrypted Images Employing Local Difference Predictor50
PQA-Net: Deep No Reference Point Cloud Quality Assessment via Multi-View Projection50
Implicit Dual-Domain Convolutional Network for Robust Color Image Compression Artifact Reduction50
Multiple Robustness Enhancements for Image Adaptive Steganography in Lossy Channels50
GUDCP: Generalization of Underwater Dark Channel Prior for Underwater Image Restoration50
Color Transferred Convolutional Neural Networks for Image Dehazing50
Multi-Purpose Oriented Single Nighttime Image Haze Removal Based on Unified Variational Retinex Model50
Transform Coding in the VVC Standard49
Encoder-Decoder With Cascaded CRFs for Semantic Segmentation49
Camouflaged Object Detection via Context-Aware Cross-Level Fusion49
SAC-Net: Spatial Attenuation Context for Salient Object Detection49
E2I: Generative Inpainting From Edge to Image49
Centralized Large Margin Cosine Loss for Open-Set Deep Palmprint Recognition48
Deep Template-Based Watermarking48
Estimating Generalized Gaussian Blur Kernels for Out-of-Focus Image Deblurring47
Causal Contextual Prediction for Learned Image Compression47
Robust High-Capacity Watermarking Over Online Social Network Shared Images47
Deep Multiscale Fusion Hashing for Cross-Modal Retrieval47
End-to-End Learning Deep CRF Models for Multi-Object Tracking Deep CRF Models47
Fully Homomorphic Encryption Encapsulated Difference Expansion for Reversible Data Hiding in Encrypted Domain47
From Multi-View to Hollow-3D: Hallucinated Hollow-3D R-CNN for 3D Object Detection47
Deep Transfer Hashing for Image Retrieval47
Rethinking Triplet Loss for Domain Adaptation47
AC-SUM-GAN: Connecting Actor-Critic and Generative Adversarial Networks for Unsupervised Video Summarization46
RGBT Salient Object Detection: Benchmark and A Novel Cooperative Ranking Approach46
Learning to Score Figure Skating Sport Videos46
Robust Reversible Watermarking in Encrypted Image With Secure Multi-Party Based on Lightweight Cryptography46
Aggregating Attentional Dilated Features for Salient Object Detection45
Layer-Specific Optimization for Mixed Data Flow With Mixed Precision in FPGA Design for CNN-Based Object Detectors45
Spatiotemporal Multimodal Learning With 3D CNNs for Video Action Recognition45
Zero Shot Detection45
Multi-MSB Compression Based Reversible Data Hiding Scheme in Encrypted Images44
Occupancy-Map-Based Rate Distortion Optimization and Partition for Video-Based Point Cloud Compression44
No-Reference Quality Assessment for 3D Colored Point Cloud and Mesh Models44
Reversible Data Hiding in JPEG Images With Multi-Objective Optimization43
Cross-Domain Complementary Learning Using Pose for Multi-Person Part Segmentation43
Sports Video Captioning via Attentive Motion Representation and Group Relationship Modeling43
Multi-View Spectral Clustering Tailored Tensor Low-Rank Representation43
Robust Texture Description Using Local Grouped Order Pattern and Non-Local Binary Pattern43
Sequential and Patch Analyses for Object Removal Video Forgery Detection and Localization43
Optimal Reversible Data Hiding Scheme Based on Multiple Histograms Modification43
Semantic-Aware Occlusion-Robust Network for Occluded Person Re-Identification43
Towards Effective Deep Embedding for Zero-Shot Learning43
UAV-Satellite View Synthesis for Cross-View Geo-Localization42
Uncertainty-Guided Cross-Modal Learning for Robust Multispectral Pedestrian Detection42
A Transformer-Based Feature Segmentation and Region Alignment Method for UAV-View Geo-Localization42
Multi-Grained Attention Networks for Single Image Super-Resolution42
Attention-Guided Global-Local Adversarial Learning for Detail-Preserving Multi-Exposure Image Fusion41
SiamFPN: A Deep Learning Method for Accurate and Real-Time Maritime Ship Tracking41
Efficient Reversible Data Hiding for JPEG Images With Multiple Histograms Modification41
Detecting Double JPEG Compressed Color Images With the Same Quantization Matrix in Spherical Coordinates40
TrajectoryCNN: A New Spatio-Temporal Feature Learning Network for Human Motion Prediction40
Deep Adversarial Data Augmentation for Extremely Low Data Regimes40
Unsupervised Cross-Media Retrieval Using Domain Adaptation With Scene Graph40
TSAN: Synthesized View Quality Enhancement via Two-Stream Attention Network for 3D-HEVC40
Blindly Assess Quality of In-the-Wild Videos via Quality-Aware Pre-Training and Motion Perception40
Quantization and Entropy Coding in the Versatile Video Coding (VVC) Standard40
Exploring Stable Coefficients on Joint Sub-Bands for Robust Video Watermarking in DT CWT Domain40
No-Reference Quality Assessment for 360-Degree Images by Analysis of Multifrequency Information and Local-Global Naturalness40
Image Steganography With Symmetric Embedding Using Gaussian Markov Random Field Model40
IID-Net: Image Inpainting Detection Network via Neural Architecture Search and Attention40
Learning Generalized Spatial-Temporal Deep Feature Representation for No-Reference Video Quality Assessment39
Visual Distortions in 360° Videos39
HF-TPE: High-Fidelity Thumbnail- Preserving Encryption39
Multiscale Low-Light Image Enhancement Network With Illumination Constraint38
Syntax-Guided Hierarchical Attention Network for Video Captioning38
Exemplar-Based Denoising: A Unified Low-Rank Recovery Framework38
A VVC Proposal With Quaternary Tree Plus Binary-Ternary Tree Coding Block Structure and Advanced Coding Techniques38
Machine Learning-Based Fast Intra Mode Decision for HEVC Screen Content Coding via Decision Trees38
DBCFace: Towards Pure Convolutional Neural Network Face Detection38
A Two-Stage Attentive Network for Single Image Super-Resolution38
Attribute-Identity Embedding and Self-Supervised Learning for Scalable Person Re-Identification38
High Capacity Reversible Data Hiding Based on Multiple Histograms Modification38
Triple Adversarial Learning and Multi-View Imaginative Reasoning for Unsupervised Domain Adaptation Person Re-Identification38
Multi-Scale Neighborhood Feature Extraction and Aggregation for Point Cloud Segmentation38
Hierarchical Feature Fusion With Mixed Convolution Attention for Single Image Dehazing38
Secure Robust JPEG Steganography Based on AutoEncoder With Adaptive BCH Encoding37
SCGAN: Saliency Map-Guided Colorization With Generative Adversarial Network37
Model Compression Using Progressive Channel Pruning37
Joint Anchor-Feature Refinement for Real-Time Accurate Object Detection in Images and Videos37
Meta-Learning-Based Incremental Few-Shot Object Detection37
UNFusion: A Unified Multi-Scale Densely Connected Network for Infrared and Visible Image Fusion37
Transformer3D-Det: Improving 3D Object Detection by Vote Refinement37
Zero-Shot Cross-Media Embedding Learning With Dual Adversarial Distribution Network37
Event-Centric Hierarchical Representation for Dense Video Captioning37
Independent Embedding Domain Based Two-Stage Robust Reversible Watermarking37
Bridge-GAN: Interpretable Representation Learning for Text-to-Image Synthesis37
Deep Cross-Modal Representation Learning and Distillation for Illumination-Invariant Pedestrian Detection36
Wavelet-Based Deep Auto Encoder-Decoder (WDAED)-Based Image Compression36
Multi-Graph Fusion and Learning for RGBT Image Saliency Detection36
Hyperspectral Image Super-Resolution via Deep Prior Regularization With Parameter Estimation36
High-Quality R-CNN Object Detection Using Multi-Path Detection Calibration Network36
A Common Method of Share Authentication in Image Secret Sharing36
A Tunable Selective Encryption Scheme for H.265/HEVC Based on Chroma IPM and Coefficient Scrambling36
Dynamic Attention Guided Multi-Trajectory Analysis for Single Object Tracking35
3D Mapping and 6D Pose Computation for Real Time Augmented Reality on Cylindrical Objects35
Food and Ingredient Joint Learning for Fine-Grained Recognition35
Perceptual Quality Assessment for Screen Content Images by Spatial Continuity35
Learning Gated Non-Local Residual for Single-Image Rain Streak Removal35
Learning Low-Rank and Sparse Discriminative Correlation Filters for Coarse-to-Fine Visual Object Tracking35
VVC Complexity and Software Implementation Analysis35
Deep Sub-Region Network for Salient Object Detection34
RGBT Tracking by Trident Fusion Network34
Human-Centric Spatio-Temporal Video Grounding With Visual Transformers34
General Video Coding Technology in Responses to the Joint Call for Proposals on Video Compression With Capability Beyond HEVC34
Detecting Small Objects Using a Channel-Aware Deconvolutional Network34
Progressive Cross-Camera Soft-Label Learning for Semi-Supervised Person Re-Identification34
Progressive Meta-Learning With Curriculum34
Single Image Brightening via Multi-Scale Exposure Fusion With Hybrid Learning34
Active Transfer Learning34
A Multi-Task Collaborative Network for Light Field Salient Object Detection34
Facial Expression Recognition With Two-Branch Disentangled Generative Adversarial Network34
Multimodal Local-Global Attention Network for Affective Video Content Analysis34
Predictive Generalized Graph Fourier Transform for Attribute Compression of Dynamic Point Clouds33
Subjective and Objective De-Raining Quality Assessment Towards Authentic Rain Image33
BBC Net: Bounding-Box Critic Network for Occlusion-Robust Object Detection33
Cross-SRN: Structure-Preserving Super-Resolution Network With Cross Convolution33
Bi-Directional Progressive Guidance Network for RGB-D Salient Object Detection33
Noise Augmented Double-Stream Graph Convolutional Networks for Image Captioning33
Detail-Preserving Multi-Exposure Fusion With Edge-Preserving Structural Patch Decomposition33
Incomplete Descriptor Mining With Elastic Loss for Person Re-Identification33
Illumination Unification for Person Re-Identification33
Depth Estimation Using a Self-Supervised Network Based on Cross-Layer Feature Fusion and the Quadtree Constraint33
Overview of the Screen Content Support in VVC: Applications, Coding Tools, and Performance33
Task-Aware Attention Model for Clothing Attribute Prediction32
Complementary Discriminative Correlation Filters Based on Collaborative Representation for Visual Object Tracking32
Perceptual Image Hashing for Content Authentication Based on Convolutional Neural Network With Multiple Constraints32
Joint Expression Synthesis and Representation Learning for Facial Expression Recognition32
Student Network Learning via Evolutionary Knowledge Distillation32
Deep Texture-Aware Features for Camouflaged Object Detection32
FilterNet: Adaptive Information Filtering Network for Accurate and Fast Image Super-Resolution32
ZoomCount: A Zooming Mechanism for Crowd Counting in Static Images32
Multi-View Spatial Attention Embedding for Vehicle Re-Identification32
3D Face Anti-Spoofing With Factorized Bilinear Coding32
DesnowGAN: An Efficient Single Image Snow Removal Framework Using Cross-Resolution Lateral Connection and GANs32
TOAN: Target-Oriented Alignment Network for Fine-Grained Image Categorization With Few Labeled Samples32
Multi-Person Hierarchical 3D Pose Estimation in Natural Videos32
A Camera Shooting Resilient Watermarking Scheme for Underpainting Documents31
Video Compressed Sensing Using a Convolutional Neural Network31
Lossless Coding of Point Cloud Geometry Using a Deep Generative Model31
AO2-DETR: Arbitrary-Oriented Object Detection Transformer31
Exploring Dense Context for Salient Object Detection31
Rethinking Camouflaged Object Detection: Models and Datasets31
Feature Alignment and Aggregation Siamese Networks for Fast Visual Tracking31
Tensorial Multi-View Clustering via Low-Rank Constrained High-Order Graph Learning31
Laplacian Regularized Nonnegative Representation for Clustering and Dimensionality Reduction31
Deep Semantic Reconstruction Hashing for Similarity Retrieval31
Unsupervised Domain Adaptation via Importance Sampling31
0.105220079422