IEEE Transactions on Circuits and Systems for Video Technology

Papers
(The TQCC of IEEE Transactions on Circuits and Systems for Video Technology is 16. The table below lists those papers that are above that threshold based on CrossRef citation counts [max. 250 papers]. The publications cover those that have been published in the past four years, i.e., from 2021-05-01 to 2025-05-01.)
ArticleCitations
Table of Contents1059
IEEE Transactions on Circuits and Systems for Video Technology publication information240
IEEE Transactions on Circuits and Systems for Video Technology publication information239
2022 Index IEEE Transactions on Circuits and Systems for Video Technology Vol. 32237
Multi-level Feature Fusion Network for Shadow Removal Detection235
UDTCWT-PHFMs Domain Statistical Image Watermarking Using Vector BW-Type R Distribution228
UAMD-Net: A Unified Adaptive Multimodal Neural Network for Dense Depth Completion206
Exploring and Exploiting High-Order Spatial–Temporal Dynamics for Long-Term Frame Prediction194
Future Feature-Based Supervised Contrastive Learning for Streaming Perception192
Representing Boundary-Ambiguous Scene Online With Scale-Encoded Cascaded Grids and Radiance Field Deblurring192
Relative Comparison-Based Consensus Learning for Multi-View Subspace Clustering189
Negative Class Guided Spatial Consistency Network for Sparsely Supervised Semantic Segmentation of Remote Sensing Images182
A Clinically Guided Graph Convolutional Network for Assessment of Parkinsonian Pronation-Supination Movements of Hands181
Towards Video Anomaly Detection in the Real World: A Binarization Embedded Weakly-Supervised Network180
Learning Spatio-Temporal Sharpness Map for Video Deblurring176
Fully Unsupervised Domain-Agnostic Image Retrieval173
Few-Shot Temporal Sentence Grounding via Memory-Guided Semantic Learning168
Ct-LVI: A Framework Toward Continuous-Time Laser-Visual-Inertial Odometry and Mapping162
RT3DHVC: A Real-time Human Holographic Video Conferencing System with a Consumer RGB-D Camera Array155
Active Spatial Positions Based Hierarchical Relation Inference for Group Activity Recognition147
Robust Monocular Pose Tracking of Less-Distinct Objects Based on Contour-Part Model142
Crowd-Powered Photo Enhancement Featuring an Active Learning Based Local Filter141
Representation Robustness and Feature Expansion for Exemplar-Free Class-Incremental Learning139
Plausible Proxy Mining With Credibility for Unsupervised Person Re-Identification138
Multi-Modal Multi-Grained Embedding Learning for Generalized Zero-Shot Video Classification133
Push-and-Pull: A General Training Framework With Differential Augmentor for Domain Generalized Point Cloud Classification132
Fuzzified Contrast Enhancement for Nearly Invisible Images132
Relation-Aware Multi-Pass Comparison Deconfounded Network for Change Captioning132
MCCE-REC: MLLM-Driven Cross-Modal Contrastive Entropy Model for Zero-Shot Referring Expression Comprehension130
Deep Affine Motion Compensation Network for Inter Prediction in VVC127
Robust Image Watermarking With Synchronization Using Template Enhanced-Extracted Network127
Joint Learning of Image Deblurring and Depth Estimation Through Adversarial Multi-Task Network126
Block Diagonal Graph Embedded Discriminative Regression for Image Representation122
MMI-Det: Exploring Multi-Modal Integration for Visible and Infrared Object Detection121
Efficient Single-Object Tracker Based on Local-Global Feature Fusion121
IEEE Transactions on Circuits and Systems for Video Technology Publication Information120
CLIP-VIS: Adapting CLIP for Open-Vocabulary Video Instance Segmentation119
DMRFlow: 4D Radar Scene Flow Estimation with Decoupled Matching and Refinement117
Semi-Supervised Crowd Counting via Multi-Task Pseudo-Label Self-Correction Strategy116
Highly-Parallel Hardwired Deep Convolutional Neural Network for 1-ms Dual-Hand Tracking114
CRP2-VCS: Contrast-Oriented Region-based Progressive Probabilistic Visual Cryptography Schemes110
Semantic-Aware Late-Stage Supervised Contrastive Learning for Fine-Grained Action Recognition110
Pose-Guided Transformer for Fine-Grained Action Quality Assessment109
Scene Prior Constrained Self-Paced Learning for Unsupervised Satellite Video Vehicle Detection107
Table of Contents105
DSC3D: Deformable Sampling Constraints in Stereo 3D Object Detection for Autonomous Driving105
Guest Editorial Introduction to the Special Issue on Label-Efficient Learning on Video Data105
Hierarchical Dynamic Programming Module for Human Pose Refinement105
Instance-Incremental Scene Graph Generation From Real-World Point Clouds via Normalizing Flows104
Learning Appearance-Motion Synergy via Memory-Guided Event Prediction for Video Anomaly Detection104
Toward Meta-Shape-Based Multi-View 3D Point Cloud Registration: An Evaluation104
IEEE Circuits and Systems Society Information104
VDTR: Video Deblurring With Transformer103
Harmony: An Eco-Friendly Adaptive Rate Control Scheme for Video-on-Demand in Low Earth Orbit Satellite Internet101
Learning to Capture the Query Distribution for Few-Shot Learning100
Stochastic Gradient Perturbation: An Implicit Regularizer for Person Re-Identification99
EIFNet: An Explicit and Implicit Feature Fusion Network for Finger Vein Verification98
Lightweight Neural Network for Enhancing Imaging Performance of Under-Display Camera94
SARGAN: Spatial Attention-Based Residuals for Facial Expression Manipulation93
Projected Generative Adversarial Network for Point Cloud Completion93
Iterative Self-Guided Image Filtering93
Cross-Level Multi-Modal Features Learning With Transformer for RGB-D Object Recognition93
Exploring Explicitly Disentangled Features for Domain Generalization93
Multi-Modal Attribute Prompting for Vision-Language Models92
FastAL: Fast Evaluation Module for Efficient Dynamic Deep Active Learning Using Broad Learning System91
Single Image Haze Removal With Haze Map Optimization for Various Haze Concentrations90
Uni3DA: Universal 3D Domain Adaptation for Object Recognition89
Dual-Stream Transformer With Distribution Alignment for Visible-Infrared Person Re-Identification89
Deep and Low-Rank Quaternion Priors for Color Image Processing88
Progressive Point Cloud Upsampling via Differentiable Rendering88
Truncated Robust Natural Watermarking With Hungarian Optimization88
Convolutional Neural Networks for Omnidirectional Image Quality Assessment: A Benchmark87
Spatial Attention-Guided Light Field Salient Object Detection Network With Implicit Neural Representation87
Local Attention Transformer-Based Full-View Finger-Vein Identification86
Pro-Tuning: Unified Prompt Tuning for Vision Tasks85
Enhancing Representation Learning With Spatial Transformation and Early Convolution for Reinforcement Learning-Based Small Object Detection84
Graph-Guided Unsupervised Multiview Representation Learning84
Spectral–Spatial Feature Extraction With Dual Graph Autoencoder for Hyperspectral Image Clustering84
Key Role Guided Transformer for Group Activity Recognition83
Adversarial Dual-Student With Differentiable Spatial Warping for Semi-Supervised Semantic Segmentation82
SpiReco: Fast and Efficient Recognition of High-Speed Moving Objects With Spike Camera81
A Format Compliant Framework for HEVC Selective Encryption After Encoding81
Reversible Data Hiding in Encrypted Image via Secret Sharing Based on GF(p) and GF(2⁸)80
Edge and Skeleton Guidance Network for Salient Object Detection in Optical Remote Sensing Images80
AirSOD: A Lightweight Network for RGB-D Salient Object Detection80
Reversible Data Hiding Over Encrypted Images via Preprocessing-Free Matrix Secret Sharing80
Low-Light Image Enhancement via Progressive-Recursive Network80
Reliable Entropy-induced Anchor Learning for Incomplete Multi-view Subspace Clustering80
Equity in Unsupervised Domain Adaptation by Nuclear Norm Maximization79
Efficient Reversible Data Hiding for JPEG Images With Multiple Histograms Modification79
D3C2-Net: Dual-Domain Deep Convolutional Coding Network for Compressive Sensing78
Scalable and Robust Tensor Ring Decomposition for Large-scale Data with Missing Data and Outliers78
Learning Depth-Density Priors for Fourier-Based Unpaired Image Restoration78
Image Super-Resolution With Self-Similarity Prior Guided Network and Sample-Discriminating Learning77
Frequency Generation for Real-World Image Super-Resolution77
PhyDAA: Physiological Dataset Assessing Attention76
Multimodal Local-Global Attention Network for Affective Video Content Analysis76
IEEE Transactions on Circuits and Systems for Video Technology publication information73
Efficient Non-Blind Image Deblurring with Discriminative Shrinkage Deep Networks73
IEEE Transactions on Circuits and Systems for Video Technology publication information73
IEEE Transactions on Circuits and Systems for Video Technology publication information72
IEEE Circuits and Systems Society Information72
Low-Resolution Object Recognition With Cross-Resolution Relational Contrastive Distillation72
StreetSurfGS: Scalable Urban Street Surface Reconstruction with Planar-based Gaussian Splatting71
Touchless Finger Vein and Fingerprint Verification via Exploiting Attention-Based Cross-Domain Fusion70
Online Unsupervised Video Object Segmentation via Contrastive Motion Clustering69
Question-Aware Global-Local Video Understanding Network for Audio-Visual Question Answering69
Balanced Teacher for Source-Free Object Detection69
Interlayer Restoration Deep Neural Network for Scalable High Efficiency Video Coding68
Dynamic Hypergraph Convolutional Network for No-Reference Point Cloud Quality Assessment67
Blind Image Quality Index for Authentic Distortions With Local and Global Deep Feature Aggregation67
Enhancing Robustness of Multi-Object Trackers With Temporal Feature Mix67
Multi-Scale Explicit Matching and Mutual Subject Teacher Learning for Generalizable Person Re-Identification67
Task-Specific Loss for Robust Instance Segmentation With Noisy Class Labels67
A Distortion-Aware Multi-Task Learning Framework for Fractional Interpolation in Video Coding66
Meta-Learning Based Domain Prior With Application to Optical-ISAR Image Translation66
Transformer-Based Multimodal Emotional Perception for Dynamic Facial Expression Recognition in the Wild66
Boosting Semi-Supervised Face Recognition With Noise Robustness65
Flow-Edge Guided Unsupervised Video Object Segmentation65
Forgery-aware Adaptive Learning with Vision Transformer for Generalized Face Forgery Detection65
Self-Supervised Adversarial Video Summarizer With Context Latent Sequence Learning64
Unsupervised Deep Hashing With Fine-Grained Similarity-Preserving Contrastive Learning for Image Retrieval64
G2LP-Net: Global to Local Progressive Video Inpainting Network64
Searching a Compact Architecture for Robust Multi-Exposure Image Fusion63
Robust Matrix Completion Based on Factorization and Truncated-Quadratic Loss Function63
Surveillance Video-and-Language Understanding: From Small to Large Multimodal Models63
Compact Interchannel Sampling Difference Descriptor for Color Texture Classification63
Optical Flow Reusing for High-Efficiency Space-Time Video Super Resolution62
Dynamic Particle Filter Framework for Robust Object Tracking62
Learning With Noisy Labels by Semantic and Feature Space Collaboration61
Texture-Aware Spherical Rotation for High Efficiency Omnidirectional Intra Video Coding61
A Novel Video Coding Strategy in HEVC for Object Detection61
VmambaIR: Visual State Space Model for Image Restoration61
CNN-Transformer Based Generative Adversarial Network for Copy-Move Source/ Target Distinguishment60
Diverse Batch Steganography Using Model-Based Selection and Double-Layered Payload Assignment60
Fixing Defect of Photometric Loss for Self-Supervised Monocular Depth Estimation60
Laplacian Pyramid Fusion Network With Hierarchical Guidance for Infrared and Visible Image Fusion59
Multi-Level Fusion and Attention-Guided CNN for Image Dehazing59
FDAC: Federated Domain Adaptation via Dual Contrastive Learning59
Enhanced Spatial-Temporal Salience for Cross-View Gait Recognition59
Conditional Dual Diffusion for Multimodal Clustering of Optical and SAR Images58
Target-Aware Tracking With Spatial-Temporal Context Attention57
Inter-Scale Similarity Guided Cost Aggregation for Stereo Matching57
Cloth-Imbalanced Gait Recognition via Hallucination57
Table of Contents57
A Universal Framework for Improving the Robustness of Coverless Image Steganography Based on Image Restoration57
DiffVein: A Unified Diffusion Network for Finger Vein Segmentation and Authentication56
Mesh2Animation: Unsupervised Animating for Quadruped 3D Objects56
FDNet: Frequency Decomposition Network for Learned Image Compression56
MixSSC: Forward-Backward Mixture for Vision-based 3D Semantic Scene Completion56
WeaFU: Weather-Informed Image Blind Restoration via Multi-Weather Distribution Diffusion56
Monocular Depth Estimation on Adverse Weathers With Curriculum Domain Distribution Alignment56
Efficiently Exploiting Spatially Variant Knowledge for Video Deblurring56
Flow Visualization for Complex Fluid Flows via A Structure-enhanced Motion Estimator55
Facial Expression Recognition With Two-Branch Disentangled Generative Adversarial Network55
All-Inclusive Image Enhancement for Degraded Images Exhibiting Low-Frequency Corruption55
Recent Advances in Rate Control: From Optimization to Implementation and Beyond55
FaceGCN: Structured Priors Inspired Graph Convolutional Networks for Blind Face Restoration55
Depth Estimation From a Single Image of Blast Furnace Burden Surface Based on Edge Defocus Tracking55
TAKD: Target-Aware Knowledge Distillation for Remote Sensing Scene Classification55
A Novel Deep Learning Framework for Automatic Recognition of Thyroid Gland and Tissues of Neck in Ultrasound Image54
Regression-Based Motion Vector Field for Video Coding54
Semantic Disentanglement Adversarial Hashing for Cross-Modal Retrieval54
Appearance Matters, So Does Audio: Revealing the Hidden Face via Cross-Modality Transfer54
Table of Contents53
An Efficient Algorithm for Generating Harmonized Stereoscopic 360° VR Images53
VSOIQE: A Novel Viewport-Based Stitched 360° Omnidirectional Image Quality Evaluator53
SMR: Spatial-Guided Model-Based Regression for 3D Hand Pose and Mesh Reconstruction53
Low-Rank Tensor Graph Learning for Multi-View Subspace Clustering52
Efficient Selective Context Network for Accurate Object Detection52
MSGA-Net: Progressive Feature Matching via Multi-Layer Sparse Graph Attention52
Holistic Prototype Attention Network for Few-Shot Video Object Segmentation52
VVC In-Loop Filters52
One for All: A Unified Generative Framework for Image Emotion Classification51
DEP-Former: Multimodal Depression Recognition Based on Facial Expressions and Audio Features via Emotional Changes51
DAHP: Deep Attention-Guided Hashing With Pairwise Labels51
STAF: 3D Human Mesh Recovery From Video With Spatio-Temporal Alignment Fusion51
OraL: An Observational Learning Paradigm for Unsupervised Hyperspectral Change Detection51
Multi-Prior Driven Network for RGB-D Salient Object Detection51
Bridging the Gap: Multi-Level Cross-Modality Joint Alignment for Visible-Infrared Person Re-Identification50
Matching Multi-Scale Feature Sets in Vision Transformer for Few-Shot Classification50
CFB-Then-ECB Mode-Based Image Encryption for an Efficient Correction of Noisy Encrypted Images50
Spike Camera Image Reconstruction Using Deep Spiking Neural Networks50
Table of Contents50
Deep Sparse Representation Based Image Restoration With Denoising Prior49
Exploiting Multiperspective Driven Hierarchical Content-Aware Network for Finger Vein Verification49
Efficient and Effective Nonconvex Low-Rank Subspace Clustering via SVT-Free Operators49
Generative Image Steganography Based on Text-to-Image Multimodal Generative Model49
Nested Fully-Connected Tensor Network Decomposition for Multi-Dimensional Visual Data Recovery49
POS-Trends Dynamic-Aware Model for Video Caption49
MtArtGPT: A Multi-Task Art Generation System With Pre-Trained Transformer48
Small Sample Image Segmentation by Coupling Convolutions and Transformers48
A Snippets Relation and Hard-Snippets Mask Network for Weakly-Supervised Temporal Action Localization48
Feature Alignment in Anchor-Free Object Detection48
PointOT: Interpretable Geometry-Inspired Point Cloud Generative Model via Optimal Transport48
A 3D Memristive Cubic Map with Dual Discrete Memristors: Design, Implementation, and Application in Image Encryption48
Globally Deformable Information Selection Transformer for Underwater Image Enhancement48
Dual-Path Feature Aware Network for Remote Sensing Image Semantic Segmentation48
Progressive Multi-Prompt learning for Vision-Language Models48
Sampling Propagation Attention With Trimap Generation Network for Natural Image Matting47
Real Image Denoising via Guided Residual Estimation and Noise Correction47
Exploiting Global Camera Network Constraints for Unsupervised Video Person Re-Identification47
Neuron-Based Spiking Transmission and Reasoning Network for Robust Image-Text Retrieval47
Collaborative Multi-Dynamic Pattern Modeling for Human Motion Prediction46
Multi-Scale Structural Graph Convolutional Network for Skeleton-Based Action Recognition46
Perceptual Underwater Image Enhancement With Deep Learning and Physical Priors46
Deep Adaptive Quadruplet Hashing With Probability Sampling for Large-Scale Image Retrieval46
Complementary Blind-Spot Network for Self-Supervised Real Image Denoising46
Dual-Constraint Coarse-to-Fine Network for Camouflaged Object Detection46
Content-Adaptive Rate Control Method for User-Generated Content Videos46
High-Resolution Feature Pyramid Network for Small Object Detection on Drone View46
Adaptive Memorization With Group Labels for Unsupervised Person Re-Identification46
Surface-continuous Scene Representation for Light Field Depth Estimation via Planarity Prior46
Contrastive Learning With Enhancing Detailed Information for Pre-Training Vision Transformer45
Semantic-Context Graph Network for Point-Based 3D Object Detection45
Enhancing Vision and Language Navigation With Prompt-Based Scene Knowledge45
Diffusion-Based Hypotheses Generation and Joint-Level Hypotheses Aggregation for 3D Human Pose Estimation45
U²-Former: Nested U-Shaped Transformer for Image Restoration via Multi-View Contrastive Learning45
A Novel Cross-Perturbation for Single Domain Generalization45
LGTrack: Exploiting Local and Global Properties for Robust Visual Tracking45
Optical Flow-Based Spatiotemporal Sketch for Video Representation: A Novel Framework44
Concept-Enhanced Relation Network for Video Visual Relation Inference44
Generalized Intra-Camera Supervised Person Re-Identification44
Dense Crosstalk Feature Aggregation for Classification and Localization in Object Detection44
PCTrack: Accurate Object Tracking for Live Video Analytics on Resource-Constrained Edge Devices44
Weakly-Supervised Temporal Action Localization by Progressive Complementary Learning44
IEEE Transactions on Circuits and Systems for Video Technology publication information43
Special Issue on Segment Anything for Videos and Beyond43
DBVC: An End-to-End 3-D Deep Biomedical Video Coding Framework43
Glimpse and Zoom: Spatio-Temporal Focused Dynamic Network for Skeleton-Based Action Recognition43
Locality-Adaptive Structured Dictionary Learning for Cross-Domain Recognition43
Dual-Domain Feature Fusion and Multi-Level Memory-Enhanced Network for Spectral Compressive Imaging43
Curiosity-Driven Class-Incremental Learning via Adaptive Sample Selection43
Learning Physical-Spatio-Temporal Features for Video Shadow Removal43
CLSR: Cross-Layer Interaction Pyramid Super-Resolution Network43
A Pixel-Level Segmentation-Synthesis Framework for Dynamic Texture Video Compression43
Partially View-Aligned Representation Learning via Cross-View Graph Contrastive Network43
Toward Extreme Image Compression With Latent Feature Guidance and Diffusion Prior42
Towards Unconstrained Facial Landmark Detection Robust to Diverse Cropping Manners42
Enhancing Transparent Object Matting Using Predicted Definite Foreground and Background42
A Deep Reinforcement Learning Approach to Multiple Streams’ Joint Bitrate Allocation42
CRDH: Compatible Reversible Data Hiding With High Capacity and Generalization41
Flexible Temperature Parallel Distillation for Dense Object Detection: Make Response-based Knowledge Distillation Great Again41
Modality Fused Class-Proxy with Knowledge Distillation for Zero-Shot Sketch-based Image Retrieval41
CodingHomo: Bootstrapping Deep Homography With Video Coding41
M3CS: Multi-Target Masked Point Modeling with Learnable Codebook and Siamese Decoders41
Deep Video Super-Resolution Using Hybrid Imaging System41
MMatch: Semi-Supervised Discriminative Representation Learning for Multi-View Classification40
Corruption-Invariant Person Re-Identification via Coarse-to-Fine Feature Alignment40
Knowledge-Based Visual Question Generation40
0.11189198493958