IEEE Transactions on Circuits and Systems for Video Technology

Papers
(The TQCC of IEEE Transactions on Circuits and Systems for Video Technology is 19. The table below lists those papers that are above that threshold based on CrossRef citation counts [max. 250 papers]. The publications cover those that have been published in the past four years, i.e., from 2022-05-01 to 2026-05-01.)
ArticleCitations
2022 Index IEEE Transactions on Circuits and Systems for Video Technology Vol. 32526
IEEE Transactions on Circuits and Systems for Video Technology Publication Information464
Table of Contents338
IEEE Transactions on Circuits and Systems for Video Technology publication information334
IEEE Transactions on Circuits and Systems for Video Technology publication information302
Multi-Modal Multi-Grained Embedding Learning for Generalized Zero-Shot Video Classification277
SARGAN: Spatial Attention-Based Residuals for Facial Expression Manipulation260
DMRFlow: 4D Radar Scene Flow Estimation With Decoupled Matching and Refinement242
Joint Learning of Image Deblurring and Depth Estimation Through Adversarial Multi-Task Network235
PhyDAA: Physiological Dataset Assessing Attention224
Learning to Capture the Query Distribution for Few-Shot Learning210
Semantic Boosting via Knowledge Sharing and Feedback for Video Anomaly Detection209
DP-Retinex: Dual-Prior Guided Low-Light Image Enhancement with YUV-Domain Reflectance-Illumination Decomposition198
Table of Contents195
IEEE Circuits and Systems Society Information195
Guest Editorial Introduction to the Special Issue on Label-Efficient Learning on Video Data180
Harmony: An Eco-Friendly Adaptive Rate Control Scheme for Video-on-Demand in Low Earth Orbit Satellite Internet172
Synergistic Fusion Network of Microscopic Hyperspectral and RGB Images for Multi-Perspective Segmentation166
Subjective and Objective Quality Assessment of Display Content Videos165
RT3DHVC: A Real-Time Human Holographic Video Conferencing System With a Consumer RGB-D Camera Array163
Convolutional Neural Networks for Omnidirectional Image Quality Assessment: A Benchmark154
CRP2-VCS: Contrast-Oriented Region-Based Progressive Probabilistic Visual Cryptography Schemes150
Stochastic Gradient Perturbation: An Implicit Regularizer for Person Re-Identification149
Uni3DA: Universal 3D Domain Adaptation for Object Recognition148
Toward Meta-Shape-Based Multi-View 3D Point Cloud Registration: An Evaluation147
A Clinically Guided Graph Convolutional Network for Assessment of Parkinsonian Pronation-Supination Movements of Hands147
Spectral–Spatial Feature Extraction With Dual Graph Autoencoder for Hyperspectral Image Clustering143
Deep Affine Motion Compensation Network for Inter Prediction in VVC141
Representation Robustness and Feature Expansion for Exemplar-Free Class-Incremental Learning141
Relative Comparison-Based Consensus Learning for Multi-View Subspace Clustering138
SpiReco: Fast and Efficient Recognition of High-Speed Moving Objects With Spike Camera137
Cross-Level Multi-Modal Features Learning With Transformer for RGB-D Object Recognition136
DSC3D: Deformable Sampling Constraints in Stereo 3D Object Detection for Autonomous Driving136
Iterative Self-Guided Image Filtering131
Viewport Prediction for Volumetric Video Streaming by Exploring Video Saliency and User Trajectory Information130
Reversible Data Hiding Over Encrypted Images via Preprocessing-Free Matrix Secret Sharing129
Dual-Stream Transformer With Distribution Alignment for Visible-Infrared Person Re-Identification125
Filtering and Alternating Calibration: Spatiotemporal Context Alternating Fusion for Event-Based Monocular Depth Estimation125
TPCM-SegNet: A Text-Prompted Dual-Path Convolution-Mamba Network for Anomaly Segmentation125
FoV Prediction-Based Adaptive Bitrate Streaming With On-Demand Transcoding for 360° Videos123
LiveMatte: Dynamic Scene Background Restoration and Selective Portrait Patch Enhancement122
DiffPixelFormer: Differential Pixel-Aware Transformer for RGB-D Indoor Scene Segmentation117
Draw Like an Artist: Complex Scene Generation With Diffusion Model via Composition, Painting, and Retouching117
VDTR: Video Deblurring With Transformer116
Learning Spatio-Temporal Sharpness Map for Video Deblurring116
Exploring and Exploiting High-Order Spatial–Temporal Dynamics for Long-Term Frame Prediction115
Projected Generative Adversarial Network for Point Cloud Completion115
USVTrack: A Benchmark for Multi-Object Tracking in Complex Water Surface Scenes115
Few-Shot Temporal Sentence Grounding via Memory-Guided Semantic Learning114
Scene Prior Constrained Self-Paced Learning for Unsupervised Satellite Video Vehicle Detection114
Deep Convolutional Primal-Dual Network for Image Deblurring112
Reliable Entropy-Induced Anchor Learning for Incomplete Multi-View Subspace Clustering112
Instance-Incremental Scene Graph Generation From Real-World Point Clouds via Normalizing Flows111
Semantic-Aware Late-Stage Supervised Contrastive Learning for Fine-Grained Action Recognition109
Semi-Supervised Crowd Counting via Multi-Task Pseudo-Label Self-Correction Strategy109
Future Feature-Based Supervised Contrastive Learning for Streaming Perception108
UAMD-Net: A Unified Adaptive Multimodal Neural Network for Dense Depth Completion107
Unsupervised Action Segmentation via Multi-Scale Temporal-Interaction Enhancement103
Phase-Guided Cross-Frequency Integration Network for ISAR and Optical Image Fusion103
ProMoT: Progressive Prompting of Modality and Temporal Dynamics for RGB-T Tracking102
Dependability Feature Learning Based on Sample Generation for Unsupervised Text-to-Image Person Re-Identification101
Scalable and Robust Tensor Ring Decomposition for Large-Scale Data With Missing Data and Outliers101
Equity in Unsupervised Domain Adaptation by Nuclear Norm Maximization101
Negative Class Guided Spatial Consistency Network for Sparsely Supervised Semantic Segmentation of Remote Sensing Images100
Graph-Guided Unsupervised Multiview Representation Learning96
UDTCWT-PHFMs Domain Statistical Image Watermarking Using Vector BW-Type R Distribution96
A Format Compliant Framework for HEVC Selective Encryption After Encoding96
Fully Unsupervised Domain-Agnostic Image Retrieval95
Representing Boundary-Ambiguous Scene Online With Scale-Encoded Cascaded Grids and Radiance Field Deblurring95
NDM: Boosting Dataset Distillation via Nested Difficulty Matching94
Reconstructing Sparse-view Indoor Scenes in View Space with Global Monocular Prior Alignment94
Ct-LVI: A Framework Toward Continuous-Time Laser-Visual-Inertial Odometry and Mapping94
Boosting Video Object Segmentation with Discriminative Core Features and Adaptive Position Refinement93
CLIP-Based Class Incremental Semantic Segmentation Framework with Generalization-Preserving Knowledge Distillation93
Hierarchical Dynamic Programming Module for Human Pose Refinement92
Efficient Single-Object Tracker Based on Local-Global Feature Fusion92
Enhancing Representation Learning With Spatial Transformation and Early Convolution for Reinforcement Learning-Based Small Object Detection90
Multi-Stage Cross-Modality Feature Interaction for RGB-Thermal Multi-Object Tracking90
VPA: Multi-Modal Virtual Point Augmentation for 3D Object Detection89
MPCF: Multi-Phase Consolidated Fusion for Multi-Modal 3D Object Detection with Pseudo Point Cloud88
Local Attention Transformer-Based Full-View Finger-Vein Identification87
Universal Immunized Cover Construction for Secure Adaptive Steganography across Multiple Domains87
DS 2 VP: Dynamically-Selected Spatially Visual Prompting87
Plausible Proxy Mining With Credibility for Unsupervised Person Re-Identification85
Lossless Dynamic Point Cloud Geometry Compression via Rate-Distortion Optimized Motion Estimation85
Fuzzified Contrast Enhancement for Nearly Invisible Images85
EIFNet: An Explicit and Implicit Feature Fusion Network for Finger Vein Verification84
Dual Difficulty-Aware Adaptive Pseudo Labeling for Semi-Supervised CNV Segmentation84
FastAL: Fast Evaluation Module for Efficient Dynamic Deep Active Learning Using Broad Learning System84
Morphology-Guided Muscle Cell Detection & Counting based on Transfer Learning, FFD Augmentation and Density-Aware Loss Optimization84
Crowd-Powered Photo Enhancement Featuring an Active Learning Based Local Filter82
Push-and-Pull: A General Training Framework With Differential Augmentor for Domain Generalized Point Cloud Classification82
TiGDistill-BEV: Multi-View BEV 3D Object Detection via Target Inner-Geometry Learning Distillation82
Exploring Explicitly Disentangled Features for Domain Generalization82
Adversarial Dual-Student With Differentiable Spatial Warping for Semi-Supervised Semantic Segmentation81
Block Diagonal Graph Embedded Discriminative Regression for Image Representation81
MEF-GD: Multimodal Enhancement and Fusion Network for Garment Designer81
Highly-Parallel Hardwired Deep Convolutional Neural Network for 1-ms Dual-Hand Tracking80
PPIFuse: Physical Priors Injected Infrared and Visible Image Fusion80
MultiHuman: Leverage Multimodal Prompts for Controllable Multi-Person Image Synthesizing79
Active Spatial Positions Based Hierarchical Relation Inference for Group Activity Recognition79
Edge and Skeleton Guidance Network for Salient Object Detection in Optical Remote Sensing Images79
Multi-Level Feature Fusion Network for Shadow Removal Detection78
Learning Monocular Depth via Cascaded Iterative Refinement in Visual-Echo Scenes77
Learning Appearance-Motion Synergy via Memory-Guided Event Prediction for Video Anomaly Detection77
ASCFormer: An Adaptive Structure-Aware Cascaded Transformer for 3D Object Detection77
Towards Video Anomaly Detection in the Real World: A Binarization Embedded Weakly-Supervised Network77
D3C2-Net: Dual-Domain Deep Convolutional Coding Network for Compressive Sensing76
MCCE-REC: MLLM-Driven Cross-Modal Contrastive Entropy Model for Zero-Shot Referring Expression Comprehension76
Open-Set Deepfake Detection: A Parameter-Efficient Adaptation Method with Forgery Style Mixture75
Deep and Low-Rank Quaternion Priors for Color Image Processing75
Robust Image Watermarking With Synchronization Using Template Enhanced-Extracted Network75
Spatial Attention-Guided Light Field Salient Object Detection Network With Implicit Neural Representation75
MMI-Det: Exploring Multi-Modal Integration for Visible and Infrared Object Detection74
Key Role Guided Transformer for Group Activity Recognition74
Relation-Aware Multi-Pass Comparison Deconfounded Network for Change Captioning74
Frequency Generation for Real-World Image Super-Resolution74
AirSOD: A Lightweight Network for RGB-D Salient Object Detection74
SMART: Semantic Matching Contrastive Learning for Partially View-Aligned Clustering74
Learning Depth-Density Priors for Fourier-Based Unpaired Image Restoration73
CLIP-VIS: Adapting CLIP for Open-Vocabulary Video Instance Segmentation73
Lightweight Neural Network for Enhancing Imaging Performance of Under-Display Camera73
Pro-Tuning: Unified Prompt Tuning for Vision Tasks72
Multi-Modal Attribute Prompting for Vision-Language Models72
Pose-Guided Transformer for Fine-Grained Action Quality Assessment72
Video Understanding With Large Language Models: A Survey72
IEEE Transactions on Circuits and Systems for Video Technology publication information71
IEEE Transactions on Circuits and Systems for Video Technology publication information71
WordCon: Word-level Typography Control in Visual Text Rendering71
All-Inclusive Image Enhancement for Degraded Images Exhibiting Low-Frequency Corruption70
IEEE Circuits and Systems Society Information70
Online Unsupervised Video Object Segmentation via Contrastive Motion Clustering69
Enhancing Robustness of Multi-Object Trackers With Temporal Feature Mix69
Multi-Scale Explicit Matching and Mutual Subject Teacher Learning for Generalizable Person Re-Identification69
Recent Advances in Rate Control: From Optimization to Implementation and Beyond69
WeaFU: Weather-Informed Image Blind Restoration via Multi-Weather Distribution Diffusion69
A Novel Deep Learning Framework for Automatic Recognition of Thyroid Gland and Tissues of Neck in Ultrasound Image68
Feature Evaluation and Joint Interaction for Audio-Visual Emotion Recognition68
MixSSC: Forward-Backward Mixture for Vision-Based 3D Semantic Scene Completion68
Dynamic Particle Filter Framework for Robust Object Tracking68
Interlayer Restoration Deep Neural Network for Scalable High Efficiency Video Coding68
Compensating for the Incomplete With the Complete: An Efficient Scene Text Detector68
Monocular Depth Estimation on Adverse Weathers With Curriculum Domain Distribution Alignment68
A Universal Framework for Improving the Robustness of Coverless Image Steganography Based on Image Restoration68
BIMM: Brain Inspired Masked Modeling for Video Representation Learning68
Mesh2Animation: Unsupervised Animating for Quadruped 3D Objects67
OraL: An Observational Learning Paradigm for Unsupervised Hyperspectral Change Detection67
DEP-Former: Multimodal Depression Recognition Based on Facial Expressions and Audio Features via Emotional Changes67
Dynamic Hypergraph Convolutional Network for No-Reference Point Cloud Quality Assessment67
MSGA-Net: Progressive Feature Matching via Multi-Layer Sparse Graph Attention67
AMTFusion: boosting 3D object detection by adaptive multi-modal temporal fusion and augmentation66
Meta-Learning Based Domain Prior With Application to Optical-ISAR Image Translation66
Inter-Scale Similarity Guided Cost Aggregation for Stereo Matching66
FaceGCN: Structured Priors Inspired Graph Convolutional Networks for Face Restoration With Unknown Degradations66
Forgery-Aware Adaptive Learning With Vision Transformer for Generalized Face Forgery Detection66
VSOIQE: A Novel Viewport-Based Stitched 360° Omnidirectional Image Quality Evaluator66
Blind Image Quality Index for Authentic Distortions With Local and Global Deep Feature Aggregation66
TAKD: Target-Aware Knowledge Distillation for Remote Sensing Scene Classification65
A Physical Model-Guided Framework for Underwater Image Enhancement and Depth Estimation65
STAF: 3D Human Mesh Recovery From Video With Spatio-Temporal Alignment Fusion65
Robust Matrix Completion Based on Factorization and Truncated-Quadratic Loss Function65
Unsupervised Deep Hashing With Fine-Grained Similarity-Preserving Contrastive Learning for Image Retrieval65
Texture-Aware Spherical Rotation for High Efficiency Omnidirectional Intra Video Coding64
CO + 3 : Improved Collaborative Consortium of Foundation Models for Open-World Few-Shot Learning64
Table of Contents64
Low-Resolution Object Recognition With Cross-Resolution Relational Contrastive Distillation64
Medical Data Security in Blockchain: A telemedicine data sharing scheme based on custom OPE and 4D-YG hyperchaotic64
Learning Scene-Invariant Distribution for Generalizable Blind Image Quality Assessment64
A Label-Free and Non-Monotonic Metric for Evaluating Denoising in Event Cameras64
Task-Specific Loss for Robust Instance Segmentation With Noisy Class Labels64
Efficiently Exploiting Spatially Variant Knowledge for Video Deblurring63
ImagingNet: A New Learnable SAR Imaging Method via Hierarchical U-Shaped Network63
Table of Contents63
Learning With Noisy Labels by Semantic and Feature Space Collaboration63
Touchless Finger Vein and Fingerprint Verification via Exploiting Attention-Based Cross-Domain Fusion63
Errata to “Local-Global Temporal Difference Learning for Satellite Video Super-Resolution”63
MMGT: Motion Mask Guided Two-Stage Network for Co-Speech Gesture Video Generation62
MDSLCA: Multi-scale Dilated Spatial and Local Channel Attention for LiDAR Point Cloud Semantic Segmentation62
Enhancing Vision Transformer with Shift Expansion Linear Attention for Image Classification and Object Tracking62
CAIR-Net: Reliability-Aware Information Routing for Robust Multimodal Object Detection under Modality Degradation62
Enhanced Spatial-Temporal Salience for Cross-View Gait Recognition62
Question-Aware Global-Local Video Understanding Network for Audio-Visual Question Answering62
FedRSL: Representation Subspace Learning in Model-Heterogeneous Federated Learning62
Non-Local Guided Neural Fields for 4D CT Reconstruction62
SMR: Spatial-Guided Model-Based Regression for 3D Hand Pose and Mesh Reconstruction61
Conditional Dual Diffusion for Multimodal Clustering of Optical and SAR Images61
Optical Flow Reusing for High-Efficiency Space-Time Video Super Resolution61
G2LP-Net: Global to Local Progressive Video Inpainting Network61
HyPSAM: Hybrid Prompt-Driven Segment Anything Model for RGB-Thermal Salient Object Detection61
Flow Visualization for Complex Fluid Flows via a Structure-Enhanced Motion Estimator60
FDNet: Frequency Decomposition Network for Learned Image Compression60
Lightweight and Personalized Single-Eye Emotion Recognition via CNN-SNN Spatiotemporal Learning and Memory-Inferred Event Features60
Multimodal Industrial Anomaly Detection via Geometric Prior60
Semantic Disentanglement Adversarial Hashing for Cross-Modal Retrieval60
Explanation-Guided Adversarial Training for Robust and Interpretable Models59
DiffVein: A Unified Diffusion Network for Finger Vein Segmentation and Authentication59
Laplacian Pyramid Fusion Network With Hierarchical Guidance for Infrared and Visible Image Fusion59
Diverse Batch Steganography Using Model-Based Selection and Double-Layered Payload Assignment59
Adaptive Mixture-of-Experts Distillation for Cross-Satellite Generalizable Incremental Remote Sensing Scene Classification58
Target-Aware Tracking With Spatial-Temporal Context Attention58
One for All: A Unified Generative Framework for Image Emotion Classification58
Multi-Prior Driven Network for RGB-D Salient Object Detection58
Transformer-Based Multimodal Emotional Perception for Dynamic Facial Expression Recognition in the Wild57
Balanced Teacher for Source-Free Object Detection57
Holistic Prototype Attention Network for Few-Shot Video Object Segmentation57
CNN-Transformer Based Generative Adversarial Network for Copy-Move Source/ Target Distinguishment57
Searching a Compact Architecture for Robust Multi-Exposure Image Fusion57
Self-Supervised Adversarial Video Summarizer With Context Latent Sequence Learning56
Surveillance Video-and-Language Understanding: From Small to Large Multimodal Models56
Efficient Non-Blind Image Deblurring With Discriminative Shrinkage Deep Networks56
FDAC: Federated Domain Adaptation via Dual Contrastive Learning56
StreetSurfGS: Scalable Urban Street Surface Reconstruction With Planar-Based Gaussian Splatting56
Flow-Edge Guided Unsupervised Video Object Segmentation55
VmambaIR: Visual State Space Model for Image Restoration55
Depth Estimation From a Single Image of Blast Furnace Burden Surface Based on Edge Defocus Tracking55
Table of Contents55
Cloth-Imbalanced Gait Recognition via Hallucination55
M3CS: Multi-Target Masked Point Modeling With Learnable Codebook and Siamese Decoders55
DBVC: An End-to-End 3-D Deep Biomedical Video Coding Framework54
Surface-Continuous Scene Representation for Light Field Depth Estimation via Planarity Prior54
Concept-Enhanced Relation Network for Video Visual Relation Inference54
Content-Adaptive Rate Control Method for User-Generated Content Videos54
IEEE Transactions on Circuits and Systems for Video Technology publication information54
Robust Context Modeling for Unsupervised Non-rigid Point Cloud Correspondence54
SPCL: Semantic Polymorphism and Commonality Learning for Text-Based Person Retrieval53
Corruption-Invariant Person Re-Identification via Coarse-to-Fine Feature Alignment53
Diffusion-Based Hypotheses Generation and Joint-Level Hypotheses Aggregation for 3D Human Pose Estimation53
Improving Zero-Shot Generalization for CLIP With Prompt Ensemble Self-Distillation53
Contrastive Learning With Enhancing Detailed Information for Pre-Training Vision Transformer52
POS-Trends Dynamic-Aware Model for Video Caption52
PCTrack: Accurate Object Tracking for Live Video Analytics on Resource-Constrained Edge Devices52
Special Issue on Segment Anything for Videos and Beyond52
Learning Multi-View Stereo With Geometry-Aware Prior52
A Pixel-Level Segmentation-Synthesis Framework for Dynamic Texture Video Compression52
SmokePose: End-to-End Smoke Keypoint Detection52
Table of Contents52
DilatedTAD: Enhancing Adaptability to Actions of Varying Durations for Temporal Action Detection51
StarPose: 3D Human Pose Estimation via Spatial-Temporal Autoregressive Diffusion51
VideoPure: Diffusion-Based Adversarial Purification for Video Recognition51
CRDH: Compatible Reversible Data Hiding With High Capacity and Generalization51
Flexible Temperature Parallel Distillation for Dense Object Detection: Make Response-Based Knowledge Distillation Great Again51
FastFace: Fast-Converging Scheduler for Large-Scale Face Recognition Training With One GPU51
HyperTrack: A Unified Network for Hyperspectral Video Object Tracking51
Semantic-Context Graph Network for Point-Based 3D Object Detection50
CodingHomo: Bootstrapping Deep Homography With Video Coding50
Neuromorphic Imaging With Super-Resolution50
CSTA: Spatial-Temporal Causal Adaptive Learning for Exemplar-Free Video Class-Incremental Learning50
Optical Flow-Based Spatiotemporal Sketch for Video Representation: A Novel Framework50
Dense Crosstalk Feature Aggregation for Classification and Localization in Object Detection50
U²-Former: Nested U-Shaped Transformer for Image Restoration via Multi-View Contrastive Learning50
0.14287185668945