Journal of Visual Communication and Image Representation

Papers
(The median citation count of Journal of Visual Communication and Image Representation is 2. The table below lists those papers that are above that threshold based on CrossRef citation counts [max. 250 papers]. The publications cover those that have been published in the past four years, i.e., from 2022-06-01 to 2026-06-01.)
ArticleCitations
Register assisted aggregation for visual place recognition348
SIM-MFR: Spatial interactions mechanisms based multi-feature representation for background modeling123
Advancing white balance correction through deep feature statistics and feature distribution matching110
FormerPose: An efficient multi-scale fusion Transformer network based on RGB-D for 6D pose estimation90
SICNet: Learning selective inter-slice context via Mask-Guided Self-knowledge distillation for NPC segmentation81
Distance distributions and runtime analysis of perceptual hashing algorithms79
Inter-image Token Relation Learning for weakly supervised semantic segmentation77
Fast HEVC inter-frame coding based on LSTM neural network technology61
DB-TASNet for disease diagnosis and lesion segmentation in medical images55
U-TPE: A universal approximate thumbnail-preserving encryption method for lossless recovery54
A fast intra CU partition algorithm in Versatile Video Coding for 360-degree video54
Faster-slow network fused with enhanced fine-grained features for action recognition50
Heterogeneity constrained color ellipsoid prior image dehazing algorithm46
A no-reference panoramic image quality assessment with hierarchical perception and color features45
Aligning computational and human perceptions of image complexity: A dual-task framework for prediction and localization43
DDFusion: An efficient multi-exposure fusion network with dense pyramidal convolution and de-correlation fusion41
All-in-focus image fusion using graph wavelet transform for multi-modal light field39
Learning informative and discriminative semantic features for robust facial expression recognition39
PTR-CNN for in-loop filtering in video coding39
Multi-image super-resolution based low complexity deep network for image compressive sensing reconstruction35
Curriculum-Meta learning for unbiased Multimodal Relation Extraction34
DBCFNet: Underwater image enhancement network based on dual branch convolution and cross level feature fusion33
3D human mesh recovery: Comparative review, models, and prospects33
Reversible data hiding based on automatic contrast enhancement using histogram expansion32
Edge-aware object pixel-level representation tracking32
Dense-sparse representation matters: A point-based method for volumetric medical image segmentation29
Capsule network with using shifted windows for 3D human pose estimation29
FlareDiffusion: Conditional diffusion model for nighttime flare removal29
DA4NeRF: Depth-aware Augmentation technique for Neural Radiance Fields29
Corner-to-Center long-range context model for efficient learned image compression29
High-capacity reversible data hiding in encrypted images based on adaptive block coding selection27
Image contrast enhancement based on the Schrödinger operator spectrum27
Real-world image dehazing with improved joint enhancement and exposure fusion26
A robust coverless image-synthesized video steganography based on asymmetric structure26
Editorial Board26
Locality-constraint Representation with Minkowski distance metric for an effective Face Hallucination25
GLST-Net: Global and local spatio-temporal feature fusion network for skeleton-based action recognition25
Blind deblurring with fractional-order calculus and local minimal pixel prior25
Pedestrian trajectory prediction using multi-cue transformer25
DetailCaptureYOLO: Accurately Detecting Small Targets in UAV Aerial Images24
Person re-identification based on improved attention mechanism and global pooling method24
Action density based frame sampling for human action recognition in videos24
A hierarchical multi-modal cross-attention model for face anti-spoofing22
AI-assisted deepfake detection using adaptive blind image watermarking22
Dual-Branch Wavelet Diffusion models with Dual-Prior Refinement for Underwater Image Enhancement21
Exploring training data-free video generation from a single image via a stable diffusion model21
Semantic similarity guided contrastive hashing for unsupervised cross-modal retrieval21
Reflectance oriented diffusion with normalizing flow illumination enhancement for the low-light images21
MSTG: Multi-Scale Transformer with Gradient for joint spatio-temporal enhancement21
Detection of HEVC double compression based on boundary effect of TU and non-zero DCT coefficient distribution20
Multiple transformation function estimation for image enhancement19
HD-YOLO: Using radius-aware loss function for head detection in top-view fisheye images19
Multi-task learning for video anomaly detection19
Masked latent transformer with random masking ratio to advance the diagnosis of dental fluorosis19
TransGANomaly: Transformer based Generative Adversarial Network for Video Anomaly Detection19
Editorial Board19
Lightweight JPEG image steganalysis using dilated blind-spot network18
Zero-CSC: Low-light image enhancement with zero-reference color self-calibration18
Personality modeling from image aesthetic attribute-aware graph representation learning18
Global–local dual-branch network with local feature enhancement for visual tracking18
DiffEEGBooth: A diffusion-based EEG generation framework for motor imagery with temporal consistency and neurophysiological constraint18
Robust text watermarking based on average skeleton mass of characters against cross-media attacks17
Towards fast and effective low-light image enhancement via adaptive Gamma correction and detail refinement17
End-to-end wavelet block feature purification network for efficient and effective UAV object tracking17
PRA-TPE: Perfectly Recoverable Approximate Thumbnail-Preserving Image Encryption17
SR4KVQA: Video quality assessment database and metric for 4K super-resolution16
Editorial Board16
Learning-based JNCD prediction for quality-wise perceptual quantization in HEVC16
EERCA-ViT: Enhanced Effective Region and Context-Aware Vision Transformers for image sentiment analysis16
Multi-modal semantic embedding network for 3D shape recognition and retrieval16
An active contour model based on Jeffreys divergence and clustering technology for image segmentation16
Opinion-unaware blind quality assessment of AI-generated omnidirectional images based on deep feature statistics16
Lite transformer with medium self attention for efficient traffic sign recognition15
Texture-aware fast mode decision and complexity allocation for VVC based point cloud compression15
Multi-scale convolutional neural networks and saliency weight maps for infrared and visible image fusion15
CCNet: CNN model with channel attention and convolutional pooling mechanism for spatial image steganalysis15
AMCFNet: Asymmetric multiscale and crossmodal fusion network for RGB-D semantic segmentation in indoor service robots15
OODNet: A deep blind JPEG image compression deblocking network using out-of-distribution detection15
ADPNet: Attention based dual path network for lane detection15
A non-extended 3D mesh secret sharing scheme adapted for FPGA processing15
Bi-READ: Bi-Residual AutoEncoder based feature enhancement for video anomaly detection15
Stacked deformable convolution network with weighted non-local attention and branch residual connection for image quality assessment15
Multiscale residual gradient attention for face anti-spoofing14
Uncertainty-aware self-supervised motion deblurring: mitigating physical priors noise via probabilistic learning14
Virtualized three-dimensional reference tables for efficient data embedding14
A novel and efficient image dehazing technique for Advanced Driver Assistance Systems14
SRI-Net: Similarity retrieval-based inference network for light field salient object detection14
Green learning: Introduction, examples and outlook14
Efficient image dehazing algorithm using multiple priors constraints14
EMCFN: Edge-based Multi-scale Cross Fusion Network for video frame interpolation14
Dictionary-based histogram packing technique for lossless image compression14
WAGAN: Bi-orthogonal Wavelet-Guided Attention Network for image and video dehazing14
Progressive enhancement network with pseudo labels for weakly supervised temporal action localization14
Iterative decoupling deconvolution network for image restoration13
Part-attentive kinematic chain-based regressor for 3D human modeling13
Multiple integration model for single-source domain generalizable person re-identification13
Action recognition method based on lightweight network and rough-fine keyframe extraction13
Human gait recognition using joint spatiotemporal modulation in deep convolutional neural networks13
PVT2DNet: Polyp segmentation with vision transformer and dual decoder refinement strategy13
Face reconstruction with detailed skin features via three selfie images13
Aethra-net: Single image and video dehazing using autoencoder13
High-capacity multi-MSB predictive reversible data hiding in encrypted domain for triangular mesh models13
Time series analysis using memory enhanced liquid neural network13
Scientific mapping and bibliometric analysis of research advancements in underwater image enhancement13
Depth error points optimization for 3D Gaussian Splatting in few-shot synthesis13
A super-resolution-based license plate recognition method for remote surveillance13
Corrigendum to “Generative detect for occlusion object based on occlusion generation and feature completing” [J. Visual Commun. Image Represent. 78 (2021) 103189]13
ADcFNet-deep learning based facial expression identification using FER vision transformer13
EFI-YOLO: An enhanced framework for industrial object detection12
Knowledge-guided quantization-aware training for EEG-based emotion recognition12
Robust reversible image watermarking scheme based on spread spectrum12
UnifiedTT: Visual tracking with unified transformer12
Context-dependent emotion recognition12
A two-step enhanced tensor denoising framework based on noise position prior and adaptive ring rank12
Image downscaling via co-occurrence learning12
Neighbor2Global: Self-supervised image denoising for Poisson-Gaussian noise12
Locality sensitive hashing scheme based on online-learning12
Deep chroma prediction of Wyner–Ziv frames in distributed video coding of wireless capsule endoscopy video12
Editorial Board12
Contrastive Deep Supervision Meets self-knowledge distillation12
Image cropping based on order learning12
Survey: 3D watermarking techniques12
A Transformer-based invertible neural network for robust image watermarking12
GSD-YOLOX: Lightweight and more accurate object detection models12
Joint strong edge and multi-stream adaptive fusion network for non-uniform image deblurring12
MIEI:A KID-based quality assessment metric for grayscale industrial equipment images12
Perceptually diverse visual saliency prediction with global context attention12
Decomposing style, content, and motion for videos12
A simple transformer-based baseline for crowd tracking with Sequential Feature Aggregation and Hybrid Group Training11
Residual spatiotemporal convolutional networks for face anti-spoofing11
An efficient optimization of measurement matrix for compressive sensing11
Image copy-move forgery detection using three-stage matching with constraints11
MemFlow-AD: An anomaly detection and localization model based on memory module and normalizing flow11
Salient object detection enhanced pseudo-labels for weakly supervised semantic segmentation11
Multi-scale Superpixel based Hierarchical Attention model for brain CT classification11
Multiple correlation filters with gaussian constraint for fast online tracking11
Multi-scale and multi-patch transformer for sandstorm image enhancement11
P-NOC: Adversarial training of CAM generating networks for robust weakly supervised semantic segmentation priors11
Improved inter-view correlations for low complexity MV-HEVC11
Dual-channel prior-based deep unfolding with contrastive learning for underwater image enhancement11
SiamMBFAN: Siamese tracker with multi-branch feature aggregation network11
Res2former: A multi-scale fusion based transformer feature extraction method11
Intermediate deep feature coding for human–machine vision collaboration11
DRC: Chromatic aberration intensity priors for underwater image enhancement11
Improved threat item detection in baggage X-ray imagery through image projection11
SemMatcher: Semantic-aware feature matching with neighborhood consensus11
CC-SMC: Chain coding-based segmentation map lossless compression11
Object semantic-guided graph attention feature fusion network for Siamese visual tracking11
Joint multi-scale transformers and pose equivalence constraints for 3D human pose estimation10
RQVR: A multi-exposure image fusion network that optimizes rendering quality and visual realism10
Multi-dimensional human preference assessment for AI-generated images with supervised contrastive learning10
Dual-branch manifold information consistency for unsupervised visible–infrared person re-identification10
Cell tracking-by-detection using elliptical bounding boxes10
Compressive Spectral Video Sensing using the Convolutional Sparse Coding framework CSC4D10
Research on a face recognition algorithm based on 3D face data and 2D face image matching10
MG-SSAF: An advanced vision Transformer10
Corrigendum to “Lightweight macro-pixel quality enhancement network for light field images compressed by versatile video coding” [J. Vis. Commun. Image Represent. 105 (2024) 104329]10
Weakly supervised semantic segmentation based on superpixel affinity10
KF-GS: Kalman filter-guided Gaussian splatting for real-time high-quality dynamic scene reconstruction10
Screen-shooting resistant image watermarking based on lightweight neural network in frequency domain10
Detecting Water in Visual Image Streams from UAV with Flight Constraints10
Chosen plaintext attack on JPEG image encryption with adaptive key and run consistency10
A novel high-fidelity reversible data hiding scheme based on multi-classification pixel value ordering10
A channel-wise contextual module for learned intra video compression10
Retrieval augmented generation for smart calorie estimation in complex food scenarios10
LFSimCC: Spatial fusion lightweight network for human pose estimation9
A dual-task region-boundary aware neural network for accurate pulmonary nodule segmentation9
Self2Channel: Self-supervised denoising of different regions using coalition game based channel mask9
Incremental pseudo-labeling for black-box unsupervised domain adaptation9
Gradient degradation-aware rate control for VVC using Nash equilibrium9
A robust and adaptive framework with space–time memory networks for Visual Object Tracking9
Multi-scale features and attention guided for brain tumor segmentation9
NCC-FDM: Frequency-domain diffusion model driven by non-physical-domain color correction for underwater image enhancement9
Human object interaction detection based on feature optimization and key human-object enhancement9
DCPNet: Deformable Control Point Network for image enhancement9
CPA-YOLOv7: Contextual and pyramid attention-based improvement of YOLOv7 for drones scene target detection9
Transferable targeted adversarial attack via multi-source perturbation generation and integration9
Editorial Board9
Vision-language tracking with attention-based optimization9
Machine learning and transformers for thyroid carcinoma diagnosis9
Document forgery detection based on spatial-frequency and multi-scale feature network9
Gesture image recognition method based on DC-Res2Net and a feature fusion attention module9
Image watermarking using DNST-PHFMs magnitude domain vector AGGM-HMT9
BAO: Background-aware activation map optimization for weakly supervised semantic segmentation without background threshold9
Low-complexity ℓ∞-compression of light field images with a deep-decompression stage9
3D human model guided pose transfer via progressive flow prediction network8
HEVC’s intra mode process expedited using Histogram of Oriented Gradients8
Blind quality assessment of light field image based on view and focus stacks8
Reversible data hiding in encrypted 3D mesh models via ripple prediction8
Applying usability assessment method for surveillance video anomaly detection with multiple distortion8
Editorial Board8
Low-complexity content-aware encoding optimization of batch video8
On the multi-level embedding of crypto-image reversible data hiding8
A novel approach in quality assessment for inpainted images8
Infrared small UAV target detection via depthwise separable residual dense attention network8
Transformer-based weakly supervised 3D human pose estimation8
Effective sparse tracking with convolution-based discriminative sparse appearance model8
Offline writer identification approach using moment features and high-order correlation functions8
Dynamic gesture recognition using 3D central difference separable residual LSTM coordinate attention networks8
Unknown Sample Selection and Discriminative Classifier Learning for Generalized Category Discovery8
Night vision self-supervised Reflectance-Aware Depth Estimation based on reflectance8
Hierarchical Feature Difference-guided Network for domain adaptation object detection8
Reversible data hiding for color images based on prediction-error value ordering and adaptive embedding8
Towards real-world haze removal with uncorrelated graph model8
Multi-stream feature refinement network for human object interaction detection8
Correlation-attention guided regression network for efficient crowd counting8
Densely aggregated U-net with spatial-spectral interaction transformer for hyperspectral compressed imaging reconstruction8
Accumulated micro-motion representations for lightweight online action detection in real-time8
TD3Net: A temporal densely connected multi-dilated convolutional network for lipreading8
Improving small objects detection using transformer8
A multi-stage spatio-temporal adaptive network for video super-resolution8
DCAM: Disturbed class activation maps for weakly supervised semantic segmentation8
Multi-branch Segmentation-guided Attention Network for crowd counting8
3D hand reconstruction via aggregating intra and inter graphs guided by prior knowledge for hand-object interaction scenario7
M-YOLOv8s: An improved small target detection algorithm for UAV aerial photography7
Enhanced monocular depth estimation using novel scale-invariant Error Structure Similarity Index measure optimization in Convolutional Neural network architecture7
JPEG image encryption with grouping coefficients based on entropy coding7
DAGNet: Depth-aware Glass-like objects segmentation via cross-modal attention7
SAFA: Lifelong Person Re-Identification learning by statistics-aware feature alignment7
Hierarchical boundary feature alignment network for video salient object detection7
Fast intra coding in AVS3 based on direct non-first pre-coding skip7
A no-reference underwater image quality evaluator via quality-aware features7
Enhancement-suppression driven lightweight fine-grained micro-expression recognition7
Unbiased feature generating for generalized zero-shot learning7
A no-reference perceptual image quality assessment database for learned image codecs7
Multiscale spatial temporal attention graph convolution network for skeleton-based anomaly behavior detection7
Multi-scale sampling and feature fusion for dynamic human rendering7
An illumination-guided dual-domain network for image exposure correction7
Similarity-aware generative adversarial network for facial expression image translation7
Lightweight whole-body mesh recovery with joints and depth aware hand detail optimization7
LaDeL: Lane detection via multimodal large language model with visual instruction tuning7
A convolutional autoencoder model with weighted multi-scale attention modules for 3D skeleton-based action recognition7
Exploring a Non-Parametric Uncertain Adaptive training method for facial expression recognition7
Quality assessment of windowed 6DoF video with viewpoint switching7
Information entropy induced graph convolutional network for semantic segmentation7
Depth from focus using directional spherical difference filter and vector to scalar fusion7
Adaptive smoothness evaluation and multiple asymmetric histogram modification for reversible data hiding7
Learn decision trees with deep visual primitives7
Copy Move Forgery detection and localisation robust to rotation using block based Discrete Cosine Transform and eigenvalues7
Category-based depth incorporation for salient object ranking7
Efficient object tracking on edge devices with MobileTrack7
Editorial Board7
Texture-aware and color-consistent learning for underwater image enhancement7
DAIRNet: Degradation-aware All-in-one Image Restoration Network with cross-channel feature interaction7
Multiscale Global-Aware Channel Attention for Person Re-identification7
From synthetic to natural — single natural image dehazing deep networks using synthetic dataset domain randomization7
Non-local feature aggregation quaternion network for single image deraining7
Knowledge NeRF: Few-shot novel view synthesis for dynamic articulated objects7
Infrared dim and small target detection based on U-Transformer7
Multi-stage feature-fusion dense network for motion deblurring7
0.080672025680542