OOIR: Observatory of International Research

Papers

(The median citation count of Journal of Visual Communication and Image Representation is 2. The table below lists those papers that are above that threshold based on CrossRef citation counts [max. 250 papers]. The publications cover those that have been published in the past four years, i.e., from 2022-01-01 to 2026-01-01.)

Article	Citations
Register assisted aggregation for visual place recognition	278
Dense-sparse representation matters: A point-based method for volumetric medical image segmentation	132
Multi-image super-resolution based low complexity deep network for image compressive sensing reconstruction	107
Edge-aware object pixel-level representation tracking	97
SIM-MFR: Spatial interactions mechanisms based multi-feature representation for background modeling	91
DDFusion: An efficient multi-exposure fusion network with dense pyramidal convolution and de-correlation fusion	79
Single-image depth estimation using relative depths	72
PTR-CNN for in-loop filtering in video coding	65
Advancing white balance correction through deep feature statistics and feature distribution matching	63
Capsule network with using shifted windows for 3D human pose estimation	56
SICNet: Learning selective inter-slice context via Mask-Guided Self-knowledge distillation for NPC segmentation	49
Reversible data hiding based on automatic contrast enhancement using histogram expansion	48
Faster-slow network fused with enhanced fine-grained features for action recognition	46
Learning informative and discriminative semantic features for robust facial expression recognition	45
A robust coverless image-synthesized video steganography based on asymmetric structure	42
Real-world image dehazing with improved joint enhancement and exposure fusion	38
Corner-to-Center long-range context model for efficient learned image compression	38
Inter-image Token Relation Learning for weakly supervised semantic segmentation	37
DA4NeRF: Depth-aware Augmentation technique for Neural Radiance Fields	37
Fast HEVC inter-frame coding based on LSTM neural network technology	36
A fast intra CU partition algorithm in Versatile Video Coding for 360-degree video	35
Distance distributions and runtime analysis of perceptual hashing algorithms	35
Aligning computational and human perceptions of image complexity: A dual-task framework for prediction and localization	35
High-capacity reversible data hiding in encrypted images based on adaptive block coding selection	34
FormerPose: An efficient multi-scale fusion Transformer network based on RGB-D for 6D pose estimation	33

U-TPE: A universal approximate thumbnail-preserving encryption method for lossless recovery	32
A no-reference panoramic image quality assessment with hierarchical perception and color features	31
DB-TASNet for disease diagnosis and lesion segmentation in medical images	30
Neural Style Transfer for image within images and conditional GANs for destylization	29
Editorial Board	29
Heterogeneity constrained color ellipsoid prior image dehazing algorithm	29
GLST-Net: Global and local spatio-temporal feature fusion network for skeleton-based action recognition	28
Locality-constraint Representation with Minkowski distance metric for an effective Face Hallucination	27
Masked latent transformer with random masking ratio to advance the diagnosis of dental fluorosis	27
Detection of HEVC double compression based on boundary effect of TU and non-zero DCT coefficient distribution	26
Learning-based JNCD prediction for quality-wise perceptual quantization in HEVC	26
Multi-task learning for video anomaly detection	26
MSTG: Multi-Scale Transformer with Gradient for joint spatio-temporal enhancement	26
Editorial Board	25
Blind deblurring with fractional-order calculus and local minimal pixel prior	24
AI-assisted deepfake detection using adaptive blind image watermarking	24
Robust text watermarking based on average skeleton mass of characters against cross-media attacks	24
Dual-Branch Wavelet Diffusion models with Dual-Prior Refinement for Underwater Image Enhancement	23
Zero-CSC: Low-light image enhancement with zero-reference color self-calibration	23
A hierarchical multi-modal cross-attention model for face anti-spoofing	23
Lightweight JPEG image steganalysis using dilated blind-spot network	22
Exploring training data-free video generation from a single image via a stable diffusion model	22
Action density based frame sampling for human action recognition in videos	22
HD-YOLO: Using radius-aware loss function for head detection in top-view fisheye images	21
DetailCaptureYOLO: Accurately Detecting Small Targets in UAV Aerial Images	21
An active contour model based on Jeffreys divergence and clustering technology for image segmentation	21
Attention mechanism enhancement algorithm based on cycle consistent generative adversarial networks for single image dehazing	20
Person re-identification based on improved attention mechanism and global pooling method	20
PRA-TPE: Perfectly Recoverable Approximate Thumbnail-Preserving Image Encryption	20
TransGANomaly: Transformer based Generative Adversarial Network for Video Anomaly Detection	20
Personality modeling from image aesthetic attribute-aware graph representation learning	20
End-to-end wavelet block feature purification network for efficient and effective UAV object tracking	20
Multiple transformation function estimation for image enhancement	20
ADPNet: Attention based dual path network for lane detection	19
A novel and efficient image dehazing technique for Advanced Driver Assistance Systems	19
SpyGAN sketch: Heterogeneous Face Matching in video for crime investigation	19
Editorial Board	19
EERCA-ViT: Enhanced Effective Region and Context-Aware Vision Transformers for image sentiment analysis	19
AMCFNet: Asymmetric multiscale and crossmodal fusion network for RGB-D semantic segmentation in indoor service robots	19
LRHW-AP: Using ranking-based metric as loss for Person Re-Identification	18
Dictionary-based histogram packing technique for lossless image compression	18
OODNet: A deep blind JPEG image compression deblocking network using out-of-distribution detection	17
Multiscale residual gradient attention for face anti-spoofing	17
Multi-modal semantic embedding network for 3D shape recognition and retrieval	17
Reversal of pixel rotation: A reversible data hiding system towards cybersecurity in encrypted images	17
SR4KVQA: Video quality assessment database and metric for 4K super-resolution	17
CCNet: CNN model with channel attention and convolutional pooling mechanism for spatial image steganalysis	17
Opinion-unaware blind quality assessment of AI-generated omnidirectional images based on deep feature statistics	17
EMCFN: Edge-based Multi-scale Cross Fusion Network for video frame interpolation	16
Security measurement of a medical communication scheme based on chaos and DNA coding	16

Green learning: Introduction, examples and outlook	16
Cross-layer progressive attention bilinear fusion method for fine-grained visual classification	16
Bi-READ: Bi-Residual AutoEncoder based feature enhancement for video anomaly detection	15
SRI-Net: Similarity retrieval-based inference network for light field salient object detection	15
Multi-scale convolutional neural networks and saliency weight maps for infrared and visible image fusion	15
Stacked deformable convolution network with weighted non-local attention and branch residual connection for image quality assessment	15
Texture-aware fast mode decision and complexity allocation for VVC based point cloud compression	15
A non-extended 3D mesh secret sharing scheme adapted for FPGA processing	15
Lite transformer with medium self attention for efficient traffic sign recognition	15
Corrigendum to “Generative detect for occlusion object based on occlusion generation and feature completing” [J. Visual Commun. Image Represent. 78 (2021) 103189]	14
Neighbor2Global: Self-supervised image denoising for Poisson-Gaussian noise	14
Scientific mapping and bibliometric analysis of research advancements in underwater image enhancement	14
Progressive enhancement network with pseudo labels for weakly supervised temporal action localization	14
UnifiedTT: Visual tracking with unified transformer	14
Human gait recognition using joint spatiotemporal modulation in deep convolutional neural networks	14
Part-attentive kinematic chain-based regressor for 3D human modeling	14
Locality sensitive hashing scheme based on online-learning	14
Virtualized three-dimensional reference tables for efficient data embedding	14
Knowledge-guided quantization-aware training for EEG-based emotion recognition	13
High-capacity multi-MSB predictive reversible data hiding in encrypted domain for triangular mesh models	13
Face reconstruction with detailed skin features via three selfie images	13
A super-resolution-based license plate recognition method for remote surveillance	13
Efficient image dehazing algorithm using multiple priors constraints	13
Robust reversible image watermarking scheme based on spread spectrum	13
Iterative decoupling deconvolution network for image restoration	13
PVT2DNet: Polyp segmentation with vision transformer and dual decoder refinement strategy	13
Action recognition method based on lightweight network and rough-fine keyframe extraction	13
A Transformer-based invertible neural network for robust image watermarking	13
Depth error points optimization for 3D Gaussian Splatting in few-shot synthesis	12
Accurate bounding-box regression with distance-IoU loss for visual tracking	12
A two-step enhanced tensor denoising framework based on noise position prior and adaptive ring rank	12
Joint strong edge and multi-stream adaptive fusion network for non-uniform image deblurring	12
DRC: Chromatic aberration intensity priors for underwater image enhancement	12
Image cropping based on order learning	12
Improved threat item detection in baggage X-ray imagery through image projection	12
Multiple integration model for single-source domain generalizable person re-identification	12
GSD-YOLOX: Lightweight and more accurate object detection models	12
Image downscaling via co-occurrence learning	12
Improved inter-view correlations for low complexity MV-HEVC	12
Editorial Board	12
Survey: 3D watermarking techniques	12
Context-dependent emotion recognition	12
Aethra-net: Single image and video dehazing using autoencoder	12
ADcFNet-deep learning based facial expression identification using FER vision transformer	12
Deep chroma prediction of Wyner–Ziv frames in distributed video coding of wireless capsule endoscopy video	12
Decomposing style, content, and motion for videos	12
Contrastive Deep Supervision Meets self-knowledge distillation	12
MemFlow-AD: An anomaly detection and localization model based on memory module and normalizing flow	12
Image watermarking using DNST-PHFMs magnitude domain vector AGGM-HMT	11
Multiple correlation filters with gaussian constraint for fast online tracking	11
SemMatcher: Semantic-aware feature matching with neighborhood consensus	11
Decomposition and replacement: Spatial knowledge distillation for monocular depth estimation	11
Exemplar-based image inpainting using adaptive two-stage structure-tensor based priority function and nonlocal filtering	11
Transferable targeted adversarial attack via multi-source perturbation generation and integration	11
Verifiable varying sized (m,	11
A channel-wise contextual module for learned intra video compression	11
An efficient optimization of measurement matrix for compressive sensing	11
MIEI:A KID-based quality assessment metric for grayscale industrial equipment images	11
Dual-channel prior-based deep unfolding with contrastive learning for underwater image enhancement	11
A simple transformer-based baseline for crowd tracking with Sequential Feature Aggregation and Hybrid Group Training	11
SiamMBFAN: Siamese tracker with multi-branch feature aggregation network	11
CC-SMC: Chain coding-based segmentation map lossless compression	11
Compressive Spectral Video Sensing using the Convolutional Sparse Coding framework CSC4D	11
P-NOC: Adversarial training of CAM generating networks for robust weakly supervised semantic segmentation priors	11
Salient object detection enhanced pseudo-labels for weakly supervised semantic segmentation	11
RQVR: A multi-exposure image fusion network that optimizes rendering quality and visual realism	11
Cell tracking-by-detection using elliptical bounding boxes	11
Dual-branch manifold information consistency for unsupervised visible–infrared person re-identification	10
Chosen plaintext attack on JPEG image encryption with adaptive key and run consistency	10
Residual spatiotemporal convolutional networks for face anti-spoofing	10
Joint multi-scale transformers and pose equivalence constraints for 3D human pose estimation	10
Low-complexity ℓ∞	10
Res2former: A multi-scale fusion based transformer feature extraction method	10
CPA-YOLOv7: Contextual and pyramid attention-based improvement of YOLOv7 for drones scene target detection	10
Retrieval augmented generation for smart calorie estimation in complex food scenarios	10
Research on a face recognition algorithm based on 3D face data and 2D face image matching	10
A novel high-fidelity reversible data hiding scheme based on multi-classification pixel value ordering	10
Multi-scale features and attention guided for brain tumor segmentation	10
Intermediate deep feature coding for human–machine vision collaboration	10
MG-SSAF: An advanced vision Transformer	10

Multi-scale and multi-patch transformer for sandstorm image enhancement	10
Object semantic-guided graph attention feature fusion network for Siamese visual tracking	10
LFSimCC: Spatial fusion lightweight network for human pose estimation	10
Multi-scale Superpixel based Hierarchical Attention model for brain CT classification	10
A dual-task region-boundary aware neural network for accurate pulmonary nodule segmentation	10
Vision-language tracking with attention-based optimization	10
Screen-shooting resistant image watermarking based on lightweight neural network in frequency domain	10
Editorial Board	9
Corrigendum to “Lightweight macro-pixel quality enhancement network for light field images compressed by versatile video coding” [J. Vis. Commun. Image Represent. 105 (2024) 104329]	9
Document forgery detection based on spatial-frequency and multi-scale feature network	9
BAO: Background-aware activation map optimization for weakly supervised semantic segmentation without background threshold	9
3D human model guided pose transfer via progressive flow prediction network	9
Applying usability assessment method for surveillance video anomaly detection with multiple distortion	9
DCPNet: Deformable Control Point Network for image enhancement	9
Reversible data hiding in encrypted 3D mesh models via ripple prediction	9
Human object interaction detection based on feature optimization and key human-object enhancement	9
Weakly supervised semantic segmentation based on superpixel affinity	9
Effective sparse tracking with convolution-based discriminative sparse appearance model	9
HEVC’s intra mode process expedited using Histogram of Oriented Gradients	9
Self2Channel: Self-supervised denoising of different regions using coalition game based channel mask	9
Dynamic gesture recognition using 3D central difference separable residual LSTM coordinate attention networks	9
Detecting Water in Visual Image Streams from UAV with Flight Constraints	9
A robust and adaptive framework with space–time memory networks for Visual Object Tracking	9
Visual secret sharing scheme with (n,n) threshold based on WeChat Mini Program codes	9
Machine learning and transformers for thyroid carcinoma diagnosis	9
Towards real-world haze removal with uncorrelated graph model	9
Unknown Sample Selection and Discriminative Classifier Learning for Generalized Category Discovery	9
Multi-stream feature refinement network for human object interaction detection	9
Transformer-based weakly supervised 3D human pose estimation	8
Enhancement-suppression driven lightweight fine-grained micro-expression recognition	8
Learn decision trees with deep visual primitives	8
Category-based depth incorporation for salient object ranking	8
Improving small objects detection using transformer	8
Blind quality assessment of light field image based on view and focus stacks	8
Multi-branch Segmentation-guided Attention Network for crowd counting	8
Gesture image recognition method based on DC-Res2Net and a feature fusion attention module	8
Reversible data hiding for color images based on prediction-error value ordering and adaptive embedding	8
Cross-dataset emotion recognition from facial expressions through convolutional neural networks	8
JPEG image encryption with grouping coefficients based on entropy coding	8
Multiscale spatial temporal attention graph convolution network for skeleton-based anomaly behavior detection	8
Efficient object tracking on edge devices with MobileTrack	8
SecureDL: A privacy preserving deep learning model for image recognition over cloud	8
On the multi-level embedding of crypto-image reversible data hiding	8
Offline writer identification approach using moment features and high-order correlation functions	8
CDGAN: Cyclic Discriminative Generative Adversarial Networks for image-to-image transformation	8
Correlation-attention guided regression network for efficient crowd counting	8
Accumulated micro-motion representations for lightweight online action detection in real-time	8
Low-complexity content-aware encoding optimization of batch video	8
Enhanced monocular depth estimation using novel scale-invariant Error Structure Similarity Index measure optimization in Convolutional Neural network architecture	8
Quality assessment of windowed 6DoF video with viewpoint switching	8
An illumination-guided dual-domain network for image exposure correction	8
3D hand reconstruction via aggregating intra and inter graphs guided by prior knowledge for hand-object interaction scenario	8
TD3Net: A temporal densely connected multi-dilated convolutional network for lipreading	8
Editorial Board	8
Editorial Board	8
Incremental pseudo-labeling for black-box unsupervised domain adaptation	8
A novel dynamic gesture understanding algorithm fusing convolutional neural networks with hand-crafted features	8
Unbiased feature generating for generalized zero-shot learning	7
Fast intra coding in AVS3 based on direct non-first pre-coding skip	7
Modification in spatial, extraction from transform: Keyless side-information steganography for JPEG	7
Deep semantic image compression via cooperative network pruning	7
Contour enhanced image super-resolution	7
Infrared dim and small target detection based on U-Transformer	7
Non-local feature aggregation quaternion network for single image deraining	7
Similarity-aware generative adversarial network for facial expression image translation	7
A no-reference perceptual image quality assessment database for learned image codecs	7
Night vision self-supervised Reflectance-Aware Depth Estimation based on reflectance	7
Adaptive smoothness evaluation and multiple asymmetric histogram modification for reversible data hiding	7
SCPNet: Self-constrained parallelism network for keypoint-based lightweight object detection	7
Feature generation based on relation learning and image partition for occluded person re-identification	7
Knowledge NeRF: Few-shot novel view synthesis for dynamic articulated objects	7
From synthetic to natural — single natural image dehazing deep networks using synthetic dataset domain randomization	7
Undirected graph representing strategy for general room layout estimation	7
Lightweight three-stream encoder–decoder network for multi-modal salient object detection	7
Copy Move Forgery detection and localisation robust to rotation using block based Discrete Cosine Transform and eigenvalues	7
Editorial Board	7
DCAM: Disturbed class activation maps for weakly supervised semantic segmentation	7
A novel complex-valued convolutional network for real-world single image dehazing	7
Editorial Board	7
ThermalDiff: A diffusion architecture for thermal image synthesis	7
A small object detection algorithm based on feature interaction and guided learning	7
Hierarchical boundary feature alignment network for video salient object detection	7
Multiscale Global-Aware Channel Attention for Person Re-identification	7
DAGNet: Depth-aware Glass-like objects segmentation via cross-modal attention	7
Human skeleton representation for 3D action recognition based on complex network coding and LSTM	7
SAFA: Lifelong Person Re-Identification learning by statistics-aware feature alignment	7
M-YOLOv8s: An improved small target detection algorithm for UAV aerial photography	7
A multi-stage spatio-temporal adaptive network for video super-resolution	7
Editorial Board	7
Shareability-Exclusivity Representation on Product Grassmann Manifolds for Multi-camera video clustering	7
Real-time facial expression recognition via quaternion Gabor convolutional neural network	6
Measuring dense false positive regions from segmentation result for whole slide tissue histology image	6
GoLDFormer: A global–local deformable window transformer for efficient image restoration	6
Coarse-to-fine underwater image enhancement with lightweight CNN and attention-based refinement	6
PMDNet: A multi-stage approach to single image dehazing with contextual and spatial feature preservation	6