OOIR: Observatory of International Research

Papers

(The median citation count of Visual Computer is 2. The table below lists those papers that are above that threshold based on CrossRef citation counts [max. 250 papers]. The publications cover those that have been published in the past four years, i.e., from 2022-01-01 to 2026-01-01.)

Article	Citations
Robust object recognition via context-driven reliability assessment	124
Adaptively weighted discrete Laplacian for inverse rendering	113
Learning shape abstraction by cropping positive cuboid primitives with negative ones	81
Edge-priority-extraction network using re-parameterization for real-time super-resolution	78
Still room for improvement in traditional 3D interaction: selecting the fixed axis in the virtual trackball	72
Region-based adaptive association learning for robust image scene recognition	72
Multi-modal co-attention relation networks for visual question answering	63
Depth-guided color correction and multi-scale Retinex network for underwater image enhancement	59
Deep learning in chronic wound segmentation: a comprehensive review and meta-analysis	59
Joint attribute soft-sharing and contextual local: a multi-level features learning network for person re-identification	58
Hybrid annotation alignment-based multi-region crop model for high-resolution image	57
Enhancing green screen matting with group normalization and perceptual loss for color overflow and complex edges	55
Virtual object sizes for efficient and convenient mid-air manipulation	50
V$$^2$$MLP: an accurate and simple multi-view MLP network for fine-grained 3D shape recognition	45
Developing an augmented reality framework with embedded objects and adaptive optical models for advanced lighting simulation	44
Image encryption algorithm based on cross-scrambling and rapid-mode diffusion	43
Self-supervised single-image 3D face reconstruction method based on attention mechanism and attribute refinement	43
Enhanced optical flow estimation via multiscale kernel selection and super-resolution integration	42
Feature decomposition and structural learning for multi-diverse and multi-view data clustering	41
Lightweight subpixel sampling network for image super-resolution	38
A two-stage model for spatial downscaling of daily precipitation data	38
Adaptive box-level supervision with superpixel shape guidance for ultrasound image segmentation	37
Exploring Structural Lines for Interior Floorplan Segmentation	37
Camera calibration for the surround-view system: a benchmark and dataset	36
Monocular human depth estimation with 3D motion flow and surface normals	36

Nighttime driver behavior prediction using taillight signal recognition via CNN-SVM classifier	35
Robust corner detection in continuous space	35
MoCoSys: human motion correction based on deep learning coupled with 3D+t Laplacian motion representation	35
MAPD: multi-receptive field and attention mechanism for multispectral pedestrian detection	34
Enhanced Temporal Representation and Spatial Alignment for High-Fidelity Talking Video Generation	33
PoseNorm-PCN: pose-normalized human point cloud completion from a single front view	33
SES-yolov5: small object graphics detection and visualization applications	33
Generative artificial intelligence for ophthalmic images: developments, applications and challenges	32
Gaze-contingent adaptation of VR stereo parameters for cybersickness prevention	31
A new chaotic image encryption algorithm based on dynamic DNA coding and RNA computing	31
Weighted and truncated $$L_1$$ image smoothing based on unsupervised learning	30
A multi-target cow face detection model in complex scenes	30
Using transfer learning to determine the type of mathematical fractals image of Islamic geometric patterns	30
SCAKD: a knowledge distillation framework based on spatial-corner attention for infrared and visible image fusion	29
Cfseg-Net: context feature extraction network for medical image segmentation	29
Point cloud quality assessment: unifying projection, geometry, and texture similarity	28
AttentionDIP: attention-based deep image prior model to restore satellite and aerial images from gamma distributed speckle interference	28
Road crack detection using pixel classification and intensity-based distinctive fuzzy C-means clustering	27
Self-knowledge distillation through ensemble model averaging: a novel approach for image classification	27
Secure management of retinal imaging based on deep learning, zero-watermarking and reversible data hiding	27
Study on the methods of hyperspectral image saliency detection based on MBCNN	26
Picking out the bad apples: unsupervised biometric data filtering for refined age estimation	26
CLFormer: a unified transformer-based framework for weakly supervised crowd counting and localization	26
ImpRes: implicit residual diffusion models for image super-resolution	26
Preface (Vol 39. Issue 6, June 2023)	26
Hybrid Mamba-Transformer Multi-Agent Reinforcement Learning for scalable coordination in complex environments	25
PlayNet: real-time handball play classification with Kalman embeddings and neural networks	25
Generating natural pedestrian crowds by learning real crowd trajectories through a transformer-based GAN	25
ARP$$\Delta $$: Accelerated ray-tracing photon differentials for real-time global illumination with combined specular and diffuse solutions	25
PL-MCT: pseudo-labeling and multi-frame consistency training for semi-supervised visual tracking	24
Two-stream inter-class variation enhancement network for facial expression recognition	24
A modified fuzzy clustering algorithm based on dynamic relatedness model for image segmentation	24
Boosting vision transformer for low-resolution borehole image stitching through algebraic multigrid	24
Correction: 3D reconstruction method based on N-step phase unwrapping	24
Lane line detection and departure estimation in a complex environment by using an asymmetric kernel convolution algorithm	23
Regions of interest selection in histopathological images using subspace and multi-objective stream clustering	23
Boosted verification using siamese neural network with DiffBlock	23
Personalized hairstyle and hair color editing based on multi-feature fusion	23
Correction: DFP-YOLO: a lightweight machine tool workpiece defect detection algorithm based on computer vision	23
Compact storage of additively weighted Voronoi diagrams	23
Icg: intensity and color gradient operator on RGB images for visual object tracking	23
Enhanced visual perception for underwater images based on multistage generative adversarial network	22
Sound signatures for images and geometric shapes	22
MF-SAM: enhancing multi-modal fusion with Mamba in SAM-Med3D for GPi segmentation	22
Artificial intelligence-assisted cervical dysplasia detection using papanicolaou smear images	22
Arbitrary style transfer via content consistency and style consistency	21
Application of VR to ikebana education	21
Enhancing multi-scale information exchange and feature fusion for human pose estimation	21
ConvFormer: parameter reduction in transformer models for 3D human pose estimation by leveraging dynamic multi-headed convolutional attention	21
DMINet: dense multi-scale inference network for salient object detection	21

Adversarial-based refinement dual-branch network for semi-supervised salient object detection of strip steel surface defects	21
Dsf-net: a dual-stream fusion network integrating structural and detailed features for fundus-based diabetic retinopathy classification	21
Liver segmentation based on complementary features U-Net	21
Double-handed dynamic gesture recognition using contour-based hand tracking and maximum mean probability ensembling (MMPE) for Indian Sign Language	21
Enhancing 3D human pose estimation via spatio-temporal dual-stream fusion	21
WeedGan: a novel generative adversarial network for cotton weed identification	21
High-fidelity facial expression transfer using part-based local–global conditional gans	20
An unsupervised denoising model for poisson noise using GSURE-driven deep image prior with multi-order regularization for medical imaging	20
Distribution-decouple learning network: an innovative approach for single image dehazing with spatial and frequency decoupling	20
Bridging realities: training visuo-haptic object recognition models for robots using 3D virtual simulations	20
Attention-enhanced controllable disentanglement for cloth-changing person re-identification	20
SSoB: searching a scene-oriented architecture for underwater object detection	20
DeMaskGAN: a de-masking generative adversarial network guided by semantic segmentation	20
Decoupled spatio-temporal grouping transformer for skeleton-based action recognition	20
Capturing spatiotemporal dependencies with competitive set attention for video summarization	20
Optimal feature selection and classification of Indian classical dance hand gesture dataset	20
Vision transformers (ViT) and deep convolutional neural network (D-CNN)-based models for MRI brain primary tumors images multi-classification supported by explainable artificial intelligence (XAI)	20
A novel robust image watermarking algorithm based on polar decomposition and image geometric correction	20
Dynamic underwater cognition: aligned detection networks for enhanced underwater object recognition	20
MCGFF-Net: a multi-scale context-aware and global feature fusion network for enhanced polyp and skin lesion segmentation	19
Dual-attention U-Net and multi-convolution network for single-image rain removal	19
Anti-counterfeiting textured pattern	19
PMGAN: pretrained model-based generative adversarial network for text-to-image generation	19
A novel robust digital image watermarking scheme based on attention U-Net++ structure	19
An automatic framework for quadrilateral surface reconstruction with partitions from 3D point clouds	19
A point cloud self-learning network based on contrastive learning for classification and segmentation	19
HSNet: hierarchical semantics network for scene parsing	19
Topology-preserved human reconstruction with details	19
Locality-constrained double-layer structure scaled simplex multi-view subspace clustering	18
Branch aware assignment for object detection	18
Prior-based privacy-assured compressed sensing scheme in cloud	18
Detail-aware image denoising via structure preserved network and residual diffusion model	18
Explaining away results in more robust visual tracking	18
A dynamic range adjustable inverse tone mapping operator based on human visual system	18
Multi-camera tracking of mechanically thrown objects for automated in-plant logistics by cognitive robots in Industry 4.0	18
Skin scar segmentation based on saliency detection	17
Cross-modal collaborative propagation for RGB–T saliency detection	17
Latent diffusion transformer for point cloud generation	17
Learning to sculpt neural cityscapes	17
LKSMN: Large Kernel Spatial Modulation Network for Lightweight Image Super-Resolution	17
Light field depth estimation using occlusion-aware consistency analysis	17
Correction: Material-aware Cross-channel Interaction Attention (MCIA) for occluded prohibited item detection	17
Blind image quality assessment by simulating the visual cortex	17
Salient-aware multiple instance learning optimized network for weakly supervised object detection	17
OSH-Splat: optimizable semantic hyperplanes for enhanced 3D language feature Gaussian splatting	17
CF-GAN: cross-domain feature fusion generative adversarial network for text-to-image synthesis	17
Deep channel-spatial attention networks for enhancing super-resolution of high-magnification SEM images	17
Point-voxel dual stream transformer for 3d point cloud learning	17
TDGar-Ani: temporal motion fusion model and deformation correction network for enhancing garment animation details	16
The devil in the details: simple and effective optical flow synthetic data generation	16
An enhanced multi-scale weight assignment strategy of two-exposure fusion	16
A workflow to systematically design uncertainty-aware visual analytics applications	16
LVDIF: a framework for real-time interaction with large volume data	16
A coarse-to-fine ghost removal scheme for HDR imaging	16
Patch excitation network for boxless action recognition in still images	16
Label-guided 4D Gaussian splatting for high-fidelity dynamic scene reconstruction	15
Fusiform multi-scale pixel self-attention network for hyperspectral images reconstruction from a single RGB image	15
Occlusion-aware segmentation via RCF-Pix2Pix generative network	15
ViT-SIR: vision transformer-based shoe image retrieval with enhanced feature representation	15
Toward robust visual tracking for UAV with adaptive spatial-temporal weighted regularization	15
Multimodal biometrics authentication using extreme learning machine with feature reduction by adaptive particle swarm optimization	15
A self-attention model for viewport prediction based on distance constraint	15
Semi-supervised multi-view clustering by label relaxation based non-negative matrix factorization	15
Virtual simulation for the dynamic response of concrete blocks under blast loading	15
Facial expression recognition based on local–global information reasoning and spatial distribution of landmark features	15
Outfit compatibility model using fully connected self-adjusting graph neural network	15
HEU-Net: hybrid attention residual block-based network with external skip connections for metal corrosion semantic segmentation	15
FFANet: dual attention-based flow field-aware network for wall identification	15
A mixed reality framework for microsurgery simulation with visual-tactile perception	15
3D human body reconstruction based on SMPL model	15
A cascaded graph convolutional network for point cloud completion	14
A two-stage network with wavelet transformation for single-image deraining	14
DICNet: achieve low-light image enhancement with image decomposition, illumination enhancement, and color restoration	14
Wall segmentation in house plans: fusion of deep learning and traditional methods	14
E-FPN: an enhanced feature pyramid network for UAV scenarios detection	14
Fairing-PIA: progressive-iterative approximation for fairing curve and surface generation	14
Virtual reality support for thoracoscopic surgery design	14
CLAC-Net: a composite medical image segmentation framework using self-attention and cross-layer asymmetric connections	14
3D printer vision calibration system based on embedding Sobel bilateral filter in least squares filtering algorithm	14
Semantic-Orthogonal Multi-modal Attention Network for RGB-D Salient Object Detection	14

EdgeLF: edge-guided registration with loftr for visible and infrared images	14
Suspect face retrieval using visual and linguistic information	13
Attention-guided self-supervised distinctive region detection in point clouds	13
A novel infrared and visible image fusion method based on multi-level saliency integration	13
Internal and external transmission encoder–decoder network for single-image deraining	13
Edge-aware texture filtering with superpixels constraint	13
STVDNet: spatio-temporal interactive video de-raining network	13
Graph matching based on feature and spatial location information	13
Fast harmonic tetrahedral mesh optimization	13
A hue preserving uniform illumination image enhancement via triangle similarity criterion in HSI color space	13
Msc-Net: multi-stage colorization network for real-world images with specular highlights	13
Neural network adaption for depth sensor replication	13
GCAENet: global-class context with advanced edge network for single human parsing	13
Multi-view clustering based on graph learning and view diversity learning	13
Editorial June 2024 ( Vol 40, Issue 6)	13
A survey on soccer player detection and tracking with videos	13
A neural builder for spatial subdivision hierarchies	13
Adaptive cascaded and parallel feature fusion for visual object tracking	13
The crowd cooperation approach for formation maintenance and collision avoidance using multi-agent deep reinforcement learning	13
The infinite doodler: expanding textures within tightly constrained manifolds	13
PDFT: parameter-diminish fine-tuning for transformer-based models	13
GlcMatch: global and local constraints for reliable feature matching	13
Enhancing high-vocabulary image annotation with a novel attention-based pooling	13
Generalized unsupervised functional map learning for dense correspondence	13
Boosting remote semantic segmentation using vision-and-language foundation model	13
Enhanced fine-grained visual classification through lightweight Transformer integration and auxiliary information fusion	12
Logical reasoning-enhanced interactive clustering: an efficient algorithm for large-scale datasets	12
A novel artificial bee colony clustering algorithm with comprehensive improvement	12
CSNet: a ConvNeXt-based Siamese network for RGB-D salient object detection	12
Hierarchical feature fusion network for light field spatial super-resolution	12
MPA-Det: multi-path aggregation-based object detection framework for aerial visual computing	12
M-GAN: multiattribute learning and multimodal feature fusion-based generative adversarial network for text-to-image synthesis	12
Recycling/upcycling graphic design: automatic design elements extraction and vectorization	12
TRAIL: Simulating the impact of human locomotion on natural landscapes	12
ODRP: a new approach for spatial street sign detection from EXIF using deep learning-based object detection, distance estimation, rotation and projection system	12
Semantically Enhanced Dual Visual Fusion Transformer for accurate image captioning	12
LiteMSNet: a lightweight semantic segmentation network with multi-scale feature extraction for urban streetscape scenes	12
MS-GAN: multi-scale GAN with parallel class activation maps for image reconstruction	12
Multi-channel correlated diffusion for text-driven artistic style transfer	12
An algorithm for cross-fiber separation in yarn hairiness image processing	12
Regularity-constrained point cloud reconstruction of building models via global alignment	12
Data privacy protection domain adaptation by roughing and finishing stage	12
Adaptive fourier-enhanced vision transformer with self-learning smoothing masks for accurate cat face recognition	12
A new face presentation attack detection method based on face-weighted multi-color multi-level texture features	12
AQPnP: an accurate and quaternion-based solution for the Perspective-n-Point problem	12
Arpotcam: augmented reality-driven honeypot for enhancing security in IoT surveillance systems	12
Ellipsoid-SLAM: enhancing dynamic scene understanding through ellipsoidal object representation and trajectory tracking	12
Adaptive frequency time-distribution network—a multiscale deblurring technique	12
Digital human and embodied intelligence for sports science: advancements, opportunities and prospects	12
Adt-net: adaptive transformation-driven text-based person search network for enhancing cross-modal retrieval robustness	12
TransDehaze: transformer-enhanced texture attention for end-to-end single image dehaze	12
High-frequency channel attention and contrastive learning for image super-resolution	12
RSFace: subject agnostic face swapping with expression high fidelity	12
Stroke-based semantic segmentation for scene-level free-hand sketches	12
Privacy-aware Real-Time Target Person Matting in Multi-Person Scenes Using Dual Encoder-Decoder Networks	12
GPT-ZSS: a unified zero-shot segmentation framework leveraging GPT-generated semantic embeddings and relationship alignment	12
ROMOT: Referring-expression-comprehension open-set multi-object tracking	12
Robust 3D watermarking with high imperceptibility based on EMD on surfaces	12
MFFN: image super-resolution via multi-level features fusion network	12
SATD: syntax-aware handwritten mathematical expression recognition based on tree-structured transformer decoder	12
TSNet: Task-specific network for joint diabetic retinopathy grading and lesion segmentation of ultra-wide optical coherence tomography angiography images	12
Enhanced small-target detection in SAR images via SIE-YOLO11: a deep learning approach	11
Cross-modal and multi-level feature refinement network for RGB-D salient object detection	11
Unsupervised inner-point-pairs model for unseen-scene and online moving object detection	11
Adaptive arc area inpainting and image enhancement method based on AI-DLC model	11
OrthopedVR: clinical assessment and pre-operative planning of paediatric patients with lower limb rotational abnormalities in virtual reality	11
Effective multi-scale enhancement fusion method for low-light images based on interest-area perception OCTM and “pixel healthiness” evaluation	11
Image-only place recognition based on regional aggregating ConvNet features for underground parking lots	11
InstantTrace: fast parallel neuron tracing on GPUs	11
Group emotion recognition based on psychological principles using a fuzzy system	11
ADMM optimizer for integrating wavelet-patch and group-based sparse representation for image inpainting	11
Grayscale uncertainty and errors of tomographic reconstructions based on projection geometries and projection sets	11
Convex hull regression strategy for people detection on top-view fisheye images	11
Dtsr: detail-enhanced transformer for image super-resolution	11
Vehicle object counting network based on feature pyramid split attention mechanism	11
IDA: an improved dual attention module for pollen classification	11
Psanet: prototype-guided salient attention for few-shot segmentation	11
An image fusion algorithm based on image clustering theory	11
Enhancing image–text matching through multi-level semantic consistency alignment	11
LSDNet: lightweight stochastic depth network for human pose estimation	11
Light field salient object detection based on discrete viewpoint selection and multi-feature fusion	11
Semantic guidance incremental network for efficiency video super-resolution	11
Preface	11
Multi-scale defect detection for plaid fabrics using scale sequence feature fusion and triple encoding	11
Robust and fast QR code images deblurring via local maximum and minimum intensity prior	11
PCCFormer: Parallel coupled convolutional transformer for image super-resolution	11
Harnessing deep learning for faster water quality assessment: identifying bacterial contaminants in real time	11
Generate anomalies from normal: a partial pseudo-anomaly augmented approach for video anomaly detection	10
Enhancing multiple-style image colorization through context-aware codebook and multi-stage learning	10
Transforming time and space: efficient video super-resolution with hybrid attention and deformable transformers	10
Zero-shot learning via categorization-relevant disentanglement and discriminative samples synthesis	10
Domain-flexible selective image encryption based on genetic operations and chaotic maps	10
Fast image recoloring for red–green anomalous trichromacy with contrast enhancement and naturalness preservation	10
ParaLkResNet: an efficient multi-scale image classification network	10
Entanglement inspired approach for determining the preeminent arrangement of static cameras in a multi-view computer vision system	10