Visual Computer

Papers
(The median citation count of Visual Computer is 2. The table below lists those papers that are above that threshold based on CrossRef citation counts [max. 250 papers]. The publications cover those that have been published in the past four years, i.e., from 2021-11-01 to 2025-11-01.)
ArticleCitations
Robust object recognition via context-driven reliability assessment119
Adaptively weighted discrete Laplacian for inverse rendering105
Learning shape abstraction by cropping positive cuboid primitives with negative ones78
AttentionDIP: attention-based deep image prior model to restore satellite and aerial images from gamma distributed speckle interference75
Edge-priority-extraction network using re-parameterization for real-time super-resolution73
Region-based adaptive association learning for robust image scene recognition70
Still room for improvement in traditional 3D interaction: selecting the fixed axis in the virtual trackball66
MAPD: multi-receptive field and attention mechanism for multispectral pedestrian detection62
Robust corner detection in continuous space57
Multi-modal co-attention relation networks for visual question answering56
Cfseg-Net: context feature extraction network for medical image segmentation54
Joint attribute soft-sharing and contextual local: a multi-level features learning network for person re-identification53
Self-supervised single-image 3D face reconstruction method based on attention mechanism and attribute refinement53
Point cloud quality assessment: unifying projection, geometry, and texture similarity44
A new chaotic image encryption algorithm based on dynamic DNA coding and RNA computing42
Weighted and truncated $$L_1$$ image smoothing based on unsupervised learning42
Self-knowledge distillation through ensemble model averaging: a novel approach for image classification41
Enhanced Temporal Representation and Spatial Alignment for High-Fidelity Talking Video Generation40
Enhancing green screen matting with group normalization and perceptual loss for color overflow and complex edges40
Hybrid annotation alignment-based multi-region crop model for high-resolution image40
MoCoSys: human motion correction based on deep learning coupled with 3D+t Laplacian motion representation35
Virtual object sizes for efficient and convenient mid-air manipulation34
Generative artificial intelligence for ophthalmic images: developments, applications and challenges34
Road crack detection using pixel classification and intensity-based distinctive fuzzy C-means clustering33
Secure management of retinal imaging based on deep learning, zero-watermarking and reversible data hiding33
Image encryption algorithm based on cross-scrambling and rapid-mode diffusion32
Developing an augmented reality framework with embedded objects and adaptive optical models for advanced lighting simulation32
V$$^2$$MLP: an accurate and simple multi-view MLP network for fine-grained 3D shape recognition32
A two-stage model for spatial downscaling of daily precipitation data31
SCAKD: a knowledge distillation framework based on spatial-corner attention for infrared and visible image fusion31
Feature decomposition and structural learning for multi-diverse and multi-view data clustering31
Lightweight subpixel sampling network for image super-resolution31
Adaptive box-level supervision with superpixel shape guidance for ultrasound image segmentation30
Exploring Structural Lines for Interior Floorplan Segmentation29
Nighttime driver behavior prediction using taillight signal recognition via CNN-SVM classifier29
Monocular human depth estimation with 3D motion flow and surface normals29
Gaze-contingent adaptation of VR stereo parameters for cybersickness prevention28
A multi-target cow face detection model in complex scenes28
Camera calibration for the surround-view system: a benchmark and dataset28
Deep learning in chronic wound segmentation: a comprehensive review and meta-analysis27
SES-yolov5: small object graphics detection and visualization applications27
Depth-guided color correction and multi-scale Retinex network for underwater image enhancement27
Preface (Vol 39. Issue 6, June 2023)25
CLFormer: a unified transformer-based framework for weakly supervised crowd counting and localization25
Liver segmentation based on complementary features U-Net25
A novel robust digital image watermarking scheme based on attention U-Net++ structure25
Study on the methods of hyperspectral image saliency detection based on MBCNN24
Picking out the bad apples: unsupervised biometric data filtering for refined age estimation24
ARP$$\Delta $$: Accelerated ray-tracing photon differentials for real-time global illumination with combined specular and diffuse solutions24
ImpRes: implicit residual diffusion models for image super-resolution24
PlayNet: real-time handball play classification with Kalman embeddings and neural networks24
Enhancing 3D human pose estimation via spatio-temporal dual-stream fusion24
Capturing spatiotemporal dependencies with competitive set attention for video summarization24
Hybrid Mamba-Transformer Multi-Agent Reinforcement Learning for scalable coordination in complex environments23
Generating natural pedestrian crowds by learning real crowd trajectories through a transformer-based GAN23
Two-stream inter-class variation enhancement network for facial expression recognition23
Decoupled spatio-temporal grouping transformer for skeleton-based action recognition23
High-fidelity facial expression transfer using part-based local–global conditional gans22
PL-MCT: pseudo-labeling and multi-frame consistency training for semi-supervised visual tracking22
A modified fuzzy clustering algorithm based on dynamic relatedness model for image segmentation22
Boosting vision transformer for low-resolution borehole image stitching through algebraic multigrid22
Correction: DFP-YOLO: a lightweight machine tool workpiece defect detection algorithm based on computer vision22
Correction: 3D reconstruction method based on N-step phase unwrapping22
Personalized hairstyle and hair color editing based on multi-feature fusion21
HSNet: hierarchical semantics network for scene parsing21
Regions of interest selection in histopathological images using subspace and multi-objective stream clustering21
Icg: intensity and color gradient operator on RGB images for visual object tracking21
Enhanced visual perception for underwater images based on multistage generative adversarial network20
Lane line detection and departure estimation in a complex environment by using an asymmetric kernel convolution algorithm20
MF-SAM: enhancing multi-modal fusion with Mamba in SAM-Med3D for GPi segmentation20
Boosted verification using siamese neural network with DiffBlock20
A point cloud self-learning network based on contrastive learning for classification and segmentation20
Compact storage of additively weighted Voronoi diagrams20
Arbitrary style transfer via content consistency and style consistency19
DeMaskGAN: a de-masking generative adversarial network guided by semantic segmentation19
Application of VR to ikebana education19
Artificial intelligence-assisted cervical dysplasia detection using papanicolaou smear images19
Optimal feature selection and classification of Indian classical dance hand gesture dataset19
DMINet: dense multi-scale inference network for salient object detection19
Attention-enhanced controllable disentanglement for cloth-changing person re-identification19
Double-handed dynamic gesture recognition using contour-based hand tracking and maximum mean probability ensembling (MMPE) for Indian Sign Language19
WeedGan: a novel generative adversarial network for cotton weed identification19
Bridging realities: training visuo-haptic object recognition models for robots using 3D virtual simulations19
Anti-counterfeiting textured pattern19
Adversarial-based refinement dual-branch network for semi-supervised salient object detection of strip steel surface defects19
A novel robust image watermarking algorithm based on polar decomposition and image geometric correction19
Sound signatures for images and geometric shapes19
ConvFormer: parameter reduction in transformer models for 3D human pose estimation by leveraging dynamic multi-headed convolutional attention18
Vision transformers (ViT) and deep convolutional neural network (D-CNN)-based models for MRI brain primary tumors images multi-classification supported by explainable artificial intelligence (XAI)18
MCGFF-Net: a multi-scale context-aware and global feature fusion network for enhanced polyp and skin lesion segmentation18
Dsf-net: a dual-stream fusion network integrating structural and detailed features for fundus-based diabetic retinopathy classification18
PMGAN: pretrained model-based generative adversarial network for text-to-image generation18
SSoB: searching a scene-oriented architecture for underwater object detection18
Enhancing multi-scale information exchange and feature fusion for human pose estimation18
Distribution-decouple learning network: an innovative approach for single image dehazing with spatial and frequency decoupling17
Dual-attention U-Net and multi-convolution network for single-image rain removal17
Prior-based privacy-assured compressed sensing scheme in cloud17
Branch aware assignment for object detection17
Multi-camera tracking of mechanically thrown objects for automated in-plant logistics by cognitive robots in Industry 4.017
An automatic framework for quadrilateral surface reconstruction with partitions from 3D point clouds17
Explaining away results in more robust visual tracking17
Locality-constrained double-layer structure scaled simplex multi-view subspace clustering17
A survey on soccer player detection and tracking with videos17
A dynamic range adjustable inverse tone mapping operator based on human visual system17
Topology-preserved human reconstruction with details17
Virtual simulation for the dynamic response of concrete blocks under blast loading17
Outfit compatibility model using fully connected self-adjusting graph neural network17
Detail-aware image denoising via structure preserved network and residual diffusion model17
3D human body reconstruction based on SMPL model16
CF-GAN: cross-domain feature fusion generative adversarial network for text-to-image synthesis16
Unsupervised deep learning based ego motion estimation with a downward facing camera16
LKSMN: Large Kernel Spatial Modulation Network for Lightweight Image Super-Resolution16
Latent diffusion transformer for point cloud generation16
OSH-Splat: optimizable semantic hyperplanes for enhanced 3D language feature Gaussian splatting16
Light field depth estimation using occlusion-aware consistency analysis16
Cross-modal collaborative propagation for RGB–T saliency detection16
Semantic-Orthogonal Multi-modal Attention Network for RGB-D Salient Object Detection16
Skin scar segmentation based on saliency detection16
Correction: Material-aware Cross-channel Interaction Attention (MCIA) for occluded prohibited item detection16
Human pose estimation with gated multi-scale feature fusion and spatial mutual information16
A coarse-to-fine ghost removal scheme for HDR imaging15
3D printer vision calibration system based on embedding Sobel bilateral filter in least squares filtering algorithm15
Facial expression recognition based on local–global information reasoning and spatial distribution of landmark features15
TDGar-Ani: temporal motion fusion model and deformation correction network for enhancing garment animation details15
Toward robust visual tracking for UAV with adaptive spatial-temporal weighted regularization15
Salient-aware multiple instance learning optimized network for weakly supervised object detection15
STVDNet: spatio-temporal interactive video de-raining network15
A workflow to systematically design uncertainty-aware visual analytics applications15
Learning to sculpt neural cityscapes15
Fairing-PIA: progressive-iterative approximation for fairing curve and surface generation15
A cascaded graph convolutional network for point cloud completion15
A mixed reality framework for microsurgery simulation with visual-tactile perception14
FFANet: dual attention-based flow field-aware network for wall identification14
The devil in the details: simple and effective optical flow synthetic data generation14
LVDIF: a framework for real-time interaction with large volume data14
DICNet: achieve low-light image enhancement with image decomposition, illumination enhancement, and color restoration14
An enhanced multi-scale weight assignment strategy of two-exposure fusion14
Wall segmentation in house plans: fusion of deep learning and traditional methods14
Patch excitation network for boxless action recognition in still images14
Blind image quality assessment by simulating the visual cortex13
ViT-SIR: vision transformer-based shoe image retrieval with enhanced feature representation13
E-FPN: an enhanced feature pyramid network for UAV scenarios detection13
Semi-supervised multi-view clustering by label relaxation based non-negative matrix factorization13
Editorial June 2024 ( Vol 40, Issue 6)13
CLAC-Net: a composite medical image segmentation framework using self-attention and cross-layer asymmetric connections13
Occlusion-aware segmentation via RCF-Pix2Pix generative network13
A self-attention model for viewport prediction based on distance constraint13
Intuitionistic fuzzy information-driven total Bregman divergence fuzzy clustering with multiple local information constraints for image segmentation13
Deep channel-spatial attention networks for enhancing super-resolution of high-magnification SEM images13
Enhancing high-vocabulary image annotation with a novel attention-based pooling13
Fusiform multi-scale pixel self-attention network for hyperspectral images reconstruction from a single RGB image13
Point-voxel dual stream transformer for 3d point cloud learning13
Multimodal biometrics authentication using extreme learning machine with feature reduction by adaptive particle swarm optimization13
A two-stage network with wavelet transformation for single-image deraining13
HEU-Net: hybrid attention residual block-based network with external skip connections for metal corrosion semantic segmentation13
Graph matching based on feature and spatial location information13
Generalized unsupervised functional map learning for dense correspondence12
A neural builder for spatial subdivision hierarchies12
Digital human and embodied intelligence for sports science: advancements, opportunities and prospects12
A novel infrared and visible image fusion method based on multi-level saliency integration12
RSFace: subject agnostic face swapping with expression high fidelity12
The infinite doodler: expanding textures within tightly constrained manifolds12
GPT-ZSS: a unified zero-shot segmentation framework leveraging GPT-generated semantic embeddings and relationship alignment12
ROMOT: Referring-expression-comprehension open-set multi-object tracking12
GCAENet: global-class context with advanced edge network for single human parsing12
Suspect face retrieval using visual and linguistic information12
Attention-guided self-supervised distinctive region detection in point clouds12
Privacy-aware Real-Time Target Person Matting in Multi-Person Scenes Using Dual Encoder-Decoder Networks12
The crowd cooperation approach for formation maintenance and collision avoidance using multi-agent deep reinforcement learning12
PDFT: parameter-diminish fine-tuning for transformer-based models12
ODRP: a new approach for spatial street sign detection from EXIF using deep learning-based object detection, distance estimation, rotation and projection system12
Neural network adaption for depth sensor replication12
A hue preserving uniform illumination image enhancement via triangle similarity criterion in HSI color space12
Recycling/upcycling graphic design: automatic design elements extraction and vectorization12
OrthopedVR: clinical assessment and pre-operative planning of paediatric patients with lower limb rotational abnormalities in virtual reality12
TRAIL: Simulating the impact of human locomotion on natural landscapes12
GlcMatch: global and local constraints for reliable feature matching12
Regularity-constrained point cloud reconstruction of building models via global alignment12
Edge-aware texture filtering with superpixels constraint12
SATD: syntax-aware handwritten mathematical expression recognition based on tree-structured transformer decoder12
MS-GAN: multi-scale GAN with parallel class activation maps for image reconstruction12
Boosting remote semantic segmentation using vision-and-language foundation model12
Ellipsoid-SLAM: enhancing dynamic scene understanding through ellipsoidal object representation and trajectory tracking12
Fast harmonic tetrahedral mesh optimization12
A new face presentation attack detection method based on face-weighted multi-color multi-level texture features12
LiteMSNet: a lightweight semantic segmentation network with multi-scale feature extraction for urban streetscape scenes12
Multi-view clustering based on graph learning and view diversity learning12
Adaptive cascaded and parallel feature fusion for visual object tracking12
Internal and external transmission encoder–decoder network for single-image deraining12
Msc-Net: multi-stage colorization network for real-world images with specular highlights12
Adaptive fourier-enhanced vision transformer with self-learning smoothing masks for accurate cat face recognition11
Zero-shot learning via categorization-relevant disentanglement and discriminative samples synthesis11
Adaptive arc area inpainting and image enhancement method based on AI-DLC model11
Semantically Enhanced Dual Visual Fusion Transformer for accurate image captioning11
MFFN: image super-resolution via multi-level features fusion network11
High-frequency channel attention and contrastive learning for image super-resolution11
Multi-channel correlated diffusion for text-driven artistic style transfer11
TSNet: Task-specific network for joint diabetic retinopathy grading and lesion segmentation of ultra-wide optical coherence tomography angiography images11
MPA-Det: multi-path aggregation-based object detection framework for aerial visual computing11
Robust 3D watermarking with high imperceptibility based on EMD on surfaces11
Preface11
Dtsr: detail-enhanced transformer for image super-resolution11
An algorithm for cross-fiber separation in yarn hairiness image processing11
Stroke-based semantic segmentation for scene-level free-hand sketches11
CSNet: a ConvNeXt-based Siamese network for RGB-D salient object detection11
Enhanced fine-grained visual classification through lightweight Transformer integration and auxiliary information fusion11
Logical reasoning-enhanced interactive clustering: an efficient algorithm for large-scale datasets11
M-GAN: multiattribute learning and multimodal feature fusion-based generative adversarial network for text-to-image synthesis11
Arpotcam: augmented reality-driven honeypot for enhancing security in IoT surveillance systems11
Multi-scale defect detection for plaid fabrics using scale sequence feature fusion and triple encoding11
TransDehaze: transformer-enhanced texture attention for end-to-end single image dehaze11
AQPnP: an accurate and quaternion-based solution for the Perspective-n-Point problem11
A novel artificial bee colony clustering algorithm with comprehensive improvement11
Data privacy protection domain adaptation by roughing and finishing stage11
Hierarchical feature fusion network for light field spatial super-resolution11
Cross-modal and multi-level feature refinement network for RGB-D salient object detection10
Correction to: Feedback through emotion extraction using logistic regression and CNN10
LSDNet: lightweight stochastic depth network for human pose estimation10
Image-only place recognition based on regional aggregating ConvNet features for underground parking lots10
Transforming time and space: efficient video super-resolution with hybrid attention and deformable transformers10
Enhancing image–text matching through multi-level semantic consistency alignment10
ADMM optimizer for integrating wavelet-patch and group-based sparse representation for image inpainting10
Enhanced small-target detection in SAR images via SIE-YOLO11: a deep learning approach10
Effective multi-scale enhancement fusion method for low-light images based on interest-area perception OCTM and “pixel healthiness” evaluation10
Grayscale uncertainty and errors of tomographic reconstructions based on projection geometries and projection sets10
Unsupervised inner-point-pairs model for unseen-scene and online moving object detection10
Semantic guidance incremental network for efficiency video super-resolution10
Disentangled representations: towards interpretation of sex determination from hip bone10
Expression-driven monocular 3D face reconstruction based on cross-modal guidance10
Histogram equalization using a selective filter10
Fast image recoloring for red–green anomalous trichromacy with contrast enhancement and naturalness preservation10
Vehicle object counting network based on feature pyramid split attention mechanism10
Light field salient object detection based on discrete viewpoint selection and multi-feature fusion10
Convex hull regression strategy for people detection on top-view fisheye images10
Action detection with two-stream enhanced detector10
CMT-6D: a lightweight iterative 6DoF pose estimation network based on cross-modal Transformer10
InstantTrace: fast parallel neuron tracing on GPUs10
Lightweight head pose estimation without keypoints based on multi-scale lightweight neural network10
Robust and fast QR code images deblurring via local maximum and minimum intensity prior10
When CNN meet with ViT: decision-level feature fusion for camouflaged object detection9
Cross-resolution feature attention network for image super-resolution9
WaveUIR: wavelet-based guided transformer model for efficient universal image restoration9
Visible-to-infrared image translation based on an improved CGAN9
Context-Aware Enhanced Virtual Try-On Network with fabric adaptive registration9
DPDTRN: a dynamic pixel-level difficulty-aware texture reconstruction network for document super-resolution9
Enhanced fine-grained relearning for skeleton-based action recognition9
PCCFormer: Parallel coupled convolutional transformer for image super-resolution9
An adaptive loss weighting multi-task network with attention-guide proposal generation for small size defect inspection9
A dual-branch feature fusion neural network for fish image fine-grained recognition9
A multi-color and multistage collaborative network guided by refined transmission prior for underwater image enhancement9
0.075174808502197