Visual Computer

Papers
(The TQCC of Visual Computer is 6. The table below lists those papers that are above that threshold based on CrossRef citation counts [max. 250 papers]. The publications cover those that have been published in the past four years, i.e., from 2022-06-01 to 2026-06-01.)
ArticleCitations
Robust object recognition via context-driven reliability assessment132
Adaptively weighted discrete Laplacian for inverse rendering100
Learning shape abstraction by cropping positive cuboid primitives with negative ones89
Edge-priority-extraction network using re-parameterization for real-time super-resolution83
Enhancing green screen matting with group normalization and perceptual loss for color overflow and complex edges63
V$$^2$$MLP: an accurate and simple multi-view MLP network for fine-grained 3D shape recognition58
Multi-modal co-attention relation networks for visual question answering54
Deep learning in chronic wound segmentation: a comprehensive review and meta-analysis54
Joint attribute soft-sharing and contextual local: a multi-level features learning network for person re-identification52
Enhanced Temporal Representation and Spatial Alignment for High-Fidelity Talking Video Generation48
Feature decomposition and structural learning for multi-diverse and multi-view data clustering48
Adaptive box-level supervision with superpixel shape guidance for ultrasound image segmentation47
Camera calibration for the surround-view system: a benchmark and dataset45
Monocular human depth estimation with 3D motion flow and surface normals45
TaiChiGPT: complex sports action generation based on large language models44
Image encryption algorithm based on cross-scrambling and rapid-mode diffusion43
SCAKD: a knowledge distillation framework based on spatial-corner attention for infrared and visible image fusion43
Self-knowledge distillation through ensemble model averaging: a novel approach for image classification41
A multi-target cow face detection model in complex scenes41
Developing an augmented reality framework with embedded objects and adaptive optical models for advanced lighting simulation40
PoseNorm-PCN: pose-normalized human point cloud completion from a single front view40
Lightweight subpixel sampling network for image super-resolution39
MoCoSys: human motion correction based on deep learning coupled with 3D+t Laplacian motion representation38
MAPD: multi-receptive field and attention mechanism for multispectral pedestrian detection38
AttentionDIP: attention-based deep image prior model to restore satellite and aerial images from gamma distributed speckle interference37
Virtual object sizes for efficient and convenient mid-air manipulation37
Using transfer learning to determine the type of mathematical fractals image of Islamic geometric patterns34
Enhanced optical flow estimation via multiscale kernel selection and super-resolution integration34
ER: Extract-regress network for precise 3D reconstruction of interacting hands from monocular images33
Uncertainty-guided time–frequency feature enhancement for emotion-aware speech-driven 3D facial animation32
A two-stage model for spatial downscaling of daily precipitation data31
Road crack detection using pixel classification and intensity-based distinctive fuzzy C-means clustering31
Nighttime driver behavior prediction using taillight signal recognition via CNN-SVM classifier31
Weighted and truncated $$L_1$$ image smoothing based on unsupervised learning30
Exploring Structural Lines for Interior Floorplan Segmentation30
Depth-guided color correction and multi-scale Retinex network for underwater image enhancement30
Hybrid annotation alignment-based multi-region crop model for high-resolution image30
A new chaotic image encryption algorithm based on dynamic DNA coding and RNA computing30
Robust corner detection in continuous space30
Gaze-contingent adaptation of VR stereo parameters for cybersickness prevention30
Cfseg-Net: context feature extraction network for medical image segmentation29
SpatioGS: spatiotemporal-aware density control for dynamic scene rendering with Gaussian splatting29
Self-supervised single-image 3D face reconstruction method based on attention mechanism and attribute refinement28
EMA-U-Net: efficient multi-attention U-Net for skin lesion segmentation28
SES-yolov5: small object graphics detection and visualization applications28
Secure management of retinal imaging based on deep learning, zero-watermarking and reversible data hiding27
Generative artificial intelligence for ophthalmic images: developments, applications and challenges27
Study on the methods of hyperspectral image saliency detection based on MBCNN26
CLFormer: a unified transformer-based framework for weakly supervised crowd counting and localization26
ARP$$\Delta $$: Accelerated ray-tracing photon differentials for real-time global illumination with combined specular and diffuse solutions26
Preface (Vol 39. Issue 6, June 2023)26
Two-stream inter-class variation enhancement network for facial expression recognition25
Dsf-net: a dual-stream fusion network integrating structural and detailed features for fundus-based diabetic retinopathy classification25
A point cloud self-learning network based on contrastive learning for classification and segmentation25
Adversarial-based refinement dual-branch network for semi-supervised salient object detection of strip steel surface defects25
Bridging realities: training visuo-haptic object recognition models for robots using 3D virtual simulations25
Distribution-decouple learning network: an innovative approach for single image dehazing with spatial and frequency decoupling25
Boosting vision transformer for low-resolution borehole image stitching through algebraic multigrid25
Icg: intensity and color gradient operator on RGB images for visual object tracking25
Enhanced visual perception for underwater images based on multistage generative adversarial network25
PL-MCT: pseudo-labeling and multi-frame consistency training for semi-supervised visual tracking25
Personalized hairstyle and hair color editing based on multi-feature fusion25
MCGFF-Net: a multi-scale context-aware and global feature fusion network for enhanced polyp and skin lesion segmentation24
PMGAN: pretrained model-based generative adversarial network for text-to-image generation24
Double-handed dynamic gesture recognition using contour-based hand tracking and maximum mean probability ensembling (MMPE) for Indian Sign Language24
A novel robust digital image watermarking scheme based on attention U-Net++ structure24
DeMaskGAN: a de-masking generative adversarial network guided by semantic segmentation24
Compact storage of additively weighted Voronoi diagrams24
Application of VR to ikebana education23
Enhancing 3D human pose estimation via spatio-temporal dual-stream fusion23
AdverFuse: robust fusion of multimodal images based on dynamic attention and adversarial learning23
Sound signatures for images and geometric shapes23
PlayNet: real-time handball play classification with Kalman embeddings and neural networks23
Vision transformers (ViT) and deep convolutional neural network (D-CNN)-based models for MRI brain primary tumors images multi-classification supported by explainable artificial intelligence (XAI)23
Arbitrary style transfer via content consistency and style consistency23
Correction: DFP-YOLO: a lightweight machine tool workpiece defect detection algorithm based on computer vision22
DMINet: dense multi-scale inference network for salient object detection22
MF-SAM: enhancing multi-modal fusion with Mamba in SAM-Med3D for GPi segmentation22
Optimal feature selection and classification of Indian classical dance hand gesture dataset22
An unsupervised denoising model for poisson noise using GSURE-driven deep image prior with multi-order regularization for medical imaging22
ConvFormer: parameter reduction in transformer models for 3D human pose estimation by leveraging dynamic multi-headed convolutional attention22
Enhancing hyperspectral image classification through spectral-spatial synergy: SSFSNet22
ImpRes: implicit residual diffusion models for image super-resolution21
Attention-enhanced controllable disentanglement for cloth-changing person re-identification21
Enhancing multi-scale information exchange and feature fusion for human pose estimation21
Liver segmentation based on complementary features U-Net21
A novel robust image watermarking algorithm based on polar decomposition and image geometric correction20
WeedGan: a novel generative adversarial network for cotton weed identification20
Correction: 3D reconstruction method based on N-step phase unwrapping20
SSoB: searching a scene-oriented architecture for underwater object detection20
CrowdImprint: decomposing context-aware interactions20
High-fidelity facial expression transfer using part-based local–global conditional gans20
Boosted verification using siamese neural network with DiffBlock20
Dynamic underwater cognition: aligned detection networks for enhanced underwater object recognition20
Capturing spatiotemporal dependencies with competitive set attention for video summarization20
Anti-counterfeiting textured pattern20
Hybrid Mamba-Transformer Multi-Agent Reinforcement Learning for scalable coordination in complex environments19
An automatic framework for quadrilateral surface reconstruction with partitions from 3D point clouds19
Topology-preserved human reconstruction with details19
Generating natural pedestrian crowds by learning real crowd trajectories through a transformer-based GAN19
Dual-attention U-Net and multi-convolution network for single-image rain removal19
Decoupled spatio-temporal grouping transformer for skeleton-based action recognition19
Branch aware assignment for object detection19
ViT-SIR: vision transformer-based shoe image retrieval with enhanced feature representation18
Cross-modal collaborative propagation for RGB–T saliency detection18
3D printer vision calibration system based on embedding Sobel bilateral filter in least squares filtering algorithm18
Light field depth estimation using occlusion-aware consistency analysis18
Toward robust visual tracking for UAV with adaptive spatial-temporal weighted regularization18
Point-voxel dual stream transformer for 3d point cloud learning18
TDGar-Ani: temporal motion fusion model and deformation correction network for enhancing garment animation details18
Salient-aware multiple instance learning optimized network for weakly supervised object detection18
Hash-NURF: efficient nested transparent object reconstruction using multi-resolution hash encoding18
Detail-aware image denoising via structure preserved network and residual diffusion model18
Patch excitation network for boxless action recognition in still images18
Facial expression recognition based on local–global information reasoning and spatial distribution of landmark features17
Skin scar segmentation based on saliency detection17
The devil in the details: simple and effective optical flow synthetic data generation17
A workflow to systematically design uncertainty-aware visual analytics applications17
E-FPN: an enhanced feature pyramid network for UAV scenarios detection17
Occlusion-aware segmentation via RCF-Pix2Pix generative network17
LVDIF: a framework for real-time interaction with large volume data17
OSH-Splat: optimizable semantic hyperplanes for enhanced 3D language feature Gaussian splatting17
Learning to sculpt neural cityscapes17
FFANet: dual attention-based flow field-aware network for wall identification17
Virtual simulation for the dynamic response of concrete blocks under blast loading16
Deep channel-spatial attention networks for enhancing super-resolution of high-magnification SEM images16
A mixed reality framework for microsurgery simulation with visual-tactile perception16
A cascaded graph convolutional network for point cloud completion16
ResNet-OSD: an optimized hybrid deep learning framework for oil spill detection in coastal drone imagery16
HEU-Net: hybrid attention residual block-based network with external skip connections for metal corrosion semantic segmentation16
Label-guided 4D Gaussian splatting for high-fidelity dynamic scene reconstruction16
Blind image quality assessment by simulating the visual cortex16
Wall segmentation in house plans: fusion of deep learning and traditional methods16
Locality-constrained double-layer structure scaled simplex multi-view subspace clustering16
Multi-camera tracking of mechanically thrown objects for automated in-plant logistics by cognitive robots in Industry 4.016
STVDNet: spatio-temporal interactive video de-raining network16
An enhanced multi-scale weight assignment strategy of two-exposure fusion16
A self-attention model for viewport prediction based on distance constraint16
Latent diffusion transformer for point cloud generation16
LKSMN: Large Kernel Spatial Modulation Network for Lightweight Image Super-Resolution16
Fusiform multi-scale pixel self-attention network for hyperspectral images reconstruction from a single RGB image16
EdgeLF: edge-guided registration with loftr for visible and infrared images15
Semantic-Orthogonal Multi-modal Attention Network for RGB-D Salient Object Detection15
A survey on soccer player detection and tracking with videos15
A two-stage network with wavelet transformation for single-image deraining15
Multimodal biometrics authentication using extreme learning machine with feature reduction by adaptive particle swarm optimization15
DICNet: achieve low-light image enhancement with image decomposition, illumination enhancement, and color restoration15
Virtual reality support for thoracoscopic surgery design15
Prior-based privacy-assured compressed sensing scheme in cloud15
Outfit compatibility model using fully connected self-adjusting graph neural network15
Gtfpose: a unified framework with double-chain GCN–transformer fusion for 3D human pose estimation15
Internal and external transmission encoder–decoder network for single-image deraining14
The infinite doodler: expanding textures within tightly constrained manifolds14
PDFT: parameter-diminish fine-tuning for transformer-based models14
Adaptive transformer-based detection: enhancing infrared image target recognition14
Editorial June 2024 ( Vol 40, Issue 6)14
A neural builder for spatial subdivision hierarchies14
Neural network adaption for depth sensor replication14
GCAENet: global-class context with advanced edge network for single human parsing14
Multi-view clustering based on graph learning and view diversity learning14
CLAC-Net: a composite medical image segmentation framework using self-attention and cross-layer asymmetric connections14
Enhancing high-vocabulary image annotation with a novel attention-based pooling14
Generalized unsupervised functional map learning for dense correspondence14
Boosting remote semantic segmentation using vision-and-language foundation model14
GDPNet: a hybrid GNN-Transformer with position–density-modulated attention for 3D point cloud semantic segmentation14
Fairing-PIA: progressive-iterative approximation for fairing curve and surface generation14
MPA-Det: multi-path aggregation-based object detection framework for aerial visual computing13
A new face presentation attack detection method based on face-weighted multi-color multi-level texture features13
ROMOT: Referring-expression-comprehension open-set multi-object tracking13
Robust 3D watermarking with high imperceptibility based on EMD on surfaces13
Data privacy protection domain adaptation by roughing and finishing stage13
CSNet: a ConvNeXt-based Siamese network for RGB-D salient object detection13
Adt-net: adaptive transformation-driven text-based person search network for enhancing cross-modal retrieval robustness13
Adaptive frequency time-distribution network—a multiscale deblurring technique13
Msc-Net: multi-stage colorization network for real-world images with specular highlights13
LiteMSNet: a lightweight semantic segmentation network with multi-scale feature extraction for urban streetscape scenes13
Adaptive cascaded and parallel feature fusion for visual object tracking13
Logical reasoning-enhanced interactive clustering: an efficient algorithm for large-scale datasets13
M-GAN: multiattribute learning and multimodal feature fusion-based generative adversarial network for text-to-image synthesis13
Correction: Fast and high-quality scale-aware filtering for 3D images13
DXAI: explaining classification by image decomposition13
RSFace: subject agnostic face swapping with expression high fidelity13
ODRP: a new approach for spatial street sign detection from EXIF using deep learning-based object detection, distance estimation, rotation and projection system13
Privacy-aware Real-Time Target Person Matting in Multi-Person Scenes Using Dual Encoder-Decoder Networks13
GPT-ZSS: a unified zero-shot segmentation framework leveraging GPT-generated semantic embeddings and relationship alignment13
Ellipsoid-SLAM: enhancing dynamic scene understanding through ellipsoidal object representation and trajectory tracking13
High-frequency channel attention and contrastive learning for image super-resolution13
Fast harmonic tetrahedral mesh optimization13
Multi-channel correlated diffusion for text-driven artistic style transfer12
Advanced detection and segmentation of parabolic trough collector and Fresnel mirrors for CSP maintenance using YOLOv8 and segment anything model12
A hue preserving uniform illumination image enhancement via triangle similarity criterion in HSI color space12
Regularity-constrained point cloud reconstruction of building models via global alignment12
Arpotcam: augmented reality-driven honeypot for enhancing security in IoT surveillance systems12
Robust and fast QR code images deblurring via local maximum and minimum intensity prior12
The crowd cooperation approach for formation maintenance and collision avoidance using multi-agent deep reinforcement learning12
Semantic guidance incremental network for efficiency video super-resolution12
Touching spaces: interactive physicalization for exploring spatial information12
Edge-aware texture filtering with superpixels constraint12
MaFIR: high-fidelity fisheye image rectification via Manhattan self-attention and dynamic feature optimization12
TransDehaze: transformer-enhanced texture attention for end-to-end single image dehaze12
Multi-scale defect detection for plaid fabrics using scale sequence feature fusion and triple encoding12
Enhancing remote sensing image segmentation with SGDC-DeepLab: a lightweight approach using Gaussian filters12
Adaptive fourier-enhanced vision transformer with self-learning smoothing masks for accurate cat face recognition12
SATD: syntax-aware handwritten mathematical expression recognition based on tree-structured transformer decoder12
Digital human and embodied intelligence for sports science: advancements, opportunities and prospects12
Cross-resolution feature attention network for image super-resolution12
Image-only place recognition based on regional aggregating ConvNet features for underground parking lots12
Adaptive arc area inpainting and image enhancement method based on AI-DLC model12
MFFN: image super-resolution via multi-level features fusion network12
An algorithm for cross-fiber separation in yarn hairiness image processing12
Stroke-based semantic segmentation for scene-level free-hand sketches12
TRAIL: Simulating the impact of human locomotion on natural landscapes12
AQPnP: an accurate and quaternion-based solution for the Perspective-n-Point problem12
OrthopedVR: clinical assessment and pre-operative planning of paediatric patients with lower limb rotational abnormalities in virtual reality12
Enhanced fine-grained visual classification through lightweight Transformer integration and auxiliary information fusion12
Recycling/upcycling graphic design: automatic design elements extraction and vectorization12
Context-Aware Enhanced Virtual Try-On Network with fabric adaptive registration12
Vehicle object counting network based on feature pyramid split attention mechanism12
Convex hull regression strategy for people detection on top-view fisheye images12
Semantically Enhanced Dual Visual Fusion Transformer for accurate image captioning12
Attention-guided self-supervised distinctive region detection in point clouds12
TSNet: Task-specific network for joint diabetic retinopathy grading and lesion segmentation of ultra-wide optical coherence tomography angiography images12
PGLRNet: target pose-guided and feature loss-reduced network for oriented object detection in remote sensing images11
Psanet: prototype-guided salient attention for few-shot segmentation11
Multimodal and multi-time-point fusion approach for automated diagnosis and grading of carotid atherosclerosis using bilateral ultrasound images and metadata11
Enhancing scene text script identification through multi-task self-supervised learning11
CSI-DMT: multi-focus image fusion via cross-task semantic interaction and dual-attention mixing transformer11
Coarse-to-fine multi-scale attention-guided network for multi-exposure image fusion11
Image recoloring for Red-Green dichromats with compensation range-based naturalness preservation and refined dichromacy gamut11
A multi-color and multistage collaborative network guided by refined transmission prior for underwater image enhancement11
Adaptive objectness learning for enhanced unknown object detection11
ADMM optimizer for integrating wavelet-patch and group-based sparse representation for image inpainting11
Group emotion recognition based on psychological principles using a fuzzy system11
Enhancing multiple-style image colorization through context-aware codebook and multi-stage learning11
Visible-to-infrared image translation based on an improved CGAN11
Zero-shot learning via categorization-relevant disentanglement and discriminative samples synthesis11
Progressive region exchange: enhancing semi-supervised medical image segmentation through incremental complexity11
Dtsr: detail-enhanced transformer for image super-resolution11
Modality-aware graph CNN for cross-modal person reidentification11
A dual-branch feature fusion neural network for fish image fine-grained recognition11
PCCFormer: Parallel coupled convolutional transformer for image super-resolution11
Expression-driven monocular 3D face reconstruction based on cross-modal guidance11
Enhanced small-target detection in SAR images via SIE-YOLO11: a deep learning approach11
Coarse-to-fine blind image deblurring based on K-means clustering11
InstantTrace: fast parallel neuron tracing on GPUs11
GenYOLO-leaf: a data-centric and open source framework for generalizable leaf instance segmentation across diverse datasets11
Cross-modal and multi-level feature refinement network for RGB-D salient object detection11
MCAM-Net: multi-scale convolutional attention for enhanced industrial surface defect detection11
CMT-6D: a lightweight iterative 6DoF pose estimation network based on cross-modal Transformer11
CAMFNet: complex camouflaged object detection via context-aware and adaptive multilevel feature fusion network11
0.18167805671692