Computational Visual Media

Papers
(The median citation count of Computational Visual Media is 2. The table below lists those papers that are above that threshold based on CrossRef citation counts [max. 250 papers]. The publications cover those that have been published in the past four years, i.e., from 2022-01-01 to 2026-01-01.)
ArticleCitations
Ultra-High Resolution Facial Texture Reconstruction from a Single Image1892
MA2Net: Multi-Scale Adaptive Mixed Attention Network for Image Demoiréing1709
Towards harmonized regional style transfer and manipulation for facial images687
Geometry-aware 3D pose transfer using transformer autoencoder121
Front Cover111
3D Indoor Scene Geometry Estimation from a Single Omnidirectional Image: A Comprehensive Survey72
3D face recognition: A comprehensive survey in 202266
Heuristic Weakly Supervised 3D Human Pose Estimation59
Neighborhood co-occurrence modeling in 3D point cloud segmentation58
Controllable multi-domain semantic artwork synthesis55
FRNeRF: Fusion and Regularization Fields for Dynamic View Synthesis54
A Biophysical-Based Skin Model for Heterogeneous Volume Rendering52
Front Cover38
Recent advances in glinty appearance rendering35
Front cover33
Anchor-Regularized GAN Priors32
Practical construction of globally injective parameterizations with positional constraints30
Multi-granularity sequence generation for hierarchical image classification27
Towards robustness and generalization of point cloud representation: A geometry coding method and a large-scale object-level dataset26
Central similarity consistency hashing for asymmetric image retrieval25
Image-guided color mapping for categorical data visualization23
Temporal vectorized visibility for direct illumination of animated models23
A causal convolutional neural network for multi-subject motion modeling and generation22
D2ANet: Difference-aware attention network for multi-level change detection from satellite imagery21
Real-time distance field acceleration based free-viewpoint video synthesis for large sports fields21
ARM3D: Attention-based relation module for indoor 3D object detection20
MusicFace: Music-driven expressive singing face synthesis20
Message from Guest Editors of the CVM 2025 Special Issue19
IIDM: Image-to-Image Diffusion Model for Semantic Image Synthesis19
Watertight surface reconstruction method for CAD models based on optimal transport16
DepthGAN: GAN-based depth generation from semantic layouts16
A survey of urban visual analytics: Advances and future directions15
NeuS-PIR: Learning Relightable Neural Surface Using Pre-Integrated Rendering15
Sphere face model: A 3D morphable model with hypersphere manifold latent space using joint 2D/3D training15
Reference-guided structure-aware deep sketch colorization for cartoons14
Let's all dance: Enhancing amateur dance motions14
Sem-iNeRF: Camera Pose Refinement by Inverting Neural Radiance Fields with Semantic Feature Consistency13
Constructing self-supporting surfaces with planar quadrilateral elements13
Multi3D: 3D-aware multimodal image synthesis13
MMRelief: Modeling Multi-Human Relief from a Single Photograph13
Global video object segmentation with spatial constraint module12
MDFP-Net: A Model-Driven Deep Neural Network for Fourier Ptychography12
Uncertainty Aware Multiple View Stereo Network with Accurate Supervision12
Co-occurrence based texture synthesis12
Mindstorms in Natural Language-Based Societies of Mind11
Addressing Missing Modality Challenges in MRI Images: A Comprehensive Review11
Benchmarking visual SLAM methods in mirror environments11
Deep unfolding multi-scale regularizer network for image denoising11
Prediction of Scene Plausibility11
Front cover10
Front cover10
Exploring Contextual Priors for Real-World Image Super-Resolution10
Deep panoramic depth prediction and completion for indoor scenes10
A Voronoi diagram approach for detecting defects in 3D printed fiber-reinforced polymers from microscope images10
EFECL: Feature encoding enhancement with contrastive learning for indoor 3D object detection10
Continuous Indexed Points for Multivariate Volume Visualization9
Front cover9
A two-step surface-based 3D deep learning pipeline for segmentation of intracranial aneurysms9
Deep image synthesis from intuitive user input: A review and perspectives9
Contents8
SGformer: Boosting transformers for indoor lighting estimation from a single image8
Message from the editor-in-chief8
Hybrid Mesh-Neural Representation for 3D Transparent Object Reconstruction8
FCDFusion: A Fast, Low Color Deviation Method for Fusing Visible and Infrared Image Pairs8
Contents7
Erroneous pixel prediction for semantic image segmentation7
Point cloud completion via structured feature maps using a feedback network7
Front cover7
Visual perception driven collage synthesis7
Dynamic ocean inverse modeling based on differentiable rendering7
PuzzleSorter: Certainty-Aware Visual Restoration of Multiple Cultural Artifacts7
Immersive analytics meets artificial intelligence: A systematic review7
A survey on facial image deblurring6
Towards uniform point distribution in feature-preserving point cloud filtering6
Message from the editor-in-chief6
Focusing on your subject: Deep subject-aware image composition recommendation networks6
Joint specular highlight detection and removal in single images via Unet-Transformer6
An anisotropic Chebyshev descriptor and its optimization for deformable shape correspondence6
Non-dominated sorting based multi-page photo collage6
An efficient algorithm for approximate Voronoi diagram construction on triangulated surfaces6
A visual modeling method for spatiotemporal and multidimensional features in epidemiological analysis: Applied COVID-19 aggregated datasets6
FilterGNN: Image feature matching with cascaded outlier filters and linear attention5
Cross-modal learning using privileged information for long-tailed image classification5
Message from the editor-in-chief5
A Simple and Effective Filtering Scheme for Improving Neural Fields5
Super-resolution reconstruction of single image for latent features5
PVT v2: Improved baselines with pyramid vision transformer5
Front cover5
Revitalizing Image Dehazing in the Real World: A High-Quality Dataset and a Customized Method5
MagicTalk: Implicit and Explicit Correlation Learning for Diffusion-Based Emotional Talking Face Generation5
Attention mechanisms in computer vision: A survey5
Continual few-shot patch-based learning for anime-style colorization5
Noise4Denoise: Leveraging noise for unsupervised point cloud denoising4
Front Cover4
ImVoxelENet: Image to Voxels Epipolar Transformer for Multi-View RGB-Based 3D Object Detection4
3D corrective nose reconstruction from a single image4
Learning physically based material and lighting decompositions for face editing4
Exploring a Hierarchical Cross-Attention Transformer for High-Speed Tracking4
Autocompletion of repetitive stroking with image guidance4
NPRportrait 1.0: A three-level benchmark for non-photorealistic rendering of portraits4
Lossless Intrinsic Image Decomposition via Learning Shading Feature Filtering4
LucIE: Language-Guided Local Image Editing for Fashion Images4
JNeRF: An efficient heterogeneous NeRF model zoo based on Jittor4
SAM-driven MAE pre-training and background-aware meta-learning for unsupervised vehicle re-identification4
Point Mask Transformer for Outdoor Point Cloud Semantic Segmentation4
Polygonal finite element-based content-aware image warping4
AR assistance for efficient dynamic target search4
LDSwap: A Semantic-Related Latent Code Disentangling Method in StyleSpace Towards High-Resolution Face Swapping3
BLNet: Bidirectional learning network for point clouds3
Emotion Amplification of Facial Videos Using a Fine-Tuned StyleGAN3
Multi-modal visual tracking: Review and experimental comparison3
PMSSC: Parallelizable multi-subset based self-expressive model for subspace clustering3
A survey on rendering homogeneous participating media3
Foundation models meet visualizations: Challenges and opportunities3
Multi-Task Gradual Inference with a Single Encoder–Decoder Network for Automatic Portrait Matting3
A comprehensive survey on the research and development of RGB-T salient object detection3
Message from the best paper award committee3
Swin3D++: Effective Multi-Source Pretraining for 3D Indoor Scene Understanding3
Class Incremental Learning via Feature Space Calibration3
Angle-uniform parallel coordinates3
CLIP-SP: Vision-language model with adaptive prompting for scene parsing3
Full-duplex strategy for video object segmentation3
GRIG: Data-Efficient Generative Residual Image Inpainting3
ARNet: Attribute Artifact Reduction for G-PCC Compressed Point Clouds3
Swin3D: A Pretrained Transformer Backbone for 3D Indoor Scene Understanding2
Recent advances in 3D Gaussian splatting2
RecStitchNet: Learning to stitch images with rectangular boundaries2
Active self-training for weakly supervised 3D scene semantic segmentation2
Specificity-preserving RGB-D saliency detection2
JVCSR+: Adaptively Learned Video Compressive Sensing Reconstruction with Joint in-Loop Reference Enhancement and Out-Loop Super-Resolution2
HoLens: A visual analytics design for higher-order movement modeling and visualization2
Spatiotemporal Fusion Transformer for Video Demoiréing2
Self-supervised coarse-to-fine monocular depth estimation using a lightweight attention module2
Remote Sensing Tuning: A Survey2
Decoupled Two-Stage Talking Head Generation via Gaussian-Landmark-Based Neural Radiance Fields2
Contents2
Sequential interactive image segmentation2
Discriminative feature encoding for intrinsic image decomposition2
Dance2MIDI: Dance-driven multi-instrument music generation2
Taming diffusion model for exemplar-based image translation2
Progressive edge-sensing dynamic scene deblurring2
FastMAE: Efficient Masked Autoencoder with Offline Tokenizer2
Audio-guided implicit neural representation for local image stylization2
Transformers in computational visual media: A survey2
Computer-Aided Layout Generation for Building Design: A Review2
Rectangling irregular videos by optimal spatio-temporal warping2
Class-conditional domain adaptation for semantic segmentation2
Contents2
0.038713932037354