Computational Visual Media

Papers
(The median citation count of Computational Visual Media is 2. The table below lists those papers that are above that threshold based on CrossRef citation counts [max. 250 papers]. The publications cover those that have been published in the past four years, i.e., from 2021-11-01 to 2025-11-01.)
ArticleCitations
3D Indoor Scene Geometry Estimation from a Single Omnidirectional Image: A Comprehensive Survey1786
Ultra-High Resolution Facial Texture Reconstruction from a Single Image1597
MA2Net: Multi-Scale Adaptive Mixed Attention Network for Image Demoiréing641
3D face recognition: A comprehensive survey in 2022108
Geometry-aware 3D pose transfer using transformer autoencoder94
Towards harmonized regional style transfer and manipulation for facial images67
Heuristic weakly supervised 3D human pose estimation58
Front Cover50
Controllable multi-domain semantic artwork synthesis48
Neighborhood co-occurrence modeling in 3D point cloud segmentation47
A Biophysical-Based Skin Model for Heterogeneous Volume Rendering42
Front Cover41
Recent advances in glinty appearance rendering38
FRNeRF: Fusion and regularization fields for dynamic view synthesis34
Temporal vectorized visibility for direct illumination of animated models31
Multi-granularity sequence generation for hierarchical image classification29
Image-guided color mapping for categorical data visualization27
A causal convolutional neural network for multi-subject motion modeling and generation26
Front cover24
Practical construction of globally injective parameterizations with positional constraints24
Central similarity consistency hashing for asymmetric image retrieval23
Anchor-Regularized GAN Priors21
Towards robustness and generalization of point cloud representation: A geometry coding method and a large-scale object-level dataset20
IIDM: Image-to-Image Diffusion Model for Semantic Image Synthesis19
Real-time distance field acceleration based free-viewpoint video synthesis for large sports fields19
ARM3D: Attention-based relation module for indoor 3D object detection19
MusicFace: Music-driven expressive singing face synthesis18
D2ANet: Difference-aware attention network for multi-level change detection from satellite imagery17
Sphere face model: A 3D morphable model with hypersphere manifold latent space using joint 2D/3D training16
Message from Guest Editors of the CVM 2025 Special Issue15
Multi3D: 3D-aware multimodal image synthesis14
NeuS-PIR: Learning Relightable Neural Surface Using Pre-Integrated Rendering14
A survey of urban visual analytics: Advances and future directions13
Reference-guided structure-aware deep sketch colorization for cartoons13
DepthGAN: GAN-based depth generation from semantic layouts13
Watertight surface reconstruction method for CAD models based on optimal transport13
Sem-iNeRF: Camera Pose Refinement by Inverting Neural Radiance Fields with Semantic Feature Consistency12
Let's all dance: Enhancing amateur dance motions12
Mindstorms in Natural Language-Based Societies of Mind12
Global video object segmentation with spatial constraint module12
MMRelief: Modeling Multi-Human Relief from a Single Photograph12
Constructing self-supporting surfaces with planar quadrilateral elements12
Benchmarking visual SLAM methods in mirror environments11
MDFP-Net: A model-driven deep neural network for Fourier ptychography11
Prediction of scene plausibility10
Addressing Missing Modality Challenges in MRI Images: A Comprehensive Review10
Co-occurrence based texture synthesis10
Uncertainty aware multiple view stereo network with accurate supervision10
A two-step surface-based 3D deep learning pipeline for segmentation of intracranial aneurysms9
Deep panoramic depth prediction and completion for indoor scenes9
Front cover9
Deep image synthesis from intuitive user input: A review and perspectives9
Exploring Contextual Priors for Real-World Image Super-Resolution9
A Voronoi diagram approach for detecting defects in 3D printed fiber-reinforced polymers from microscope images9
EFECL: Feature encoding enhancement with contrastive learning for indoor 3D object detection9
Front cover9
Deep unfolding multi-scale regularizer network for image denoising9
Hybrid Mesh-Neural Representation for 3D Transparent Object Reconstruction8
Front cover8
SGformer: Boosting transformers for indoor lighting estimation from a single image7
Contents7
Point cloud completion via structured feature maps using a feedback network7
Dynamic ocean inverse modeling based on differentiable rendering7
PuzzleSorter: Certainty-aware visual restoration of multiple cultural artifacts7
Visual perception driven collage synthesis7
Continuous indexed points for multivariate volume visualization7
FCDFusion: A Fast, Low Color Deviation Method for Fusing Visible and Infrared Image Pairs7
Message from the editor-in-chief7
Erroneous pixel prediction for semantic image segmentation6
A survey on facial image deblurring6
Front cover6
An efficient algorithm for approximate Voronoi diagram construction on triangulated surfaces6
Towards uniform point distribution in feature-preserving point cloud filtering6
Non-dominated sorting based multi-page photo collage6
Immersive analytics meets artificial intelligence: A systematic review6
Joint specular highlight detection and removal in single images via Unet-Transformer6
Focusing on your subject: Deep subject-aware image composition recommendation networks6
Message from the editor-in-chief5
Revitalizing Image Dehazing in the Real World: A High-Quality Dataset and a Customized Method5
An anisotropic Chebyshev descriptor and its optimization for deformable shape correspondence5
Super-resolution reconstruction of single image for latent features5
A Simple and Effective Filtering Scheme for Improving Neural Fields5
A visual modeling method for spatiotemporal and multidimensional features in epidemiological analysis: Applied COVID-19 aggregated datasets5
FilterGNN: Image feature matching with cascaded outlier filters and linear attention5
Continual few-shot patch-based learning for anime-style colorization4
Autocompletion of repetitive stroking with image guidance4
MagicTalk: Implicit and Explicit Correlation Learning for Diffusion-Based Emotional Talking Face Generation4
NPRportrait 1.0: A three-level benchmark for non-photorealistic rendering of portraits4
PVT v2: Improved baselines with pyramid vision transformer4
Message from the editor-in-chief4
Cross-modal learning using privileged information for long-tailed image classification4
Front cover4
Attention mechanisms in computer vision: A survey4
SAM-driven MAE pre-training and background-aware meta-learning for unsupervised vehicle re-identification3
Lossless Intrinsic Image Decomposition via Learning Shading Feature Filtering3
Full-duplex strategy for video object segmentation3
BLNet: Bidirectional learning network for point clouds3
LDSwap: A semantic-related latent code disentangling method in StyleSpace towards high-resolution face swapping3
CLIP-SP: Vision-language model with adaptive prompting for scene parsing3
Point Mask Transformer for Outdoor Point Cloud Semantic Segmentation3
Noise4Denoise: Leveraging noise for unsupervised point cloud denoising3
LucIE: Language-Guided Local Image Editing for Fashion Images3
AR assistance for efficient dynamic target search3
Multi-modal visual tracking: Review and experimental comparison3
Message from the best paper award committee3
Swin3D++: Effective Multi-Source Pretraining for 3D Indoor Scene Understanding3
Learning physically based material and lighting decompositions for face editing3
ImVoxelENet: Image to Voxels Epipolar Transformer for Multi-View RGB-Based 3D Object Detection3
JNeRF: An efficient heterogeneous NeRF model zoo based on Jittor3
3D corrective nose reconstruction from a single image3
Emotion Amplification of Facial Videos Using a Fine-Tuned StyleGAN3
PMSSC: Parallelizable multi-subset based self-expressive model for subspace clustering3
Angle-uniform parallel coordinates3
Polygonal finite element-based content-aware image warping3
Exploring a hierarchical cross-attention transformer for high-speed tracking3
Swin3D: A Pretrained Transformer Backbone for 3D Indoor Scene Understanding2
Audio-guided implicit neural representation for local image stylization2
A survey on rendering homogeneous participating media2
Taming diffusion model for exemplar-based image translation2
Decoupled Two-Stage Talking Head Generation via Gaussian-Landmark-Based Neural Radiance Fields2
Class-conditional domain adaptation for semantic segmentation2
Recent advances in 3D Gaussian splatting2
Class incremental learning via feature space calibration2
Remote sensing tuning: A survey2
Self-supervised coarse-to-fine monocular depth estimation using a lightweight attention module2
Contents2
Transformers in computational visual media: A survey2
FastMAE: Efficient Masked Autoencoder with Offline Tokenizer2
Foundation models meet visualizations: Challenges and opportunities2
ARNet: Attribute Artifact Reduction for G-PCC Compressed Point Clouds2
Progressive edge-sensing dynamic scene deblurring2
Spatiotemporal Fusion Transformer for Video Demoiréing2
RecStitchNet: Learning to stitch images with rectangular boundaries2
0.039666891098022