Computational Visual Media

Papers
(The median citation count of Computational Visual Media is 2. The table below lists those papers that are above that threshold based on CrossRef citation counts [max. 250 papers]. The publications cover those that have been published in the past four years, i.e., from 2022-05-01 to 2026-05-01.)
ArticleCitations
Geometry-aware 3D pose transfer using transformer autoencoder2123
Ultra-High Resolution Facial Texture Reconstruction from a Single Image1998
MA2Net: Multi-Scale Adaptive Mixed Attention Network for Image Demoiréing811
Front Cover179
3D face recognition: A comprehensive survey in 202282
3D Indoor Scene Geometry Estimation from a Single Omnidirectional Image: A Comprehensive Survey82
Heuristic Weakly Supervised 3D Human Pose Estimation75
Towards harmonized regional style transfer and manipulation for facial images73
Recent advances in glinty appearance rendering67
Front Cover63
FRNeRF: Fusion and Regularization Fields for Dynamic View Synthesis63
A Biophysical-Based Skin Model for Heterogeneous Volume Rendering47
Controllable multi-domain semantic artwork synthesis41
Neighborhood co-occurrence modeling in 3D point cloud segmentation40
Real-Time Woven Fabric Rendering Using SGGX Fitting34
Central similarity consistency hashing for asymmetric image retrieval29
Practical construction of globally injective parameterizations with positional constraints27
Multi-granularity sequence generation for hierarchical image classification26
Temporal vectorized visibility for direct illumination of animated models26
Anchor-Regularized GAN Priors24
A causal convolutional neural network for multi-subject motion modeling and generation23
Neural Scene Baking for Permutation Invariant Transparency Rendering with Real-Time Global Illumination22
See More, Know More: Richer Prior Knowledge for Novel Class Discovery22
Neural Reconstruction and Super-Resolution for Foveated Real-Time Rendering21
Image-guided color mapping for categorical data visualization20
Self-supervised learning for pre-training 3D point clouds: A survey19
Real-time distance field acceleration based free-viewpoint video synthesis for large sports fields18
IIDM: Image-to-Image Diffusion Model for Semantic Image Synthesis18
MusicFace: Music-driven expressive singing face synthesis18
Towards robustness and generalization of point cloud representation: A geometry coding method and a large-scale object-level dataset18
Photorealistic fire scene video generation via multimodal large language model and pre-trained video diffusion model17
Contents17
D2ANet: Difference-aware attention network for multi-level change detection from satellite imagery16
Message from Guest Editors of the CVM 2025 Special Issue15
A survey of urban visual analytics: Advances and future directions15
ARM3D: Attention-based relation module for indoor 3D object detection15
Watertight surface reconstruction method for CAD models based on optimal transport13
Multi3D: 3D-aware multimodal image synthesis13
NeuS-PIR: Learning Relightable Neural Surface Using Pre-Integrated Rendering13
DepthGAN: GAN-based depth generation from semantic layouts13
MDFP-Net: A Model-Driven Deep Neural Network for Fourier Ptychography12
MMRelief: Modeling Multi-Human Relief from a Single Photograph12
Sphere face model: A 3D morphable model with hypersphere manifold latent space using joint 2D/3D training12
Benchmarking visual SLAM methods in mirror environments12
Uncertainty Aware Multiple View Stereo Network with Accurate Supervision12
Sem-iNeRF: Camera Pose Refinement by Inverting Neural Radiance Fields with Semantic Feature Consistency12
Prediction of Scene Plausibility12
Constructing self-supporting surfaces with planar quadrilateral elements12
Let's all dance: Enhancing amateur dance motions12
Global video object segmentation with spatial constraint module11
Addressing Missing Modality Challenges in MRI Images: A Comprehensive Review11
Co-occurrence based texture synthesis11
Mindstorms in Natural Language-Based Societies of Mind11
Front cover10
Deep unfolding multi-scale regularizer network for image denoising10
Continuous Indexed Points for Multivariate Volume Visualization9
EG-HumanNeRF: Efficient Generalizable Human NeRF Utilizing Human Prior for Sparse View9
Deep panoramic depth prediction and completion for indoor scenes9
SGformer: Boosting transformers for indoor lighting estimation from a single image9
Front cover9
EFECL: Feature encoding enhancement with contrastive learning for indoor 3D object detection9
Exploring Contextual Priors for Real-World Image Super-Resolution9
A two-step surface-based 3D deep learning pipeline for segmentation of intracranial aneurysms9
A Voronoi diagram approach for detecting defects in 3D printed fiber-reinforced polymers from microscope images9
Neural video field editing9
Front cover9
Hybrid Mesh-Neural Representation for 3D Transparent Object Reconstruction8
Contents8
Point cloud completion via structured feature maps using a feedback network8
Contents8
FCDFusion: A Fast, Low Color Deviation Method for Fusing Visible and Infrared Image Pairs8
PuzzleSorter: Certainty-Aware Visual Restoration of Multiple Cultural Artifacts8
SMixNet: Style mixture network for exemplar-based image translation8
Dynamic ocean inverse modeling based on differentiable rendering8
Front cover8
Towards uniform point distribution in feature-preserving point cloud filtering7
Non-dominated sorting based multi-page photo collage7
A survey on facial image deblurring7
An efficient algorithm for approximate Voronoi diagram construction on triangulated surfaces7
Message from the editor-in-chief7
Focusing on your subject: Deep subject-aware image composition recommendation networks7
Immersive Analytics Meets Artificial Intelligence: A Systematic Review7
Joint specular highlight detection and removal in single images via Unet-Transformer7
Open-Vocabulary Camouflaged Object Segmentation with Cascaded Vision Language Models6
MagicTalk: Implicit and Explicit Correlation Learning for Diffusion-Based Emotional Talking Face Generation6
A Simple and Effective Filtering Scheme for Improving Neural Fields6
A visual modeling method for spatiotemporal and multidimensional features in epidemiological analysis: Applied COVID-19 aggregated datasets6
An anisotropic Chebyshev descriptor and its optimization for deformable shape correspondence6
Revitalizing Image Dehazing in the Real World: A High-Quality Dataset and a Customized Method6
Polygonal finite element-based content-aware image warping5
Cross-modal learning using privileged information for long-tailed image classification5
Attention mechanisms in computer vision: A survey5
NPRportrait 1.0: A three-level benchmark for non-photorealistic rendering of portraits5
ImVoxelENet: Image to Voxels Epipolar Transformer for Multi-View RGB-Based 3D Object Detection5
Front cover5
Super-resolution reconstruction of single image for latent features5
Continual few-shot patch-based learning for anime-style colorization5
Message from the editor-in-chief5
Front Cover5
Lossless Intrinsic Image Decomposition via Learning Shading Feature Filtering5
FilterGNN: Image feature matching with cascaded outlier filters and linear attention5
PVT v2: Improved baselines with pyramid vision transformer5
Contents5
Exploring a Hierarchical Cross-Attention Transformer for High-Speed Tracking5
Autocompletion of repetitive stroking with image guidance4
Pyramid-Angular-Constraint Network for Light Field Super-Resolution4
JNeRF: An efficient heterogeneous NeRF model zoo based on Jittor4
LucIE: Language-Guided Local Image Editing for Fashion Images4
Full-duplex strategy for video object segmentation4
Learning physically based material and lighting decompositions for face editing4
PraNet-V2: Dual-Supervised Reverse Attention for Medical Image Segmentation4
FEDNet: A Feature-Enhanced Diffusion Network for Efficient and Universal Texture Synthesis4
Point Mask Transformer for Outdoor Point Cloud Semantic Segmentation4
3D corrective nose reconstruction from a single image4
AR assistance for efficient dynamic target search4
A Comprehensive Survey on the Research and Development of RGB-T Salient Object Detection4
SAM-driven MAE pre-training and background-aware meta-learning for unsupervised vehicle re-identification4
BoostPoint: Boosting Point Cloud Backbones with Image Pre-Training for 3D Understanding4
Multi-Color Compressive Hologram Synthesis with Learned Wave Propagation4
Noise4Denoise: Leveraging noise for unsupervised point cloud denoising4
Emotion Amplification of Facial Videos Using a Fine-Tuned StyleGAN4
Multi-modal visual tracking: Review and experimental comparison4
Audio-guided implicit neural representation for local image stylization3
GRIG: Data-Efficient Generative Residual Image Inpainting3
PMSSC: Parallelizable multi-subset based self-expressive model for subspace clustering3
A survey on rendering homogeneous participating media3
ARNet: Attribute Artifact Reduction for G-PCC Compressed Point Clouds3
Recent advances in 3D Gaussian splatting3
CLIP-SP: Vision-language model with adaptive prompting for scene parsing3
Multi-Task Gradual Inference with a Single Encoder–Decoder Network for Automatic Portrait Matting3
Angle-uniform parallel coordinates3
Class Incremental Learning via Feature Space Calibration3
Swin3D: A Pretrained Transformer Backbone for 3D Indoor Scene Understanding3
FastMAE: Efficient Masked Autoencoder with Offline Tokenizer3
Message from the best paper award committee3
BLNet: Bidirectional learning network for point clouds3
Foundation models meet visualizations: Challenges and opportunities3
LDSwap: A Semantic-Related Latent Code Disentangling Method in StyleSpace Towards High-Resolution Face Swapping3
Swin3D++: Effective Multi-Source Pretraining for 3D Indoor Scene Understanding3
Contents2
Taming diffusion model for exemplar-based image translation2
RecStitchNet: Learning to stitch images with rectangular boundaries2
JVCSR+: Adaptively Learned Video Compressive Sensing Reconstruction with Joint in-Loop Reference Enhancement and Out-Loop Super-Resolution2
Dance2MIDI: Dance-driven multi-instrument music generation2
FloW-Deformation-Aware Point Cloud Completion Network for 3D Metal Bent Tube2
Class-conditional domain adaptation for semantic segmentation2
Progressive edge-sensing dynamic scene deblurring2
Contents2
Spatiotemporal Fusion Transformer for Video Demoiréing2
Specificity-preserving RGB-D saliency detection2
HoLens: A visual analytics design for higher-order movement modeling and visualization2
Self-supervised coarse-to-fine monocular depth estimation using a lightweight attention module2
Remote Sensing Tuning: A Survey2
Decoupled Two-Stage Talking Head Generation via Gaussian-Landmark-Based Neural Radiance Fields2
Language Interprets Vision: Adaptive Encoding and Decoding for Referring Image Segmentation2
Discriminative feature encoding for intrinsic image decomposition2
DragTex: Generative Point-Based Texture Editing on 3D Mesh2
0.14413785934448