Topic

NeRF

Updated 2026.03.31 · 328 papers

← Back to topics
EmoTaG: Emotion-Aware Talking Head Synthesis on Gaussian Splatting with Few-Shot Personalization Haolan Xu, Keli Cheng, Lei Wang, Ning Bi, Xiaoming Liu Updated 2026-03-28

Audio-driven 3D talking head synthesis has advanced rapidly with Neural Radiance Fields (NeRF) and 3D Gaussian Splatting (3DGS). By leveraging rich pre-trained priors, few-shot methods enable instant personalization from just a few seconds of video. However, under expressive facial motion, existing few-shot approaches often suffer from geometric instability and audio-emotion mismatch, highlighting the need for more effective emotion-aware motion modeling. In this work, we present EmoTaG, a few-shot emotion-aware 3D talking head synthesis framework built on the Pretrain-and-Adapt paradigm. Our key insight is to reformulate motion prediction in a structured FLAME parameter space rather than directly deforming 3D Gaussians, thereby introducing explicit geometric priors that improve motion stability. Building upon this, we propose a Gated Residual Motion Network (GRMN), which captures emotional prosody from audio while supplementing head pose and upper-face cues absent from audio, enabling expressive and coherent motion generation. Extensive experiments demonstrate that EmoTaG achieves state-of-the-art performance in emotional expressiveness, lip synchronization, visual realism, and motion stability.

Preview loads on expand
Few TensoRF: Enhance the Few-shot on Tensorial Radiance Fields Thanh-Hai Le, Hoang-Hau Tran, Trong-Nghia Vu Updated 2026-03-27

This paper presents Few TensoRF, a 3D reconstruction framework that combines TensorRF's efficient tensor based representation with FreeNeRF's frequency driven few shot regularization. Using TensorRF to significantly accelerate rendering speed and introducing frequency and occlusion masks, the method improves stability and reconstruction quality under sparse input views. Experiments on the Synthesis NeRF benchmark show that Few TensoRF method improves the average PSNR from 21.45 dB (TensorRF) to 23.70 dB, with the fine tuned version reaching 24.52 dB, while maintaining TensorRF's fast \(\approx10-15\) minute training time. Experiments on the THuman 2.0 dataset further demonstrate competitive performance in human body reconstruction, achieving 27.37 - 34.00 dB with only eight input images. These results highlight Few TensoRF as an efficient and data effective solution for real-time 3D reconstruction across diverse scenes.

Preview loads on expand
FluidGaussian: Propagating Simulation-Based Uncertainty Toward Functionally-Intelligent 3D Reconstruction Yuqiu Liu, Jialin Song, Marissa Ramirez de Chanlatte, Rochishnu Chowdhury, Rushil Paresh Desai, Wuyang Chen, Daniel Martin, Michael W. Mahoney Updated 2026-03-27

Real objects that inhabit the physical world follow physical laws and thus behave plausibly during interaction with other physical objects. However, current methods that perform 3D reconstructions of real-world scenes from multi-view 2D images optimize primarily for visual fidelity, i.e., they train with photometric losses and reason about uncertainty in the image or representation space. This appearance-centric view overlooks body contacts and couplings, conflates function-critical regions (e.g., aerodynamic or hydrodynamic surfaces) with ornamentation, and reconstructs structures suboptimally, even when physical regularizers are added. All these can lead to unphysical and implausible interactions. To address this, we consider the question: How can 3D reconstruction become aware of real-world interactions and underlying object functionality, beyond visual cues? To answer this question, we propose FluidGaussian, a plug-and-play method that tightly couples geometry reconstruction with ubiquitous fluid-structure interactions to assess surface quality at high granularity. We define a simulation-based uncertainty metric induced by fluid simulations and integrate it with active learning to prioritize views that improve both visual and physical fidelity. In an empirical evaluation on NeRF Synthetic (Blender), Mip-NeRF 360, and DrivAerNet++, our FluidGaussian method yields up to +8.6% visual PSNR (Peak Signal-to-Noise Ratio) and -62.3% velocity divergence during fluid simulations. Our code is available at https://github.com/delta-lab-ai/FluidGaussian.

Preview loads on expand
3D Gaussian Splatting with Self-Constrained Priors for High Fidelity Surface Reconstruction Takeshi Noda, Yu-Shen Liu, Zhizhong Han Updated 2026-03-26

Rendering 3D surfaces has been revolutionized within the modeling of radiance fields through either 3DGS or NeRF. Although 3DGS has shown advantages over NeRF in terms of rendering quality or speed, there is still room for improvement in recovering high fidelity surfaces through 3DGS. To resolve this issue, we propose a self-constrained prior to constrain the learning of 3D Gaussians, aiming for more accurate depth rendering. Our self-constrained prior is derived from a TSDF grid that is obtained by fusing the depth maps rendered with current 3D Gaussians. The prior measures a distance field around the estimated surface, offering a band centered at the surface for imposing more specific constraints on 3D Gaussians, such as removing Gaussians outside the band, moving Gaussians closer to the surface, and encouraging larger or smaller opacity in a geometry-aware manner. More importantly, our prior can be regularly updated by the most recent depth images which are usually more accurate and complete. In addition, the prior can also progressively narrow the band to tighten the imposed constraints. We justify our idea and report our superiority over the state-of-the-art methods in evaluations on widely used benchmarks.

Preview loads on expand
UniQueR: Unified Query-based Feedforward 3D Reconstruction Chensheng Peng, Quentin Herau, Jiezhi Yang, Yichen Xie, Yihan Hu, Wenzhao Zheng, Matthew Strong, Masayoshi Tomizuka, Wei Zhan Updated 2026-03-24

We present UniQueR, a unified query-based feedforward framework for efficient and accurate 3D reconstruction from unposed images. Existing feedforward models such as DUSt3R, VGGT, and AnySplat typically predict per-pixel point maps or pixel-aligned Gaussians, which remain fundamentally 2.5D and limited to visible surfaces. In contrast, UniQueR formulates reconstruction as a sparse 3D query inference problem. Our model learns a compact set of 3D anchor points that act as explicit geometric queries, enabling the network to infer scene structure, including geometry in occluded regions--in a single forward pass. Each query encodes spatial and appearance priors directly in global 3D space (instead of per-frame camera space) and spawns a set of 3D Gaussians for differentiable rendering. By leveraging unified query interactions across multi-view features and a decoupled cross-attention design, UniQueR achieves strong geometric expressiveness while substantially reducing memory and computational cost. Experiments on Mip-NeRF 360 and VR-NeRF demonstrate that UniQueR surpasses state-of-the-art feedforward methods in both rendering quality and geometric accuracy, using an order of magnitude fewer primitives than dense alternatives.

Preview loads on expand
SatGeo-NeRF: Geometrically Regularized NeRF for Satellite Imagery Valentin Wagner, Sebastian Bullinger, Michael Arens, Rainer Stiefelhagen Updated 2026-03-23

We present SatGeo-NeRF, a geometrically regularized NeRF for satellite imagery that mitigates overfitting-induced geometric artifacts observed in current state-of-the-art models using three model-agnostic regularizers. Gravity-Aligned Planarity Regularization aligns depth-inferred, approximated surface normals with the gravity axis to promote local planarity, coupling adjacent rays via a corresponding surface approximation to facilitate cross-ray gradient flow. Granularity Regularization enforces a coarse-to-fine geometry-learning scheme, and Depth-Supervised Regularization stabilizes early training for improved geometric accuracy. On the DFC2019 satellite reconstruction benchmark, SatGeo-NeRF improves the Mean Altitude Error by 13.9% and 11.7% relative to state-of-the-art baselines such as EO-NeRF and EO-GS.

Preview loads on expand
RefracGS: Novel View Synthesis Through Refractive Water Surfaces with 3D Gaussian Ray Tracing Yiming Shao, Qiyu Dai, Chong Gao, Guanbin Li, Yeqiang Wang, He Sun, Qiong Zeng, Baoquan Chen, Wenzheng Chen Updated 2026-03-23

Novel view synthesis (NVS) through non-planar refractive surfaces presents fundamental challenges due to severe, spatially varying optical distortions. While recent representations like NeRF and 3D Gaussian Splatting (3DGS) excel at NVS, their assumption of straight-line ray propagation fails under these conditions, leading to significant artifacts. To overcome this limitation, we introduce RefracGS, a framework that jointly reconstructs the refractive water surface and the scene beneath the interface. Our key insight is to explicitly decouple the refractive boundary from the target objects: the refractive surface is modeled via a neural height field, capturing wave geometry, while the underlying scene is represented as a 3D Gaussian field. We formulate a refraction-aware Gaussian ray tracing approach that accurately computes non-linear ray trajectories using Snell's law and efficiently renders the underlying Gaussian field while backpropagating the loss gradients to the parameterized refractive surface. Through end-to-end joint optimization of both representations, our method ensures high-fidelity NVS and view-consistent surface recovery. Experiments on both synthetic and real-world scenes with complex waves demonstrate that RefracGS outperforms prior refractive methods in visual quality, while achieving 15x faster training and real-time rendering at 200 FPS. The project page for RefracGS is available at https://yimgshao.github.io/refracgs/.

Preview loads on expand
GaussianPile: A Unified Sparse Gaussian Splatting Framework for Slice-based Volumetric Reconstruction Di Kong, Yikai Wang, Wenjie Guo, Yifan Bu, Boya Zhang, Yuexin Duan, Xiawei Yue, Wenbiao Du, Yiman Zhong, Yuwen Chen, Cheng Ma Updated 2026-03-21

Slice-based volumetric imaging is widely applied and it demands representations that compress aggressively while preserving internal structure for analysis. We introduce GaussianPile, unifying 3D Gaussian splatting with an imaging system-aware focus model to address this challenge. Our proposed method introduces three key innovations: (i) a slice-aware piling strategy that positions anisotropic 3D Gaussians to model through-slice contributions, (ii) a differentiable projection operator that encodes the finite-thickness point spread function of the imaging acquisition system, and (iii) a compact encoding and joint optimization pipeline that simultaneously reconstructs and compresses the Gaussian sets. Our CUDA-based design retains the compression and real-time rendering efficiency of Gaussian primitives while preserving high-frequency internal volumetric detail. Experiments on microscopy and ultrasound datasets demonstrate that our method reduces storage and reconstruction cost, sustains diagnostic fidelity, and enables fast 2D visualization, along with 3D voxelization. In practice, it delivers high-quality results in as few as 3 minutes, up to 11x faster than NeRF-based approaches, and achieves consistent 16x compression over voxel grids, offering a practical path to deployable compression and exploration of slice-based volumetric datasets.

Preview loads on expand
Benchmarking Efficient & Effective Camera Pose Estimation Strategies for Novel View Synthesis Jhacson Meza, Martin R. Oswald, Torsten Sattler Updated 2026-03-20

Novel view synthesis (NVS) approaches such as NeRFs or 3DGS can produce photo-realistic 3D scene representation from a set of images with known extrinsic and intrinsic parameters. The necessary camera poses and calibrations are typically obtained from the images via Structure-from-Motion (SfM). Classical SfM approaches rely on local feature matches between the images to estimate both the poses and a sparse 3D model of the scene, using bundle adjustment to refine initial pose, intrinsics, and geometry estimates. In order to increase run-time efficiency, recent SfM systems forgo optimization via bundle adjustment. Instead, they train feed-forward (transformer-based) neural networks to directly regress camera parameters and the 3D structure. While orders of magnitude more efficient, such recent works produce significantly less accurate estimates. To stimulate research on developing SfM approaches that are both efficient \emph{and} effective, this paper develops a benchmark focused on SfM for novel view synthesis. Using existing datasets and two simple strategies for making the reconstruction process more efficient, we show that: (1) simply using fewer features already significantly accelerates classical SfM methods while maintaining high pose accuracy. (2) using feed-forward networks to obtain initial estimates and refining them using classical SfM techniques leads to the best efficiency-effectiveness trade-off. We will make our benchmark and code publicly available.

Preview loads on expand
Generalizable NGP-SR: Generalizable Neural Radiance Fields Super-Resolution via Neural Graph Primitives Wanqi Yuan, Omkar Sharad Mayekar, Connor Pennington, Nianyi Li Updated 2026-03-20

Neural Radiance Fields (NeRF) achieve photorealistic novel view synthesis but become costly when high-resolution (HR) rendering is required, as HR outputs demand dense sampling and higher-capacity models. Moreover, naively super-resolving per-view renderings in 2D often breaks multi-view consistency. We propose Generalizable NGP-SR, a 3D-aware super-resolution framework that reconstructs an HR radiance field directly from low-resolution (LR) posed images. Built on Neural Graphics Primitives (NGP), NGP-SR conditions radiance prediction on 3D coordinates and learned local texture tokens, enabling recovery of high-frequency details within the radiance field and producing view-consistent HR novel views without external HR references or post-hoc 2D upsampling. Importantly, our model is generalizable: once trained, it can be applied to unseen scenes and rendered from novel viewpoints without per-scene optimization. Experiments on multiple datasets show that NGP-SR consistently improves both reconstruction quality and runtime efficiency over prior NeRF-based super-resolution methods, offering a practical solution for scalable high-resolution novel view synthesis.

Preview loads on expand
Fast and Generalizable NeRF Architecture Selection for Satellite Scene Reconstruction Devjyoti Chakraborty, Zaki Sukma, Rakandhiya D. Rachmanto, Kriti Ghosh, In Kee Kim, Suchendra M. Bhandarkar, Lakshmish Ramaswamy, Nancy K. O'Hare, Deepak Mishra Updated 2026-03-18

Neural Radiance Fields (NeRF) have emerged as a powerful approach for photorealistic 3D reconstruction from multi-view images. However, deploying NeRF for satellite imagery remains challenging. Each scene requires individual training, and optimizing architectures via Neural Architecture Search (NAS) demands hours to days of GPU time. While existing approaches focus on architectural improvements, our SHAP analysis reveals that multi-view consistency, rather than model architecture, determines reconstruction quality. Based on this insight, we develop PreSCAN, a predictive framework that estimates NeRF quality prior to training using lightweight geometric and photometric descriptors. PreSCAN selects suitable architectures in < 30 seconds with < 1 dB prediction error, achieving 1000$\times$ speedup over NAS. We further demonstrate PreSCAN's deployment utility on edge platforms (Jetson Orin), where combining its predictions with offline cost profiling reduces inference power by 26% and latency by 43% with minimal quality loss. Experiments on DFC2019 datasets confirm that PreSCAN generalizes across diverse satellite scenes without retraining.

Preview loads on expand
Neural Radiance Maps for Extraterrestrial Navigation and Path Planning Adam Dai, Shubh Gupta, Grace Gao Updated 2026-03-18

Autonomous vehicles such as the Mars rovers currently lead the vanguard of surface exploration on extraterrestrial planets and moons. In order to accelerate the pace of exploration and science objectives, it is critical to plan safe and efficient paths for these vehicles. However, current rover autonomy is limited by a lack of global maps which can be easily constructed and stored for onboard re-planning. Recently, Neural Radiance Fields (NeRFs) have been introduced as a detailed 3D scene representation which can be trained from sparse 2D images and efficiently stored. We propose to use NeRFs to construct maps for online use in autonomous navigation, and present a planning framework which leverages the NeRF map to integrate local and global information. Our approach interpolates local cost observations across global regions using kernel ridge regression over terrain features extracted from the NeRF map, allowing the rover to re-route itself around untraversable areas discovered during online operation. We validate our approach in high-fidelity simulation and demonstrate lower cost and higher percentage success rate path planning compared to various baselines.

Preview loads on expand
E2EGS: Event-to-Edge Gaussian Splatting for Pose-Free 3D Reconstruction Yunsoo Kim, Changki Sung, Dasol Hong, Hyun Myung Updated 2026-03-16

The emergence of neural radiance fields (NeRF) and 3D Gaussian splatting (3DGS) has advanced novel view synthesis (NVS). These methods, however, require high-quality RGB inputs and accurate corresponding poses, limiting robustness under real-world conditions such as fast camera motion or adverse lighting. Event cameras, which capture brightness changes at each pixel with high temporal resolution and wide dynamic range, enable precise sensing of dynamic scenes and offer a promising solution. However, existing event-based NVS methods either assume known poses or rely on depth estimation models that are bounded by their initial observations, failing to generalize as the camera traverses previously unseen regions. We present E2EGS, a pose-free framework operating solely on event streams. Our key insight is that edge information provides rich structural cues essential for accurate trajectory estimation and high-quality NVS. To extract edges from noisy event streams, we exploit the distinct spatio-temporal characteristics of edges and non-edge regions. The event camera's movement induces consistent events along edges, while non-edge regions produce sparse noise. We leverage this through a patch-based temporal coherence analysis that measures local variance to extract edges while robustly suppressing noise. The extracted edges guide structure-aware Gaussian initialization and enable edge-weighted losses throughout initialization, tracking, and bundle adjustment. Extensive experiments on both synthetic and real datasets demonstrate that E2EGS achieves superior reconstruction quality and trajectory accuracy, establishing a fully pose-free paradigm for event-based 3D reconstruction.

Preview loads on expand
Spectral-Geometric Neural Fields for Pose-Free LiDAR View Synthesis Yinuo Jiang, Jun Cheng, Yiran Wang, Cheng Cheng Updated 2026-03-13

Neural Radiance Fields (NeRF) have shown remarkable success in image novel view synthesis (NVS), inspiring extensions to LiDAR NVS. However, most methods heavily rely on accurate camera poses for scene reconstruction. The sparsity and textureless nature of LiDAR data also present distinct challenges, leading to geometric holes and discontinuous surfaces. To address these issues, we propose SG-NLF, a pose-free LiDAR NeRF framework that integrates spectral information with geometric consistency. Specifically, we design a hybrid representation based on spectral priors to reconstruct smooth geometry. For pose optimization, we construct a confidence-aware graph based on feature compatibility to achieve global alignment. In addition, an adversarial learning strategy is introduced to enforce cross-frame consistency, thereby enhancing reconstruction quality. Comprehensive experiments demonstrate the effectiveness of our framework, especially in challenging low-frequency scenarios. Compared to previous state-of-the-art methods, SG-NLF improves reconstruction quality and pose accuracy by over 35.8% and 68.8%. Our work can provide a novel perspective for LiDAR view synthesis.

Preview loads on expand
Catalyst4D: High-Fidelity 3D-to-4D Scene Editing via Dynamic Propagation Shifeng Chen, Yihui Li, Jun Liao, Hongyu Yang, Di Huang Updated 2026-03-13

Recent advances in 3D scene editing using NeRF and 3DGS enable high-quality static scene editing. In contrast, dynamic scene editing remains challenging, as methods that directly extend 2D diffusion models to 4D often produce motion artifacts, temporal flickering, and inconsistent style propagation. We introduce Catalyst4D, a framework that transfers high-quality 3D edits to dynamic 4D Gaussian scenes while maintaining spatial and temporal coherence. At its core, Anchor-based Motion Guidance (AMG) builds a set of structurally stable and spatially representative anchors from both original and edited Gaussians. These anchors serve as robust region-level references, and their correspondences are established via optimal transport to enable consistent deformation propagation without cross-region interference or motion drift. Complementarily, Color Uncertainty-guided Appearance Refinement (CUAR) preserves temporal appearance consistency by estimating per-Gaussian color uncertainty and selectively refining regions prone to occlusion-induced artifacts. Extensive experiments demonstrate that Catalyst4D achieves temporally stable, high-fidelity dynamic scene editing and outperforms existing methods in both visual quality and motion coherence.

Preview loads on expand
Node-RF: Learning Generalized Continuous Space-Time Scene Dynamics with Neural ODE-based NeRFs Hiran Sarkar, Liming Kuang, Yordanka Velikova, Benjamin Busam Updated 2026-03-13

Predicting scene dynamics from visual observations is challenging. Existing methods capture dynamics only within observed boundaries failing to extrapolate far beyond the training sequence. Node-RF (Neural ODE-based NeRF) overcomes this limitation by integrating Neural Ordinary Differential Equations (NODEs) with dynamic Neural Radiance Fields (NeRFs), enabling a continuous-time, spatiotemporal representation that generalizes beyond observed trajectories at constant memory cost. From visual input, Node-RF learns an implicit scene state that evolves over time via an ODE solver, propagating feature embeddings via differential calculus. A NeRF-based renderer interprets calculated embeddings to synthesize arbitrary views for long-range extrapolation. Training on multiple motion sequences with shared dynamics allows for generalization to unseen conditions. Our experiments demonstrate that Node-RF can characterize abstract system behavior without explicit model to identify critical points for future predictions.

Preview loads on expand
DenoiseSplat: Feed-Forward Gaussian Splatting for Noisy 3D Scene Reconstruction Fuzhen Jiang, Zhuoran Li, Yinlin Zhang Updated 2026-03-10

3D scene reconstruction and novel-view synthesis are fundamental for VR, robotics, and content creation. However, most NeRF and 3D Gaussian Splatting pipelines assume clean inputs and degrade under real noise and artifacts. We therefore propose DenoiseSplat, a feed-forward 3D Gaussian splatting method for noisy multi-view images. We build a large-scale, scene-consistent noisy--clean benchmark on RE10K by injecting Gaussian, Poisson, speckle, and salt-and-pepper noise with controlled intensities. With a lightweight MVSplat-style feed-forward backbone, we train end-to-end using only clean 2D renderings as supervision and no 3D ground truth. On noisy RE10K, DenoiseSplat outperforms vanilla MVSplat and a strong two-stage baseline (IDF + MVSplat) in PSNR/SSIM and LPIPS across noise types and levels.

Preview loads on expand
Speeding Up the Learning of 3D Gaussians with Much Shorter Gaussian Lists Jiaqi Liu, Zhizhong Han Updated 2026-03-10

3D Gaussian splatting (3DGS) has become a vital tool for learning a radiance field from multiple posed images. Although 3DGS shows great advantages over NeRF in terms of rendering quality and efficiency, it remains a research challenge to further improve the efficiency of learning 3D Gaussians. To overcome this challenge, we propose novel training strategies and losses to shorten each Gaussian list used to render a pixel, which speeds up the splatting by involving fewer Gaussians along a ray. Specifically, we shrink the size of each Gaussian by resetting their scales regularly, encouraging smaller Gaussians to cover fewer nearby pixels, which shortens the Gaussian lists of pixels. Additionally, we introduce an entropy constraint on the alpha blending procedure to sharpen the weight distribution of Gaussians along each ray, which drives dominant weights larger while making minor weights smaller. As a result, each Gaussian becomes more focused on the pixels where it is dominant, which reduces its impact on nearby pixels, leading to even shorter Gaussian lists. Eventually, we integrate our method into a rendering resolution scheduler which further improves efficiency through progressive resolution increase. We evaluate our method by comparing it with state-of-the-art methods on widely used benchmarks. Our results show significant advantages over others in efficiency without sacrificing rendering quality.

Preview loads on expand
SkipGS: Post-Densification Backward Skipping for Efficient 3DGS Training Jingxing Li, Yongjae Leeand, Deliang Fan Updated 2026-03-09

3D Gaussian Splatting (3DGS) achieves real-time novel-view synthesis by optimizing millions of anisotropic Gaussians, yet its training remains expensive, with the backward pass dominating runtime in the post-densification refinement phase. We observe substantial update redundancy in this phase: many sampled views have near-plateaued losses and provide diminishing gradient benefits, but standard training still runs full backpropagation. We propose SkipGS with a novel view-adaptive backward gating mechanism for efficient post-densification training. SkipGS always performs the forward pass to update per-view loss statistics, and selectively skips backward passes when the sampled view's loss is consistent with its recent per-view baseline, while enforcing a minimum backward budget for stable optimization. On Mip-NeRF 360, compared to 3DGS, SkipGS reduces end-to-end training time by 23.1%, driven by a 42.0% reduction in post-densification time, with comparable reconstruction quality. Because it only changes when to backpropagate -- without modifying the renderer, representation, or loss -- SkipGS is plug-and-play and compatible with other complementary efficiency strategies for additive speedups.

Preview loads on expand
Fast Low-light Enhancement and Deblurring for 3D Dark Scenes Feng Zhang, Jinglong Wang, Ze Li, Yanghong Zhou, Yang Chen, Lei Chen, Xiatian Zhu Updated 2026-03-09

Novel view synthesis from low-light, noisy, and motion-blurred imagery remains a valuable and challenging task. Current volumetric rendering methods struggle with compound degradation, and sequential 2D preprocessing introduces artifacts due to interdependencies. In this work, we introduce FLED-GS, a fast low-light enhancement and deblurring framework that reformulates 3D scene restoration as an alternating cycle of enhancement and reconstruction. Specifically, FLED-GS inserts several intermediate brightness anchors to enable progressive recovery, preventing noise blow-up from harming deblurring or geometry. Each iteration sharpens inputs with an off-the-shelf 2D deblurrer and then performs noise-aware 3DGS reconstruction that estimates and suppresses noise while producing clean priors for the next level. Experiments show FLED-GS outperforms state-of-the-art LuSh-NeRF, achieving 21$\times$ faster training and 11$\times$ faster rendering.

Preview loads on expand
Virtual Intraoperative CT (viCT): Sequential Anatomic Updates for Modeling Tissue Resection Throughout Endoscopic Sinus Surgery Nicole M. Gunderson, Graham J. Harris, Jeremy S. Ruthberg, Pengcheng Chen, Di Mao, Randall A. Bly, Waleed M. Abuzeid, Eric J. Seibel Updated 2026-03-07

Purpose: Incomplete dissection is a common cause of persistent disease and revision endoscopic sinus surgery (ESS) in chronic rhinosinusitis. Current image-guided surgery systems typically reference static preoperative CT (pCT), and do not model evolving resection boundaries. We present Virtual Intraoperative CT (viCT), a method for sequentially updating pCT throughout ESS using intraoperative 3D reconstructions from monocular endoscopic video to enable visualization of evolving anatomy in CT format. Methods: Monocular endoscopic video is processed using a depth-supervised NeRF framework with virtual stereo synthesis to generate metrically scaled 3D reconstructions at multiple surgical intervals. Reconstructions undergo rigid, landmark-based registration in 3D Slicer guided by anatomical correspondences, and are then voxelized into the pCT grid. viCT volumes were generated using a ray-based occupancy comparison between pCT and reconstruction to delete outdated voxels and remap preserved anatomy and updated boundaries. Performance is evaluated in a cadaveric feasibility study of four specimens across four ESS stages using volumetric overlap (DSC, Jaccard) and surface metrics (HD95, Chamfer, MSD, RMSD), and qualitative comparisons to ground-truth CT. Results: viCT updates show agreement with ground-truth anatomy across surgical stages, with submillimeter mean surface errors. Dice Similarity Coefficient (DSC) = 0.88 +/- 0.05 and Jaccard Index = 0.79 +/- 0.07, and Hausdorff Distance 95% (HD95) = 0.69 +/- 0.28 mm, Chamfer Distance = 0.09 +/- 0.05 mm, Mean Surface Distance (MSD) = 0.11 +/- 0.05 mm, and Root Mean Square Distance (RMSD) = 0.32 +/- 0.10 mm. Conclusion: viCT enables CT-format anatomic updating in an ESS setting without ancillary hardware. Future work will focus on fully automating registration, validation in live cases, and optimizing runtime for real-time deployment.

Preview loads on expand
FTSplat: Feed-forward Triangle Splatting Network Xiong Jinlin, Li Can, Shen Jiawei, Qi Zhigang, Sun Lei, Zhao Dongyang Updated 2026-03-06

High-fidelity three-dimensional (3D) reconstruction is essential for robotics and simulation. While Neural Radiance Fields (NeRF) and 3D Gaussian Splatting (3DGS) achieve impressive rendering quality, their reliance on time-consuming per-scene optimization limits real-time deployment. Emerging feed-forward Gaussian splatting methods improve efficiency but often lack explicit, manifold geometry required for direct simulation. To address these limitations, we propose a feed-forward framework for triangle primitive generation that directly predicts continuous triangle surfaces from calibrated multi-view images. Our method produces simulation-ready models in a single forward pass, obviating the need for per-scene optimization or post-processing. We introduce a pixel-aligned triangle generation module and incorporate relative 3D point cloud supervision to enhance geometric learning stability and consistency. Experiments demonstrate that our method achieves efficient reconstruction while maintaining seamless compatibility with standard graphics and robotic simulators.

Preview loads on expand
Towards 3D Scene Understanding of Gas Plumes in LWIR Hyperspectral Images Using Neural Radiance Fields Scout Jarman, Zigfried Hampel-Arias, Adra Carr, Kevin R. Moon Updated 2026-03-05

Hyperspectral images (HSI) have many applications, ranging from environmental monitoring to national security, and can be used for material detection and identification. Longwave infrared (LWIR) HSI can be used for gas plume detection and analysis. Oftentimes, only a few images of a scene of interest are available and are analyzed individually. The ability to combine information from multiple images into a single, cohesive representation could enhance analysis by providing more context on the scene's geometry and spectral properties. Neural radiance fields (NeRFs) create a latent neural representation of volumetric scene properties that enable novel-view rendering and geometry reconstruction, offering a promising avenue for hyperspectral 3D scene reconstruction. We explore the possibility of using NeRFs to create 3D scene reconstructions from LWIR HSI and demonstrate that the model can be used for the basic downstream analysis task of gas plume detection. The physics-based DIRSIG software suite was used to generate a synthetic multi-view LWIR HSI dataset of a simple facility with a strong sulfur hexafluoride gas plume. Our method, built on the standard Mip-NeRF architecture, combines state-of-the-art methods for hyperspectral NeRFs and sparse-view NeRFs, along with a novel adaptive weighted MSE loss. Our final NeRF method requires around 50% fewer training images than the standard Mip-NeRF and achieves an average PSNR of 39.8 dB with as few as 30 training images. Gas plume detection applied to NeRF-rendered test images using the adaptive coherence estimator achieves an average AUC of 0.821 when compared with detection masks generated from ground-truth test images.

Preview loads on expand
GloSplat: Joint Pose-Appearance Optimization for Faster and More Accurate 3D Reconstruction Tianyu Xiong, Rui Li, Linjie Li, Jiaqi Yang Updated 2026-03-05

Feature extraction, matching, structure from motion (SfM), and novel view synthesis (NVS) have traditionally been treated as separate problems with independent optimization objectives. We present GloSplat, a framework that performs \emph{joint pose-appearance optimization} during 3D Gaussian Splatting training. Unlike prior joint optimization methods (BARF, NeRF--, 3RGS) that rely purely on photometric gradients for pose refinement, GloSplat preserves \emph{explicit SfM feature tracks} as first-class entities throughout training: track 3D points are maintained as separate optimizable parameters from Gaussian primitives, providing persistent geometric anchors via a reprojection loss that operates alongside photometric supervision. This architectural choice prevents early-stage pose drift while enabling fine-grained refinement -- a capability absent in photometric-only approaches. We introduce two pipeline variants: (1) \textbf{GloSplat-F}, a COLMAP-free variant using retrieval-based pair selection for efficient reconstruction, and (2) \textbf{GloSplat-A}, an exhaustive matching variant for maximum quality. Both employ global SfM initialization followed by joint photometric-geometric optimization during 3DGS training. Experiments demonstrate that GloSplat-F achieves state-of-the-art among COLMAP-free methods while GloSplat-A surpasses all COLMAP-based baselines.

Preview loads on expand
R3GW: Relightable 3D Gaussians for Outdoor Scenes in the Wild Margherita Lea Corona, Wieland Morgenstern, Peter Eisert, Anna Hilsmann Updated 2026-03-03

3D Gaussian Splatting (3DGS) has established itself as a leading technique for 3D reconstruction and novel view synthesis of static scenes, achieving outstanding rendering quality and fast training. However, the method does not explicitly model the scene illumination, making it unsuitable for relighting tasks. Furthermore, 3DGS struggles to reconstruct scenes captured in the wild by unconstrained photo collections featuring changing lighting conditions. In this paper, we present R3GW, a novel method that learns a relightable 3DGS representation of an outdoor scene captured in the wild. Our approach separates the scene into a relightable foreground and a non-reflective background (the sky), using two distinct sets of Gaussians. R3GW models view-dependent lighting effects in the foreground reflections by combining Physically Based Rendering with the 3DGS scene representation in a varying illumination setting. We evaluate our method quantitatively and qualitatively on the NeRF-OSR dataset, offering state-of-the-art performance and enhanced support for physically-based relighting of unconstrained scenes. Our method synthesizes photorealistic novel views under arbitrary illumination conditions. Additionally, our representation of the sky mitigates depth reconstruction artifacts, improving rendering quality at the sky-foreground boundary

Preview loads on expand
Neural Electromagnetic Fields for High-Resolution Material Parameter Reconstruction Zhe Chen, Peilin Zheng, Wenshuo Chen, Xiucheng Wang, Yutao Yue, Nan Cheng Updated 2026-03-03

Creating functional Digital Twins, simulatable 3D replicas of the real world, is a central challenge in computer vision. Current methods like NeRF produce visually rich but functionally incomplete twins. The key barrier is the lack of underlying material properties (e.g., permittivity, conductivity). Acquiring this information for every point in a scene via non-contact, non-invasive sensing is a primary goal, but it demands solving a notoriously ill-posed physical inversion problem. Standard remote signals, like images and radio frequencies (RF), deeply entangle the unknown geometry, ambient field, and target materials. We introduce NEMF, a novel framework for dense, non-invasive physical inversion designed to build functional digital twins. Our key insight is a systematic disentanglement strategy. NEMF leverages high-fidelity geometry from images as a powerful anchor, which first enables the resolution of the ambient field. By constraining both geometry and field using only non-invasive data, the original ill-posed problem transforms into a well-posed, physics-supervised learning task. This transformation unlocks our core inversion module: a decoder. Guided by ambient RF signals and a differentiable layer incorporating physical reflection models, it learns to explicitly output a continuous, spatially-varying field of the scene's underlying material parameters. We validate our framework on high-fidelity synthetic datasets. Experiments show our non-invasive inversion reconstructs these material maps with high accuracy, and the resulting functional twin enables high-fidelity physical simulation. This advance moves beyond passive visual replicas, enabling the creation of truly functional and simulatable models of the physical world.

Preview loads on expand
Sapling-NeRF: Geo-Localised Sapling Reconstruction in Forests for Ecological Monitoring Miguel Ángel Muñoz-Bañón, Nived Chebrolu, Sruthi M. Krishna Moorthy, Yifu Tao, Fernando Torres, Roberto Salguero-Gómez, Maurice Fallon Updated 2026-02-26

Saplings are key indicators of forest regeneration and overall forest health. However, their fine-scale architectural traits are difficult to capture with existing 3D sensing methods, which make quantitative evaluation difficult. Terrestrial Laser Scanners (TLS), Mobile Laser Scanners (MLS), or traditional photogrammetry approaches poorly reconstruct thin branches, dense foliage, and lack the scale consistency needed for long-term monitoring. Implicit 3D reconstruction methods such as Neural Radiance Fields (NeRF) and 3D Gaussian Splatting (3DGS) are promising alternatives, but cannot recover the true scale of a scene and lack any means to be accurately geo-localised. In this paper, we present a pipeline which fuses NeRF, LiDAR SLAM, and GNSS to enable repeatable, geo-localised ecological monitoring of saplings. Our system proposes a three-level representation: (i) coarse Earth-frame localisation using GNSS, (ii) LiDAR-based SLAM for centimetre-accurate localisation and reconstruction, and (iii) NeRF-derived object-centric dense reconstruction of individual saplings. This approach enables repeatable quantitative evaluation and long-term monitoring of sapling traits. Our experiments in forest plots in Wytham Woods (Oxford, UK) and Evo (Finland) show that stem height, branching patterns, and leaf-to-wood ratios can be captured with increased accuracy as compared to TLS. We demonstrate that accurate stem skeletons and leaf distributions can be measured for saplings with heights between 0.5m and 2m in situ, giving ecologists access to richer structural and quantitative data for analysing forest dynamics.

Preview loads on expand
Event-Aided Sharp Radiance Field Reconstruction for Fast-Flying Drones Rong Zou, Marco Cannici, Davide Scaramuzza Updated 2026-02-26

Fast-flying aerial robots promise rapid inspection under limited battery constraints, with direct applications in infrastructure inspection, terrain exploration, and search and rescue. However, high speeds lead to severe motion blur in images and induce significant drift and noise in pose estimates, making dense 3D reconstruction with Neural Radiance Fields (NeRFs) particularly challenging due to their high sensitivity to such degradations. In this work, we present a unified framework that leverages asynchronous event streams alongside motion-blurred frames to reconstruct high-fidelity radiance fields from agile drone flights. By embedding event-image fusion into NeRF optimization and jointly refining event-based visual-inertial odometry priors using both event and frame modalities, our method recovers sharp radiance fields and accurate camera trajectories without ground-truth supervision. We validate our approach on both synthetic data and real-world sequences captured by a fast-flying drone. Despite highly dynamic drone flights, where RGB frames are severely degraded by motion blur and pose priors become unreliable, our method reconstructs high-fidelity radiance fields and preserves fine scene details, delivering a performance gain of over 50% on real-world data compared to state-of-the-art methods.

Preview loads on expand
Lie Flow: Video Dynamic Fields Modeling and Predicting with Lie Algebra as Geometric Physics Principle Weidong Qiao, Wangmeng Zuo, Hui Li Updated 2026-02-25

Modeling 4D scenes requires capturing both spatial structure and temporal motion, which is challenging due to the need for physically consistent representations of complex rigid and non-rigid motions. Existing approaches mainly rely on translational displacements, which struggle to represent rotations, articulated transformations, often leading to spatial inconsistency and physically implausible motion. LieFlow, a dynamic radiance representation framework that explicitly models motion within the SE(3) Lie group, enabling coherent learning of translation and rotation in a unified geometric space. The SE(3) transformation field enforces physically inspired constraints to maintain motion continuity and geometric consistency. The evaluation includes a synthetic dataset with rigid-body trajectories and two real-world datasets capturing complex motion under natural lighting and occlusions. Across all datasets, LieFlow consistently improves view-synthesis fidelity, temporal coherence, and physical realism over NeRF-based baselines. These results confirm that SE(3)-based motion modeling offers a robust and physically grounded framework for representing dynamic 4D scenes.

Preview loads on expand
Monocular Endoscopic Tissue 3D Reconstruction with Multi-Level Geometry Regularization Yangsen Chen, Hao Wang Updated 2026-02-24

Reconstructing deformable endoscopic tissues is crucial for achieving robot-assisted surgery. However, 3D Gaussian Splatting-based approaches encounter challenges in achieving consistent tissue surface reconstruction, while existing NeRF-based methods lack real-time rendering capabilities. In pursuit of both smooth deformable surfaces and real-time rendering, we introduce a novel approach based on 3D Gaussian Splatting. Specifically, we introduce surface-aware reconstruction, initially employing a Sign Distance Field-based method to construct a mesh, subsequently utilizing this mesh to constrain the Gaussian Splatting reconstruction process. Furthermore, to ensure the generation of physically plausible deformations, we incorporate local rigidity and global non-rigidity restrictions to guide Gaussian deformation, tailored for the highly deformable nature of soft endoscopic tissue. Based on 3D Gaussian Splatting, our proposed method delivers a fast rendering process and smooth surface appearances. Quantitative and qualitative analysis against alternative methodologies shows that our approach achieves solid reconstruction quality in both textures and geometries.

Preview loads on expand
Large-scale Photorealistic Outdoor 3D Scene Reconstruction from UAV Imagery Using Gaussian Splatting Techniques Christos Maikos, Georgios Angelidis, Georgios Th. Papadopoulos Updated 2026-02-23

In this study, we present an end-to-end pipeline capable of converting drone-captured video streams into high-fidelity 3D reconstructions with minimal latency. Unmanned aerial vehicles (UAVs) are extensively used in aerial real-time perception applications. Moreover, recent advances in 3D Gaussian Splatting (3DGS) have demonstrated significant potential for real-time neural rendering. However, their integration into end-to-end UAV-based reconstruction and visualization systems remains underexplored. Our goal is to propose an efficient architecture that combines live video acquisition via RTMP streaming, synchronized sensor fusion, camera pose estimation, and 3DGS optimization, achieving continuous model updates and low-latency deployment within interactive visualization environments that supports immersive augmented and virtual reality (AR/VR) applications. Experimental results demonstrate that the proposed method achieves competitive visual fidelity, while delivering significantly higher rendering performance and substantially reduced end-to-end latency, compared to NeRF-based approaches. Reconstruction quality remains within 4-7\% of high-fidelity offline references, confirming the suitability of the proposed system for real-time, scalable augmented perception from aerial platforms.

Preview loads on expand
Augmented Radiance Field: A General Framework for Enhanced Gaussian Splatting Yixin Yang, Bojian Wu, Yang Zhou, Hui Huang Updated 2026-02-23

Due to the real-time rendering performance, 3D Gaussian Splatting (3DGS) has emerged as the leading method for radiance field reconstruction. However, its reliance on spherical harmonics for color encoding inherently limits its ability to separate diffuse and specular components, making it challenging to accurately represent complex reflections. To address this, we propose a novel enhanced Gaussian kernel that explicitly models specular effects through view-dependent opacity. Meanwhile, we introduce an error-driven compensation strategy to improve rendering quality in existing 3DGS scenes. Our method begins with 2D Gaussian initialization and then adaptively inserts and optimizes enhanced Gaussian kernels, ultimately producing an augmented radiance field. Experiments demonstrate that our method not only surpasses state-of-the-art NeRF methods in rendering performance but also achieves greater parameter efficiency. Project page at: https://xiaoxinyyx.github.io/augs.

Preview loads on expand
PhysConvex: Physics-Informed 3D Dynamic Convex Radiance Fields for Reconstruction and Simulation Dan Wang, Xinrui Cui, Serge Belongie, Ravi Ramamoorthi Updated 2026-02-21

Reconstructing and simulating dynamic 3D scenes with both visual realism and physical consistency remains a fundamental challenge. Existing neural representations, such as NeRFs and 3DGS, excel in appearance reconstruction but struggle to capture complex material deformation and dynamics. We propose PhysConvex, a Physics-informed 3D Dynamic Convex Radiance Field that unifies visual rendering and physical simulation. PhysConvex represents deformable radiance fields using physically grounded convex primitives governed by continuum mechanics. We introduce a boundary-driven dynamic convex representation that models deformation through vertex and surface dynamics, capturing spatially adaptive, non-uniform deformation, and evolving boundaries. To efficiently simulate complex geometries and heterogeneous materials, we further develop a reduced-order convex simulation that advects dynamic convex fields using neural skinning eigenmodes as shape- and material-aware deformation bases with time-varying reduced DOFs under Newtonian dynamics. Convex dynamics also offers compact, gap-free volumetric coverage, enhancing both geometric efficiency and simulation fidelity. Experiments demonstrate that PhysConvex achieves high-fidelity reconstruction of geometry, appearance, and physical properties from videos, outperforming existing methods.

Preview loads on expand
Unifying Color and Lightness Correction with View-Adaptive Curve Adjustment for Robust 3D Novel View Synthesis Ziteng Cui, Shuhong Liu, Xiaoyu Dong, Xuangeng Chu, Lin Gu, Ming-Hsuan Yang, Tatsuya Harada Updated 2026-02-20

High-quality image acquisition in real-world environments remains challenging due to complex illumination variations and inherent limitations of camera imaging pipelines. These issues are exacerbated in multi-view capture, where differences in lighting, sensor responses, and image signal processor (ISP) configurations introduce photometric and chromatic inconsistencies that violate the assumptions of photometric consistency underlying modern 3D novel view synthesis (NVS) methods, including Neural Radiance Fields (NeRF) and 3D Gaussian Splatting (3DGS), leading to degraded reconstruction and rendering quality. We propose Luminance-GS++, a 3DGS-based framework for robust NVS under diverse illumination conditions. Our method combines a globally view-adaptive lightness adjustment with a local pixel-wise residual refinement for precise color correction. We further design unsupervised objectives that jointly enforce lightness correction and multi-view geometric and photometric consistency. Extensive experiments demonstrate state-of-the-art performance across challenging scenarios, including low-light, overexposure, and complex luminance and chromatic variations. Unlike prior approaches that modify the underlying representation, our method preserves the explicit 3DGS formulation, improving reconstruction fidelity while maintaining real-time rendering efficiency.

Preview loads on expand
HS-3D-NeRF: 3D Surface and Hyperspectral Reconstruction From Stationary Hyperspectral Images Using Multi-Channel NeRFs Kibon Ku, Talukder Z. Jubery, Adarsh Krishnamurthy, Baskar Ganapathysubramanian Updated 2026-02-18

Advances in hyperspectral imaging (HSI) and 3D reconstruction have enabled accurate, high-throughput characterization of agricultural produce quality and plant phenotypes, both essential for advancing agricultural sustainability and breeding programs. HSI captures detailed biochemical features of produce, while 3D geometric data substantially improves morphological analysis. However, integrating these two modalities at scale remains challenging, as conventional approaches involve complex hardware setups incompatible with automated phenotyping systems. Recent advances in neural radiance fields (NeRF) offer computationally efficient 3D reconstruction but typically require moving-camera setups, limiting throughput and reproducibility in standard indoor agricultural environments. To address these challenges, we introduce HSI-SC-NeRF, a stationary-camera multi-channel NeRF framework for high-throughput hyperspectral 3D reconstruction targeting postharvest inspection of agricultural produce. Multi-view hyperspectral data is captured using a stationary camera while the object rotates within a custom-built Teflon imaging chamber providing diffuse, uniform illumination. Object poses are estimated via ArUco calibration markers and transformed to the camera frame of reference through simulated pose transformations, enabling standard NeRF training on stationary-camera data. A multi-channel NeRF formulation optimizes reconstruction across all hyperspectral bands jointly using a composite spectral loss, supported by a two-stage training protocol that decouples geometric initialization from radiometric refinement. Experiments on three agricultural produce samples demonstrate high spatial reconstruction accuracy and strong spectral fidelity across the visible and near-infrared spectrum, confirming the suitability of HSI-SC-NeRF for integration into automated agricultural workflows.

Preview loads on expand
Subtractive Modulative Network with Learnable Periodic Activations Tiou Wang, Zhuoqian Yang, Markus Flierl, Mathieu Salzmann, Sabine Süsstrunk Updated 2026-02-18

We propose the Subtractive Modulative Network (SMN), a novel, parameter-efficient Implicit Neural Representation (INR) architecture inspired by classical subtractive synthesis. The SMN is designed as a principled signal processing pipeline, featuring a learnable periodic activation layer (Oscillator) that generates a multi-frequency basis, and a series of modulative mask modules (Filters) that actively generate high-order harmonics. We provide both theoretical analysis and empirical validation for our design. Our SMN achieves a PSNR of $40+$ dB on two image datasets, comparing favorably against state-of-the-art methods in terms of both reconstruction accuracy and parameter efficiency. Furthermore, consistent advantage is observed on the challenging 3D NeRF novel view synthesis task. Supplementary materials are available at https://inrainbws.github.io/smn/.

Preview loads on expand
High-fidelity 3D reconstruction for planetary exploration Alfonso Martínez-Petersen, Levin Gerdes, David Rodríguez-Martínez, C. J. Pérez-del-Pulgar Updated 2026-02-14

Planetary exploration increasingly relies on autonomous robotic systems capable of perceiving, interpreting, and reconstructing their surroundings in the absence of global positioning or real-time communication with Earth. Rovers operating on planetary surfaces must navigate under sever environmental constraints, limited visual redundancy, and communication delays, making onboard spatial awareness and visual localization key components for mission success. Traditional techniques based on Structure-from-Motion (SfM) and Simultaneous Localization and Mapping (SLAM) provide geometric consistency but struggle to capture radiometric detail or to scale efficiently in unstructured, low-texture terrains typical of extraterrestrial environments. This work explores the integration of radiance field-based methods - specifically Neural Radiance Fields (NeRF) and Gaussian Splatting - into a unified, automated environment reconstruction pipeline for planetary robotics. Our system combines the Nerfstudio and COLMAP frameworks with a ROS2-compatible workflow capable of processing raw rover data directly from rosbag recordings. This approach enables the generation of dense, photorealistic, and metrically consistent 3D representations from minimal visual input, supporting improved perception and planning for autonomous systems operating in planetary-like conditions. The resulting pipeline established a foundation for future research in radiance field-based mapping, bridging the gap between geometric and neural representations in planetary exploration.

Preview loads on expand
Nighttime Autonomous Driving Scene Reconstruction with Physically-Based Gaussian Splatting Tae-Kyeong Kim, Xingxin Chen, Guile Wu, Chengjie Huang, Dongfeng Bai, Bingbing Liu Updated 2026-02-14

This paper focuses on scene reconstruction under nighttime conditions in autonomous driving simulation. Recent methods based on Neural Radiance Fields (NeRFs) and 3D Gaussian Splatting (3DGS) have achieved photorealistic modeling in autonomous driving scene reconstruction, but they primarily focus on normal-light conditions. Low-light driving scenes are more challenging to model due to their complex lighting and appearance conditions, which often causes performance degradation of existing methods. To address this problem, this work presents a novel approach that integrates physically based rendering into 3DGS to enhance nighttime scene reconstruction for autonomous driving. Specifically, our approach integrates physically based rendering into composite scene Gaussian representations and jointly optimizes Bidirectional Reflectance Distribution Function (BRDF) based material properties. We explicitly model diffuse components through a global illumination module and specular components by anisotropic spherical Gaussians. As a result, our approach improves reconstruction quality for outdoor nighttime driving scenes, while maintaining real-time rendering. Extensive experiments across diverse nighttime scenarios on two real-world autonomous driving datasets, including nuScenes and Waymo, demonstrate that our approach outperforms the state-of-the-art methods both quantitatively and qualitatively.

Preview loads on expand
From Implicit Ambiguity to Explicit Solidity: Diagnosing Interior Geometric Degradation in Neural Radiance Fields for Dense 3D Scene Understanding Jiangsan Zhao, Jakob Geipel, Kryzysztof Kusnierek Updated 2026-02-12

Neural Radiance Fields (NeRFs) have emerged as a powerful paradigm for multi-view reconstruction, complementing classical photogrammetric pipelines based on Structure-from-Motion (SfM) and Multi-View Stereo (MVS). However, their reliability for quantitative 3D analysis in dense, self-occluding scenes remains poorly understood. In this study, we identify a fundamental failure mode of implicit density fields under heavy occlusion, which we term Interior Geometric Degradation (IGD). We show that transmittance-based volumetric optimization satisfies photometric supervision by reconstructing hollow or fragmented structures rather than solid interiors, leading to systematic instance undercounting. Through controlled experiments on synthetic datasets with increasing occlusion, we demonstrate that state-of-the-art mask-supervised NeRFs saturate at approximately 89% instance recovery in dense scenes, despite improved surface coherence and mask quality. To overcome this limitation, we introduce an explicit geometric pipeline based on Sparse Voxel Rasterization (SVRaster), initialized from SfM feature geometry. By projecting 2D instance masks onto an explicit voxel grid and enforcing geometric separation via recursive splitting, our approach preserves physical solidity and achieves a 95.8% recovery rate in dense clusters. A sensitivity analysis using degraded segmentation masks further shows that explicit SfM-based geometry is substantially more robust to supervision failure, recovering 43% more instances than implicit baselines. These results demonstrate that explicit geometric priors are a prerequisite for reliable quantitative analysis in highly self-occluding 3D scenes.

Preview loads on expand
Dynamic Black-hole Emission Tomography with Physics-informed Neural Fields Berthy T. Feng, Andrew A. Chael, David Bromley, Aviad Levis, William T. Freeman, Katherine L. Bouman Updated 2026-02-08

With the success of static black-hole imaging, the next frontier is the dynamic and 3D imaging of black holes. Recovering the dynamic 3D gas near a black hole would reveal previously-unseen parts of the universe and inform new physics models. However, only sparse radio measurements from a single viewpoint are possible, making the dynamic 3D reconstruction problem significantly ill-posed. Previously, BH-NeRF addressed the ill-posed problem by assuming Keplerian dynamics of the gas, but this assumption breaks down near the black hole, where the strong gravitational pull of the black hole and increased electromagnetic activity complicate fluid dynamics. To overcome the restrictive assumptions of BH-NeRF, we propose PI-DEF, a physics-informed approach that uses differentiable neural rendering to fit a 4D (time + 3D) emissivity field given EHT measurements. Our approach jointly reconstructs the 3D velocity field with the 4D emissivity field and enforces the velocity as a soft constraint on the dynamics of the emissivity. In experiments on simulated data, we find significantly improved reconstruction accuracy over both BH-NeRF and a physics-agnostic approach. We demonstrate how our method may be used to estimate other physics parameters of the black hole, such as its spin.

Preview loads on expand
Deepfake Synthesis vs. Detection: An Uneven Contest Md. Tarek Hasan, Sanjay Saha, Shaojing Fan, Swakkhar Shatabda, Terence Sim Updated 2026-02-08

The rapid advancement of deepfake technology has significantly elevated the realism and accessibility of synthetic media. Emerging techniques, such as diffusion-based models and Neural Radiance Fields (NeRF), alongside enhancements in traditional Generative Adversarial Networks (GANs), have contributed to the sophisticated generation of deepfake videos. Concurrently, deepfake detection methods have seen notable progress, driven by innovations in Transformer architectures, contrastive learning, and other machine learning approaches. In this study, we conduct a comprehensive empirical analysis of state-of-the-art deepfake detection techniques, including human evaluation experiments against cutting-edge synthesis methods. Our findings highlight a concerning trend: many state-of-the-art detection models exhibit markedly poor performance when challenged with deepfakes produced by modern synthesis techniques, including poor performance by human participants against the best quality deepfakes. Through extensive experimentation, we provide evidence that underscores the urgent need for continued refinement of detection models to keep pace with the evolving capabilities of deepfake generation technologies. This research emphasizes the critical gap between current detection methodologies and the sophistication of new generation techniques, calling for intensified efforts in this crucial area of study.

Preview loads on expand
NVS-HO: A Benchmark for Novel View Synthesis of Handheld Objects Musawar Ali, Manuel Carranza-García, Nicola Fioraio, Samuele Salti, Luigi Di Stefano Updated 2026-02-05

We propose NVS-HO, the first benchmark designed for novel view synthesis of handheld objects in real-world environments using only RGB inputs. Each object is recorded in two complementary RGB sequences: (1) a handheld sequence, where the object is manipulated in front of a static camera, and (2) a board sequence, where the object is fixed on a ChArUco board to provide accurate camera poses via marker detection. The goal of NVS-HO is to learn a NVS model that captures the full appearance of an object from (1), whereas (2) provides the ground-truth images used for evaluation. To establish baselines, we consider both a classical SfM pipeline and a state-of-the-art pre-trained feed-forward neural network (VGGT) as pose estimators, and train NVS models based on NeRF and Gaussian Splatting. Our experiments reveal significant performance gaps in current methods under unconstrained handheld conditions, highlighting the need for more robust approaches. NVS-HO thus offers a challenging real-world benchmark to drive progress in RGB-based novel view synthesis of handheld objects.

Preview loads on expand
NeVStereo: A NeRF-Driven NVS-Stereo Architecture for High-Fidelity 3D Tasks Pengcheng Chen, Yue Hu, Wenhao Li, Nicole M Gunderson, Andrew Feng, Zhenglong Sun, Peter Beerel, Eric J Seibel Updated 2026-02-05

In modern dense 3D reconstruction, feed-forward systems (e.g., VGGT, pi3) focus on end-to-end matching and geometry prediction but do not explicitly output the novel view synthesis (NVS). Neural rendering-based approaches offer high-fidelity NVS and detailed geometry from posed images, yet they typically assume fixed camera poses and can be sensitive to pose errors. As a result, it remains non-trivial to obtain a single framework that can offer accurate poses, reliable depth, high-quality rendering, and accurate 3D surfaces from casually captured views. We present NeVStereo, a NeRF-driven NVS-stereo architecture that aims to jointly deliver camera poses, multi-view depth, novel view synthesis, and surface reconstruction from multi-view RGB-only inputs. NeVStereo combines NeRF-based NVS for stereo-friendly renderings, confidence-guided multi-view depth estimation, NeRF-coupled bundle adjustment for pose refinement, and an iterative refinement stage that updates both depth and the radiance field to improve geometric consistency. This design mitigated the common NeRF-based issues such as surface stacking, artifacts, and pose-depth coupling. Across indoor, outdoor, tabletop, and aerial benchmarks, our experiments indicate that NeVStereo achieves consistently strong zero-shot performance, with up to 36% lower depth error, 10.4% improved pose accuracy, 4.5% higher NVS fidelity, and state-of-the-art mesh quality (F1 91.93%, Chamfer 4.35 mm) compared to existing prestigious methods.

Preview loads on expand
Beyond Cropping and Rotation: Automated Evolution of Powerful Task-Specific Augmentations with Generative Models Judah Goldfeder, Shreyes Kaliyur, Vaibhav Sourirajan, Patrick Minwan Puma, Philippe Martin Wyder, Yuhang Hu, Jiong Lin, Hod Lipson Updated 2026-02-03

Data augmentation has long been a cornerstone for reducing overfitting in vision models, with methods like AutoAugment automating the design of task-specific augmentations. Recent advances in generative models, such as conditional diffusion and few-shot NeRFs, offer a new paradigm for data augmentation by synthesizing data with significantly greater diversity and realism. However, unlike traditional augmentations like cropping or rotation, these methods introduce substantial changes that enhance robustness but also risk degrading performance if the augmentations are poorly matched to the task. In this work, we present EvoAug, an automated augmentation learning pipeline, which leverages these generative models alongside an efficient evolutionary algorithm to learn optimal task-specific augmentations. Our pipeline introduces a novel approach to image augmentation that learns stochastic augmentation trees that hierarchically compose augmentations, enabling more structured and adaptive transformations. We demonstrate strong performance across fine-grained classification and few-shot learning tasks. Notably, our pipeline discovers augmentations that align with domain knowledge, even in low-data settings. These results highlight the potential of learned generative augmentations, unlocking new possibilities for robust model training.

Preview loads on expand
Under-Canopy Terrain Reconstruction in Dense Forests Using RGB Imaging and Neural 3D Reconstruction Refael Sheffer, Chen Pinchover, Haim Zisman, Dror Ozeri, Roee Litman Updated 2026-02-02

Mapping the terrain and understory hidden beneath dense forest canopies is of great interest for numerous applications such as search and rescue, trail mapping, forest inventory tasks, and more. Existing solutions rely on specialized sensors: either heavy, costly airborne LiDAR, or Airborne Optical Sectioning (AOS), which uses thermal synthetic aperture photography and is tailored for person detection. We introduce a novel approach for the reconstruction of canopy-free, photorealistic ground views using only conventional RGB images. Our solution is based on the celebrated Neural Radiance Fields (NeRF), a recent 3D reconstruction method. Additionally, we include specific image capture considerations, which dictate the needed illumination to successfully expose the scene beneath the canopy. To better cope with the poorly lit understory, we employ a low light loss. Finally, we propose two complementary approaches to remove occluding canopy elements by controlling per-ray integration procedure. To validate the value of our approach, we present two possible downstream tasks. For the task of search and rescue (SAR), we demonstrate that our method enables person detection which achieves promising results compared to thermal AOS (using only RGB images). Additionally, we show the potential of our approach for forest inventory tasks like tree counting. These results position our approach as a cost-effective, high-resolution alternative to specialized sensors for SAR, trail mapping, and forest-inventory tasks.

Preview loads on expand
EAG-PT: Emission-Aware Gaussians and Path Tracing for Indoor Scene Reconstruction and Editing Xijie Yang, Mulin Yu, Changjian Jiang, Kerui Ren, Tao Lu, Jiangmiao Pang, Dahua Lin, Bo Dai, Linning Xu Updated 2026-01-30

Recent reconstruction methods based on radiance field such as NeRF and 3DGS reproduce indoor scenes with high visual fidelity, but break down under scene editing due to baked illumination and the lack of explicit light transport. In contrast, physically based inverse rendering relies on mesh representations and path tracing, which enforce correct light transport but place strong requirements on geometric fidelity, becoming a practical bottleneck for real indoor scenes. In this work, we propose Emission-Aware Gaussians and Path Tracing (EAG-PT), aiming for physically based light transport with a unified 2D Gaussian representation. Our design is based on three cores: (1) using 2D Gaussians as a unified scene representation and transport-friendly geometry proxy that avoids reconstructed mesh, (2) explicitly separating emissive and non-emissive components during reconstruction for further scene editing, and (3) decoupling reconstruction from final rendering by using efficient single-bounce optimization and high-quality multi-bounce path tracing after scene editing. Experiments on synthetic and real indoor scenes show that EAG-PT produces more natural and physically consistent renders after editing than radiant scene reconstructions, while preserving finer geometric detail and avoiding mesh-induced artifacts compared to mesh-based inverse path tracing. These results suggest promising directions for future use in interior design, XR content creation, and embodied AI.

Preview loads on expand
Diachronic Stereo Matching for Multi-Date Satellite Imagery Elías Masquil, Luca Savant Aira, Roger Marí, Thibaud Ehret, Pablo Musé, Gabriele Facciolo Updated 2026-01-30

Recent advances in image-based satellite 3D reconstruction have progressed along two complementary directions. On one hand, multi-date approaches using NeRF or Gaussian-splatting jointly model appearance and geometry across many acquisitions, achieving accurate reconstructions on opportunistic imagery with numerous observations. On the other hand, classical stereoscopic reconstruction pipelines deliver robust and scalable results for simultaneous or quasi-simultaneous image pairs. However, when the two images are captured months apart, strong seasonal, illumination, and shadow changes violate standard stereoscopic assumptions, causing existing pipelines to fail. This work presents the first Diachronic Stereo Matching method for satellite imagery, enabling reliable 3D reconstruction from temporally distant pairs. Two advances make this possible: (1) fine-tuning a state-of-the-art deep stereo network that leverages monocular depth priors, and (2) exposing it to a dataset specifically curated to include a diverse set of diachronic image pairs. In particular, we start from a pretrained MonSter model, trained initially on a mix of synthetic and real datasets such as SceneFlow and KITTI, and fine-tune it on a set of stereo pairs derived from the DFC2019 remote sensing challenge. This dataset contains both synchronic and diachronic pairs under diverse seasonal and illumination conditions. Experiments on multi-date WorldView-3 imagery demonstrate that our approach consistently surpasses classical pipelines and unadapted deep stereo models on both synchronic and diachronic settings. Fine-tuning on temporally diverse images, together with monocular priors, proves essential for enabling 3D reconstruction from previously incompatible acquisition dates. Left image (winter) Right image (autumn) DSM geometry Ours (1.23 m) Zero-shot (3.99 m) LiDAR GT Figure 1. Output geometry for a winter-autumn image pair from Omaha (OMA 331 test scene). Our method recovers accurate geometry despite the diachronic nature of the pair, exhibiting strong appearance changes, which cause existing zero-shot methods to fail. Missing values due to perspective shown in black. Mean altitude error in parentheses; lower is better.

Preview loads on expand
Lightweight High-Fidelity Low-Bitrate Talking Face Compression for 3D Video Conference Jianglong Li, Jun Xu, Bingcong Lu, Zhengxue Cheng, Hongwei Hu, Ronghua Wu, Li Song Updated 2026-01-29

The demand for immersive and interactive communication has driven advancements in 3D video conferencing, yet achieving high-fidelity 3D talking face representation at low bitrates remains a challenge. Traditional 2D video compression techniques fail to preserve fine-grained geometric and appearance details, while implicit neural rendering methods like NeRF suffer from prohibitive computational costs. To address these challenges, we propose a lightweight, high-fidelity, low-bitrate 3D talking face compression framework that integrates FLAME-based parametric modeling with 3DGS neural rendering. Our approach transmits only essential facial metadata in real time, enabling efficient reconstruction with a Gaussian-based head model. Additionally, we introduce a compact representation and compression scheme, including Gaussian attribute compression and MLP optimization, to enhance transmission efficiency. Experimental results demonstrate that our method achieves superior rate-distortion performance, delivering high-quality facial rendering at extremely low bitrates, making it well-suited for real-time 3D video conferencing applications.

Preview loads on expand
WaterClear-GS: Optical-Aware Gaussian Splatting for Underwater Reconstruction and Restoration Xinrui Zhang, Yufeng Wang, Shuangkang Fang, Zesheng Wang, Dacheng Qi, Wenrui Ding Updated 2026-01-27

Underwater 3D reconstruction and appearance restoration are hindered by the complex optical properties of water, such as wavelength-dependent attenuation and scattering. Existing Neural Radiance Fields (NeRF)-based methods struggle with slow rendering speeds and suboptimal color restoration, while 3D Gaussian Splatting (3DGS) inherently lacks the capability to model complex volumetric scattering effects. To address these issues, we introduce WaterClear-GS, the first pure 3DGS-based framework that explicitly integrates underwater optical properties of local attenuation and scattering into Gaussian primitives, eliminating the need for an auxiliary medium network. Our method employs a dual-branch optimization strategy to ensure underwater photometric consistency while naturally recovering water-free appearances. This strategy is enhanced by depth-guided geometry regularization and perception-driven image loss, together with exposure constraints, spatially-adaptive regularization, and physically guided spectral regularization, which collectively enforce local 3D coherence and maintain natural visual perception. Experiments on standard benchmarks and our newly collected dataset demonstrate that WaterClear-GS achieves outstanding performance on both novel view synthesis (NVS) and underwater image restoration (UIR) tasks, while maintaining real-time rendering. The code will be available at https://buaaxrzhang.github.io/WaterClear-GS/.

Preview loads on expand
Bridging Visual and Wireless Sensing: A Unified Radiation Field for 3D Radio Map Construction Chaozheng Wen, Jingwen Tong, Zehong Lin, Chenghong Bian, Jun Zhang Updated 2026-01-27

The emerging applications of next-generation wireless networks (e.g., immersive 3D communication, low-altitude networks, and integrated sensing and communication) necessitate high-fidelity environmental intelligence. 3D radio maps have emerged as a critical tool for this purpose, enabling spectrum-aware planning and environment-aware sensing by bridging the gap between physical environments and electromagnetic signal propagation. However, constructing accurate 3D radio maps requires fine-grained 3D geometric information and a profound understanding of electromagnetic wave propagation. Existing approaches typically treat optical and wireless knowledge as distinct modalities, failing to exploit the fundamental physical principles governing both light and electromagnetic propagation. To bridge this gap, we propose URF-GS, a unified radio-optical radiation field representation framework for accurate and generalizable 3D radio map construction based on 3D Gaussian splatting (3D-GS) and inverse rendering. By fusing visual and wireless sensing observations, URF-GS recovers scene geometry and material properties while accurately predicting radio signal behavior at arbitrary transmitter-receiver (Tx-Rx) configurations. Experimental results demonstrate that URF-GS achieves up to a 24.7% improvement in spatial spectrum prediction accuracy and a 10x increase in sample efficiency for 3D radio map construction compared with neural radiance field (NeRF)-based methods. This work establishes a foundation for next-generation wireless networks by integrating perception, interaction, and communication through holistic radiation field reconstruction.

Preview loads on expand
Audio-Driven Talking Face Generation with Blink Embedding and Hash Grid Landmarks Encoding Yuhui Zhang, Hui Yu, Wei Liang, Sunjie Zhang Updated 2026-01-26

Dynamic Neural Radiance Fields (NeRF) have demonstrated considerable success in generating high-fidelity 3D models of talking portraits. Despite significant advancements in the rendering speed and generation quality, challenges persist in accurately and efficiently capturing mouth movements in talking portraits. To tackle this challenge, we propose an automatic method based on blink embedding and hash grid landmarks encoding in this study, which can substantially enhance the fidelity of talking faces. Specifically, we leverage facial features encoded as conditional features and integrate audio features as residual terms into our model through a Dynamic Landmark Transformer. Furthermore, we employ neural radiance fields to model the entire face, resulting in a lifelike face representation. Experimental evaluations have validated the superiority of our approach to existing methods.

Preview loads on expand
MV-SAM: Multi-view Promptable Segmentation using Pointmap Guidance Yoonwoo Jeong, Cheng Sun, Yu-Chiang Frank Wang, Minsu Cho, Jaesung Choe Updated 2026-01-25

Promptable segmentation has emerged as a powerful paradigm in computer vision, enabling users to guide models in parsing complex scenes with prompts such as clicks, boxes, or textual cues. Recent advances, exemplified by the Segment Anything Model (SAM), have extended this paradigm to videos and multi-view images. However, the lack of 3D awareness often leads to inconsistent results, necessitating costly per-scene optimization to enforce 3D consistency. In this work, we introduce MV-SAM, a framework for multi-view segmentation that achieves 3D consistency using pointmaps -- 3D points reconstructed from unposed images by recent visual geometry models. Leveraging the pixel-point one-to-one correspondence of pointmaps, MV-SAM lifts images and prompts into 3D space, eliminating the need for explicit 3D networks or annotated 3D data. Specifically, MV-SAM extends SAM by lifting image embeddings from its pretrained encoder into 3D point embeddings, which are decoded by a transformer using cross-attention with 3D prompt embeddings. This design aligns 2D interactions with 3D geometry, enabling the model to implicitly learn consistent masks across views through 3D positional embeddings. Trained on the SA-1B dataset, our method generalizes well across domains, outperforming SAM2-Video and achieving comparable performance with per-scene optimization baselines on NVOS, SPIn-NeRF, ScanNet++, uCo3D, and DL3DV benchmarks. Code will be released.

Preview loads on expand
NeRF-MIR: Towards High-Quality Restoration of Masked Images with Neural Radiance Fields Xianliang Huang, Zhizhou Zhong, Shuhang Chen, Yi Xu, Juhong Guan, Shuigeng Zhou Updated 2026-01-24

Neural Radiance Fields (NeRF) have demonstrated remarkable performance in novel view synthesis. However, there is much improvement room on restoring 3D scenes based on NeRF from corrupted images, which are common in natural scene captures and can significantly impact the effectiveness of NeRF. This paper introduces NeRF-MIR, a novel neural rendering approach specifically proposed for the restoration of masked images, demonstrating the potential of NeRF in this domain. Recognizing that randomly emitting rays to pixels in NeRF may not effectively learn intricate image textures, we propose a \textbf{P}atch-based \textbf{E}ntropy for \textbf{R}ay \textbf{E}mitting (\textbf{PERE}) strategy to distribute emitted rays properly. This enables NeRF-MIR to fuse comprehensive information from images of different views. Additionally, we introduce a \textbf{P}rogressively \textbf{I}terative \textbf{RE}storation (\textbf{PIRE}) mechanism to restore the masked regions in a self-training process. Furthermore, we design a dynamically-weighted loss function that automatically recalibrates the loss weights for masked regions. As existing datasets do not support NeRF-based masked image restoration, we construct three masked datasets to simulate corrupted scenarios. Extensive experiments on real data and constructed datasets demonstrate the superiority of NeRF-MIR over its counterparts in masked image restoration.

Preview loads on expand
Multi-View Consistent Wound Segmentation With Neural Fields Remi Chierchia, Léo Lebrat, David Ahmedt-Aristizabal, Yulia Arzhaeva, Olivier Salvado, Clinton Fookes, Rodrigo Santa Cruz Updated 2026-01-23

Wound care is often challenged by the economic and logistical burdens that consistently afflict patients and hospitals worldwide. In recent decades, healthcare professionals have sought support from computer vision and machine learning algorithms. In particular, wound segmentation has gained interest due to its ability to provide professionals with fast, automatic tissue assessment from standard RGB images. Some approaches have extended segmentation to 3D, enabling more complete and precise healing progress tracking. However, inferring multi-view consistent 3D structures from 2D images remains a challenge. In this paper, we evaluate WoundNeRF, a NeRF SDF-based method for estimating robust wound segmentations from automatically generated annotations. We demonstrate the potential of this paradigm in recovering accurate segmentations by comparing it against state-of-the-art Vision Transformer networks and conventional rasterisation-based algorithms. The code will be released to facilitate further development in this promising paradigm.

Preview loads on expand
Seeing through Light and Darkness: Sensor-Physics Grounded Deblurring HDR NeRF from Single-Exposure Images and Events Yunshan Qi, Lin Zhu, Nan Bao, Yifan Zhao, Jia Li Updated 2026-01-21

Novel view synthesis from low dynamic range (LDR) blurry images, which are common in the wild, struggles to recover high dynamic range (HDR) and sharp 3D representations in extreme lighting conditions. Although existing methods employ event data to address this issue, they ignore the sensor-physics mismatches between the camera output and physical world radiance, resulting in suboptimal HDR and deblurring results. To cope with this problem, we propose a unified sensor-physics grounded NeRF framework for sharp HDR novel view synthesis from single-exposure blurry LDR images and corresponding events. We employ NeRF to directly represent the actual radiance of the 3D scene in the HDR domain and model raw HDR scene rays hitting the sensor pixels as in the physical world. A pixel-wise RGB mapping field is introduced to align the above rendered pixel values with the sensor-recorded LDR pixel values of the input images. A novel event mapping field is also designed to bridge the physical scene dynamics and actual event sensor output. The two mapping fields are jointly optimized with the NeRF network, leveraging the spatial and temporal dynamic information in events to enhance the sharp HDR 3D representation learning. Experiments on the collected and public datasets demonstrate that our method can achieve state-of-the-art deblurring HDR novel view synthesis results with single-exposure blurry LDR images and corresponding events.

Preview loads on expand
GAT-NeRF: Geometry-Aware-Transformer Enhanced Neural Radiance Fields for High-Fidelity 4D Facial Avatars Zhe Chang, Haodong Jin, Ying Sun, Yan Song, Hui Yu Updated 2026-01-21

High-fidelity 4D dynamic facial avatar reconstruction from monocular video is a critical yet challenging task, driven by increasing demands for immersive virtual human applications. While Neural Radiance Fields (NeRF) have advanced scene representation, their capacity to capture high-frequency facial details, such as dynamic wrinkles and subtle textures from information-constrained monocular streams, requires significant enhancement. To tackle this challenge, we propose a novel hybrid neural radiance field framework, called Geometry-Aware-Transformer Enhanced NeRF (GAT-NeRF) for high-fidelity and controllable 4D facial avatar reconstruction, which integrates the Transformer mechanism into the NeRF pipeline. GAT-NeRF synergistically combines a coordinate-aligned Multilayer Perceptron (MLP) with a lightweight Transformer module, termed as Geometry-Aware-Transformer (GAT) due to its processing of multi-modal inputs containing explicit geometric priors. The GAT module is enabled by fusing multi-modal input features, including 3D spatial coordinates, 3D Morphable Model (3DMM) expression parameters, and learnable latent codes to effectively learn and enhance feature representations pertinent to fine-grained geometry. The Transformer's effective feature learning capabilities are leveraged to significantly augment the modeling of complex local facial patterns like dynamic wrinkles and acne scars. Comprehensive experiments unequivocally demonstrate GAT-NeRF's state-of-the-art performance in visual fidelity and high-frequency detail recovery, forging new pathways for creating realistic dynamic digital humans for multimedia applications.

Preview loads on expand
POTR: Post-Training 3DGS Compression Bert Ramlot, Martijn Courteaux, Peter Lambert, Glenn Van Wallendael Updated 2026-01-21

3D Gaussian Splatting (3DGS) has recently emerged as a promising contender to Neural Radiance Fields (NeRF) in 3D scene reconstruction and real-time novel view synthesis. 3DGS outperforms NeRF in training and inference speed but has substantially higher storage requirements. To remedy this downside, we propose POTR, a post-training 3DGS codec built on two novel techniques. First, POTR introduces a novel pruning approach that uses a modified 3DGS rasterizer to efficiently calculate every splat's individual removal effect simultaneously. This technique results in 2-4x fewer splats than other post-training pruning techniques and as a result also significantly accelerates inference with experiments demonstrating 1.5-2x faster inference than other compressed models. Second, we propose a novel method to recompute lighting coefficients, significantly reducing their entropy without using any form of training. Our fast and highly parallel approach especially increases AC lighting coefficient sparsity, with experiments demonstrating increases from 70% to 97%, with minimal loss in quality. Finally, we extend POTR with a simple fine-tuning scheme to further enhance pruning, inference, and rate-distortion performance. Experiments demonstrate that POTR, even without fine-tuning, consistently outperforms all other post-training compression techniques in both rate-distortion performance and inference speed.

Preview loads on expand
TreeDGS: Aerial Gaussian Splatting for Distant DBH Measurement Belal Shaheen, Minh-Hieu Nguyen, Bach-Thuan Bui, Shubham, Tim Wu, Michael Fairley, Matthew David Zane, Michael Wu, James Tompkin Updated 2026-01-19

Aerial remote sensing enables efficient large-area surveying, but accurate direct object-level measurement remains difficult in complex natural scenes. Recent advancements in 3D vision, particularly learned radiance-field representations such as NeRF and 3D Gaussian Splatting, have begun to raise the ceiling on reconstruction fidelity and densifiable geometry from posed imagery. Nevertheless, direct aerial measurement of important natural attributes such as tree diameter at breast height (DBH) remains challenging. Trunks in aerial forest scans are distant and sparsely observed in image views: at typical operating altitudes, stems may span only a few pixels. With these constraints, conventional reconstruction methods leave breast-height trunk geometry weakly constrained. We present TreeDGS, an aerial image reconstruction method that leverages 3D Gaussian Splatting as a continuous, densifiable scene representation for trunk measurement. After SfM-MVS initialization and Gaussian optimization, we extract a dense point set from the Gaussian field using RaDe-GS's depth-aware cumulative-opacity integration and associate each sample with a multi-view opacity reliability score. We then estimate DBH from trunk-isolated points using opacity-weighted solid-circle fitting. Evaluated on 10 plots with field-measured DBH, TreeDGS reaches 4.79,cm RMSE (about 2.6 pixels at this GSD) and outperforms a state-of-the-art LiDAR baseline (7.91,cm RMSE), demonstrating that densified splat-based geometry can enable accurate, low-cost aerial DBH measurement.

Preview loads on expand
Bayesian Monocular Depth Refinement via Neural Radiance Fields Arun Muthukkumar Updated 2026-01-15

Monocular depth estimation has applications in many fields, such as autonomous navigation and extended reality, making it an essential computer vision task. However, current methods often produce smooth depth maps that lack the fine geometric detail needed for accurate scene understanding. We propose MDENeRF, an iterative framework that refines monocular depth estimates using depth information from Neural Radiance Fields (NeRFs). MDENeRF consists of three components: (1) an initial monocular estimate for global structure, (2) a NeRF trained on perturbed viewpoints, with per-pixel uncertainty, and (3) Bayesian fusion of the noisy monocular and NeRF depths. We derive NeRF uncertainty from the volume rendering process to iteratively inject high-frequency fine details. Meanwhile, our monocular prior maintains global structure. We demonstrate improvements on key metrics and experiments using indoor scenes from the SUN RGB-D dataset.

Preview loads on expand
Radiant Foam Rendering on a Graph Processor Zulkhuu Tuya, Ignacio Alzugaray, Nicholas Fry, Andrew J. Davison Updated 2026-01-11

Many emerging many-core accelerators replace a single large device memory with hundreds to thousands of lightweight cores, each owning only a small local SRAM and exchanging data via explicit on-chip communication. This organization offers high aggregate bandwidth, but it breaks a key assumption behind many volumetric rendering techniques: that rays can randomly access a large, unified scene representation. Rendering efficiently on such hardware therefore requires distributing both data and computation, keeping ray traversal mostly local, and structuring communication into predictable routes. We present a fully in-SRAM, distributed renderer for the Radiant Foam Voronoi-cell volumetric representation on the Graphcore Mk2 IPU(Intelligence Processing Unit), a many-core accelerator with tile-local SRAM and explicit inter-tile communication. Our system shards the scene across tiles and forwards rays between shards through a hierarchical routing overlay, enabling ray marching entirely from on-chip SRAM with predictable communication. On Mip-NeRF~360 scenes, the system attains near-interactive throughput of approximately 1 fps at 640x480 with image and depth map quality close to the original GPU-based Radiant Foam implementation, while keeping all scene data and ray state in on-chip SRAM. Beyond demonstrating feasibility, we analyze routing, memory, and scheduling bottlenecks that inform how future distributed-memory accelerators can better support irregular, data-movement-heavy rendering workloads.

Preview loads on expand
HOSC: A Periodic Activation with Saturation Control for High-Fidelity Implicit Neural Representations Michal Jan Wlodarczyk, Danzel Serrano, Przemyslaw Musialski Updated 2026-01-10

Periodic activations such as sine preserve high-frequency information in implicit neural representations (INRs) through their oscillatory structure, but often suffer from gradient instability and limited control over multi-scale behavior. We introduce the Hyperbolic Oscillator with Saturation Control (HOSC) activation, $\text{HOSC}(x) = \tanh\bigl(β\sin(ω_0 x)\bigr)$, which exposes an explicit parameter $β$ that controls the Lipschitz bound of the activation by $βω_0$. This provides a direct mechanism to tune gradient magnitudes while retaining a periodic carrier. We provide a mathematical analysis and conduct a comprehensive empirical study across images, audio, video, NeRFs, and SDFs using standardized training protocols. Comparative analysis against SIREN, FINER, and related methods shows where HOSC provides substantial benefits and where it achieves competitive parity. Results establish HOSC as a practical periodic activation for INR applications, with domain-specific guidance on hyperparameter selection. For code visit the project page https://hosc-nn.github.io/ .

Preview loads on expand
QNeRF: Neural Radiance Fields on a Simulated Gate-Based Quantum Computer Daniele Lizzio Bosco, Shuteng Wang, Giuseppe Serra, Vladislav Golyanik Updated 2026-01-08

Recently, Quantum Visual Fields (QVFs) have shown promising improvements in model compactness and convergence speed for learning the provided 2D or 3D signals. Meanwhile, novel-view synthesis has seen major advances with Neural Radiance Fields (NeRFs), where models learn a compact representation from 2D images to render 3D scenes, albeit at the cost of larger models and intensive training. In this work, we extend the approach of QVFs by introducing QNeRF, the first hybrid quantum-classical model designed for novel-view synthesis from 2D images. QNeRF leverages parameterised quantum circuits to encode spatial and view-dependent information via quantum superposition and entanglement, resulting in more compact models compared to the classical counterpart. We present two architectural variants. Full QNeRF maximally exploits all quantum amplitudes to enhance representational capabilities. In contrast, Dual-Branch QNeRF introduces a task-informed inductive bias by branching spatial and view-dependent quantum state preparations, drastically reducing the complexity of this operation and ensuring scalability and potential hardware compatibility. Our experiments demonstrate that -- when trained on images of moderate resolution -- QNeRF matches or outperforms classical NeRF baselines while using less than half the number of parameters. These results suggest that quantum machine learning can serve as a competitive alternative for continuous signal representation in mid-level tasks in computer vision, such as 3D representation learning from 2D observations.

Preview loads on expand
DivAS: Interactive 3D Segmentation of NeRFs via Depth-Weighted Voxel Aggregation Ayush Pande Updated 2026-01-08

Existing methods for segmenting Neural Radiance Fields (NeRFs) are often optimization-based, requiring slow per-scene training that sacrifices the zero-shot capabilities of 2D foundation models. We introduce DivAS (Depth-interactive Voxel Aggregation Segmentation), an optimization-free, fully interactive framework that addresses these limitations. Our method operates via a fast GUI-based workflow where 2D SAM masks, generated from user point prompts, are refined using NeRF-derived depth priors to improve geometric accuracy and foreground-background separation. The core of our contribution is a custom CUDA kernel that aggregates these refined multi-view masks into a unified 3D voxel grid in under 200ms, enabling real-time visual feedback. This optimization-free design eliminates the need for per-scene training. Experiments on Mip-NeRF 360° and LLFF show that DivAS achieves segmentation quality comparable to optimization-based methods, while being 2-2.5x faster end-to-end, and up to an order of magnitude faster when excluding user prompting time.

Preview loads on expand
EdgeNeRF: Edge-Guided Regularization for Neural Radiance Fields from Sparse Views Weiqi Yu, Yiyang Yao, Lin He, Jianming Lv Updated 2026-01-04

Neural Radiance Fields (NeRF) achieve remarkable performance in dense multi-view scenarios, but their reconstruction quality degrades significantly under sparse inputs due to geometric artifacts. Existing methods utilize global depth regularization to mitigate artifacts, leading to the loss of geometric boundary details. To address this problem, we propose EdgeNeRF, an edge-guided sparse-view 3D reconstruction algorithm. Our method leverages the prior that abrupt changes in depth and normals generate edges. Specifically, we first extract edges from input images, then apply depth and normal regularization constraints to non-edge regions, enhancing geometric consistency while preserving high-frequency details at boundaries. Experiments on LLFF and DTU datasets demonstrate EdgeNeRF's superior performance, particularly in retaining sharp geometric boundaries and suppressing artifacts. Additionally, the proposed edge-guided depth regularization module can be seamlessly integrated into other methods in a plug-and-play manner, significantly improving their performance without substantially increasing training time. Code is available at https://github.com/skyhigh404/edgenerf.

Preview loads on expand
CropNeRF: A Neural Radiance Field-Based Framework for Crop Counting Md Ahmed Al Muzaddid, William J. Beksi Updated 2026-01-01

Rigorous crop counting is crucial for effective agricultural management and informed intervention strategies. However, in outdoor field environments, partial occlusions combined with inherent ambiguity in distinguishing clustered crops from individual viewpoints poses an immense challenge for image-based segmentation methods. To address these problems, we introduce a novel crop counting framework designed for exact enumeration via 3D instance segmentation. Our approach utilizes 2D images captured from multiple viewpoints and associates independent instance masks for neural radiance field (NeRF) view synthesis. We introduce crop visibility and mask consistency scores, which are incorporated alongside 3D information from a NeRF model. This results in an effective segmentation of crop instances in 3D and highly-accurate crop counts. Furthermore, our method eliminates the dependence on crop-specific parameter tuning. We validate our framework on three agricultural datasets consisting of cotton bolls, apples, and pears, and demonstrate consistent counting performance despite major variations in crop color, shape, and size. A comparative analysis against the state of the art highlights superior performance on crop counting tasks. Lastly, we contribute a cotton plant dataset to advance further research on this topic.

Preview loads on expand
UniC-Lift: Unified 3D Instance Segmentation via Contrastive Learning Ankit Dhiman, Srinath R, Jaswanth Reddy, Lokesh R Boregowda, Venkatesh Babu Radhakrishnan Updated 2025-12-31

3D Gaussian Splatting (3DGS) and Neural Radiance Fields (NeRF) have advanced novel-view synthesis. Recent methods extend multi-view 2D segmentation to 3D, enabling instance/semantic segmentation for better scene understanding. A key challenge is the inconsistency of 2D instance labels across views, leading to poor 3D predictions. Existing methods use a two-stage approach in which some rely on contrastive learning with hyperparameter-sensitive clustering, while others preprocess labels for consistency. We propose a unified framework that merges these steps, reducing training time and improving performance by introducing a learnable feature embedding for segmentation in Gaussian primitives. This embedding is then efficiently decoded into instance labels through a novel "Embedding-to-Label" process, effectively integrating the optimization. While this unified framework offers substantial benefits, we observed artifacts at the object boundaries. To address the object boundary issues, we propose hard-mining samples along these boundaries. However, directly applying hard mining to the feature embeddings proved unstable. Therefore, we apply a linear layer to the rasterized feature embeddings before calculating the triplet loss, which stabilizes training and significantly improves performance. Our method outperforms baselines qualitatively and quantitatively on the ScanNet, Replica3D, and Messy-Rooms datasets.

Preview loads on expand
ShinyNeRF: Digitizing Anisotropic Appearance in Neural Radiance Fields Albert Barreiro, Roger Marí, Rafael Redondo, Gloria Haro, Carles Bosch Updated 2025-12-25

Recent advances in digitization technologies have transformed the preservation and dissemination of cultural heritage. In this vein, Neural Radiance Fields (NeRF) have emerged as a leading technology for 3D digitization, delivering representations with exceptional realism. However, existing methods struggle to accurately model anisotropic specular surfaces, typically observed, for example, on brushed metals. In this work, we introduce ShinyNeRF, a novel framework capable of handling both isotropic and anisotropic reflections. Our method is capable of jointly estimating surface normals, tangents, specular concentration, and anisotropy magnitudes of an Anisotropic Spherical Gaussian (ASG) distribution, by learning an approximation of the outgoing radiance as an encoded mixture of isotropic von Mises-Fisher (vMF) distributions. Experimental results show that ShinyNeRF not only achieves state-of-the-art performance on digitizing anisotropic specular reflections, but also offers plausible physical interpretations and editing of material properties compared to existing methods.

Preview loads on expand
Dreamcrafter: Immersive Editing of 3D Radiance Fields Through Flexible, Generative Inputs and Outputs Cyrus Vachha, Yixiao Kang, Zach Dive, Ashwat Chidambaram, Anik Gupta, Eunice Jun, Bjoern Hartmann Updated 2025-12-25

Authoring 3D scenes is a central task for spatial computing applications. Competing visions for lowering existing barriers are (1) focus on immersive, direct manipulation of 3D content or (2) leverage AI techniques that capture real scenes (3D Radiance Fields such as, NeRFs, 3D Gaussian Splatting) and modify them at a higher level of abstraction, at the cost of high latency. We unify the complementary strengths of these approaches and investigate how to integrate generative AI advances into real-time, immersive 3D Radiance Field editing. We introduce Dreamcrafter, a VR-based 3D scene editing system that: (1) provides a modular architecture to integrate generative AI algorithms; (2) combines different levels of control for creating objects, including natural language and direct manipulation; and (3) introduces proxy representations that support interaction during high-latency operations. We contribute empirical findings on control preferences and discuss how generative AI interfaces beyond text input enhance creativity in scene editing and world building.

Preview loads on expand
Neural Brain Fields: A NeRF-Inspired Approach for Generating Nonexistent EEG Electrodes Shahar Ain Kedem, Itamar Zimerman, Eliya Nachmani Updated 2025-12-20

Electroencephalography (EEG) data present unique modeling challenges because recordings vary in length, exhibit very low signal to noise ratios, differ significantly across participants, drift over time within sessions, and are rarely available in large and clean datasets. Consequently, developing deep learning methods that can effectively process EEG signals remains an open and important research problem. To tackle this problem, this work presents a new method inspired by Neural Radiance Fields (NeRF). In computer vision, NeRF techniques train a neural network to memorize the appearance of a 3D scene and then uses its learned parameters to render and edit the scene from any viewpoint. We draw an analogy between the discrete images captured from different viewpoints used to learn a continuous 3D scene in NeRF, and EEG electrodes positioned at different locations on the scalp, which are used to infer the underlying representation of continuous neural activity. Building on this connection, we show that a neural network can be trained on a single EEG sample in a NeRF style manner to produce a fixed size and informative weight vector that encodes the entire signal. Moreover, via this representation we can render the EEG signal at previously unseen time steps and spatial electrode positions. We demonstrate that this approach enables continuous visualization of brain activity at any desired resolution, including ultra high resolution, and reconstruction of raw EEG signals. Finally, our empirical analysis shows that this method can effectively simulate nonexistent electrodes data in EEG recordings, allowing the reconstructed signal to be fed into standard EEG processing networks to improve performance.

Preview loads on expand
Joint Learning of Depth, Pose, and Local Radiance Field for Large Scale Monocular 3D Reconstruction Shahram Najam Syed, Yitian Hu, Yuchao Yao Updated 2025-12-20

Photorealistic 3-D reconstruction from monocular video collapses in large-scale scenes when depth, pose, and radiance are solved in isolation: scale-ambiguous depth yields ghost geometry, long-horizon pose drift corrupts alignment, and a single global NeRF cannot model hundreds of metres of content. We introduce a joint learning framework that couples all three factors and demonstrably overcomes each failure case. Our system begins with a Vision-Transformer (ViT) depth network trained with metric-scale supervision, giving globally consistent depths despite wide field-of-view variations. A multi-scale feature bundle-adjustment (BA) layer refines camera poses directly in feature space--leveraging learned pyramidal descriptors instead of brittle keypoints--to suppress drift on unconstrained trajectories. For scene representation, we deploy an incremental local-radiance-field hierarchy: new hash-grid NeRFs are allocated and frozen on-the-fly when view overlap falls below a threshold, enabling city-block-scale coverage on a single GPU. Evaluated on the Tanks and Temples benchmark, our method reduces Absolute Trajectory Error to 0.001-0.021 m across eight indoor-outdoor sequences--up to 18x lower than BARF and 2x lower than NoPe-NeRF--while maintaining sub-pixel Relative Pose Error. These results demonstrate that metric-scale, drift-free 3-D reconstruction and high-fidelity novel-view synthesis are achievable from a single uncalibrated RGB camera.

Preview loads on expand
SDFoam: Signed-Distance Foam for explicit surface reconstruction Antonella Rech, Nicola Conci, Nicola Garau Updated 2025-12-18

Neural radiance fields (NeRF) have driven impressive progress in view synthesis by using ray-traced volumetric rendering. Splatting-based methods such as 3D Gaussian Splatting (3DGS) provide faster rendering by rasterizing 3D primitives. RadiantFoam (RF) brought ray tracing back, achieving throughput comparable to Gaussian Splatting by organizing radiance with an explicit Voronoi Diagram (VD). Yet, all the mentioned methods still struggle with precise mesh reconstruction. We address this gap by jointly learning an explicit VD with an implicit Signed Distance Field (SDF). The scene is optimized via ray tracing and regularized by an Eikonal objective. The SDF introduces metric-consistent isosurfaces, which, in turn, bias near-surface Voronoi cell faces to align with the zero level set. The resulting model produces crisper, view-consistent surfaces with fewer floaters and improved topology, while preserving photometric quality and maintaining training speed on par with RadiantFoam. Across diverse scenes, our hybrid implicit-explicit formulation, which we name SDFoam, substantially improves mesh reconstruction accuracy (Chamfer distance) with comparable appearance (PSNR, SSIM), without sacrificing efficiency.

Preview loads on expand
Using Gaussian Splats to Create High-Fidelity Facial Geometry and Texture Haodi He, Jihun Yu, Ronald Fedkiw Updated 2025-12-18

We leverage increasingly popular three-dimensional neural representations in order to construct a unified and consistent explanation of a collection of uncalibrated images of the human face. Our approach utilizes Gaussian Splatting, since it is more explicit and thus more amenable to constraints than NeRFs. We leverage segmentation annotations to align the semantic regions of the face, facilitating the reconstruction of a neutral pose from only 11 images (as opposed to requiring a long video). We soft constrain the Gaussians to an underlying triangulated surface in order to provide a more structured Gaussian Splat reconstruction, which in turn informs subsequent perturbations to increase the accuracy of the underlying triangulated surface. The resulting triangulated surface can then be used in a standard graphics pipeline. In addition, and perhaps most impactful, we show how accurate geometry enables the Gaussian Splats to be transformed into texture space where they can be treated as a view-dependent neural texture. This allows one to use high visual fidelity Gaussian Splatting on any asset in a scene without the need to modify any other asset or any other aspect (geometry, lighting, renderer, etc.) of the graphics pipeline. We utilize a relightable Gaussian model to disentangle texture from lighting in order to obtain a delit high-resolution albedo texture that is also readily usable in a standard graphics pipeline. The flexibility of our system allows for training with disparate images, even with incompatible lighting, facilitating robust regularization. Finally, we demonstrate the efficacy of our approach by illustrating its use in a text-driven asset creation pipeline.

Preview loads on expand
NAP3D: NeRF Assisted 3D-3D Pose Alignment for Autonomous Vehicles Gaurav Bansal Updated 2025-12-17

Accurate localization is essential for autonomous vehicles, yet sensor noise and drift over time can lead to significant pose estimation errors, particularly in long-horizon environments. A common strategy for correcting accumulated error is visual loop closure in SLAM, which adjusts the pose graph when the agent revisits previously mapped locations. These techniques typically rely on identifying visual mappings between the current view and previously observed scenes and often require fusing data from multiple sensors. In contrast, this work introduces NeRF-Assisted 3D-3D Pose Alignment (NAP3D), a complementary approach that leverages 3D-3D correspondences between the agent's current depth image and a pre-trained Neural Radiance Field (NeRF). By directly aligning 3D points from the observed scene with synthesized points from the NeRF, NAP3D refines the estimated pose even from novel viewpoints, without relying on revisiting previously observed locations. This robust 3D-3D formulation provides advantages over conventional 2D-3D localization methods while remaining comparable in accuracy and applicability. Experiments demonstrate that NAP3D achieves camera pose correction within 5 cm on a custom dataset, robustly outperforming a 2D-3D Perspective-N-Point baseline. On TUM RGB-D, NAP3D consistently improves 3D alignment RMSE by approximately 6 cm compared to this baseline given varying noise, despite PnP achieving lower raw rotation and translation parameter error in some regimes, highlighting NAP3D's improved geometric consistency in 3D space. By providing a lightweight, dataset-agnostic tool, NAP3D complements existing SLAM and localization pipelines when traditional loop closure is unavailable.

Preview loads on expand
Broadening View Synthesis of Dynamic Scenes from Constrained Monocular Videos Le Jiang et.al. Updated 2025-12-16

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
HGS: Hybrid Gaussian Splatting with Static-Dynamic Decomposition for Compact Dynamic View Synthesis Kaizhe Zhang et.al. Updated 2025-12-16

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
AnchorHOI: Zero-shot Generation of 4D Human-Object Interaction via Anchor-based Prior Distillation Sisi Dai et.al. Updated 2025-12-16

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
Quantum Implicit Neural Representations for 3D Scene Reconstruction and Novel View Synthesis Yeray Cordero et.al. Updated 2025-12-14

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
Physically Aware 360$^\circ$ View Generation from a Single Image using Disentangled Scene Embeddings Karthikeya KV et.al. Updated 2025-12-11

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
Relightable and Dynamic Gaussian Avatar Reconstruction from Monocular Video Seonghwa Choi et.al. Updated 2025-12-11

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
Log NeRF: Comparing Spaces for Learning Radiance Fields Sihe Chen et.al. Updated 2025-12-10

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
AGORA: Adversarial Generation Of Real-time Animatable 3D Gaussian Head Avatars Ramazan Fazylov et.al. Updated 2025-12-10

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
HybridSplat: Fast Reflection-baked Gaussian Tracing using Hybrid Splatting Chang Liu et.al. Updated 2025-12-09

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
Blur2Sharp: Human Novel Pose and View Synthesis with Generative Prior Refinement Chia-Hern Lai et.al. Updated 2025-12-09

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
From Orbit to Ground: Generative City Photogrammetry from Extreme Off-Nadir Satellite Images Fei Yu et.al. Updated 2025-12-09

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
Radiance-Field Reinforced Pretraining: Scaling Localization Models with Unlabeled Wireless Signals Guosheng Wang et.al. Updated 2025-12-08

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
Gaussian Entropy Fields: Driving Adaptive Sparsity in 3D Gaussian Optimization Hong Kuang et.al. Updated 2025-12-04

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
Radiance Meshes for Volumetric Reconstruction Alexander Mai et.al. Updated 2025-12-03

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
What Is The Best 3D Scene Representation for Robotics? From Geometric to Foundation Models Tianchen Deng et.al. Updated 2025-12-03

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
Flux4D: Flow-based Unsupervised 4D Reconstruction Jingkang Wang et.al. Updated 2025-12-02

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
PolarGuide-GSDR: 3D Gaussian Splatting Driven by Polarization Priors and Deferred Reflection for Real-World Reflective Scenes Derui Shan et.al. Updated 2025-12-02

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
SplatSuRe: Selective Super-Resolution for Multi-view Consistent 3D Gaussian Splatting Pranav Asthana et.al. Updated 2025-12-01

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
EGG-Fusion: Efficient 3D Reconstruction with Geometry-aware Gaussian Surfel on the Fly Xiaokun Pan et.al. Updated 2025-12-01

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
Dynamic-eDiTor: Training-Free Text-Driven 4D Scene Editing with Multimodal Diffusion Transformer Dong In Lee et.al. Updated 2025-11-30

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
SplatFont3D: Structure-Aware Text-to-3D Artistic Font Generation with Part-Level Style Control Ji Gan et.al. Updated 2025-11-29

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
Image Valuation in NeRF-based 3D reconstruction Grigorios Aris Cheimariotis et.al. Updated 2025-11-28

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
$Δ$-NeRF: Incremental Refinement of Neural Radiance Fields through Residual Control and Knowledge Transfer Kriti Ghosh et.al. Updated 2025-11-25

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
Proxy-Free Gaussian Splats Deformation with Splat-Based Surface Estimation Jaeyeong Kim et.al. Updated 2025-11-24

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
MapRF: Weakly Supervised Online HD Map Construction via NeRF-Guided Self-Training Hongyu Lyu et.al. Updated 2025-11-24

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
TPG-INR: Target Prior-Guided Implicit 3D CT Reconstruction for Enhanced Sparse-view Imaging Qinglei Cao et.al. Updated 2025-11-24

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
ReCoGS: Real-time ReColoring for Gaussian Splatting scenes Lorenzo Rutayisire et.al. Updated 2025-11-23

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
AIA-UltraNeRF:Acoustic-Impedance-Aware Neural Radiance Field with Hash Encodings for Robotic Ultrasound Reconstruction and Localization Shuai Zhang et.al. Updated 2025-11-23

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
NoPe-NeRF++: Local-to-Global Optimization of NeRF with No Pose Prior Dongbo Shi et.al. Updated 2025-11-21

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
EOGS++: Earth Observation Gaussian Splatting with Internal Camera Refinement and Direct Panchromatic Rendering Pierrick Bournez et.al. Updated 2025-11-20

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
iGaussian: Real-Time Camera Pose Estimation via Feed-Forward 3D Gaussian Splatting Inversion Hao Wang et.al. Updated 2025-11-18

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
PFAvatar: Pose-Fusion 3D Personalized Avatar Reconstruction from Real-World Outfit-of-the-Day Photos Dianbing Xi et.al. Updated 2025-11-18

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
OPFormer: Object Pose Estimation leveraging foundation model with geometric encoding Artem Moroz et.al. Updated 2025-11-16

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
LiDAR-GS++:Improving LiDAR Gaussian Reconstruction via Diffusion Priors Qifeng Chen et.al. Updated 2025-11-15

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
RePose-NeRF: Robust Radiance Fields for Mesh Reconstruction under Noisy Camera Poses Sriram Srinivasan et.al. Updated 2025-11-11

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
Is It Truly Necessary to Process and Fit Minutes-Long Reference Videos for Personalized Talking Face Generation? Rui-Qing Sun et.al. Updated 2025-11-11

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
Sparse4DGS: 4D Gaussian Splatting for Sparse-Frame Dynamic Scene Reconstruction Changyue Shi et.al. Updated 2025-11-10

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
Inpaint360GS: Efficient Object-Aware 3D Inpainting via Gaussian Splatting for 360° Scenes Shaoxiang Wang et.al. Updated 2025-11-09

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
VDNeRF: Vision-only Dynamic Neural Radiance Field for Urban Scenes Zhengyu Zou et.al. Updated 2025-11-09

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
4D3R: Motion-Aware Neural Reconstruction and Rendering of Dynamic Scenes from Monocular Videos Mengqi Guo et.al. Updated 2025-11-07

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
Efficient representation of 3D spatial data for defense-related applications Benjamin Kahl et.al. Updated 2025-11-07

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
3D Gaussian Point Encoders Jim James et.al. Updated 2025-11-06

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
FastGS: Training 3D Gaussian Splatting in 100 Seconds Shiwei Ren et.al. Updated 2025-11-06

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
LiteVoxel: Low-memory Intelligent Thresholding for Efficient Voxel Rasterization Jee Won Lee et.al. Updated 2025-11-04

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
Object-Centric 3D Gaussian Splatting for Strawberry Plant Reconstruction and Phenotyping Jiajia Li et.al. Updated 2025-11-04

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
GauSSmart: Enhanced 3D Reconstruction through 2D Foundation Models and Geometric Filtering Alexander Valverde et.al. Updated 2025-11-03

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
SAGS: Self-Adaptive Alias-Free Gaussian Splatting for Dynamic Surgical Endoscopic Reconstruction Wenfeng Huang et.al. Updated 2025-10-31

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
4-Doodle: Text to 3D Sketches that Move! Hao Chen et.al. Updated 2025-10-29

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
I2-NeRF: Learning Neural Radiance Fields Under Physically-Grounded Media Interactions Shuhong Liu et.al. Updated 2025-10-25

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
From Far and Near: Perceptual Evaluation of Crowd Representations Across Levels of Detail Xiaohan Sun et.al. Updated 2025-10-23

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
Extreme Views: 3DGS Filter for Novel View Synthesis from Out-of-Distribution Camera Poses Damian Bowness et.al. Updated 2025-10-22

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
AegisRF: Adversarial Perturbations Guided with Sensitivity for Protecting Intellectual Property of Neural Radiance Fields Woo Jae Kim et.al. Updated 2025-10-22

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
Advances in 4D Representation: Geometry, Motion, and Interaction Mingrui Zhao et.al. Updated 2025-10-22

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
SimULi: Real-Time LiDAR and Camera Simulation with Unscented Transforms Haithem Turki et.al. Updated 2025-10-16

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
Perspective-aware 3D Gaussian Inpainting with Multi-view Consistency Yuxin Cheng et.al. Updated 2025-10-13

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
Opacity-Gradient Driven Density Control for Compact and Efficient Few-Shot 3D Gaussian Splatting Abdelrhman Elrawy et.al. Updated 2025-10-11

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
Gesplat: Robust Pose-Free 3D Reconstruction via Geometry-Guided Gaussian Splatting Jiahui Lu et.al. Updated 2025-10-11

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
Geometry-Aware Scene Configurations for Novel View Synthesis Minkwan Kim et.al. Updated 2025-10-10

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
Vision Language Models: A Survey of 26K Papers Fengming Lin et.al. Updated 2025-10-10

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
HERO: Hardware-Efficient RL-based Optimization Framework for NeRF Quantization Yipu Zhang et.al. Updated 2025-10-10

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
An Energy-Efficient Edge Coprocessor for Neural Rendering with Explicit Data Reuse Strategies Binzhe Yuan et.al. Updated 2025-10-09

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
VGGT-X: When VGGT Meets Dense Novel View Synthesis Yang Liu et.al. Updated 2025-10-08

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
OracleGS: Grounding Generative Priors for Sparse-View Gaussian Splatting Atakan Topaloglu et.al. Updated 2025-10-04

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
ROGR: Relightable 3D Objects using Generative Relighting Jiapeng Tang et.al. Updated 2025-10-03

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
StealthAttack: Robust 3D Gaussian Splatting Poisoning via Density-Guided Illusions Bo-Hsu Ke et.al. Updated 2025-10-02

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
GEM: 3D Gaussian Splatting for Efficient and Accurate Cryo-EM Reconstruction Huaizhi Qu et.al. Updated 2025-10-02

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
Multi-level Dynamic Style Transfer for NeRFs Zesheng Li et.al. Updated 2025-10-01

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
FM-SIREN & FM-FINER: Nyquist-Informed Frequency Multiplier for Implicit Neural Representation with Periodic Activation Mohammed Alsakabi et.al. Updated 2025-09-30

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
From Fields to Splats: A Cross-Domain Survey of Real-Time Neural Scene Representations Javed Ahmad et.al. Updated 2025-09-28

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
WaveletGaussian: Wavelet-domain Diffusion for Sparse-view 3D Gaussian Object Reconstruction Hung Nguyen et.al. Updated 2025-09-23

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
Seeing Through Reflections: Advancing 3D Scene Reconstruction in Mirror-Containing Environments with Gaussian Splatting Zijing Guo et.al. Updated 2025-09-23

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
HyRF: Hybrid Radiance Fields for Memory-efficient and High-quality Novel View Synthesis Zipeng Wang et.al. Updated 2025-09-23

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
From Restoration to Reconstruction: Rethinking 3D Gaussian Splatting for Underwater Scenes Guoxi Huang et.al. Updated 2025-09-22

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
MS-GS: Multi-Appearance Sparse-View 3D Gaussian Splatting in the Wild Deming Li et.al. Updated 2025-09-22

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
DT-NeRF: A Diffusion and Transformer-Based Optimization Approach for Neural Radiance Fields in 3D Reconstruction Bo Liu et.al. Updated 2025-09-21

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
PGSTalker: Real-Time Audio-Driven Talking Head Generation via 3D Gaussian Splatting with Pixel-Aware Density Control Tianheng Zhu et.al. Updated 2025-09-21

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
RGB-Only Supervised Camera Parameter Optimization in Dynamic Scenes Fang Li et.al. Updated 2025-09-19

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
NeRF-based Visualization of 3D Cues Supporting Data-Driven Spacecraft Pose Estimation Antoine Legrand et.al. Updated 2025-09-18

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
ProFusion: 3D Reconstruction of Protein Complex Structures from Multi-view AFM Images Jaydeep Rade et.al. Updated 2025-09-17

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
SuNeRF-CME: Physics-Informed Neural Radiance Fields for Tomographic Reconstruction of Coronal Mass Ejections Robert Jarolim et.al. Updated 2025-09-16

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
Exploring Metric Fusion for Evaluation of NeRFs Shreyas Shivakumara et.al. Updated 2025-09-16

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
Neural 3D Object Reconstruction with Small-Scale Unmanned Aerial Vehicles Àlmos Veres-Vitàlyos et.al. Updated 2025-09-15

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
ROSGS: Relightable Outdoor Scenes With Gaussian Splatting Lianjun Liao et.al. Updated 2025-09-14

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
SPHERE: Semantic-PHysical Engaged REpresentation for 3D Semantic Scene Completion Zhiwen Yang et.al. Updated 2025-09-14

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
Multispectral-NeRF:a multispectral modeling approach based on neural radiance fields Hong Zhang et.al. Updated 2025-09-14

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
SplatFill: 3D Scene Inpainting via Depth-Guided Gaussian Splatting Mahtab Dahaghin et.al. Updated 2025-09-09

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
DiGS: Accurate and Complete Surface Reconstruction from 3D Gaussians via Direct SDF Learning Wenzhi Guo et.al. Updated 2025-09-09

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
GS-TG: 3D Gaussian Splatting Accelerator with Tile Grouping for Reducing Redundant Sorting while Preserving Rasterization Efficiency Joongho Jo et.al. Updated 2025-09-03

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
SWAGSplatting: Semantic-guided Water-scene Augmented Gaussian Splatting Zhuodong Jiang et.al. Updated 2025-08-31

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
Adam SLAM - the last mile of camera calibration with 3DGS Matthieu Gendrin et.al. Updated 2025-08-28

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
Generating Human-AI Collaborative Design Sequence for 3D Assets via Differentiable Operation Graph Xiaoyang Huang et.al. Updated 2025-08-28

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
Can we make NeRF-based visual localization privacy-preserving? Maxime Pietrantoni et.al. Updated 2025-08-26

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
Real-time 3D Visualization of Radiance Fields on Light Field Displays Jonghyun Kim et.al. Updated 2025-08-25

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
Align 3D Representation and Text Embedding for 3D Content Personalization Qi Song et.al. Updated 2025-08-23

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
GOGS: High-Fidelity Geometry and Relighting for Glossy Objects via Gaussian Surfels Xingyuan Yang et.al. Updated 2025-08-20

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
DoRF: Doppler Radiance Fields for Robust Human Activity Recognition Using Wi-Fi Navid Hasanzadeh et.al. Updated 2025-07-16

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
HPR3D: Hierarchical Proxy Representation for High-Fidelity 3D Reconstruction and Controllable Editing Tielong Wang et.al. Updated 2025-07-16

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
VoxelRF: Voxelized Radiance Field for Fast Wireless Channel Modeling Zihang Zeng et.al. Updated 2025-07-14

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
BayesSDF: Surface-Based Laplacian Uncertainty Estimation for 3D Geometry with Neural Signed Distance Fields Rushil Desai et.al. Updated 2025-07-14

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
Stable Score Distillation Haiming Zhu et.al. Updated 2025-07-12

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
From images to properties: a NeRF-driven framework for granular material parameter inversion Cheng-Hsi Hsiao et.al. Updated 2025-07-11

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
MUVOD: A Novel Multi-view Video Object Segmentation Dataset and A Benchmark for 3D Segmentation Bangning Wei et.al. Updated 2025-07-10

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
Reflections Unlock: Geometry-Aware Reflection Disentanglement in 3D Gaussian Splatting for Photorealistic Scenes Rendering Jiayi Song et.al. Updated 2025-07-08

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
DreamArt: Generating Interactable Articulated Objects from a Single Image Ruijie Lu et.al. Updated 2025-07-08

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
A View-consistent Sampling Method for Regularized Training of Neural Radiance Fields Aoxiang Fan et.al. Updated 2025-07-06

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
Tile and Slide : A New Framework for Scaling NeRF from Local to Global 3D Earth Observation Camille Billouard et.al. Updated 2025-07-02

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
Surgical Neural Radiance Fields from One Image Alberto Neri et.al. Updated 2025-07-01

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
PlantSegNeRF: A few-shot, cross-dataset method for plant 3D instance point cloud reconstruction via joint-channel NeRF with multi-view image instance matching Xin Yang et.al. Updated 2025-07-01

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
AttentionGS: Towards Initialization-Free 3D Gaussian Splatting via Structural Attention Ziao Liu et.al. Updated 2025-06-30

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
Dynamic View Synthesis from Small Camera Motion Videos Huiqiang Sun et.al. Updated 2025-06-29

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
UnMix-NeRF: Spectral Unmixing Meets Neural Radiance Fields Fabian Perez et.al. Updated 2025-06-27

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
PanSt3R: Multi-view Consistent Panoptic Segmentation Lojze Zust et.al. Updated 2025-06-26

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
2D Triangle Splatting for Direct Differentiable Mesh Training Kaifeng Sheng et.al. Updated 2025-06-26

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
Joint attitude estimation and 3D neural reconstruction of non-cooperative space objects Clément Forray et.al. Updated 2025-06-25

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
Self-Supervised Multimodal NeRF for Autonomous Driving Gaurav Sharma et.al. Updated 2025-06-25

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
ICP-3DGS: SfM-free 3D Gaussian Splatting for Large-scale Unbounded Scenes Chenhao Zhang et.al. Updated 2025-06-24

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
NeRF-based CBCT Reconstruction needs Normalization and Initialization Zhuowei Xu et.al. Updated 2025-06-24

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
HoliGS: Holistic Gaussian Splatting for Embodied View Synthesis Xiaoyuan Wang et.al. Updated 2025-06-24

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
RA-NeRF: Robust Neural Radiance Field Reconstruction with Accurate Camera Pose Estimation under Complex Trajectories Qingsong Yan et.al. Updated 2025-06-24

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
MCN-SLAM: Multi-Agent Collaborative Neural SLAM with Hybrid Implicit Neural Scene Representation Tianchen Deng et.al. Updated 2025-06-23

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
R3eVision: A Survey on Robust Rendering, Restoration, and Enhancement for 3D Low-Level Vision Weeyoung Kwon et.al. Updated 2025-06-23

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
Limitations of NERF with pre-trained Vision Features for Few-Shot 3D Reconstruction Ankit Sanjyal et.al. Updated 2025-06-22

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
3D Gaussian Splatting for Fine-Detailed Surface Reconstruction in Large-Scale Scene Shihan Chen et.al. Updated 2025-06-21

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
Rasterizing Wireless Radiance Field via Deformable 2D Gaussian Splatting Mufan Liu et.al. Updated 2025-06-18

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
Peering into the Unknown: Active View Selection with Neural Uncertainty Maps for 3D Reconstruction Zhengquan Zhang et.al. Updated 2025-06-17

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
Efficient multi-view training for 3D Gaussian Splatting Minhyuk Choi et.al. Updated 2025-06-17

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
Genesis: Multimodal Driving Scene Generation with Spatio-Temporal and Cross-Modal Consistency Xiangyu Guo et.al. Updated 2025-06-16

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
PointGS: Point Attention-Aware Sparse View Synthesis with Gaussian Splatting Lintao Xiang et.al. Updated 2025-06-12

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
The Less You Depend, The More You Learn: Synthesizing Novel Views from Sparse, Unposed Images without Any 3D Knowledge Haoru Wang et.al. Updated 2025-06-11

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
A Probability-guided Sampler for Neural Implicit Surface Rendering Gonçalo Dias Pais et.al. Updated 2025-06-10

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
Speedy Deformable 3D Gaussian Splatting: Fast Rendering and Compression of Dynamic Scenes Allen Tu et.al. Updated 2025-06-09

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
SPC to 3D: Novel View Synthesis from Binary SPC via I2I translation Sumit Sharma et.al. Updated 2025-06-07

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
Splat and Replace: 3D Reconstruction with Repetitive Elements Nicolás Violante et.al. Updated 2025-06-06

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
NeurNCD: Novel Class Discovery via Implicit Neural Representation Junming Wang et.al. Updated 2025-06-06

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
Dy3DGS-SLAM: Monocular 3D Gaussian Splatting SLAM for Dynamic Environments Mingrui Li et.al. Updated 2025-06-06

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
ProJo4D: Progressive Joint Optimization for Sparse-View Inverse Physics Estimation Daniel Rho et.al. Updated 2025-06-06

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
Unifying Appearance Codes and Bilateral Grids for Driving Scene Gaussian Splatting Nan Wang et.al. Updated 2025-06-06

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
Generating Synthetic Stereo Datasets using 3D Gaussian Splatting and Expert Knowledge Transfer Filip Slezak et.al. Updated 2025-06-05

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
Hi-Dyna Graph: Hierarchical Dynamic Scene Graph for Robotic Autonomy in Human-Centric Environments Jiawei Hou et.al. Updated 2025-05-30

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
ErpGS: Equirectangular Image Rendering enhanced with 3D Gaussian Regularization Shintaro Ito et.al. Updated 2025-05-30

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
PhysicsNeRF: Physics-Guided 3D Reconstruction from Sparse Views Mohamed Rayan Barhdadi et.al. Updated 2025-05-29

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
LODGE: Level-of-Detail Large-Scale Gaussian Splatting with Efficient Rendering Jonas Kulhanek et.al. Updated 2025-05-29

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
Can NeRFs See without Cameras? Chaitanya Amballa et.al. Updated 2025-05-28

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
Learning Fine-Grained Geometry for Sparse-View Splatting via Cascade Depth Loss Wenjun Lu et.al. Updated 2025-05-28

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
Hyperspectral Gaussian Splatting Sunil Kumar Narayanan et.al. Updated 2025-05-28

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
Structure from Collision Takuhiro Kaneko et.al. Updated 2025-05-27

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
OB3D: A New Dataset for Benchmarking Omnidirectional 3D Reconstruction Using Blender Shintaro Ito et.al. Updated 2025-05-26

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
GoLF-NRT: Integrating Global Context and Local Geometry for Few-Shot View Synthesis You Wang et.al. Updated 2025-05-26

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
Depth-Guided Bundle Sampling for Efficient Generalizable Neural Radiance Field Reconstruction Li Fang et.al. Updated 2025-05-26

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
UAV See, UGV Do: Aerial Imagery and Virtual Teach Enabling Zero-Shot Ground Vehicle Repeat Desiree Fisker et.al. Updated 2025-05-22

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
IPENS:Interactive Unsupervised Framework for Rapid Plant Phenotyping Extraction via NeRF-SAM2 Fusion Wentao Song et.al. Updated 2025-05-19

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
3D Gaussian Adaptive Reconstruction for Fourier Light-Field Microscopy Chenyu Xu et.al. Updated 2025-05-19

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
Is Semantic SLAM Ready for Embedded Systems ? A Comparative Survey Calvin Galagain et.al. Updated 2025-05-18

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
MutualNeRF: Improve the Performance of NeRF under Limited Samples with Mutual Information Theory Zifan Wang et.al. Updated 2025-05-16

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
EA-3DGS: Efficient and Adaptive 3D Gaussians with Highly Enhanced Quality for outdoor scenes Jianlin Guo et.al. Updated 2025-05-16

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
Large-Scale Gaussian Splatting SLAM Zhe Xin et.al. Updated 2025-05-15

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
Sparse Point Cloud Patches Rendering via Splitting 2D Gaussians Ma Changfeng et.al. Updated 2025-05-14

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
FreeDriveRF: Monocular RGB Dynamic NeRF without Poses for Autonomous Driving via Point-Level Dynamic-Static Decoupling Yue Wen et.al. Updated 2025-05-14

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
FOCI: Trajectory Optimization on Gaussian Splats Mario Gomez Andreu et.al. Updated 2025-05-13

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
TUM2TWIN: Introducing the Large-Scale Multimodal Urban Digital Twin Benchmark Dataset Olaf Wysocki et.al. Updated 2025-05-13

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
TUGS: Physics-based Compact Representation of Underwater Scenes by Tensorized Gaussian Shijie Lian et.al. Updated 2025-05-12

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
Geometric Prior-Guided Neural Implicit Surface Reconstruction in the Wild Lintao Xiang et.al. Updated 2025-05-12

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
NeuGen: Amplifying the 'Neural' in Neural Radiance Fields for Domain Generalization Ahmed Qazi et.al. Updated 2025-05-11

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
3D Characterization of Smoke Plume Dispersion Using Multi-View Drone Swarm Nikil Krishnakumar et.al. Updated 2025-05-10

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
FlexNeRFer: A Multi-Dataflow, Adaptive Sparsity-Aware Accelerator for On-Device NeRF Rendering Seock-Hwan Noh et.al. Updated 2025-05-10

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
3D Scene Generation: A Survey Beichen Wen et.al. Updated 2025-05-08

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
HandOcc: NeRF-based Hand Rendering with Occupancy Networks Maksym Ivashechkin et.al. Updated 2025-05-04

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
Learning Heterogeneous Mixture of Scene Experts for Large-scale Neural Radiance Fields Zhenxing Mi et.al. Updated 2025-05-04

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
AquaGS: Fast Underwater Scene Reconstruction with SfM-Free Gaussian Splatting Junhao Shi et.al. Updated 2025-05-03

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
Unified Steganography via Implicit Neural Representation Qi Song et.al. Updated 2025-05-03

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
Cues3D: Unleashing the Power of Sole NeRF for Consistent and Unique Instances in Open-Vocabulary 3D Panoptic Segmentation Feng Xue et.al. Updated 2025-05-01

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
GSFeatLoc: Visual Localization Using Feature Correspondence on 3D Gaussian Splatting Jongwon Lee et.al. Updated 2025-05-01

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
A Survey on 3D Reconstruction Techniques in Plant Phenotyping: From Classical Methods to Neural Radiance Fields (NeRF), 3D Gaussian Splatting (3DGS), and Beyond Jiajia Li et.al. Updated 2025-04-30

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
GauSS-MI: Gaussian Splatting Shannon Mutual Information for Active 3D Reconstruction Yuhan Xie et.al. Updated 2025-04-29

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
Large-scale visual SLAM for in-the-wild videos Shuo Sun et.al. Updated 2025-04-29

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
Sparse2DGS: Geometry-Prioritized Gaussian Splatting for Surface Reconstruction from Sparse Views Jiang Wu et.al. Updated 2025-04-29

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
IM-Portrait: Learning 3D-aware Video Diffusion for Photorealistic Talking Heads from Monocular Videos Yuan Li et.al. Updated 2025-04-29

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
Joint Optimization of Neural Radiance Fields and Continuous Camera Motion from a Monocular Video Hoang Chuong Nguyen et.al. Updated 2025-04-28

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
RGS-DR: Reflective Gaussian Surfels with Deferred Rendering for Shiny Objects Georgios Kouros et.al. Updated 2025-04-28

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
Beyond Physical Reach: Comparing Head- and Cane-Mounted Cameras for Last-Mile Navigation by Blind Users Apurv Varshney et.al. Updated 2025-04-27

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
CasualHDRSplat: Robust High Dynamic Range 3D Gaussian Splatting from Casually Captured Videos Shucheng Gong et.al. Updated 2025-04-24

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
Dual-Camera All-in-Focus Neural Radiance Fields Xianrui Luo et.al. Updated 2025-04-23

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
Beyond Anonymization: Object Scrubbing for Privacy-Preserving 2D and 3D Vision Tasks Murat Bilgehan Ertan et.al. Updated 2025-04-23

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
SaENeRF: Suppressing Artifacts in Event-based Neural Radiance Fields Yuanjian Wang et.al. Updated 2025-04-23

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
Pose Optimization for Autonomous Driving Datasets using Neural Rendering Models Quentin Herau et.al. Updated 2025-04-22

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
StyleMe3D: Stylization with Disentangled Priors by Multiple Encoders on 3D Gaussians Cailin Zhuang et.al. Updated 2025-04-21

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
SLAM&Render: A Benchmark for the Intersection Between Neural Rendering, Gaussian Splatting and SLAM Samuel Cerezo et.al. Updated 2025-04-21

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
Scaling LLaNA: Advancing NeRF-Language Understanding Through Large-Scale Training Andrea Amaduzzi et.al. Updated 2025-04-18

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
GSAC: Leveraging Gaussian Splatting for Photorealistic Avatar Creation with Unity Integration Rendong Zhang et.al. Updated 2025-04-17

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
R-Meshfusion: Reinforcement Learning Powered Sparse-View Mesh Reconstruction with Diffusion Priors Haoyang Wang et.al. Updated 2025-04-16

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
LL-Gaussian: Low-Light Scene Reconstruction and Enhancement via Gaussian Splatting for Novel View Synthesis Hao Sun et.al. Updated 2025-04-15

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
MCBlock: Boosting Neural Radiance Field Training Speed by MCTS-based Dynamic-Resolution Ray Sampling Yunpeng Tan et.al. Updated 2025-04-14

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
NeRF-Based Transparent Object Grasping Enhanced by Shape Priors Yi Han et.al. Updated 2025-04-14

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
HAL-NeRF: High Accuracy Localization Leveraging Neural Radiance Fields Asterios Reppas et.al. Updated 2025-04-11

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
Wheat3DGS: In-field 3D Reconstruction, Instance Segmentation and Phenotyping of Wheat Heads with Gaussian Splatting Daiwei Zhang et.al. Updated 2025-04-09

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
S-EO: A Large-Scale Dataset for Geometry-Aware Shadow Detection in Remote Sensing Applications Masquil Elías et.al. Updated 2025-04-09

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
SVG-IR: Spatially-Varying Gaussian Splatting for Inverse Rendering Hanxiao Sun et.al. Updated 2025-04-09

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
Meta-Continual Learning of Neural Fields Seungyoon Woo et.al. Updated 2025-04-08

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
SE4Lip: Speech-Lip Encoder for Talking Head Synthesis to Solve Phoneme-Viseme Alignment Ambiguity Yihuan Huang et.al. Updated 2025-04-08

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
InvNeRF-Seg: Fine-Tuning a Pre-Trained NeRF for 3D Object Segmentation Jiangsan Zhao et.al. Updated 2025-04-08

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
DeclutterNeRF: Generative-Free 3D Scene Recovery for Occlusion Removal Wanzhou Liu et.al. Updated 2025-04-07

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
Thermoxels: a voxel-based method to generate simulation-ready 3D thermal models Etienne Chassaing et.al. Updated 2025-04-06

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
NeRFlex: Resource-aware Real-time High-quality Rendering of Complex Scenes on Mobile Devices Zhe Wang et.al. Updated 2025-04-04

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
MultiNeRF: Multiple Watermark Embedding for Neural Radiance Fields Yash Kulthe et.al. Updated 2025-04-03

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
LPA3D: 3D Room-Level Scene Generation from In-the-Wild Images Ming-Jia Yang et.al. Updated 2025-04-03

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
Diffusion-Guided Gaussian Splatting for Large-Scale Unconstrained 3D Reconstruction and Novel View Synthesis Niluthpol Chowdhury Mithun et.al. Updated 2025-04-02

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
BOGausS: Better Optimized Gaussian Splatting Stéphane Pateux et.al. Updated 2025-04-02

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
FIORD: A Fisheye Indoor-Outdoor Dataset with LIDAR Ground Truth for 3D Scene Reconstruction and Benchmarking Ulas Gunes et.al. Updated 2025-04-02

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
RealityAvatar: Towards Realistic Loose Clothing Modeling in Animatable 3D Gaussian Avatars Yahui Li et.al. Updated 2025-04-02

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
Luminance-GS: Adapting 3D Gaussian Splatting to Challenging Lighting Conditions with View-Adaptive Curve Adjustment Ziteng Cui et.al. Updated 2025-04-02

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
OccludeNeRF: Geometric-aware 3D Scene Inpainting with Collaborative Score Distillation in NeRF Jingyu Shi et.al. Updated 2025-04-01

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
Neural Pruning for 3D Scene Reconstruction: Efficient NeRF Acceleration Tianqi Ding et.al. Updated 2025-04-01

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
NeuRadar: Neural Radiance Fields for Automotive Radar Point Clouds Mahan Rafidashti et.al. Updated 2025-04-01

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
ABC-GS: Alignment-Based Controllable Style Transfer for 3D Gaussian Splatting Wenjie Liu et.al. Updated 2025-03-28

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
LandMarkSystem Technical Report Zhenxiang Ma et.al. Updated 2025-03-28

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
NeRF-based Point Cloud Reconstruction using a Stationary Camera for Agricultural Applications Kibon Ku et.al. Updated 2025-03-27

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
Refined Geometry-guided Head Avatar Reconstruction from Monocular RGB Video Pilseo Park et.al. Updated 2025-03-27

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
HS-SLAM: Hybrid Representation with Structural Supervision for Improved Dense SLAM Ziren Gong et.al. Updated 2025-03-27

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
RainyGS: Efficient Rain Synthesis with Physically-Based Gaussian Splatting Qiyu Dai et.al. Updated 2025-03-27

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
UGNA-VPR: A Novel Training Paradigm for Visual Place Recognition Based on Uncertainty-Guided NeRF Augmentation Yehui Shen et.al. Updated 2025-03-27

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
AccidentSim: Generating Physically Realistic Vehicle Collision Videos from Real-World Accident Reports Xiangwen Zhang et.al. Updated 2025-03-26

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
EVolSplat: Efficient Volume-based Gaussian Splatting for Urban View Synthesis Sheng Miao et.al. Updated 2025-03-26

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
CoMapGS: Covisibility Map-based Gaussian Splatting for Sparse Novel View Synthesis Youngkyoon Jang et.al. Updated 2025-03-25

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
Learning Scene-Level Signed Directional Distance Function with Ellipsoidal Priors and Neural Residuals Zhirui Dai et.al. Updated 2025-03-25

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
MultimodalStudio: A Heterogeneous Sensor Dataset and Framework for Neural Rendering across Multiple Imaging Modalities Federico Lincetto et.al. Updated 2025-03-25

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
LookCloser: Frequency-aware Radiance Field for Tiny-Detail Scene Xiaoyu Zhang et.al. Updated 2025-03-25

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
NexusGS: Sparse View Synthesis with Epipolar Depth Priors in 3D Gaussian Splatting Yulong Zheng et.al. Updated 2025-03-24

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
NeRFPrior: Learning Neural Radiance Field as a Prior for Indoor Scene Reconstruction Wenyuan Zhang et.al. Updated 2025-03-24

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
End-to-End Implicit Neural Representations for Classification Alexander Gielisse et.al. Updated 2025-03-23

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
Unraveling the Effects of Synthetic Data on End-to-End Autonomous Driving Junhao Ge et.al. Updated 2025-03-23

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
PanopticSplatting: End-to-End Panoptic Gaussian Splatting Yuxuan Xie et.al. Updated 2025-03-23

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
Splat-LOAM: Gaussian Splatting LiDAR Odometry and Mapping Emanuele Giacomini et.al. Updated 2025-03-21

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
FFaceNeRF: Few-shot Face Editing in Neural Radiance Fields Kwan Yun et.al. Updated 2025-03-21

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
DroneSplat: 3D Gaussian Splatting for Robust 3D Reconstruction from In-the-Wild Drone Imagery Jiadong Tang et.al. Updated 2025-03-21

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
Digitally Prototype Your Eye Tracker: Simulating Hardware Performance using 3D Synthetic Data Esther Y. H. Lin et.al. Updated 2025-03-20

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
GO-N3RDet: Geometry Optimized NeRF-enhanced 3D Object Detector Zechuan Li et.al. Updated 2025-03-19

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
MultiBARF: Integrating Imagery of Different Wavelength Regions by Using Neural Radiance Fields Kana Kurata et.al. Updated 2025-03-19

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
3D Engine-ready Photorealistic Avatars via Dynamic Textures Yifan Wang et.al. Updated 2025-03-19

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
ClimateGS: Real-Time Climate Simulation with 3D Gaussian Style Transfer Yuezhen Xie et.al. Updated 2025-03-19

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
Segmentation-Guided Neural Radiance Fields for Novel Street View Synthesis Yizhou Li et.al. Updated 2025-03-18

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
Improving Geometric Consistency for 360-Degree Neural Radiance Fields in Indoor Scenarios Iryna Repinetska et.al. Updated 2025-03-17

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
TriDF: Triplane-Accelerated Density Fields for Few-Shot Remote Sensing Novel View Synthesis Jiaming Kang et.al. Updated 2025-03-17

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
DeGauss: Dynamic-Static Decomposition with Gaussian Splatting for Distractor-free 3D Reconstruction Rui Wang et.al. Updated 2025-03-17

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
DivCon-NeRF: Generating Augmented Rays with Diversity and Consistency for Few-shot View Synthesis Ingyun Lee et.al. Updated 2025-03-17

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
FA-BARF: Frequency Adapted Bundle-Adjusting Neural Radiance Fields Rui Qian et.al. Updated 2025-03-15

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
Flow-NeRF: Joint Learning of Geometry, Poses, and Dense Flow within Unified Neural Representations Xunzhi Zheng et.al. Updated 2025-03-13

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
AI-assisted 3D Preservation and Reconstruction of Temple Arts Naai-Jung Shih et.al. Updated 2025-03-13

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
Hybrid Rendering for Multimodal Autonomous Driving: Merging Neural and Physics-Based Simulation Máté Tóth et.al. Updated 2025-03-12

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
GAS-NeRF: Geometry-Aware Stylization of Dynamic Radiance Fields Nhat Phuong Anh Vu et.al. Updated 2025-03-11

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
Uni-Gaussians: Unifying Camera and Lidar Simulation with Gaussians for Dynamic Driving Scenarios Zikang Yuan et.al. Updated 2025-03-11

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
GigaSLAM: Large-Scale Monocular SLAM with Hierachical Gaussian Splats Kai Deng et.al. Updated 2025-03-11

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
NeRF-VIO: Map-Based Visual-Inertial Odometry with Initialization Leveraging Neural Radiance Fields Yanyu Zhang et.al. Updated 2025-03-11

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
Neural Radiance and Gaze Fields for Visual Attention Modeling in 3D Environments Andrei Chubarau et.al. Updated 2025-03-10

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
CATPlan: Loss-based Collision Prediction in End-to-End Autonomous Driving Ziliang Xiong et.al. Updated 2025-03-10

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
Feature-EndoGaussian: Feature Distilled Gaussian Splatting in Surgical Deformable Scene Reconstruction Kai Li et.al. Updated 2025-03-08

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand