Topic

Gaussian Splatting

Updated 2026.03.31 · 627 papers

← Back to topics
SGS-Intrinsic: Semantic-Invariant Gaussian Splatting for Sparse-View Indoor Inverse Rendering Jiahao Niu, Rongjia Zheng, Wenju Xu, WeiShi Zheng, Qing Zhang Updated 2026-03-29

We present SGS-Intrinsic, an indoor inverse rendering framework that works well for sparse-view images. Unlike existing 3D Gaussian Splatting (3DGS) based methods that focus on object-centric reconstruction and fail to work under sparse view settings, our method allows to achieve high-quality geometry reconstruction and accurate disentanglement of material and illumination. The core idea is to construct a dense and geometry-consistent Gaussian semantic field guided by semantic and geometric priors, providing a reliable foundation for subsequent inverse rendering. Building upon this, we perform material-illumination disentanglement by combining a hybrid illumination model and material prior to effectively capture illumination-material interactions. To mitigate the impact of cast shadows and enhance the robustness of material recovery, we introduce illumination-invariant material constraint together with a deshadowing model. Extensive experiments on benchmark datasets show that our method consistently improves both reconstruction fidelity and inverse rendering quality over existing 3DGS-based inverse rendering approaches. Our code is available at https://github.com/GrumpySloths/SGS_Intrinsic.github.io.

Preview loads on expand
DiffSoup: Direct Differentiable Rasterization of Triangle Soup for Extreme Radiance Field Simplification Kenji Tojo, Bernd Bickel, Nobuyuki Umetani Updated 2026-03-28

Radiance field reconstruction aims to recover high-quality 3D representations from multi-view RGB images. Recent advances, such as 3D Gaussian splatting, enable real-time rendering with high visual fidelity on sufficiently powerful graphics hardware. However, efficient online transmission and rendering across diverse platforms requires drastic model simplification, reducing the number of primitives by several orders of magnitude. We introduce DiffSoup, a radiance field representation that employs a soup (i.e., a highly unstructured set) of a small number of triangles with neural textures and binary opacity. We show that this binary opacity representation is directly differentiable via stochastic opacity masking, enabling stable training without a mollifier (i.e., smooth rasterization). DiffSoup can be rasterized using standard depth testing, enabling seamless integration into traditional graphics pipelines and interactive rendering on consumer-grade laptops and mobile devices. Code is available at https://github.com/kenji-tojo/diffsoup.

Preview loads on expand
Detailed Geometry and Appearance from Opportunistic Motion Ryosuke Hirai, Kohei Yamashita, Antoine Guédon, Ryo Kawahara, Vincent Lepetit, Ko Nishino Updated 2026-03-27

Reconstructing 3D geometry and appearance from a sparse set of fixed cameras is a foundational task with broad applications, yet it remains fundamentally constrained by the limited viewpoints. We show that this bound can be broken by exploiting opportunistic object motion: as a person manipulates an object~(e.g., moving a chair or lifting a mug), the static cameras effectively ``orbit'' the object in its local coordinate frame, providing additional virtual viewpoints. Harnessing this object motion, however, poses two challenges: the tight coupling of object pose and geometry estimation and the complex appearance variations of a moving object under static illumination. We address these by formulating a joint pose and shape optimization using 2D Gaussian splatting with alternating minimization of 6DoF trajectories and primitive parameters, and by introducing a novel appearance model that factorizes diffuse and specular components with reflected directional probing within the spherical harmonics space. Extensive experiments on synthetic and real-world datasets with extremely sparse viewpoints demonstrate that our method recovers significantly more accurate geometry and appearance than state-of-the-art baselines.

Preview loads on expand
Drive-Through 3D Vehicle Exterior Reconstruction via Dynamic-Scene SfM and Distortion-Aware Gaussian Splatting Nitin Kulkarni, Akhil Devarashetti, Charlie Cluss, Livio Forte, Philip Schneider, Chunming Qiao, Alina Vereshchaka Updated 2026-03-27

High-fidelity 3D reconstruction of vehicle exteriors improves buyer confidence in online automotive marketplaces, but generating these models in cluttered dealership drive-throughs presents severe technical challenges. Unlike static-scene photogrammetry, this setting features a dynamic vehicle moving against heavily cluttered, static backgrounds. This problem is further compounded by wide-angle lens distortion, specular automotive paint, and non-rigid wheel rotations that violate classical epipolar constraints. We propose an end-to-end pipeline utilizing a two-pillar camera rig. First, we resolve dynamic-scene ambiguities by coupling SAM 3 for instance segmentation with motion-gating to cleanly isolate the moving vehicle, explicitly masking out non-rigid wheels to enforce strict epipolar geometry. Second, we extract robust correspondences directly on raw, distorted 4K imagery using the RoMa v2 learned matcher guided by semantic confidence masks. Third, these matches are integrated into a rig-aware SfM optimization that utilizes CAD-derived relative pose priors to eliminate scale drift. Finally, we use a distortion-aware 3D Gaussian Splatting framework (3DGUT) coupled with a stochastic Markov Chain Monte Carlo (MCMC) densification strategy to render reflective surfaces. Evaluations on 25 real-world vehicles across 10 dealerships demonstrate that our full pipeline achieves a PSNR of 28.66 dB, an SSIM of 0.89, and an LPIPS of 0.21 on held-out views, representing a 3.85 dB improvement over standard 3D-GS, delivering inspection-grade interactive 3D models without controlled studio infrastructure.

Preview loads on expand
Scene Grounding In the Wild Tamir Cohen, Leo Segre, Shay Shomer-Chai, Shai Avidan, Hadar Averbuch-Elor Updated 2026-03-27

Reconstructing accurate 3D models of large-scale real-world scenes from unstructured, in-the-wild imagery remains a core challenge in computer vision, especially when the input views have little or no overlap. In such cases, existing reconstruction pipelines often produce multiple disconnected partial reconstructions or erroneously merge non-overlapping regions into overlapping geometry. In this work, we propose a framework that grounds each partial reconstruction to a complete reference model of the scene, enabling globally consistent alignment even in the absence of visual overlap. We obtain reference models from dense, geospatially accurate pseudo-synthetic renderings derived from Google Earth Studio. These renderings provide full scene coverage but differ substantially in appearance from real-world photographs. Our key insight is that, despite this significant domain gap, both domains share the same underlying scene semantics. We represent the reference model using 3D Gaussian Splatting, augmenting each Gaussian with semantic features, and formulate alignment as an inverse feature-based optimization scheme that estimates a global 6DoF pose and scale while keeping the reference model fixed. Furthermore, we introduce the WikiEarth dataset, which registers existing partial 3D reconstructions with pseudo-synthetic reference models. We demonstrate that our approach consistently improves global alignment when initialized with various classical and learning-based pipelines, while mitigating failure modes of state-of-the-art end-to-end models. All code and data will be released.

Preview loads on expand
GLINT: Modeling Scene-Scale Transparency via Gaussian Radiance Transport Youngju Na, Jaeseong Yun, Soohyun Ryu, Hyunsu Kim, Sung-Eui Yoon, Suyong Yeon Updated 2026-03-27

While 3D Gaussian splatting has emerged as a powerful paradigm, it fundamentally fails to model transparency such as glass panels. The core challenge lies in decoupling the intertwined radiance contributions from transparent interfaces and the transmitted geometry observed through the glass. We present GLINT, a framework that models scene-scale transparency through explicit decomposed Gaussian representation. GLINT reconstructs the primary interface and models reflected and transmitted radiance separately, enabling consistent radiance transport. During optimization, GLINT bootstraps transparency localization from geometry-separation cues induced by the decomposition, together with geometry and material priors from a pre-trained video relighting model. Extensive experiments demonstrate consistent improvements over prior methods for reconstructing complex transparent scenes.

Preview loads on expand
R-PGA: Robust Physical Adversarial Camouflage Generation via Relightable 3D Gaussian Splatting Tianrui Lou, Siyuan Liang, Jiawei Liang, Yuze Gao, Xiaochun Cao Updated 2026-03-27

Physical adversarial camouflage poses a severe security threat to autonomous driving systems by mapping adversarial textures onto 3D objects. Nevertheless, current methods remain brittle in complex dynamic scenarios, failing to generalize across diverse geometric (e.g., viewing configurations) and radiometric (e.g., dynamic illumination, atmospheric scattering) variations. We attribute this deficiency to two fundamental limitations in simulation and optimization. First, the reliance on coarse, oversimplified simulations (e.g., via CARLA) induces a significant domain gap, confining optimization to a biased feature space. Second, standard strategies targeting average performance result in a rugged loss landscape, leaving the camouflage vulnerable to configuration shifts.To bridge these gaps, we propose the Relightable Physical 3D Gaussian Splatting (3DGS) based Attack framework (R-PGA). Technically, to address the simulation fidelity issue, we leverage 3DGS to ensure photo-realistic reconstruction and augment it with physically disentangled attributes to decouple intrinsic material from lighting. Furthermore, we design a hybrid rendering pipeline that leverages precise Relightable 3DGS for foreground rendering, while employing a pre-trained image translation model to synthesize plausible relighted backgrounds that align with the relighted foreground.To address the optimization robustness issue, we propose the Hard Physical Configuration Mining (HPCM) module, designed to actively mine worst-case physical configurations and suppress their corresponding loss peaks. This strategy not only diminishes the overall loss magnitude but also effectively flattens the rugged loss landscape, ensuring consistent adversarial effectiveness and robustness across varying physical configurations.

Preview loads on expand
arg-VU: Affordance Reasoning with Physics-Aware 3D Geometry for Visual Understanding in Robotic Surgery Nan Xiao, Yunxin Fan, Farong Wang, Fei Liu Updated 2026-03-26

Affordance reasoning provides a principled link between perception and action, yet remains underexplored in surgical robotics, where tissues are highly deformable, compliant, and dynamically coupled with tool motion. We present arg-VU, a physics-aware affordance reasoning framework that integrates temporally consistent geometry tracking with constraint-induced mechanical modeling for surgical visual understanding. Surgical scenes are reconstructed using 3D Gaussian Splatting (3DGS) and converted into a temporally tracked surface representation. Extended Position-Based Dynamics (XPBD) embeds local deformation constraints and produces representative geometry points (RGPs) whose constraint sensitivities define anisotropic stiffness metrics capturing the local constraint-manifold geometry. Robotic tool poses in SE(3) are incorporated to compute rigidly induced displacements at RGPs, from which we derive two complementary measures: a physics-aware compliance energy that evaluates mechanical feasibility with respect to local deformation constraints, and a positional agreement score that captures motion alignment (as kinematic motion baseline). Experiments on surgical video datasets show that arg-VU yields more stable, physically consistent, and interpretable affordance predictions than kinematic baselines. These results demonstrate that physics-aware geometric representations enable reliable affordance reasoning for deformable surgical environments and support embodied robotic interaction.

Preview loads on expand
Less Gaussians, Texture More: 4K Feed-Forward Textured Splatting Yixing Lao, Xuyang Bai, Xiaoyang Wu, Nuoyuan Yan, Zixin Luo, Tian Fang, Jean-Daniel Nahmias, Yanghai Tsin, Shiwei Li, Hengshuang Zhao Updated 2026-03-26

Existing feed-forward 3D Gaussian Splatting methods predict pixel-aligned primitives, leading to a quadratic growth in primitive count as resolution increases. This fundamentally limits their scalability, making high-resolution synthesis such as 4K intractable. We introduce LGTM (Less Gaussians, Texture More), a feed-forward framework that overcomes this resolution scaling barrier. By predicting compact Gaussian primitives coupled with per-primitive textures, LGTM decouples geometric complexity from rendering resolution. This approach enables high-fidelity 4K novel view synthesis without per-scene optimization, a capability previously out of reach for feed-forward methods, all while using significantly fewer Gaussian primitives. Project page: https://yxlao.github.io/lgtm/

Preview loads on expand
ViewSplat: View-Adaptive Dynamic Gaussian Splatting for Feed-Forward Synthesis Moonyeon Jeong, Seunggi Min, Suhyeon Lee, Hongje Seong Updated 2026-03-26

We present ViewSplat, a view-adaptive 3D Gaussian splatting network for novel view synthesis from unposed images. While recent feed-forward 3D Gaussian splatting has significantly accelerated 3D scene reconstruction by bypassing per-scene optimization, a fundamental fidelity gap remains. We attribute this bottleneck to the limited capacity of single-step feed-forward networks to regress static Gaussian primitives that satisfy all viewpoints. To address this limitation, we shift the paradigm from static primitive regression to view-adaptive dynamic splatting. Instead of a rigid Gaussian representation, our pipeline learns a view-adaptable latent representation. Specifically, ViewSplat initially predicts base Gaussian primitives alongside the weights of dynamic MLPs. During rendering, these MLPs take target view coordinates as input and predict view-dependent residual updates for each Gaussian attribute (i.e., 3D position, scale, rotation, opacity, and color). This mechanism, which we term view-adaptive dynamic splatting, allows each primitive to rectify initial estimation errors, effectively capturing high-fidelity appearances. Extensive experiments demonstrate that ViewSplat achieves state-of-the-art fidelity while maintaining fast inference (17 FPS) and real-time rendering (154 FPS).

Preview loads on expand
SpectralSplats: Robust Differentiable Tracking via Spectral Moment Supervision Avigail Cohen Rimon, Amir Mann, Mirela Ben Chen, Or Litany Updated 2026-03-25

3D Gaussian Splatting (3DGS) enables real-time, photorealistic novel view synthesis, making it a highly attractive representation for model-based video tracking. However, leveraging the differentiability of the 3DGS renderer "in the wild" remains notoriously fragile. A fundamental bottleneck lies in the compact, local support of the Gaussian primitives. Standard photometric objectives implicitly rely on spatial overlap; if severe camera misalignment places the rendered object outside the target's local footprint, gradients strictly vanish, leaving the optimizer stranded. We introduce SpectralSplats, a robust tracking framework that resolves this "vanishing gradient" problem by shifting the optimization objective from the spatial to the frequency domain. By supervising the rendered image via a set of global complex sinusoidal features (Spectral Moments), we construct a global basin of attraction, ensuring that a valid, directional gradient toward the target exists across the entire image domain, even when pixel overlap is completely nonexistent. To harness this global basin without introducing periodic local minima associated with high frequencies, we derive a principled Frequency Annealing schedule from first principles, gracefully transitioning the optimizer from global convexity to precise spatial alignment. We demonstrate that SpectralSplats acts as a seamless, drop-in replacement for spatial losses across diverse deformation parameterizations (from MLPs to sparse control points), successfully recovering complex deformations even from severely misaligned initializations where standard appearance-based tracking catastrophically fails.

Preview loads on expand
FilterGS: Traversal-Free Parallel Filtering and Adaptive Shrinking for Large-Scale LoD 3D Gaussian Splatting Yixian Wang, Haolin Yu, Jiadong Tang, Yu Gao, Xihan Wang, Yufeng Yue, Yi Yang Updated 2026-03-25

3D Gaussian Splatting has revolutionized neural rendering with real-time performance. However, scaling this approach to large scenes using Level-of-Detail methods faces critical challenges: inefficient serial traversal consuming over 60\% of rendering time, and redundant Gaussian-tile pairs that incur unnecessary processing overhead. To address these limitations, we introduce FilterGS, featuring a parallel filtering mechanism with two complementary filters that select Gaussian elements efficiently without tree traversal. Additionally, we propose a novel GTC metric that quantifies the redundancy of Gaussian-tile key-value pairs. Based on this metric, we introduce a scene-adaptive Gaussian shrinking strategy that effectively reduces redundant pairs. Extensive experiments demonstrate that FilterGS achieves state-of-the-art rendering speeds while maintaining competitive visual quality across multiple large-scale datasets. Project page: https://github.com/xenon-w/FilterGS

Preview loads on expand
Instrument-Splatting++: Towards Controllable Surgical Instrument Digital Twin Using Gaussian Splatting Shuojue Yang, Zijian Wu, Chengjiaao Liao, Qian Li, Daiyun Shen, Chang Han Low, Septimiu E. Salcudean, Yueming Jin Updated 2026-03-25

High-quality and controllable digital twins of surgical instruments are critical for Real2Sim in robot-assisted surgery, as they enable realistic simulation, synthetic data generation, and perception learning under novel poses. We present Instrument-Splatting++, a monocular 3D Gaussian Splatting (3DGS) framework that reconstructs surgical instruments as a fully controllable Gaussian asset with high fidelity. Our pipeline starts with part-wise geometry pretraining that injects CAD priors into Gaussian primitives and equips the representation with part-aware semantic rendering. Built on the pretrained model, we propose a semantics-aware pose estimation and tracking (SAPET) method to recover per-frame 6-DoF pose and joint angles from unposed endoscopic videos, where a gripper-tip network trained purely from synthetic semantics provides robust supervision and a loose regularization suppresses singular articulations. Finally, we introduce Robust Texture Learning (RTL), which alternates pose refinement and robust appearance optimization, mitigating pose noise during texture learning. The proposed framework can perform pose estimation and learn realistic texture from unposed videos. We validate our method on sequences extracted from EndoVis17/18, SAR-RARP, and an in-house dataset, showing superior photometric quality and improved geometric accuracy over state-of-the-art baselines. We further demonstrate a downstream keypoint detection task where unseen-pose data augmentation from our controllable instrument Gaussian improves performance.

Preview loads on expand
AdvSplat: Adversarial Attacks on Feed-Forward Gaussian Splatting Models Yiran Qiao, Yiren Lu, Yunlai Zhou, Rui Yang, Linlin Hou, Yu Yin, Jing Ma Updated 2026-03-24

3D Gaussian Splatting (3DGS) is increasingly recognized as a powerful paradigm for real-time, high-fidelity 3D reconstruction. However, its per-scene optimization pipeline limits scalability and generalization, and prevents efficient inference. Recently emerged feed-forward 3DGS models address these limitations by enabling fast reconstruction from a few input views after large-scale pretraining, without scene-specific optimization. Despite their advantages and strong potential for commercial deployment, the use of neural networks as the backbone also amplifies the risk of adversarial manipulation. In this paper, we introduce AdvSplat, the first systematic study of adversarial attacks on feed-forward 3DGS. We first employ white-box attacks to reveal fundamental vulnerabilities of this model family. We then develop two improved, practically relevant, query-efficient black-box algorithms that optimize pixel-space perturbations via a frequency-domain parameterization: one based on gradient estimation and the other gradient-free, without requiring any access to model internals. Extensive experiments across multiple datasets demonstrate that AdvSplat can significantly disrupt reconstruction results by injecting imperceptible perturbations into the input images. Our findings surface an overlooked yet urgent problem in this domain, and we hope to draw the community's attention to this emerging security and robustness challenge.

Preview loads on expand
Stochastic Ray Tracing for the Reconstruction of 3D Gaussian Splatting Peiyu Xu, Xin Sun, Krishna Mullia, Raymond Fei, Iliyan Georgiev, Shuang Zhao Updated 2026-03-24

Ray-tracing-based 3D Gaussian splatting (3DGS) methods overcome the limitations of rasterization -- rigid pinhole camera assumptions, inaccurate shadows, and lack of native reflection or refraction -- but remain slower due to the cost of sorting all intersecting Gaussians along every ray. Moreover, existing ray-tracing methods still rely on rasterization-style approximations such as shadow mapping for relightable scenes, undermining the generality that ray tracing promises. We present a differentiable, sorting-free stochastic formulation for ray-traced 3DGS -- the first framework that uses stochastic ray tracing to both reconstruct and render standard and relightable 3DGS scenes. At its core is an unbiased Monte Carlo estimator for pixel-color gradients that evaluates only a small sampled subset of Gaussians per ray, bypassing the need for sorting. For standard 3DGS, our method matches the reconstruction quality and speed of rasterization-based 3DGS while substantially outperforming sorting-based ray tracing. For relightable 3DGS, the same stochastic estimator drives per-Gaussian shading with fully ray-traced shadow rays, delivering notably higher reconstruction fidelity than prior work.

Preview loads on expand
Pose-Free Omnidirectional Gaussian Splatting for 360-Degree Videos with Consistent Depth Priors Chuanqing Zhuang, Xin Lu, Zehui Deng, Zhengda Lu, Yiqun Wang, Junqi Diao, Jun Xiao Updated 2026-03-24

Omnidirectional 3D Gaussian Splatting with panoramas is a key technique for 3D scene representation, and existing methods typically rely on slow SfM to provide camera poses and sparse points priors. In this work, we propose a pose-free omnidirectional 3DGS method, named PFGS360, that reconstructs 3D Gaussians from unposed omnidirectional videos. To achieve accurate camera pose estimation, we first construct a spherical consistency-aware pose estimation module, which recovers poses by establishing consistent 2D-3D correspondences between the reconstructed Gaussians and the unposed images using Gaussians' internal depth priors. Besides, to enhance the fidelity of novel view synthesis, we introduce a depth-inlier-aware densification module to extract depth inliers and Gaussian outliers with consistent monocular depth priors, enabling efficient Gaussian densification and achieving photorealistic novel view synthesis. The experiments show significant outperformance over existing pose-free and pose-aware 3DGS methods on both real-world and synthetic 360-degree videos. Code is available at https://github.com/zcq15/PFGS360.

Preview loads on expand
GTLR-GS: Geometry-Texture Aware LiDAR-Regularized 3D Gaussian Splatting for Realistic Scene Reconstruction Yan Fang, Jianfei Ge, Jiangjian Xiao Updated 2026-03-24

Recent advances in 3D Gaussian Splatting (3DGS) have enabled real-time, photorealistic scene reconstruction. However, conventional 3DGS frameworks typically rely on sparse point clouds derived from Structure-from-Motion (SfM), which inherently suffer from scale ambiguity, limited geometric consistency, and strong view dependency due to the lack of geometric priors. In this work, a LiDAR-centric 3D Gaussian Splatting framework is proposed that explicitly incorporates metric geometric priors into the entire Gaussian optimization process. Instead of treating LiDAR data as a passive initialization source, 3DGS optimization is reformulated as a geometry-conditioned allocation and refinement problem under a fixed representational budget. Specifically, this work introduces (i) a geometry-texture-aware allocation strategy that selectively assigns Gaussian primitives to regions with high structural or appearance complexity, (ii) a curvature-adaptive refinement mechanism that dynamically guides Gaussian splitting toward geometrically complex areas during training, and (iii) a confidence-aware metric depth regularization that anchors the reconstructed geometry to absolute scale using LiDAR measurements while maintaining optimization stability. Extensive experiments on the ScanNet++ dataset and a custom real-world dataset validate the proposed approach. The results demonstrate state-of-the-art performance in metric-scale reconstruction with high geometric fidelity.

Preview loads on expand
PhotoAgent: A Robotic Photographer with Spatial and Aesthetic Understanding Lirong Che, Zhenfeng Gan, Yanbo Chen, Junbo Tan, Xueqian Wang Updated 2026-03-24

Embodied agents for creative tasks like photography must bridge the semantic gap between high-level language commands and geometric control. We introduce PhotoAgent, an agent that achieves this by integrating Large Multimodal Models (LMMs) reasoning with a novel control paradigm. PhotoAgent first translates subjective aesthetic goals into solvable geometric constraints via LMM-driven, chain-of-thought (CoT) reasoning, allowing an analytical solver to compute a high-quality initial viewpoint. This initial pose is then iteratively refined through visual reflection within a photorealistic internal world model built with 3D Gaussian Splatting (3DGS). This ``mental simulation'' replaces costly and slow physical trial-and-error, enabling rapid convergence to aesthetically superior results. Evaluations confirm that PhotoAgent excels in spatial reasoning and achieves superior final image quality.

Preview loads on expand
Predictive Photometric Uncertainty in Gaussian Splatting for Novel View Synthesis Chamuditha Jayanga Galappaththige, Thomas Gottwald, Peter Stehr, Edgar Heinert, Niko Suenderhauf, Dimity Miller, Matthias Rottmann Updated 2026-03-24

Recent advances in 3D Gaussian Splatting have enabled impressive photorealistic novel view synthesis. However, to transition from a pure rendering engine to a reliable spatial map for autonomous agents and safety-critical applications, knowing where the representation is uncertain is as important as the rendering fidelity itself. We bridge this critical gap by introducing a lightweight, plug-and-play framework for pixel-wise, view-dependent predictive uncertainty estimation. Our post-hoc method formulates uncertainty as a Bayesian-regularized linear least-squares optimization over reconstruction residuals. This architecture-agnostic approach extracts a per-primitive uncertainty channel without modifying the underlying scene representation or degrading baseline visual fidelity. Crucially, we demonstrate that providing this actionable reliability signal successfully translates 3D Gaussian splatting into a trustworthy spatial map, further improving state-of-the-art performance across three critical downstream perception tasks: active view selection, pose-agnostic scene change detection, and pose-agnostic anomaly detection.

Preview loads on expand
Drop-In Perceptual Optimization for 3D Gaussian Splatting Ezgi Ozyilkan, Zhiqi Chen, Oren Rippel, Jona Ballé, Kedar Tatwawadi Updated 2026-03-23

Despite their output being ultimately consumed by human viewers, 3D Gaussian Splatting (3DGS) methods often rely on ad-hoc combinations of pixel-level losses, resulting in blurry renderings. To address this, we systematically explore perceptual optimization strategies for 3DGS by searching over a diverse set of distortion losses. We conduct the first-of-its-kind large-scale human subjective study on 3DGS, involving 39,320 pairwise ratings across several datasets and 3DGS frameworks. A regularized version of Wasserstein Distortion, which we call WD-R, emerges as the clear winner, excelling at recovering fine textures without incurring a higher splat count. WD-R is preferred by raters more than $2.3\times$ over the original 3DGS loss, and $1.5\times$ over current best method Perceptual-GS. WD-R also consistently achieves state-of-the-art LPIPS, DISTS, and FID scores across various datasets, and generalizes across recent frameworks, such as Mip-Splatting and Scaffold-GS, where replacing the original loss with WD-R consistently enhances perceptual quality within a similar resource budget (number of splats for Mip-Splatting, model size for Scaffold-GS), and leads to reconstructions being preferred by human raters $1.8\times$ and $3.6\times$, respectively. We also find that this carries over to the task of 3DGS scene compression, with $\approx 50\%$ bitrate savings for comparable perceptual metric performance.

Preview loads on expand
Matryoshka Gaussian Splatting Zhilin Guo, Boqiao Zhang, Hakan Aktas, Kyle Fogarty, Jeffrey Hu, Nursena Koprucu Aslan, Wenzhao Li, Canberk Baykal, Albert Miao, Josef Bengtson, Chenliang Zhou, Weihao Xia, Cristina Nader Vasconcelos. Cengiz Oztireli Updated 2026-03-19

The ability to render scenes at adjustable fidelity from a single model, known as level of detail (LoD), is crucial for practical deployment of 3D Gaussian Splatting (3DGS). Existing discrete LoD methods expose only a limited set of operating points, while concurrent continuous LoD approaches enable smoother scaling but often suffer noticeable quality degradation at full capacity, making LoD a costly design decision. We introduce Matryoshka Gaussian Splatting (MGS), a training framework that enables continuous LoD for standard 3DGS pipelines without sacrificing full-capacity rendering quality. MGS learns a single ordered set of Gaussians such that rendering any prefix, the first k splats, produces a coherent reconstruction whose fidelity improves smoothly with increasing budget. Our key idea is stochastic budget training: each iteration samples a random splat budget and optimises both the corresponding prefix and the full set. This strategy requires only two forward passes and introduces no architectural modifications. Experiments across four benchmarks and six baselines show that MGS matches the full-capacity performance of its backbone while enabling a continuous speed-quality trade-off from a single model. Extensive ablations on ordering strategies, training objectives, and model capacity further validate the designs.

Preview loads on expand
Reconstruction Matters: Learning Geometry-Aligned BEV Representation through 3D Gaussian Splatting Yiren Lu, Xin Ye, Burhaneddin Yaman, Jingru Luo, Zhexiao Xiong, Liu Ren, Yu Yin Updated 2026-03-19

Bird's-Eye-View (BEV) perception serves as a cornerstone for autonomous driving, offering a unified spatial representation that fuses surrounding-view images to enable reasoning for various downstream tasks, such as semantic segmentation, 3D object detection, and motion prediction. However, most existing BEV perception frameworks adopt an end-to-end training paradigm, where image features are directly transformed into the BEV space and optimized solely through downstream task supervision. This formulation treats the entire perception process as a black box, often lacking explicit 3D geometric understanding and interpretability, leading to suboptimal performance. In this paper, we claim that an explicit 3D representation matters for accurate BEV perception, and we propose Splat2BEV, a Gaussian Splatting-assisted framework for BEV tasks. Splat2BEV aims to learn BEV feature representations that are both semantically rich and geometrically precise. We first pre-train a Gaussian generator that explicitly reconstructs 3D scenes from multi-view inputs, enabling the generation of geometry-aligned feature representations. These representations are then projected into the BEV space to serve as inputs for downstream tasks. Extensive experiments on nuScenes and argoverse dataset demonstrate that Splat2BEV achieves state-of-the-art performance and validate the effectiveness of incorporating explicit 3D reconstruction into BEV perception.

Preview loads on expand
GSMem: 3D Gaussian Splatting as Persistent Spatial Memory for Zero-Shot Embodied Exploration and Reasoning Yiren Lu, Yi Du, Disheng Liu, Yunlai Zhou, Chen Wang, Yu Yin Updated 2026-03-19

Effective embodied exploration requires agents to accumulate and retain spatial knowledge over time. However, existing scene representations, such as discrete scene graphs or static view-based snapshots, lack \textit{post-hoc re-observability}. If an initial observation misses a target, the resulting memory omission is often irrecoverable. To bridge this gap, we propose \textbf{GSMem}, a zero-shot embodied exploration and reasoning framework built upon 3D Gaussian Splatting (3DGS). By explicitly parameterizing continuous geometry and dense appearance, 3DGS serves as a persistent spatial memory that endows the agent with \textit{Spatial Recollection}: the ability to render photorealistic novel views from optimal, previously unoccupied viewpoints. To operationalize this, GSMem employs a retrieval mechanism that simultaneously leverages parallel object-level scene graphs and semantic-level language fields. This complementary design robustly localizes target regions, enabling the agent to ``hallucinate'' optimal views for high-fidelity Vision-Language Model (VLM) reasoning. Furthermore, we introduce a hybrid exploration strategy that combines VLM-driven semantic scoring with a 3DGS-based coverage objective, balancing task-aware exploration with geometric coverage. Extensive experiments on embodied question answering and lifelong navigation demonstrate the robustness and effectiveness of our framework

Preview loads on expand
GHOST: Fast Category-agnostic Hand-Object Interaction Reconstruction from RGB Videos using Gaussian Splatting Ahmed Tawfik Aboukhadra, Marcel Rogge, Nadia Robertini, Abdalla Arafa, Jameel Malik, Ahmed Elhayek, Didier Stricker Updated 2026-03-19

Understanding realistic hand-object interactions from monocular RGB videos is essential for AR/VR, robotics, and embodied AI. Existing methods rely on category-specific templates or heavy computation, yet still produce physically inconsistent hand-object alignment in 3D. We introduce GHOST (Gaussian Hand-Object Splatting), a fast, category-agnostic framework for reconstructing dynamic hand-object interactions using 2D Gaussian Splatting. GHOST represents both hands and objects as dense, view-consistent Gaussian discs and introduces three key innovations: (1) a geometric-prior retrieval and consistency loss that completes occluded object regions, (2) a grasp-aware alignment that refines hand translations and object scale to ensure realistic contact, and (3) a hand-aware background loss that prevents penalizing hand-occluded object regions. GHOST achieves complete, physically consistent, and animatable reconstructions from a single RGB video while running an order of magnitude faster than prior category-agnostic methods. Extensive experiments on ARCTIC, HO3D, and in-the-wild datasets demonstrate state-of-the-art accuracy in 3D reconstruction and 2D rendering quality, establishing GHOST as an efficient and robust solution for realistic hand-object interaction modeling. Code is available at https://github.com/ATAboukhadra/GHOST.

Preview loads on expand
From ex(p) to poly: Gaussian Splatting with Polynomial Kernels Joerg H. Mueller, Martin Winter, Markus Steinberger Updated 2026-03-19

Recent advancements in Gaussian Splatting (3DGS) have introduced various modifications to the original kernel, resulting in significant performance improvements. However, many of these kernel changes are incompatible with existing datasets optimized for the original Gaussian kernel, presenting a challenge for widespread adoption. In this work, we address this challenge by proposing an alternative kernel that maintains compatibility with existing datasets while improving computational efficiency. Specifically, we replace the original exponential kernel with a polynomial approximation combined with a ReLU function. This modification allows for more aggressive culling of Gaussians, leading to enhanced performance across different 3DGS implementations. Our results show a notable performance improvement of 4 to 15% with negligible impact on image quality. We also provide a detailed mathematical analysis of the new kernel and discuss its potential benefits for 3DGS implementations on NPU hardware.

Preview loads on expand
OnlinePG: Online Open-Vocabulary Panoptic Mapping with 3D Gaussian Splatting Hongjia Zhai, Qi Zhang, Xiaokun Pan, Xiyu Zhang, Yitong Dong, Huaqi Zhang, Dan Xu, Guofeng Zhang Updated 2026-03-19

Open-vocabulary scene understanding with online panoptic mapping is essential for embodied applications to perceive and interact with environments. However, existing methods are predominantly offline or lack instance-level understanding, limiting their applicability to real-world robotic tasks. In this paper, we propose OnlinePG, a novel and effective system that integrates geometric reconstruction and open-vocabulary perception using 3D Gaussian Splatting in an online setting. Technically, to achieve online panoptic mapping, we employ an efficient local-to-global paradigm with a sliding window. To build local consistency map, we construct a 3D segment clustering graph that jointly leverages geometric and semantic cues, fusing inconsistent segments within sliding window into complete instances. Subsequently, to update the global map, we construct explicit grids with spatial attributes for the local 3D Gaussian map and fuse them into the global map via robust bidirectional bipartite 3D Gaussian instance matching. Finally, we utilize the fused VLM features inside the 3D spatial attribute grids to achieve open-vocabulary scene understanding. Extensive experiments on widely used datasets demonstrate that our method achieves better performance among online approaches, while maintaining real-time efficiency.

Preview loads on expand
Inst4DGS: Instance-Decomposed 4D Gaussian Splatting with Multi-Video Label Permutation Learning Yonghan Lee, Dinesh Manocha Updated 2026-03-19

We present Inst4DGS, an instance-decomposed 4D Gaussian Splatting (4DGS) approach with long-horizon per-Gaussian trajectories. While dynamic 4DGS has advanced rapidly, instance-decomposed 4DGS remains underexplored, largely due to the difficulty of associating inconsistent instance labels across independently segmented multi-view videos. We address this challenge by introducing per-video label-permutation latents that learn cross-video instance matches through a differentiable Sinkhorn layer, enabling direct multi-view supervision with consistent identity preservation. This explicit label alignment yields sharp decision boundaries and temporally stable identities without identity drift. To further improve efficiency, we propose instance-decomposed motion scaffolds that provide low-dimensional motion bases per object for long-horizon trajectory optimization. Experiments on Panoptic Studio and Neural3DV show that Inst4DGS jointly supports tracking and instance decomposition while achieving state-of-the-art rendering and segmentation quality. On the Panoptic Studio dataset, Inst4DGS improves PSNR from 26.10 to 28.36, and instance mIoU from 0.6310 to 0.9129, over the strongest baseline.

Preview loads on expand
Semantic Segmentation and Depth Estimation for Real-Time Lunar Surface Mapping Using 3D Gaussian Splatting Guillem Casadesus Vila, Adam Dai, Grace Gao Updated 2026-03-18

Navigation and mapping on the lunar surface require robust perception under challenging conditions, including poorly textured environments, high-contrast lighting, and limited computational resources. This paper presents a real-time mapping framework that integrates dense perception models with a 3D Gaussian Splatting (3DGS) representation. We first benchmark several models on synthetic datasets generated with the LuPNT simulator, selecting a stereo dense depth estimation model based on Gated Recurrent Units for its balance of speed and accuracy in depth estimation, and a convolutional neural network for its superior performance in detecting semantic segments. Using ground truth poses to decouple the local scene understanding from the global state estimation, our pipeline reconstructs a 120-meter traverse with a geometric height accuracy of approximately 3 cm, outperforming a traditional point cloud baseline without LiDAR. The resulting 3DGS map enables novel view synthesis and serves as a foundation for a full SLAM system, where its capacity for joint map and pose optimization would offer significant advantages. Our results demonstrate that combining semantic segmentation and dense depth estimation with learned map representations is an effective approach for creating detailed, large-scale maps to support future lunar surface missions.

Preview loads on expand
AHOY! Animatable Humans under Occlusion from YouTube Videos with Gaussian Splatting and Video Diffusion Priors Aymen Mir, Riza Alp Guler, Xiangjun Tang, Peter Wonka, Gerard Pons-Moll Updated 2026-03-18

We present AHOY, a method for reconstructing complete, animatable 3D Gaussian avatars from in-the-wild monocular video despite heavy occlusion. Existing methods assume unoccluded input-a fully visible subject, often in a canonical pose-excluding the vast majority of real-world footage where people are routinely occluded by furniture, objects, or other people. Reconstructing from such footage poses fundamental challenges: large body regions may never be observed, and multi-view supervision per pose is unavailable. We address these challenges with four contributions: (i) a hallucination-as-supervision pipeline that uses identity-finetuned diffusion models to generate dense supervision for previously unobserved body regions; (ii) a two-stage canonical-to-pose-dependent architecture that bootstraps from sparse observations to full pose-dependent Gaussian maps; (iii) a map-pose/LBS-pose decoupling that absorbs multi-view inconsistencies from the generated data; (iv) a head/body split supervision strategy that preserves facial identity. We evaluate on YouTube videos and on multi-view capture data with significant occlusion and demonstrate state-of-the-art reconstruction quality. We also demonstrate that the resulting avatars are robust enough to be animated with novel poses and composited into 3DGS scenes captured using cell-phone video. Our project page is available at https://miraymen.github.io/ahoy/

Preview loads on expand
CrowdGaussian: Reconstructing High-Fidelity 3D Gaussians for Human Crowd from a Single Image Yizheng Song, Yiyu Zhuang, Qipeng Xu, Haixiang Wang, Jiahe Zhu, Jing Tian, Siyu Zhu, Hao Zhu Updated 2026-03-18

Single-view 3D human reconstruction has garnered significant attention in recent years. Despite numerous advancements, prior research has concentrated on reconstructing 3D models from clear, close-up images of individual subjects, often yielding subpar results in the more prevalent multi-person scenarios. Reconstructing 3D human crowd models is a highly intricate task, laden with challenges such as: 1) extensive occlusions, 2) low clarity, and 3) numerous and various appearances. To address this task, we propose CrowdGaussian, a unified framework that directly reconstructs multi-person 3D Gaussian Splatting (3DGS) representations from single-image inputs. To handle occlusions, we devise a self-supervised adaptation pipeline that enables the pretrained large human model to reconstruct complete 3D humans with plausible geometry and appearance from heavily occluded inputs. Furthermore, we introduce Self-Calibrated Learning (SCL). This training strategy enables single-step diffusion models to adaptively refine coarse renderings to optimal quality by blending identity-preserving samples with clean/corrupted image pairs. The outputs can be distilled back to enhance the quality of multi-person 3DGS representations. Extensive experiments demonstrate that CrowdGaussian generates photorealistic, geometrically coherent reconstructions of multi-person scenes.

Preview loads on expand
Spectral Defense Against Resource-Targeting Attack in 3D Gaussian Splatting Yang Chen, Yi Yu, Jiaming He, Yueqi Duan, Zheng Zhu, Yap-Peng Tan Updated 2026-03-13

Recent advances in 3D Gaussian Splatting (3DGS) deliver high-quality rendering, yet the Gaussian representation exposes a new attack surface, the resource-targeting attack. This attack poisons training images, excessively inducing Gaussian growth to cause resource exhaustion. Although efficiency-oriented methods such as smoothing, thresholding, and pruning have been explored, these spatial-domain strategies operate on visible structures but overlook how stealthy perturbations distort the underlying spectral behaviors of training data. As a result, poisoned inputs introduce abnormal high-frequency amplifications that mislead 3DGS into interpreting noisy patterns as detailed structures, ultimately causing unstable Gaussian overgrowth and degraded scene fidelity. To address this, we propose \textbf{Spectral Defense} in Gaussian and image fields. We first design a 3D frequency filter to selectively prune Gaussians exhibiting abnormally high frequencies. Since natural scenes also contain legitimate high-frequency structures, directly suppressing high frequencies is insufficient, and we further develop a 2D spectral regularization on renderings, distinguishing naturally isotropic frequencies while penalizing anisotropic angular energy to constrain noisy patterns. Experiments show that our defense builds robust, accurate, and secure 3DGS, suppressing overgrowth by up to $5.92\times$, reducing memory by up to $3.66\times$, and improving speed by up to $4.34\times$ under attacks.

Preview loads on expand
LR-SGS: Robust LiDAR-Reflectance-Guided Salient Gaussian Splatting for Self-Driving Scene Reconstruction Ziyu Chen, Fan Zhu, Hui Zhu, Deyi Kong, Xinkai Kuang, Yujia Zhang, Chunmao Jiang Updated 2026-03-13

Recent 3D Gaussian Splatting (3DGS) methods have demonstrated the feasibility of self-driving scene reconstruction and novel view synthesis. However, most existing methods either rely solely on cameras or use LiDAR only for Gaussian initialization or depth supervision, while the rich scene information contained in point clouds, such as reflectance, and the complementarity between LiDAR and RGB have not been fully exploited, leading to degradation in challenging self-driving scenes, such as those with high ego-motion and complex lighting. To address these issues, we propose a robust and efficient LiDAR-reflectance-guided Salient Gaussian Splatting method (LR-SGS) for self-driving scenes, which introduces a structure-aware Salient Gaussian representation, initialized from geometric and reflectance feature points extracted from LiDAR and refined through a salient transform and improved density control to capture edge and planar structures. Furthermore, we calibrate LiDAR intensity into reflectance and attach it to each Gaussian as a lighting-invariant material channel, jointly aligned with RGB to enforce boundary consistency. Extensive experiments on the Waymo Open Dataset demonstrate that LR-SGS achieves superior reconstruction performance with fewer Gaussians and shorter training time. In particular, on Complex Lighting scenes, our method surpasses OmniRe by 1.18 dB PSNR.

Preview loads on expand
AstroSplat: Physics-Based Gaussian Splatting for Rendering and Reconstruction of Small Celestial Bodies Jennifer Nolan, Travis Driver, John Christian Updated 2026-03-12

Image-based surface reconstruction and characterization are crucial for missions to small celestial bodies (e.g., asteroids), as it informs mission planning, navigation, and scientific analysis. Recent advances in Gaussian splatting enable high-fidelity neural scene representations but typically rely on a spherical harmonic intensity parameterization that is strictly appearance-based and does not explicitly model material properties or light-surface interactions. We introduce AstroSplat, a physics-based Gaussian splatting framework that integrates planetary reflectance models to improve the autonomous reconstruction and photometric characterization of small-body surfaces from in-situ imagery. The proposed framework is validated on real imagery taken by NASA's Dawn mission, where we demonstrate superior rendering performance and surface reconstruction accuracy compared to the typical spherical harmonic parameterization.

Preview loads on expand
Mango-GS: Enhancing Spatio-Temporal Consistency in Dynamic Scenes Reconstruction using Multi-Frame Node-Guided 4D Gaussian Splatting Tingxuan Huang, Haowei Zhu, Jun-hai Yong, Hao Pan, Bin Wang Updated 2026-03-12

Reconstructing dynamic 3D scenes with photorealistic detail and strong temporal coherence remains a significant challenge. Existing Gaussian splatting approaches for dynamic scene modeling often rely on per-frame optimization, which can overfit to instantaneous states instead of capturing underlying motion dynamics. To address this, we present Mango-GS, a multi-frame, node-guided framework for high-fidelity 4D reconstruction. Mango-GS leverages a temporal Transformer to model motion dependencies within a short window of frames, producing temporally consistent deformations. For efficiency, temporal modeling is confined to a sparse set of control nodes. Each node is represented by a decoupled canonical position and a latent code, providing a stable semantic anchor for motion propagation and preventing correspondence drift under large motion. Our framework is trained end-to-end, enhanced by an input masking strategy and two multi-frame losses to improve robustness. Extensive experiments demonstrate that Mango-GS achieves state-of-the-art reconstruction quality and real-time rendering speed, enabling high-fidelity reconstruction and interactive rendering of dynamic scenes.

Preview loads on expand
Mobile-GS: Real-time Gaussian Splatting for Mobile Devices Xiaobiao Du, Yida Wang, Kun Zhan, Xin Yu Updated 2026-03-12

3D Gaussian Splatting (3DGS) has emerged as a powerful representation for high-quality rendering across a wide range of applications.However, its high computational demands and large storage costs pose significant challenges for deployment on mobile devices. In this work, we propose a mobile-tailored real-time Gaussian Splatting method, dubbed Mobile-GS, enabling efficient inference of Gaussian Splatting on edge devices. Specifically, we first identify alpha blending as the primary computational bottleneck, since it relies on the time-consuming Gaussian depth sorting process. To solve this issue, we propose a depth-aware order-independent rendering scheme that eliminates the need for sorting, thereby substantially accelerating rendering. Although this order-independent rendering improves rendering speed, it may introduce transparency artifacts in regions with overlapping geometry due to the scarcity of rendering order. To address this problem, we propose a neural view-dependent enhancement strategy, enabling more accurate modeling of view-dependent effects conditioned on viewing direction, 3D Gaussian geometry, and appearance attributes. In this way, Mobile-GS can achieve both high-quality and real-time rendering. Furthermore, to facilitate deployment on memory-constrained mobile platforms, we also introduce first-order spherical harmonics distillation, a neural vector quantization technique, and a contribution-based pruning strategy to reduce the number of Gaussian primitives and compress the 3D Gaussian representation with the assistance of neural networks. Extensive experiments demonstrate that our proposed Mobile-GS achieves real-time rendering and compact model size while preserving high visual quality, making it well-suited for mobile applications.

Preview loads on expand
InstantHDR: Single-forward Gaussian Splatting for High Dynamic Range 3D Reconstruction Dingqiang Ye, Jiacong Xu, Jianglu Ping, Yuxiang Guo, Chao Fan, Vishal M. Patel Updated 2026-03-11

High dynamic range (HDR) novel view synthesis (NVS) aims to reconstruct HDR scenes from multi-exposure low dynamic range (LDR) images. Existing HDR pipelines heavily rely on known camera poses, well-initialized dense point clouds, and time-consuming per-scene optimization. Current feed-forward alternatives overlook the HDR problem by assuming exposure-invariant appearance. To bridge this gap, we propose InstantHDR, a feed-forward network that reconstructs 3D HDR scenes from uncalibrated multi-exposure LDR collections in a single forward pass. Specifically, we design a geometry-guided appearance modeling for multi-exposure fusion, and a meta-network for generalizable scene-specific tone mapping. Due to the lack of HDR scene data, we build a pre-training dataset, called HDR-Pretrain, for generalizable feed-forward HDR models, featuring 168 Blender-rendered scenes, diverse lighting types, and multiple camera response functions. Comprehensive experiments show that our InstantHDR delivers comparable synthesis performance to the state-of-the-art optimization-based HDR methods while enjoying $\sim700\times$ and $\sim20\times$ reconstruction speed improvement with our single-forward and post-optimization settings. All code, models, and datasets will be released after the review process.

Preview loads on expand
S2D: Sparse to Dense Lifting for 3D Reconstruction with Minimal Inputs Yuzhou Ji, Qijian Tian, He Zhu, Xiaoqi Jiang, Guangzhi Cao, Lizhuang Ma, Yuan Xie, Xin Tan Updated 2026-03-11

Explicit 3D representations have already become an essential medium for 3D simulation and understanding. However, the most commonly used point cloud and 3D Gaussian Splatting (3DGS) each suffer from non-photorealistic rendering and significant degradation under sparse inputs. In this paper, we introduce Sparse to Dense lifting (S2D), a novel pipeline that bridges the two representations and achieves high-quality 3DGS reconstruction with minimal inputs. Specifically, the S2D lifting is two-fold. We first present an efficient one-step diffusion model that lifts sparse point cloud for high-fidelity image artifact fixing. Meanwhile, to reconstruct 3D consistent scenes, we also design a corresponding reconstruction strategy with random sample drop and weighted gradient for robust model fitting from sparse input views to dense novel views. Extensive experiments show that S2D achieves the best consistency in generating novel view guidance and first-tier sparse view reconstruction quality under different input sparsity. By reconstructing stable scenes with the least possible captures among existing methods, S2D enables minimal input requirements for 3DGS applications.

Preview loads on expand
PolGS++: Physically-Guided Polarimetric Gaussian Splatting for Fast Reflective Surface Reconstruction Yufei Han, Chu Zhou, Youwei Lyu, Qi Chen, Si Li, Boxin Shi, Yunpeng Jia, Heng Guo, Zhanyu Ma Updated 2026-03-11

Accurate reconstruction of reflective surfaces remains a fundamental challenge in computer vision, with broad applications in real-time virtual reality and digital content creation. Although 3D Gaussian Splatting (3DGS) enables efficient novel-view rendering with explicit representations, its performance on reflective surfaces still lags behind implicit neural methods, especially in recovering fine geometry and surface normals. To address this gap, we propose PolGS++, a physically-guided polarimetric Gaussian Splatting framework for fast reflective surface reconstruction. Specifically, we integrate a polarized BRDF (pBRDF) model into 3DGS to explicitly decouple diffuse and specular components, providing physically grounded reflectance modeling and stronger geometric cues for reflective surface recovery. Furthermore, we introduce a depth-guided visibility mask acquisition mechanism that enables angle-of-polarization (AoP)-based tangent-space consistency constraints in Gaussian Splatting without costly ray-tracing intersections. This physically guided design improves reconstruction quality and efficiency, requiring only about 10 minutes of training. Extensive experiments on both synthetic and real-world datasets validate the effectiveness of our method.

Preview loads on expand
Splat2Real: Novel-view Scaling for Physical AI with 3D Gaussian Splatting Hansol Lim, Jongseong Brad Choi Updated 2026-03-11

Physical AI faces viewpoint shift between training and deployment, and novel-view robustness is essential for monocular RGB-to-3D perception. We cast Real2Render2Real monocular depth pretraining as imitation-learning-style supervision from a digital twin oracle: a student depth network imitates expert metric depth/visibility rendered from a scene mesh, while 3DGS supplies scalable novel-view observations. We present Splat2Real, centered on novel-view scaling: performance depends more on which views are added than on raw view count. We introduce CN-Coverage, a coverage+novelty curriculum that greedily selects views by geometry gain and an extrapolation penalty, plus a quality-aware guardrail fallback for low-reliability teachers. Across 20 TUM RGB-D sequences with step-matched budgets (N=0 to 2000 additional rendered views, with N unique <= 500 and resampling for larger budgets), naive scaling is unstable; CN-Coverage mitigates worst-case regressions relative to Robot/Coverage policies, and GOL-Gated CN-Coverage provides the strongest medium-high-budget stability with the lowest high-novelty tail error. Downstream control-proxy results versus N provides embodied-relevance evidence by shifting safety/progress trade-offs under viewpoint shift.

Preview loads on expand
P-GSVC: Layered Progressive 2D Gaussian Splatting for Scalable Image and Video Longan Wang, Yuang Shi, Wei Tsang Ooi Updated 2026-03-11

Gaussian splatting has emerged as a competitive explicit representation for image and video reconstruction. In this work, we present P-GSVC, the first layered progressive 2D Gaussian splatting framework that provides a unified solution for scalable Gaussian representation in both images and videos. P-GSVC organizes 2D Gaussian splats into a base layer and successive enhancement layers, enabling coarse-to-fine reconstructions. To effectively optimize this layered representation, we propose a joint training strategy that simultaneously updates Gaussians across layers, aligning their optimization trajectories to ensure inter-layer compatibility and a stable progressive reconstruction. P-GSVC supports scalability in terms of both quality and resolution. Our experiments show that the joint training strategy can gain up to 1.9 dB improvement in PSNR for video and 2.6 dB improvement in PSNR for image when compared to methods that perform sequential layer-wise training. Project page: https://longanwang-cs.github.io/PGSVC-webpage/

Preview loads on expand
VarSplat: Uncertainty-aware 3D Gaussian Splatting for Robust RGB-D SLAM Anh Thuan Tran, Jana Kosecka Updated 2026-03-10

Simultaneous Localization and Mapping (SLAM) with 3D Gaussian Splatting (3DGS) enables fast, differentiable rendering and high-fidelity reconstruction across diverse real-world scenes. However, existing 3DGS-SLAM approaches handle measurement reliability implicitly, making pose estimation and global alignment susceptible to drift in low-texture regions, transparent surfaces, or areas with complex reflectance properties. To this end, we introduce VarSplat, an uncertainty-aware 3DGS-SLAM system that explicitly learns per-splat appearance variance. By using the law of total variance with alpha compositing, we then render differentiable per-pixel uncertainty map via efficient, single-pass rasterization. This map guides tracking, submap registration, and loop detection toward focusing on reliable regions and contributes to more stable optimization. Experimental results on Replica (synthetic) and TUM-RGBD, ScanNet, and ScanNet++ (real-world) show that VarSplat improves robustness and achieves competitive or superior tracking, mapping, and novel view synthesis rendering compared to existing studies for dense RGB-D SLAM.

Preview loads on expand
DiffWind: Physics-Informed Differentiable Modeling of Wind-Driven Object Dynamics Yuanhang Lei, Boming Zhao, Zesong Yang, Xingxuan Li, Tao Cheng, Haocheng Peng, Ru Zhang, Yang Yang, Siyuan Huang, Yujun Shen, Ruizhen Hu, Hujun Bao, Zhaopeng Cui Updated 2026-03-10

Modeling wind-driven object dynamics from video observations is highly challenging due to the invisibility and spatio-temporal variability of wind, as well as the complex deformations of objects. We present DiffWind, a physics-informed differentiable framework that unifies wind-object interaction modeling, video-based reconstruction, and forward simulation. Specifically, we represent wind as a grid-based physical field and objects as particle systems derived from 3D Gaussian Splatting, with their interaction modeled by the Material Point Method (MPM). To recover wind-driven object dynamics, we introduce a reconstruction framework that jointly optimizes the spatio-temporal wind force field and object motion through differentiable rendering and simulation. To ensure physical validity, we incorporate the Lattice Boltzmann Method (LBM) as a physics-informed constraint, enforcing compliance with fluid dynamics laws. Beyond reconstruction, our method naturally supports forward simulation under novel wind conditions and enables new applications such as wind retargeting. We further introduce WD-Objects, a dataset of synthetic and real-world wind-driven scenes. Extensive experiments demonstrate that our method significantly outperforms prior dynamic scene modeling approaches in both reconstruction accuracy and simulation fidelity, opening a new avenue for video-based wind-object interaction modeling.

Preview loads on expand
X-GS: An Extensible Open Framework Unifying 3DGS Architectures with Downstream Multimodal Models Yueen Ma, Irwin King Updated 2026-03-10

3D Gaussian Splatting (3DGS) has emerged as a powerful technique for novel view synthesis, subsequently extending into numerous spatial AI applications. However, most existing 3DGS methods are isolated, focusing on specific domains such as online SLAM, semantic enrichment, or 3DGS for unposed images. In this paper, we introduce X-GS, an extensible open framework that unifies a broad range of techniques to enable real-time 3DGS-based online SLAM enriched with semantics, bridging the gap to downstream multimodal models. At the core of X-GS is a highly efficient pipeline called X-GS-Perceiver, capable of taking unposed RGB (or optionally RGB-D) video streams as input to co-optimize geometry and poses, and distill high-dimensional semantic features from vision foundation models into the 3D Gaussians. We achieve real-time performance through a novel online Vector Quantization (VQ) module, a GPU-accelerated grid-sampling scheme, and a highly parallelized pipeline design. The semantic 3D Gaussians can then be utilized by vision-language models within the X-GS-Thinker component, enabling downstream tasks such as object detection, zero-shot caption generation, and potentially embodied tasks. Experimental results on real-world datasets showcase the efficacy, efficiency, and newly unlocked multimodal capabilities of the X-GS framework.

Preview loads on expand
DenoiseSplat: Feed-Forward Gaussian Splatting for Noisy 3D Scene Reconstruction Fuzhen Jiang, Zhuoran Li, Yinlin Zhang Updated 2026-03-10

3D scene reconstruction and novel-view synthesis are fundamental for VR, robotics, and content creation. However, most NeRF and 3D Gaussian Splatting pipelines assume clean inputs and degrade under real noise and artifacts. We therefore propose DenoiseSplat, a feed-forward 3D Gaussian splatting method for noisy multi-view images. We build a large-scale, scene-consistent noisy--clean benchmark on RE10K by injecting Gaussian, Poisson, speckle, and salt-and-pepper noise with controlled intensities. With a lightweight MVSplat-style feed-forward backbone, we train end-to-end using only clean 2D renderings as supervision and no 3D ground truth. On noisy RE10K, DenoiseSplat outperforms vanilla MVSplat and a strong two-stage baseline (IDF + MVSplat) in PSNR/SSIM and LPIPS across noise types and levels.

Preview loads on expand
Learning Convex Decomposition via Feature Fields Yuezhi Yang, Qixing Huang, Mikaela Angelina Uy, Nicholas Sharp Updated 2026-03-10

This work proposes a new formulation to the long-standing problem of convex decomposition through learning feature fields, enabling the first feed-forward model for open-world convex decomposition. Our method produces high-quality decompositions of 3D shapes into a union of convex bodies, which are essential to accelerate collision detection in physical simulation, amongst many other applications. The key insight is to adopt a feature learning approach and learn a continuous feature field that can later be clustered to yield a good convex decomposition via our self-supervised, purely-geometric objective derived from the classical definition of convexity. Our formulation can be used for single shape optimization, but more importantly, feature prediction unlocks scalable, self-supervised learning on large datasets resulting in the first learned open-world model for convex decomposition. Experiments show that our decompositions are higher-quality than alternatives and generalize across open-world objects as well as across representations to meshes, CAD models, and even Gaussian splats. https://research.nvidia.com/labs/sil/projects/learning-convex-decomp/

Preview loads on expand
Speeding Up the Learning of 3D Gaussians with Much Shorter Gaussian Lists Jiaqi Liu, Zhizhong Han Updated 2026-03-10

3D Gaussian splatting (3DGS) has become a vital tool for learning a radiance field from multiple posed images. Although 3DGS shows great advantages over NeRF in terms of rendering quality and efficiency, it remains a research challenge to further improve the efficiency of learning 3D Gaussians. To overcome this challenge, we propose novel training strategies and losses to shorten each Gaussian list used to render a pixel, which speeds up the splatting by involving fewer Gaussians along a ray. Specifically, we shrink the size of each Gaussian by resetting their scales regularly, encouraging smaller Gaussians to cover fewer nearby pixels, which shortens the Gaussian lists of pixels. Additionally, we introduce an entropy constraint on the alpha blending procedure to sharpen the weight distribution of Gaussians along each ray, which drives dominant weights larger while making minor weights smaller. As a result, each Gaussian becomes more focused on the pixels where it is dominant, which reduces its impact on nearby pixels, leading to even shorter Gaussian lists. Eventually, we integrate our method into a rendering resolution scheduler which further improves efficiency through progressive resolution increase. We evaluate our method by comparing it with state-of-the-art methods on widely used benchmarks. Our results show significant advantages over others in efficiency without sacrificing rendering quality.

Preview loads on expand
SkipGS: Post-Densification Backward Skipping for Efficient 3DGS Training Jingxing Li, Yongjae Leeand, Deliang Fan Updated 2026-03-09

3D Gaussian Splatting (3DGS) achieves real-time novel-view synthesis by optimizing millions of anisotropic Gaussians, yet its training remains expensive, with the backward pass dominating runtime in the post-densification refinement phase. We observe substantial update redundancy in this phase: many sampled views have near-plateaued losses and provide diminishing gradient benefits, but standard training still runs full backpropagation. We propose SkipGS with a novel view-adaptive backward gating mechanism for efficient post-densification training. SkipGS always performs the forward pass to update per-view loss statistics, and selectively skips backward passes when the sampled view's loss is consistent with its recent per-view baseline, while enforcing a minimum backward budget for stable optimization. On Mip-NeRF 360, compared to 3DGS, SkipGS reduces end-to-end training time by 23.1%, driven by a 42.0% reduction in post-densification time, with comparable reconstruction quality. Because it only changes when to backpropagate -- without modifying the renderer, representation, or loss -- SkipGS is plug-and-play and compatible with other complementary efficiency strategies for additive speedups.

Preview loads on expand
SurgCalib: Gaussian Splatting-Based Hand-Eye Calibration for Robot-Assisted Minimally Invasive Surgery Zijian Wu, Shuojue Yang, Yu Chung Lee, Eitan Prisman, Yueming Jin, Septimiu E. Salcudean Updated 2026-03-09

We present a Gaussian Splatting-based framework for hand-eye calibration of the da Vinci surgical robot. In a vision-guided robotic system, accurate estimation of the rigid transformation between the robot base and the camera frame is essential for reliable closed-loop control. For cable-driven surgical robots, this task faces unique challenges. The encoders of surgical instruments often produce inaccurate proprioceptive measurements due to cable stretch and backlash. Conventional hand-eye calibration approaches typically rely on known fiducial patterns and solve the AX = XB formulation. While effective, introducing additional markers into the operating room (OR) environment can violate sterility protocols and disrupt surgical workflows. In this study, we propose SurgCalib, an automatic, markerless framework that has the potential to be used in the OR. SurgCalib first initializes the pose of the surgical instrument using raw kinematic measurements and subsequently refines this pose through a two-phase optimization procedure under the RCM constraint within a Gaussian Splatting-based differentiable rendering pipeline. We evaluate the proposed method on the public dVRK benchmark, SurgPose. The results demonstrate average 2D tool-tip reprojection errors of 12.24 px (2.06 mm) and 11.33 px (1.9 mm), and 3D tool-tip Euclidean distance errors of 5.98 mm and 4.75 mm, for the left and right instruments, respectively.

Preview loads on expand
Where, What, Why: Toward Explainable 3D-GS Watermarking Mingshu Cai, Jiajun Li, Osamu Yoshie, Yuya Ieiri, Yixuan Li Updated 2026-03-09

As 3D Gaussian Splatting becomes the de facto representation for interactive 3D assets, robust yet imperceptible watermarking is critical. We present a representation-native framework that separates where to write from how to preserve quality. A Trio-Experts module operates directly on Gaussian primitives to derive priors for carrier selection, while a Safety and Budget Aware Gate (SBAG) allocates Gaussians to watermark carriers, optimized for bit resilience under perturbation and bitrate budgets, and to visual compensators that are insulated from watermark loss. To maintain fidelity, we introduce a channel-wise group mask that controls gradient propagation for carriers and compensators, thereby limiting Gaussian parameter updates, repairing local artifacts, and preserving high-frequency details without increasing runtime. Our design yields view-consistent watermark persistence and strong robustness against common image distortions such as compression and noise, while achieving a favorable robustness-quality trade-off compared with prior methods. In addition, decoupled finetuning provides per-Gaussian attributions that reveal where the message is carried and why those carriers are selected, enabling auditable explainability. Compared with state-of-the-art methods, our approach achieves a PSNR improvement of +0.83 dB and a bit-accuracy gain of +1.24%.

Preview loads on expand
ImprovedGS+: A High-Performance C++/CUDA Re-Implementation Strategy for 3D Gaussian Splatting Jordi Muñoz Vicente Updated 2026-03-09

Recent advancements in 3D Gaussian Splatting (3DGS) have shifted the focus toward balancing reconstruction fidelity with computational efficiency. In this work, we propose ImprovedGS+, a high-performance, low-level reinvention of the ImprovedGS strategy, implemented natively within the LichtFeld-Studio framework. By transitioning from high-level Python logic to hardware-optimized C++/CUDA kernels, we achieve a significant reduction in host-device synchronization and training latency. Our implementation introduces a Long-Axis-Split (LAS) CUDA kernel, custom Laplacian-based importance kernels with Non-Maximum Suppression (NMS) for edge scores, and an adaptive Exponential Scale Scheduler. Experimental results on the Mip-NeRF360 dataset demonstrate that ImprovedGS+ establishes a new Pareto-optimal front for scene reconstruction. Our 1M-budget variant outperforms the state-of-the-art MCMC baseline by achieving a 26.8% reduction in training time (saving 17 minutes per session) and utilizing 13.3% fewer Gaussians while maintaining superior visual quality. Furthermore, our full variant demonstrates a 1.28 dB PSNR increase over the ADC baseline with a 38.4% reduction in parametric complexity. These results validate ImprovedGS+ as a scalable, high-speed solution that upholds the core pillars of Speed, Quality, and Usability within the LichtFeld-Studio ecosystem.

Preview loads on expand
Latent Gaussian Splatting for 4D Panoptic Occupancy Tracking Maximilian Luz, Rohit Mohan, Thomas Nürnberg, Yakov Miron, Daniele Cattaneo, Abhinav Valada Updated 2026-02-26

Capturing 4D spatiotemporal surroundings is crucial for the safe and reliable operation of robots in dynamic environments. However, most existing methods address only one side of the problem: they either provide coarse geometric tracking via bounding boxes, or detailed 3D structures like voxel-based occupancy that lack explicit temporal association. In this work, we present Latent Gaussian Splatting for 4D Panoptic Occupancy Tracking (LaGS) that advances spatiotemporal scene understanding in a holistic direction. Our approach incorporates camera-based end-to-end tracking with mask-based multi-view panoptic occupancy prediction, and addresses the key challenge of efficiently aggregating multi-view information into 3D voxel grids via a novel latent Gaussian splatting approach. Specifically, we first fuse observations into 3D Gaussians that serve as a sparse point-centric latent representation of the 3D scene, and then splat the aggregated features onto a 3D voxel grid that is decoded by a mask-based segmentation head. We evaluate LaGS on the Occ3D nuScenes and Waymo datasets, achieving state-of-the-art performance for 4D panoptic occupancy tracking. We make our code available at https://lags.cs.uni-freiburg.de/.

Preview loads on expand
PackUV: Packed Gaussian UV Maps for 4D Volumetric Video Aashish Rai, Angela Xing, Anushka Agarwal, Xiaoyan Cong, Zekun Li, Tao Lu, Aayush Prakash, Srinath Sridhar Updated 2026-02-26

Volumetric videos offer immersive 4D experiences, but remain difficult to reconstruct, store, and stream at scale. Existing Gaussian Splatting based methods achieve high-quality reconstruction but break down on long sequences, temporal inconsistency, and fail under large motions and disocclusions. Moreover, their outputs are typically incompatible with conventional video coding pipelines, preventing practical applications. We introduce PackUV, a novel 4D Gaussian representation that maps all Gaussian attributes into a sequence of structured, multi-scale UV atlas, enabling compact, image-native storage. To fit this representation from multi-view videos, we propose PackUV-GS, a temporally consistent fitting method that directly optimizes Gaussian parameters in the UV domain. A flow-guided Gaussian labeling and video keyframing module identifies dynamic Gaussians, stabilizes static regions, and preserves temporal coherence even under large motions and disocclusions. The resulting UV atlas format is the first unified volumetric video representation compatible with standard video codecs (e.g., FFV1) without losing quality, enabling efficient streaming within existing multimedia infrastructure. To evaluate long-duration volumetric capture, we present PackUV-2B, the largest multi-view video dataset to date, featuring more than 50 synchronized cameras, substantial motion, and frequent disocclusions across 100 sequences and 2B (billion) frames. Extensive experiments demonstrate that our method surpasses existing baselines in rendering fidelity while scaling to sequences up to 30 minutes with consistent quality.

Preview loads on expand
GSTurb: Gaussian Splatting for Atmospheric Turbulence Mitigation Hanliang Du, Zhangji Lu, Zewei Cai, Qijian Tang, Qifeng Yu, Xiaoli Liu Updated 2026-02-26

Atmospheric turbulence causes significant image degradation due to pixel displacement (tilt) and blur, particularly in long-range imaging applications. In this paper, we propose a novel framework for atmospheric turbulence mitigation, GSTurb, which integrates optical flow-guided tilt correction and Gaussian splatting for modeling non-isoplanatic blur. The framework employs Gaussian parameters to represent tilt and blur, and optimizes them across multiple frames to enhance restoration. Experimental results on the ATSyn-static dataset demonstrate the effectiveness of our method, achieving a peak PSNR of 27.67 dB and SSIM of 0.8735. Compared to the state-of-the-art method, GSTurb improves PSNR by 1.3 dB (a 4.5% increase) and SSIM by 0.048 (a 5.8% increase). Additionally, on real datasets, including the TSRWGAN Real-World and CLEAR datasets, GSTurb outperforms existing methods, showing significant improvements in both qualitative and quantitative performance. These results highlight that combining optical flow-guided tilt correction with Gaussian splatting effectively enhances image restoration under both synthetic and real-world turbulence conditions. The code for this method will be available at https://github.com/DuhlLiamz/3DGS_turbulence/tree/main.

Preview loads on expand
Sapling-NeRF: Geo-Localised Sapling Reconstruction in Forests for Ecological Monitoring Miguel Ángel Muñoz-Bañón, Nived Chebrolu, Sruthi M. Krishna Moorthy, Yifu Tao, Fernando Torres, Roberto Salguero-Gómez, Maurice Fallon Updated 2026-02-26

Saplings are key indicators of forest regeneration and overall forest health. However, their fine-scale architectural traits are difficult to capture with existing 3D sensing methods, which make quantitative evaluation difficult. Terrestrial Laser Scanners (TLS), Mobile Laser Scanners (MLS), or traditional photogrammetry approaches poorly reconstruct thin branches, dense foliage, and lack the scale consistency needed for long-term monitoring. Implicit 3D reconstruction methods such as Neural Radiance Fields (NeRF) and 3D Gaussian Splatting (3DGS) are promising alternatives, but cannot recover the true scale of a scene and lack any means to be accurately geo-localised. In this paper, we present a pipeline which fuses NeRF, LiDAR SLAM, and GNSS to enable repeatable, geo-localised ecological monitoring of saplings. Our system proposes a three-level representation: (i) coarse Earth-frame localisation using GNSS, (ii) LiDAR-based SLAM for centimetre-accurate localisation and reconstruction, and (iii) NeRF-derived object-centric dense reconstruction of individual saplings. This approach enables repeatable quantitative evaluation and long-term monitoring of sapling traits. Our experiments in forest plots in Wytham Woods (Oxford, UK) and Evo (Finland) show that stem height, branching patterns, and leaf-to-wood ratios can be captured with increased accuracy as compared to TLS. We demonstrate that accurate stem skeletons and leaf distributions can be measured for saplings with heights between 0.5m and 2m in situ, giving ecologists access to richer structural and quantitative data for analysing forest dynamics.

Preview loads on expand
ArtPro: Self-Supervised Articulated Object Reconstruction with Adaptive Integration of Mobility Proposals Xuelu Li, Zhaonan Wang, Xiaogang Wang, Lei Wu, Manyi Li, Changhe Tu Updated 2026-02-26

Reconstructing articulated objects into high-fidelity digital twins is crucial for applications such as robotic manipulation and interactive simulation. Recent self-supervised methods using differentiable rendering frameworks like 3D Gaussian Splatting remain highly sensitive to the initial part segmentation. Their reliance on heuristic clustering or pre-trained models often causes optimization to converge to local minima, especially for complex multi-part objects. To address these limitations, we propose ArtPro, a novel self-supervised framework that introduces adaptive integration of mobility proposals. Our approach begins with an over-segmentation initialization guided by geometry features and motion priors, generating part proposals with plausible motion hypotheses. During optimization, we dynamically merge these proposals by analyzing motion consistency among spatial neighbors, while a collision-aware motion pruning mechanism prevents erroneous kinematic estimation. Extensive experiments on both synthetic and real-world objects demonstrate that ArtPro achieves robust reconstruction of complex multi-part objects, significantly outperforming existing methods in accuracy and stability.

Preview loads on expand
BetterScene: 3D Scene Synthesis with Representation-Aligned Generative Model Yuci Han, Charles Toth, John E. Anderson, William J. Shuart, Alper Yilmaz Updated 2026-02-26

We present BetterScene, an approach to enhance novel view synthesis (NVS) quality for diverse real-world scenes using extremely sparse, unconstrained photos. BetterScene leverages the production-ready Stable Video Diffusion (SVD) model pretrained on billions of frames as a strong backbone, aiming to mitigate artifacts and recover view-consistent details at inference time. Conventional methods have developed similar diffusion-based solutions to address these challenges of novel view synthesis. Despite significant improvements, these methods typically rely on off-the-shelf pretrained diffusion priors and fine-tune only the UNet module while keeping other components frozen, which still leads to inconsistent details and artifacts even when incorporating geometry-aware regularizations like depth or semantic conditions. To address this, we investigate the latent space of the diffusion model and introduce two components: (1) temporal equivariance regularization and (2) vision foundation model-aligned representation, both applied to the variational autoencoder (VAE) module within the SVD pipeline. BetterScene integrates a feed-forward 3D Gaussian Splatting (3DGS) model to render features as inputs for the SVD enhancer and generate continuous, artifact-free, consistent novel views. We evaluate on the challenging DL3DV-10K dataset and demonstrate superior performance compared to state-of-the-art methods.

Preview loads on expand
GIFSplat: Generative Prior-Guided Iterative Feed-Forward 3D Gaussian Splatting from Sparse Views Tianyu Chen, Wei Xiang, Kang Han, Yu Lu, Di Wu, Gaowen Liu, Ramana Rao Kompella Updated 2026-02-26

Feed-forward 3D reconstruction offers substantial runtime advantages over per-scene optimization, which remains slow at inference and often fragile under sparse views. However, existing feed-forward methods still have potential for further performance gains, especially for out-of-domain data, and struggle to retain second-level inference time once a generative prior is introduced. These limitations stem from the one-shot prediction paradigm in existing feed-forward pipeline: models are strictly bounded by capacity, lack inference-time refinement, and are ill-suited for continuously injecting generative priors. We introduce GIFSplat, a purely feed-forward iterative refinement framework for 3D Gaussian Splatting from sparse unposed views. A small number of forward-only residual updates progressively refine current 3D scene using rendering evidence, achieve favorable balance between efficiency and quality. Furthermore, we distill a frozen diffusion prior into Gaussian-level cues from enhanced novel renderings without gradient backpropagation or ever-increasing view-set expansion, thereby enabling per-scene adaptation with generative prior while preserving feed-forward efficiency. Across DL3DV, RealEstate10K, and DTU, GIFSplat consistently outperforms state-of-the-art feed-forward baselines, improving PSNR by up to +2.1 dB, and it maintains second-scale inference time without requiring camera poses or any test-time gradient optimization.

Preview loads on expand
SwiftNDC: Fast Neural Depth Correction for High-Fidelity 3D Reconstruction Kang Han, Wei Xiang, Lu Yu, Mathew Wyatt, Gaowen Liu, Ramana Rao Kompella Updated 2026-02-26

Depth-guided 3D reconstruction has gained popularity as a fast alternative to optimization-heavy approaches, yet existing methods still suffer from scale drift, multi-view inconsistencies, and the need for substantial refinement to achieve high-fidelity geometry. Here, we propose SwiftNDC, a fast and general framework built around a Neural Depth Correction field that produces cross-view consistent depth maps. From these refined depths, we generate a dense point cloud through back-projection and robust reprojection-error filtering, obtaining a clean and uniformly distributed geometric initialization for downstream reconstruction. This reliable dense geometry substantially accelerates 3D Gaussian Splatting (3DGS) for mesh reconstruction, enabling high-quality surfaces with significantly fewer optimization iterations. For novel-view synthesis, SwiftNDC can also improve 3DGS rendering quality, highlighting the benefits of strong geometric initialization. We conduct a comprehensive study across five datasets, including two for mesh reconstruction, as well as three for novel-view synthesis. SwiftNDC consistently reduces running time for accurate mesh reconstruction and boosts rendering fidelity for view synthesis, demonstrating the effectiveness of combining neural depth refinement with robust geometric initialization for high-fidelity and efficient 3D reconstruction.

Preview loads on expand
AeroDGS: Physically Consistent Dynamic Gaussian Splatting for Single-Sequence Aerial 4D Reconstruction Hanyang Liu, Rongjun Qin Updated 2026-02-25

Recent advances in 4D scene reconstruction have significantly improved dynamic modeling across various domains. However, existing approaches remain limited under aerial conditions with single-view capture, wide spatial range, and dynamic objects of limited spatial footprint and large motion disparity. These challenges cause severe depth ambiguity and unstable motion estimation, making monocular aerial reconstruction inherently ill-posed. To this end, we present AeroDGS, a physics-guided 4D Gaussian splatting framework for monocular UAV videos. AeroDGS introduces a Monocular Geometry Lifting module that reconstructs reliable static and dynamic geometry from a single aerial sequence, providing a robust basis for dynamic estimation. To further resolve monocular ambiguity, we propose a Physics-Guided Optimization module that incorporates differentiable ground-support, upright-stability, and trajectory-smoothness priors, transforming ambiguous image cues into physically consistent motion. The framework jointly refines static backgrounds and dynamic entities with stable geometry and coherent temporal evolution. We additionally build a real-world UAV dataset that spans various altitudes and motion conditions to evaluate dynamic aerial reconstruction. Experiments on synthetic and real UAV scenes demonstrate that AeroDGS outperforms state-of-the-art methods, achieving superior reconstruction fidelity in dynamic aerial environments.

Preview loads on expand
Interactive Augmented Reality-enabled Outdoor Scene Visualization For Enhanced Real-time Disaster Response Dimitrios Apostolakis, Georgios Angelidis, Vasileios Argyriou, Panagiotis Sarigiannidis, Georgios Th. Papadopoulos Updated 2026-02-25

A user-centered AR interface for disaster response is presented in this work that uses 3D Gaussian Splatting (3DGS) to visualize detailed scene reconstructions, while maintaining situational awareness and keeping cognitive load low. The interface relies on a lightweight interaction approach, combining World-in-Miniature (WIM) navigation with semantic Points of Interest (POIs) that can be filtered as needed, and it is supported by an architecture designed to stream updates as reconstructions evolve. User feedback from a preliminary evaluation indicates that this design is easy to use and supports real-time coordination, with participants highlighting the value of interaction and POIs for fast decision-making in context. Thorough user-centric performance evaluation demonstrates strong usability of the developed interface and high acceptance ratios.

Preview loads on expand
BrepGaussian: CAD reconstruction from Multi-View Images with Gaussian Splatting Jiaxing Yu, Dongyang Ren, Hangyu Xu, Zhouyuxiao Yang, Yuanqi Li, Jie Guo, Zhengkang Zhou, Yanwen Guo Updated 2026-02-24

The boundary representation (B-rep) models a 3D solid as its explicit boundaries: trimmed corners, edges, and faces. Recovering B-rep representation from unstructured data is a challenging and valuable task of computer vision and graphics. Recent advances in deep learning have greatly improved the recovery of 3D shape geometry, but still depend on dense and clean point clouds and struggle to generalize to novel shapes. We propose B-rep Gaussian Splatting (BrepGaussian), a novel framework that learns 3D parametric representations from 2D images. We employ a Gaussian Splatting renderer with learnable features, followed by a specific fitting strategy. To disentangle geometry reconstruction and feature learning, we introduce a two-stage learning framework that first captures geometry and edges and then refines patch features to achieve clean geometry and coherent instance representations. Extensive experiments demonstrate the superior performance of our approach to state-of-the-art methods. We will release our code and datasets upon acceptance.

Preview loads on expand
Dropping Anchor and Spherical Harmonics for Sparse-view Gaussian Splatting Shuangkang Fang, I-Chao Shen, Xuanyang Zhang, Zesheng Wang, Yufeng Wang, Wenrui Ding, Gang Yu, Takeo Igarashi Updated 2026-02-24

Recent 3D Gaussian Splatting (3DGS) Dropout methods address overfitting under sparse-view conditions by randomly nullifying Gaussian opacities. However, we identify a neighbor compensation effect in these approaches: dropped Gaussians are often compensated by their neighbors, weakening the intended regularization. Moreover, these methods overlook the contribution of high-degree spherical harmonic coefficients (SH) to overfitting. To address these issues, we propose DropAnSH-GS, a novel anchor-based Dropout strategy. Rather than dropping Gaussians independently, our method randomly selects certain Gaussians as anchors and simultaneously removes their spatial neighbors. This effectively disrupts local redundancies near anchors and encourages the model to learn more robust, globally informed representations. Furthermore, we extend the Dropout to color attributes by randomly dropping higher-degree SH to concentrate appearance information in lower-degree SH. This strategy further mitigates overfitting and enables flexible post-training model compression via SH truncation. Experimental results demonstrate that DropAnSH-GS substantially outperforms existing Dropout methods with negligible computational overhead, and can be readily integrated into various 3DGS variants to enhance their performances. Project Website: https://sk-fun.fun/DropAnSH-GS

Preview loads on expand
RU4D-SLAM: Reweighting Uncertainty in Gaussian Splatting SLAM for 4D Scene Reconstruction Yangfan Zhao, Hanwei Zhang, Ke Huang, Qiufeng Wang, Zhenzhou Shao, Dengyu Wu Updated 2026-02-24

Combining 3D Gaussian splatting with Simultaneous Localization and Mapping (SLAM) has gained popularity as it enables continuous 3D environment reconstruction during motion. However, existing methods struggle in dynamic environments, particularly moving objects complicate 3D reconstruction and, in turn, hinder reliable tracking. The emergence of 4D reconstruction, especially 4D Gaussian splatting, offers a promising direction for addressing these challenges, yet its potential for 4D-aware SLAM remains largely underexplored. Along this direction, we propose a robust and efficient framework, namely Reweighting Uncertainty in Gaussian Splatting SLAM (RU4D-SLAM) for 4D scene reconstruction, that introduces temporal factors into spatial 3D representation while incorporating uncertainty-aware perception of scene changes, blurred image synthesis, and dynamic scene reconstruction. We enhance dynamic scene representation by integrating motion blur rendering, and improve uncertainty-aware tracking by extending per-pixel uncertainty modeling, which is originally designed for static scenarios, to handle blurred images. Furthermore, we propose a semantic-guided reweighting mechanism for per-pixel uncertainty estimation in dynamic scenes, and introduce a learnable opacity weight to support adaptive 4D mapping. Extensive experiments on standard benchmarks demonstrate that our method substantially outperforms state-of-the-art approaches in both trajectory accuracy and 4D scene reconstruction, particularly in dynamic environments with moving objects and low-quality inputs. Code available: https://ru4d-slam.github.io

Preview loads on expand
Monocular Endoscopic Tissue 3D Reconstruction with Multi-Level Geometry Regularization Yangsen Chen, Hao Wang Updated 2026-02-24

Reconstructing deformable endoscopic tissues is crucial for achieving robot-assisted surgery. However, 3D Gaussian Splatting-based approaches encounter challenges in achieving consistent tissue surface reconstruction, while existing NeRF-based methods lack real-time rendering capabilities. In pursuit of both smooth deformable surfaces and real-time rendering, we introduce a novel approach based on 3D Gaussian Splatting. Specifically, we introduce surface-aware reconstruction, initially employing a Sign Distance Field-based method to construct a mesh, subsequently utilizing this mesh to constrain the Gaussian Splatting reconstruction process. Furthermore, to ensure the generation of physically plausible deformations, we incorporate local rigidity and global non-rigidity restrictions to guide Gaussian deformation, tailored for the highly deformable nature of soft endoscopic tissue. Based on 3D Gaussian Splatting, our proposed method delivers a fast rendering process and smooth surface appearances. Quantitative and qualitative analysis against alternative methodologies shows that our approach achieves solid reconstruction quality in both textures and geometries.

Preview loads on expand
WildGHand: Learning Anti-Perturbation Gaussian Hand Avatars from Monocular In-the-Wild Videos Hanhui Li, Xuan Huang, Wanquan Liu, Yuhao Cheng, Long Chen, Yiqiang Yan, Xiaodan Liang, Chenqiang Gao Updated 2026-02-24

Despite recent progress in 3D hand reconstruction from monocular videos, most existing methods rely on data captured in well-controlled environments and therefore degrade in real-world settings with severe perturbations, such as hand-object interactions, extreme poses, illumination changes, and motion blur. To tackle these issues, we introduce WildGHand, an optimization-based framework that enables self-adaptive 3D Gaussian splatting on in-the-wild videos and produces high-fidelity hand avatars. WildGHand incorporates two key components: (i) a dynamic perturbation disentanglement module that explicitly represents perturbations as time-varying biases on 3D Gaussian attributes during optimization, and (ii) a perturbation-aware optimization strategy that generates per-frame anisotropic weighted masks to guide optimization. Together, these components allow the framework to identify and suppress perturbations across both spatial and temporal dimensions. We further curate a dataset of monocular hand videos captured under diverse perturbations to benchmark in-the-wild hand avatar reconstruction. Extensive experiments on this dataset and two public datasets demonstrate that WildGHand achieves state-of-the-art performance and substantially improves over its base model across multiple metrics (e.g., up to a $15.8\%$ relative gain in PSNR and a $23.1\%$ relative reduction in LPIPS). Our implementation and dataset are available at https://github.com/XuanHuang0/WildGHand.

Preview loads on expand
Aesthetic Camera Viewpoint Suggestion with 3D Aesthetic Field Sheyang Tang, Armin Shafiee Sarvestani, Jialu Xu, Xiaoyu Xu, Zhou Wang Updated 2026-02-23

The aesthetic quality of a scene depends strongly on camera viewpoint. Existing approaches for aesthetic viewpoint suggestion are either single-view adjustments, predicting limited camera adjustments from a single image without understanding scene geometry, or 3D exploration approaches, which rely on dense captures or prebuilt 3D environments coupled with costly reinforcement learning (RL) searches. In this work, we introduce the notion of 3D aesthetic field that enables geometry-grounded aesthetic reasoning in 3D with sparse captures, allowing efficient viewpoint suggestions in contrast to costly RL searches. We opt to learn this 3D aesthetic field using a feedforward 3D Gaussian Splatting network that distills high-level aesthetic knowledge from a pretrained 2D aesthetic model into 3D space, enabling aesthetic prediction for novel viewpoints from only sparse input views. Building on this field, we propose a two-stage search pipeline that combines coarse viewpoint sampling with gradient-based refinement, efficiently identifying aesthetically appealing viewpoints without dense captures or RL exploration. Extensive experiments show that our method consistently suggests viewpoints with superior framing and composition compared to existing approaches, establishing a new direction toward 3D-aware aesthetic modeling.

Preview loads on expand
Large-scale Photorealistic Outdoor 3D Scene Reconstruction from UAV Imagery Using Gaussian Splatting Techniques Christos Maikos, Georgios Angelidis, Georgios Th. Papadopoulos Updated 2026-02-23

In this study, we present an end-to-end pipeline capable of converting drone-captured video streams into high-fidelity 3D reconstructions with minimal latency. Unmanned aerial vehicles (UAVs) are extensively used in aerial real-time perception applications. Moreover, recent advances in 3D Gaussian Splatting (3DGS) have demonstrated significant potential for real-time neural rendering. However, their integration into end-to-end UAV-based reconstruction and visualization systems remains underexplored. Our goal is to propose an efficient architecture that combines live video acquisition via RTMP streaming, synchronized sensor fusion, camera pose estimation, and 3DGS optimization, achieving continuous model updates and low-latency deployment within interactive visualization environments that supports immersive augmented and virtual reality (AR/VR) applications. Experimental results demonstrate that the proposed method achieves competitive visual fidelity, while delivering significantly higher rendering performance and substantially reduced end-to-end latency, compared to NeRF-based approaches. Reconstruction quality remains within 4-7\% of high-fidelity offline references, confirming the suitability of the proposed system for real-time, scalable augmented perception from aerial platforms.

Preview loads on expand
tttLRM: Test-Time Training for Long Context and Autoregressive 3D Reconstruction Chen Wang, Hao Tan, Wang Yifan, Zhiqin Chen, Yuheng Liu, Kalyan Sunkavalli, Sai Bi, Lingjie Liu, Yiwei Hu Updated 2026-02-23

We propose tttLRM, a novel large 3D reconstruction model that leverages a Test-Time Training (TTT) layer to enable long-context, autoregressive 3D reconstruction with linear computational complexity, further scaling the model's capability. Our framework efficiently compresses multiple image observations into the fast weights of the TTT layer, forming an implicit 3D representation in the latent space that can be decoded into various explicit formats, such as Gaussian Splats (GS) for downstream applications. The online learning variant of our model supports progressive 3D reconstruction and refinement from streaming observations. We demonstrate that pretraining on novel view synthesis tasks effectively transfers to explicit 3D modeling, resulting in improved reconstruction quality and faster convergence. Extensive experiments show that our method achieves superior performance in feedforward 3D Gaussian reconstruction compared to state-of-the-art approaches on both objects and scenes.

Preview loads on expand
Augmented Radiance Field: A General Framework for Enhanced Gaussian Splatting Yixin Yang, Bojian Wu, Yang Zhou, Hui Huang Updated 2026-02-23

Due to the real-time rendering performance, 3D Gaussian Splatting (3DGS) has emerged as the leading method for radiance field reconstruction. However, its reliance on spherical harmonics for color encoding inherently limits its ability to separate diffuse and specular components, making it challenging to accurately represent complex reflections. To address this, we propose a novel enhanced Gaussian kernel that explicitly models specular effects through view-dependent opacity. Meanwhile, we introduce an error-driven compensation strategy to improve rendering quality in existing 3DGS scenes. Our method begins with 2D Gaussian initialization and then adaptively inserts and optimizes enhanced Gaussian kernels, ultimately producing an augmented radiance field. Experiments demonstrate that our method not only surpasses state-of-the-art NeRF methods in rendering performance but also achieves greater parameter efficiency. Project page at: https://xiaoxinyyx.github.io/augs.

Preview loads on expand
One2Scene: Geometric Consistent Explorable 3D Scene Generation from a Single Image Pengfei Wang, Liyi Chen, Zhiyuan Ma, Yanjun Guo, Guowen Zhang, Lei Zhang Updated 2026-02-23

Generating explorable 3D scenes from a single image is a highly challenging problem in 3D vision. Existing methods struggle to support free exploration, often producing severe geometric distortions and noisy artifacts when the viewpoint moves far from the original perspective. We introduce \textbf{One2Scene}, an effective framework that decomposes this ill-posed problem into three tractable sub-tasks to enable immersive explorable scene generation. We first use a panorama generator to produce anchor views from a single input image as initialization. Then, we lift these 2D anchors into an explicit 3D geometric scaffold via a generalizable, feed-forward Gaussian Splatting network. Instead of treating the panorama as a single image for reconstruction, we project it into multiple sparse anchor views and reformulate the reconstruction task as multi-view stereo matching, which allows us to leverage robust geometric priors learned from large-scale multi-view datasets. A bidirectional feature fusion module is used to enforce cross-view consistency, yielding an efficient and geometrically reliable scaffold. Finally, the scaffold serves as a strong prior for a novel view generator to produce photorealistic and geometrically accurate views at arbitrary cameras. By explicitly conditioning on a 3D-consistent scaffold to perform reconstruction, One2Scene works stably under large camera motions, supporting immersive scene exploration. Extensive experiments show that One2Scene substantially outperforms state-of-the-art methods in panorama depth estimation, feed-forward 360° reconstruction, and explorable 3D scene generation. Code and models will be released.

Preview loads on expand
3DGSNav: Enhancing Vision-Language Model Reasoning for Object Navigation via Active 3D Gaussian Splatting Wancai Zheng, Hao Chen, Xianlong Lu, Linlin Ou, Xinyi Yu Updated 2026-02-12

Object navigation is a core capability of embodied intelligence, enabling an agent to locate target objects in unknown environments. Recent advances in vision-language models (VLMs) have facilitated zero-shot object navigation (ZSON). However, existing methods often rely on scene abstractions that convert environments into semantic maps or textual representations, causing high-level decision making to be constrained by the accuracy of low-level perception. In this work, we present 3DGSNav, a novel ZSON framework that embeds 3D Gaussian Splatting (3DGS) as persistent memory for VLMs to enhance spatial reasoning. Through active perception, 3DGSNav incrementally constructs a 3DGS representation of the environment, enabling trajectory-guided free-viewpoint rendering of frontier-aware first-person views. Moreover, we design structured visual prompts and integrate them with Chain-of-Thought (CoT) prompting to further improve VLM reasoning. During navigation, a real-time object detector filters potential targets, while VLM-driven active viewpoint switching performs target re-verification, ensuring efficient and reliable recognition. Extensive evaluations across multiple benchmarks and real-world experiments on a quadruped robot demonstrate that our method achieves robust and competitive performance against state-of-the-art approaches.The Project Page:https://aczheng-cai.github.io/3dgsnav.github.io/

Preview loads on expand
GSO-SLAM: Bidirectionally Coupled Gaussian Splatting and Direct Visual Odometry Jiung Yeon, Seongbo Ha, Hyeonwoo Yu Updated 2026-02-12

We propose GSO-SLAM, a real-time monocular dense SLAM system that leverages Gaussian scene representation. Unlike existing methods that couple tracking and mapping with a unified scene, incurring computational costs, or loosely integrate them with well-structured tracking frameworks, introducing redundancies, our method bidirectionally couples Visual Odometry (VO) and Gaussian Splatting (GS). Specifically, our approach formulates joint optimization within an Expectation-Maximization (EM) framework, enabling the simultaneous refinement of VO-derived semi-dense depth estimates and the GS representation without additional computational overhead. Moreover, we present Gaussian Splat Initialization, which utilizes image information, keyframe poses, and pixel associations from VO to produce close approximations to the final Gaussian scene, thereby eliminating the need for heuristic methods. Through extensive experiments, we validate the effectiveness of our method, showing that it not only operates in real time but also achieves state-of-the-art geometric/photometric fidelity of the reconstructed scene and tracking accuracy.

Preview loads on expand
TG-Field: Geometry-Aware Radiative Gaussian Fields for Tomographic Reconstruction Yuxiang Zhong, Jun Wei, Chaoqi Chen, Senyou An, Hui Huang Updated 2026-02-12

3D Gaussian Splatting (3DGS) has revolutionized 3D scene representation with superior efficiency and quality. While recent adaptations for computed tomography (CT) show promise, they struggle with severe artifacts under highly sparse-view projections and dynamic motions. To address these challenges, we propose Tomographic Geometry Field (TG-Field), a geometry-aware Gaussian deformation framework tailored for both static and dynamic CT reconstruction. A multi-resolution hash encoder is employed to capture local spatial priors, regularizing primitive parameters under ultra-sparse settings. We further extend the framework to dynamic reconstruction by introducing time-conditioned representations and a spatiotemporal attention block to adaptively aggregate features, thereby resolving spatiotemporal ambiguities and enforcing temporal coherence. In addition, a motion-flow network models fine-grained respiratory motion to track local anatomical deformations. Extensive experiments on synthetic and real-world datasets demonstrate that TG-Field consistently outperforms existing methods, achieving state-of-the-art reconstruction accuracy under highly sparse-view conditions.

Preview loads on expand
Variation-aware Flexible 3D Gaussian Editing Hao Qin, Yukai Sun, Meng Wang, Ming Kong, Mengxu Lu, Qiang Zhu Updated 2026-02-12

Indirect editing methods for 3D Gaussian Splatting (3DGS) have recently witnessed significant advancements. These approaches operate by first applying edits in the rendered 2D space and subsequently projecting the modifications back into 3D. However, this paradigm inevitably introduces cross-view inconsistencies and constrains both the flexibility and efficiency of the editing process. To address these challenges, we present VF-Editor, which enables native editing of Gaussian primitives by predicting attribute variations in a feedforward manner. To accurately and efficiently estimate these variations, we design a novel variation predictor distilled from 2D editing knowledge. The predictor encodes the input to generate a variation field and employs two learnable, parallel decoding functions to iteratively infer attribute changes for each 3D Gaussian. Thanks to its unified design, VF-Editor can seamlessly distill editing knowledge from diverse 2D editors and strategies into a single predictor, allowing for flexible and effective knowledge transfer into the 3D domain. Extensive experiments on both public and private datasets reveal the inherent limitations of indirect editing pipelines and validate the effectiveness and flexibility of our approach.

Preview loads on expand
LeafFit: Plant Assets Creation from 3D Gaussian Splatting Chang Luo, Nobuyuki Umetani Updated 2026-02-12

We propose LeafFit, a pipeline that converts 3D Gaussian Splatting (3DGS) of individual plants into editable, instanced mesh assets. While 3DGS faithfully captures complex foliage, its high memory footprint and lack of mesh topology make it incompatible with traditional game production workflows. We address this by leveraging the repetition of leaf shapes; our method segments leaves from the unstructured 3DGS, with optional user interaction included as a fallback. A representative leaf group is selected and converted into a thin, sharp mesh to serve as a template; this template is then fitted to all other leaves via differentiable Moving Least Squares (MLS) deformation. At runtime, the deformation is evaluated efficiently on-the-fly using a vertex shader to minimize storage requirements. Experiments demonstrate that LeafFit achieves higher segmentation quality and deformation accuracy than recent baselines while significantly reducing data size and enabling parameter-level editing.

Preview loads on expand
ReaDy-Go: Real-to-Sim Dynamic 3D Gaussian Splatting Simulation for Environment-Specific Visual Navigation with Moving Obstacles Seungyeon Yoo, Youngseok Jang, Dabin Kim, Youngsoo Han, Seungwoo Jung, H. Jin Kim Updated 2026-02-12

Visual navigation models often struggle in real-world dynamic environments due to limited robustness to the sim-to-real gap and the difficulty of training policies tailored to target deployment environments (e.g., households, restaurants, and factories). Although real-to-sim navigation simulation using 3D Gaussian Splatting (GS) can mitigate this gap, prior works have assumed only static scenes or unrealistic dynamic obstacles, despite the importance of safe navigation in dynamic environments. To address these issues, we propose ReaDy-Go, a novel real-to-sim simulation pipeline that synthesizes photorealistic dynamic scenarios for target environments. ReaDy-Go generates photorealistic navigation datasets for dynamic environments by combining a reconstructed static GS scene with dynamic human GS obstacles, and trains policies robust to both the sim-to-real gap and moving obstacles. The pipeline consists of three components: (1) a dynamic GS simulator that integrates scene GS with a human animation module, enabling the insertion of animatable human GS avatars and the synthesis of plausible human motions from 2D trajectories, (2) navigation dataset generation for dynamic environments that leverages the simulator, a robot expert planner designed for dynamic GS representations, and a human planner, and (3) policy learning using the generated datasets. ReaDy-Go outperforms baselines across target environments in both simulation and real-world experiments, demonstrating improved navigation performance even after sim-to-real transfer and in the presence of moving obstacles. Moreover, zero-shot sim-to-real deployment in an unseen environment indicates its generalization potential. Project page: https://syeon-yoo.github.io/ready-go-site/.

Preview loads on expand
ERGO: Excess-Risk-Guided Optimization for High-Fidelity Monocular 3D Gaussian Splatting Zehua Ma, Hanhui Li, Zhenyu Xie, Xiaonan Luo, Michael Kampffmeyer, Feng Gao, Xiaodan Liang Updated 2026-02-10

Generating 3D content from a single image remains a fundamentally challenging and ill-posed problem due to the inherent absence of geometric and textural information in occluded regions. While state-of-the-art generative models can synthesize auxiliary views to provide additional supervision, these views inevitably contain geometric inconsistencies and textural misalignments that propagate and amplify artifacts during 3D reconstruction. To effectively harness these imperfect supervisory signals, we propose an adaptive optimization framework guided by excess risk decomposition, termed ERGO. Specifically, ERGO decomposes the optimization losses in 3D Gaussian splatting into two components, i.e., excess risk that quantifies the suboptimality gap between current and optimal parameters, and Bayes error that models the irreducible noise inherent in synthesized views. This decomposition enables ERGO to dynamically estimate the view-specific excess risk and adaptively adjust loss weights during optimization. Furthermore, we introduce geometry-aware and texture-aware objectives that complement the excess-risk-derived weighting mechanism, establishing a synergistic global-local optimization paradigm. Consequently, ERGO demonstrates robustness against supervision noise while consistently enhancing both geometric fidelity and textural quality of the reconstructed 3D content. Extensive experiments on the Google Scanned Objects dataset and the OmniObject3D dataset demonstrate the superiority of ERGO over existing state-of-the-art methods.

Preview loads on expand
XSPLAIN: XAI-enabling Splat-based Prototype Learning for Attribute-aware INterpretability Dominik Galus, Julia Farganus, Tymoteusz Zapala, Mikołaj Czachorowski, Piotr Borycki, Przemysław Spurek, Piotr Syga Updated 2026-02-10

3D Gaussian Splatting (3DGS) has rapidly become a standard for high-fidelity 3D reconstruction, yet its adoption in multiple critical domains is hindered by the lack of interpretability of the generation models as well as classification of the Splats. While explainability methods exist for other 3D representations, like point clouds, they typically rely on ambiguous saliency maps that fail to capture the volumetric coherence of Gaussian primitives. We introduce XSPLAIN, the first ante-hoc, prototype-based interpretability framework designed specifically for 3DGS classification. Our approach leverages a voxel-aggregated PointNet backbone and a novel, invertible orthogonal transformation that disentangles feature channels for interpretability while strictly preserving the original decision boundaries. Explanations are grounded in representative training examples, enabling intuitive ``this looks like that'' reasoning without any degradation in classification performance. A rigorous user study (N=51) demonstrates a decisive preference for our approach: participants selected XSPLAIN explanations 48.4\% of the time as the best, significantly outperforming baselines $(p<0.001)$, showing that XSPLAIN provides transparency and user trust. The source code for this work is available at: https://github.com/Solvro/ml-splat-xai

Preview loads on expand
ArtisanGS: Interactive Tools for Gaussian Splat Selection with AI and Human in the Loop Clement Fuji Tsang, Anita Hu, Or Perel, Carsten Kolve, Maria Shugrina Updated 2026-02-10

Representation in the family of 3D Gaussian Splats (3DGS) are growing into a viable alternative to traditional graphics for an expanding number of application, including recent techniques that facilitate physics simulation and animation. However, extracting usable objects from in-the-wild captures remains challenging and controllable editing techniques for this representation are limited. Unlike the bulk of emerging techniques, focused on automatic solutions or high-level editing, we introduce an interactive suite of tools centered around versatile Gaussian Splat selection and segmentation. We propose a fast AI-driven method to propagate user-guided 2D selection masks to 3DGS selections. This technique allows for user intervention in the case of errors and is further coupled with flexible manual selection and segmentation tools. These allow a user to achieve virtually any binary segmentation of an unstructured 3DGS scene. We evaluate our toolset against the state-of-the-art for Gaussian Splat selection and demonstrate their utility for downstream applications by developing a user-guided local editing approach, leveraging a custom Video Diffusion Model. With flexible selection tools, users have direct control over the areas that the AI can modify. Our selection and editing tools can be used for any in-the-wild capture without additional optimization.

Preview loads on expand
Faster-GS: Analyzing and Improving Gaussian Splatting Optimization Florian Hahlbohm, Linus Franke, Martin Eisemann, Marcus Magnor Updated 2026-02-10

Recent advances in 3D Gaussian Splatting (3DGS) have focused on accelerating optimization while preserving reconstruction quality. However, many proposed methods entangle implementation-level improvements with fundamental algorithmic modifications or trade performance for fidelity, leading to a fragmented research landscape that complicates fair comparison. In this work, we consolidate and evaluate the most effective and broadly applicable strategies from prior 3DGS research and augment them with several novel optimizations. We further investigate underexplored aspects of the framework, including numerical stability, Gaussian truncation, and gradient approximation. The resulting system, Faster-GS, provides a rigorously optimized algorithm that we evaluate across a comprehensive suite of benchmarks. Our experiments demonstrate that Faster-GS achieves up to 5$\times$ faster training while maintaining visual quality, establishing a new cost-effective and resource efficient baseline for 3DGS optimization. Furthermore, we demonstrate that optimizations can be applied to 4D Gaussian reconstruction, leading to efficient non-rigid scene optimization.

Preview loads on expand
Grow with the Flow: 4D Reconstruction of Growing Plants with Gaussian Flow Fields Weihan Luo, Lily Goli, Sherwin Bahmani, Felix Taubner, Andrea Tagliasacchi, David B. Lindell Updated 2026-02-09

Modeling the time-varying 3D appearance of plants during their growth poses unique challenges: unlike many dynamic scenes, plants generate new geometry over time as they expand, branch, and differentiate. Recent motion modeling techniques are ill-suited to this problem setting. For example, deformation fields cannot introduce new geometry, and 4D Gaussian splatting constrains motion to a linear trajectory in space and time and cannot track the same set of Gaussians over time. Here, we introduce a 3D Gaussian flow field representation that models plant growth as a time-varying derivative over Gaussian parameters -- position, scale, orientation, color, and opacity -- enabling nonlinear and continuous-time growth dynamics. To initialize a sufficient set of Gaussian primitives, we reconstruct the mature plant and learn a process of reverse growth, effectively simulating the plant's developmental history in reverse. Our approach achieves superior image quality and geometric accuracy compared to prior methods on multi-view timelapse datasets of plant growth, providing a new approach for appearance modeling of growing 3D structures.

Preview loads on expand
Analysis of Converged 3D Gaussian Splatting Solutions: Density Effects and Prediction Limit Zhendong Wang, Cihan Ruan, Jingchuan Xiao, Chuqing Shi, Wei Jiang, Wei Wang, Wenjie Liu, Nam Ling Updated 2026-02-09

We investigate what structure emerges in 3D Gaussian Splatting (3DGS) solutions from standard multi-view optimization. We term these Rendering-Optimal References (RORs) and analyze their statistical properties, revealing stable patterns: mixture-structured scales and bimodal radiance across diverse scenes. To understand what determines these parameters, we apply learnability probes by training predictors to reconstruct RORs from point clouds without rendering supervision. Our analysis uncovers fundamental density-stratification. Dense regions exhibit geometry-correlated parameters amenable to render-free prediction, while sparse regions show systematic failure across architectures. We formalize this through variance decomposition, demonstrating that visibility heterogeneity creates covariance-dominated coupling between geometric and appearance parameters in sparse regions. This reveals the dual character of RORs: geometric primitives where point clouds suffice, and view synthesis primitives where multi-view constraints are essential. We provide density-aware strategies that improve training robustness and discuss architectural implications for systems that adaptively balance feed-forward prediction and rendering-based refinement.

Preview loads on expand
GaussianCaR: Gaussian Splatting for Efficient Camera-Radar Fusion Santiago Montiel-Marín, Miguel Antunes-García, Fabio Sánchez-García, Angel Llamazares, Holger Caesar, Luis M. Bergasa Updated 2026-02-09

Robust and accurate perception of dynamic objects and map elements is crucial for autonomous vehicles performing safe navigation in complex traffic scenarios. While vision-only methods have become the de facto standard due to their technical advances, they can benefit from effective and cost-efficient fusion with radar measurements. In this work, we advance fusion methods by repurposing Gaussian Splatting as an efficient universal view transformer that bridges the view disparity gap, mapping both image pixels and radar points into a common Bird's-Eye View (BEV) representation. Our main contribution is GaussianCaR, an end-to-end network for BEV segmentation that, unlike prior BEV fusion methods, leverages Gaussian Splatting to map raw sensor information into latent features for efficient camera-radar fusion. Our architecture combines multi-scale fusion with a transformer decoder to efficiently extract BEV features. Experimental results demonstrate that our approach achieves performance on par with, or even surpassing, the state of the art on BEV segmentation tasks (57.3%, 82.9%, and 50.1% IoU for vehicles, roads, and lane dividers) on the nuScenes dataset, while maintaining a 3.2x faster inference runtime. Code and project page are available online.

Preview loads on expand
Rotated Lights for Consistent and Efficient 2D Gaussians Inverse Rendering Geng Lin, Matthias Zwicker Updated 2026-02-09

Inverse rendering aims to decompose a scene into its geometry, material properties and light conditions under a certain rendering model. It has wide applications like view synthesis, relighting, and scene editing. In recent years, inverse rendering methods have been inspired by view synthesis approaches like neural radiance fields and Gaussian splatting, which are capable of efficiently decomposing a scene into its geometry and radiance. They then further estimate the material and lighting that lead to the observed scene radiance. However, the latter step is highly ambiguous and prior works suffer from inaccurate color and baked shadows in their albedo estimation albeit their regularization. To this end, we propose RotLight, a simple capturing setup, to address the ambiguity. Compared to a usual capture, RotLight only requires the object to be rotated several times during the process. We show that as few as two rotations is effective in reducing artifacts. To further improve 2DGS-based inverse rendering, we additionally introduce a proxy mesh that not only allows accurate incident light tracing, but also enables a residual constraint and improves global illumination handling. We demonstrate with both synthetic and real world datasets that our method achieves superior albedo estimation while keeping efficient computation.

Preview loads on expand
Informative Object-centric Next Best View for Object-aware 3D Gaussian Splatting in Cluttered Scenes Seunghoon Jeong, Eunho Lee, Jeongyun Kim, Ayoung Kim Updated 2026-02-09

In cluttered scenes with inevitable occlusions and incomplete observations, selecting informative viewpoints is essential for building a reliable representation. In this context, 3D Gaussian Splatting (3DGS) offers a distinct advantage, as it can explicitly guide the selection of subsequent viewpoints and then refine the representation with new observations. However, existing approaches rely solely on geometric cues, neglect manipulation-relevant semantics, and tend to prioritize exploitation over exploration. To tackle these limitations, we introduce an instance-aware Next Best View (NBV) policy that prioritizes underexplored regions by leveraging object features. Specifically, our object-aware 3DGS distills instancelevel information into one-hot object vectors, which are used to compute confidence-weighted information gain that guides the identification of regions associated with erroneous and uncertain Gaussians. Furthermore, our method can be easily adapted to an object-centric NBV, which focuses view selection on a target object, thereby improving reconstruction robustness to object placement. Experiments demonstrate that our NBV policy reduces depth error by up to 77.14% on the synthetic dataset and 34.10% on the real-world GraspNet dataset compared to baselines. Moreover, compared to targeting the entire scene, performing NBV on a specific object yields an additional reduction of 25.60% in depth error for that object. We further validate the effectiveness of our approach through real-world robotic manipulation tasks.

Preview loads on expand
Thermal odometry and dense mapping using learned ddometry and Gaussian splatting Tianhao Zhou, Yujia Chen, Zhihao Zhan, Yuhang Ming, Jianzhu Huai Updated 2026-02-07

Thermal infrared sensors, with wavelengths longer than smoke particles, can capture imagery independent of darkness, dust, and smoke. This robustness has made them increasingly valuable for motion estimation and environmental perception in robotics, particularly in adverse conditions. Existing thermal odometry and mapping approaches, however, are predominantly geometric and often fail across diverse datasets while lacking the ability to produce dense maps. Motivated by the efficiency and high-quality reconstruction ability of recent Gaussian Splatting (GS) techniques, we propose TOM-GS, a thermal odometry and mapping method that integrates learning-based odometry with GS-based dense mapping. TOM-GS is among the first GS-based SLAM systems tailored for thermal cameras, featuring dedicated thermal image enhancement and monocular depth integration. Extensive experiments on motion estimation and novel-view rendering demonstrate that TOM-GS outperforms existing learning-based methods, confirming the benefits of learning-based pipelines for robust thermal odometry and dense reconstruction.

Preview loads on expand
Zero-Shot UAV Navigation in Forests via Relightable 3D Gaussian Splatting Zinan Lv, Yeqian Qian, Chen Sang, Hao Liu, Danping Zou, Ming Yang Updated 2026-02-06

UAV navigation in unstructured outdoor environments using passive monocular vision is hindered by the substantial visual domain gap between simulation and reality. While 3D Gaussian Splatting enables photorealistic scene reconstruction from real-world data, existing methods inherently couple static lighting with geometry, severely limiting policy generalization to dynamic real-world illumination. In this paper, we propose a novel end-to-end reinforcement learning framework designed for effective zero-shot transfer to unstructured outdoors. Within a high-fidelity simulation grounded in real-world data, our policy is trained to map raw monocular RGB observations directly to continuous control commands. To overcome photometric limitations, we introduce Relightable 3D Gaussian Splatting, which decomposes scene components to enable explicit, physically grounded editing of environmental lighting within the neural representation. By augmenting training with diverse synthesized lighting conditions ranging from strong directional sunlight to diffuse overcast skies, we compel the policy to learn robust, illumination-invariant visual features. Extensive real-world experiments demonstrate that a lightweight quadrotor achieves robust, collision-free navigation in complex forest environments at speeds up to 10 m/s, exhibiting significant resilience to drastic lighting variations without fine-tuning.

Preview loads on expand
DynFOA: Generating First-Order Ambisonics with Conditional Diffusion for Dynamic and Acoustically Complex 360-Degree Videos Ziyu Luo, Lin Chen, Qiang Qu, Xiaoming Chen, Yiran Shen Updated 2026-02-06

Spatial audio is crucial for creating compelling immersive 360-degree video experiences. However, generating realistic spatial audio, such as first-order ambisonics (FOA), from 360-degree videos in complex acoustic scenes remains challenging. Existing methods often overlook the dynamic nature and acoustic complexity of 360-degree scenes, fail to fully account for dynamic sound sources, and neglect complex environmental effects such as occlusion, reflections, and reverberation, which are influenced by scene geometries and materials. We propose DynFOA, a framework based on dynamic acoustic perception and conditional diffusion, for generating high-fidelity FOA from 360-degree videos. DynFOA first performs visual processing via a video encoder, which detects and localizes multiple dynamic sound sources, estimates their depth and semantics, and reconstructs the scene geometry and materials using a 3D Gaussian Splatting. This reconstruction technique accurately models occlusion, reflections, and reverberation based on the geometries and materials of the reconstructed 3D scene and the listener's viewpoint. The audio encoder then captures the spatial motion and temporal 4D sound source trajectories to fine-tune the diffusion-based FOA generator. The fine-tuned FOA generator adjusts spatial cues in real time, ensuring consistent directional fidelity during listener head rotation and complex environmental changes. Extensive evaluations demonstrate that DynFOA consistently outperforms existing methods across metrics such as spatial accuracy, acoustic fidelity, and distribution matching, while also improving the user experience. Therefore, DynFOA provides a robust and scalable approach to rendering realistic dynamic spatial audio for VR and immersive media applications.

Preview loads on expand
GaussianPOP: Principled Simplification Framework for Compact 3D Gaussian Splatting via Error Quantification Soonbin Lee, Yeong-Gyu Kim, Simon Sasse, Tomas M. Borges, Yago Sanchez, Eun-Seok Ryu, Thomas Schierl, Cornelius Hellge Updated 2026-02-06

Existing 3D Gaussian Splatting simplification methods commonly use importance scores, such as blending weights or sensitivity, to identify redundant Gaussians. However, these scores are not driven by visual error metrics, often leading to suboptimal trade-offs between compactness and rendering fidelity. We present GaussianPOP, a principled simplification framework based on analytical Gaussian error quantification. Our key contribution is a novel error criterion, derived directly from the 3DGS rendering equation, that precisely measures each Gaussian's contribution to the rendered image. By introducing a highly efficient algorithm, our framework enables practical error calculation in a single forward pass. The framework is both accurate and flexible, supporting on-training pruning as well as post-training simplification via iterative error re-quantification for improved stability. Experimental results show that our method consistently outperforms existing state-of-the-art pruning methods across both application scenarios, achieving a superior trade-off between model compactness and high rendering quality.

Preview loads on expand
Uncertainty-Aware 4D Gaussian Splatting for Monocular Occluded Human Rendering Weiquan Wang, Feifei Shao, Lin Li, Zhen Wang, Jun Xiao, Long Chen Updated 2026-02-06

High-fidelity rendering of dynamic humans from monocular videos typically degrades catastrophically under occlusions. Existing solutions incorporate external priors-either hallucinating missing content via generative models, which induces severe temporal flickering, or imposing rigid geometric heuristics that fail to capture diverse appearances. To this end, we reformulate the task as a Maximum A Posteriori estimation problem under heteroscedastic observation noise. In this paper, we propose U-4DGS, a framework integrating a Probabilistic Deformation Network and a Double Rasterization pipeline. This architecture renders pixel-aligned uncertainty maps that act as an adaptive gradient modulator, automatically attenuating artifacts from unreliable observations. Furthermore, to prevent geometric drift in regions lacking reliable visual cues, we enforce Confidence-Aware Regularizations, which leverage the learned uncertainty to selectively propagate spatial-temporal validity. Extensive experiments on ZJU-MoCap and OcMotion demonstrate that U-4DGS achieves SOTA rendering fidelity and robustness.

Preview loads on expand
Nix and Fix: Targeting 1000x Compression of 3D Gaussian Splatting with Diffusion Models Cem Eteke, Enzo Tartaglione Updated 2026-02-04

3D Gaussian Splatting (3DGS) revolutionized novel view rendering. Instead of inferring from dense spatial points, as implicit representations do, 3DGS uses sparse Gaussians. This enables real-time performance but increases space requirements, hindering applications such as immersive communication. 3DGS compression emerged as a field aimed at alleviating this issue. While impressive progress has been made, at low rates, compression introduces artifacts that degrade visual quality significantly. We introduce NiFi, a method for extreme 3DGS compression through restoration via artifact-aware, diffusion-based one-step distillation. We show that our method achieves state-of-the-art perceptual quality at extremely low rates, down to 0.1 MB, and towards 1000x rate improvement over 3DGS at comparable perceptual performance. The code will be open-sourced upon acceptance.

Preview loads on expand
VecSet-Edit: Unleashing Pre-trained LRM for Mesh Editing from Single Image Teng-Fang Hsiao, Bo-Kai Ruan, Yu-Lun Liu, Hong-Han Shuai Updated 2026-02-04

3D editing has emerged as a critical research area to provide users with flexible control over 3D assets. While current editing approaches predominantly focus on 3D Gaussian Splatting or multi-view images, the direct editing of 3D meshes remains underexplored. Prior attempts, such as VoxHammer, rely on voxel-based representations that suffer from limited resolution and necessitate labor-intensive 3D mask. To address these limitations, we propose \textbf{VecSet-Edit}, the first pipeline that leverages the high-fidelity VecSet Large Reconstruction Model (LRM) as a backbone for mesh editing. Our approach is grounded on a analysis of the spatial properties in VecSet tokens, revealing that token subsets govern distinct geometric regions. Based on this insight, we introduce Mask-guided Token Seeding and Attention-aligned Token Gating strategies to precisely localize target regions using only 2D image conditions. Also, considering the difference between VecSet diffusion process versus voxel we design a Drift-aware Token Pruning to reject geometric outliers during the denoising process. Finally, our Detail-preserving Texture Baking module ensures that we not only preserve the geometric details of original mesh but also the textural information. More details can be found in our project page: https://github.com/BlueDyee/VecSet-Edit/tree/main

Preview loads on expand
Towards Next-Generation SLAM: A Survey on 3DGS-SLAM Focusing on Performance, Robustness, and Future Directions Li Wang, Ruixuan Gong, Yumo Han, Lei Yang, Lu Yang, Ying Li, Bin Xu, Huaping Liu, Rong Fu Updated 2026-02-04

Traditional Simultaneous Localization and Mapping (SLAM) systems often face limitations including coarse rendering quality, insufficient recovery of scene details, and poor robustness in dynamic environments. 3D Gaussian Splatting (3DGS), with its efficient explicit representation and high-quality rendering capabilities, offers a new reconstruction paradigm for SLAM. This survey comprehensively reviews key technical approaches for integrating 3DGS with SLAM. We analyze performance optimization of representative methods across four critical dimensions: rendering quality, tracking accuracy, reconstruction speed, and memory consumption, delving into their design principles and breakthroughs. Furthermore, we examine methods for enhancing the robustness of 3DGS-SLAM in complex environments such as motion blur and dynamic environments. Finally, we discuss future challenges and development trends in this area. This survey aims to provide a technical reference for researchers and foster the development of next-generation SLAM systems characterized by high fidelity, efficiency, and robustness.

Preview loads on expand
AnyStyle: Single-Pass Multimodal Stylization for 3D Gaussian Splatting Joanna Kaleta, Bartosz Świrta, Kacper Kania, Przemysław Spurek, Marek Kowalski Updated 2026-02-03

The growing demand for rapid and scalable 3D asset creation has driven interest in feed-forward 3D reconstruction methods, with 3D Gaussian Splatting (3DGS) emerging as an effective scene representation. While recent approaches have demonstrated pose-free reconstruction from unposed image collections, integrating stylization or appearance control into such pipelines remains underexplored. Existing attempts largely rely on image-based conditioning, which limits both controllability and flexibility. In this work, we introduce AnyStyle, a feed-forward 3D reconstruction and stylization framework that enables pose-free, zero-shot stylization through multimodal conditioning. Our method supports both textual and visual style inputs, allowing users to control the scene appearance using natural language descriptions or reference images. We propose a modular stylization architecture that requires only minimal architectural modifications and can be integrated into existing feed-forward 3D reconstruction backbones. Experiments demonstrate that AnyStyle improves style controllability over prior feed-forward stylization methods while preserving high-quality geometric reconstruction. A user study further confirms that AnyStyle achieves superior stylization quality compared to an existing state-of-the-art approach. Repository: https://github.com/joaxkal/AnyStyle.

Preview loads on expand
Constrained Dynamic Gaussian Splatting Zihan Zheng, Zhenglong Wu, Xuanxuan Wang, Houqiang Zhong, Xiaoyun Zhang, Qiang Hu, Guangtao Zhai, Wenjun Zhang Updated 2026-02-03

While Dynamic Gaussian Splatting enables high-fidelity 4D reconstruction, its deployment is severely hindered by a fundamental dilemma: unconstrained densification leads to excessive memory consumption incompatible with edge devices, whereas heuristic pruning fails to achieve optimal rendering quality under preset Gaussian budgets. In this work, we propose Constrained Dynamic Gaussian Splatting (CDGS), a novel framework that formulates dynamic scene reconstruction as a budget-constrained optimization problem to enforce a strict, user-defined Gaussian budget during training. Our key insight is to introduce a differentiable budget controller as the core optimization driver. Guided by a multi-modal unified importance score, this controller fuses geometric, motion, and perceptual cues for precise capacity regulation. To maximize the utility of this fixed budget, we further decouple the optimization of static and dynamic elements, employing an adaptive allocation mechanism that dynamically distributes capacity based on motion complexity. Furthermore, we implement a three-phase training strategy to seamlessly integrate these constraints, ensuring precise adherence to the target count. Coupled with a dual-mode hybrid compression scheme, CDGS not only strictly adheres to hardware constraints (error < 2%}) but also pushes the Pareto frontier of rate-distortion performance. Extensive experiments demonstrate that CDGS delivers optimal rendering quality under varying capacity limits, achieving over 3x compression compared to state-of-the-art methods.

Preview loads on expand
Pi-GS: Sparse-View Gaussian Splatting with Dense π^3 Initialization Manuel Hofer, Markus Steinberger, Thomas Köhler Updated 2026-02-03

Novel view synthesis has evolved rapidly, advancing from Neural Radiance Fields to 3D Gaussian Splatting (3DGS), which offers real-time rendering and rapid training without compromising visual fidelity. However, 3DGS relies heavily on accurate camera poses and high-quality point cloud initialization, which are difficult to obtain in sparse-view scenarios. While traditional Structure from Motion (SfM) pipelines often fail in these settings, existing learning-based point estimation alternatives typically require reliable reference views and remain sensitive to pose or depth errors. In this work, we propose a robust method utilizing π^3, a reference-free point cloud estimation network. We integrate dense initialization from π^3 with a regularization scheme designed to mitigate geometric inaccuracies. Specifically, we employ uncertainty-guided depth supervision, normal consistency loss, and depth warping. Experimental results demonstrate that our approach achieves state-of-the-art performance on the Tanks and Temples, LLFF, DTU, and MipNeRF360 datasets.

Preview loads on expand
WebSplatter: Enabling Cross-Device Efficient Gaussian Splatting in Web Browsers via WebGPU Yudong Han, Chao Xu, Xiaodan Ye, Weichen Bi, Zilong Dong, Yun Ma Updated 2026-02-03

We present WebSplatter, an end-to-end GPU rendering pipeline for the heterogeneous web ecosystem. Unlike naive ports, WebSplatter introduces a wait-free hierarchical radix sort that circumvents the lack of global atomics in WebGPU, ensuring deterministic execution across diverse hardware. Furthermore, we propose an opacity-aware geometry culling stage that dynamically prunes splats before rasterization, significantly reducing overdraw and peak memory footprint. Evaluation demonstrates that WebSplatter consistently achieves 1.2$\times$ to 4.5$\times$ speedups over state-of-the-art web viewers.

Preview loads on expand
SharpTimeGS: Sharp and Stable Dynamic Gaussian Splatting via Lifespan Modulation Zhanfeng Liao, Jiajun Zhang, Hanzhang Tu, Zhixi Wang, Yunqi Gao, Hongwen Zhang, Yebin Liu Updated 2026-02-03

Novel view synthesis of dynamic scenes is fundamental to achieving photorealistic 4D reconstruction and immersive visual experiences. Recent progress in Gaussian-based representations has significantly improved real-time rendering quality, yet existing methods still struggle to maintain a balance between long-term static and short-term dynamic regions in both representation and optimization. To address this, we present SharpTimeGS, a lifespan-aware 4D Gaussian framework that achieves temporally adaptive modeling of both static and dynamic regions under a unified representation. Specifically, we introduce a learnable lifespan parameter that reformulates temporal visibility from a Gaussian-shaped decay into a flat-top profile, allowing primitives to remain consistently active over their intended duration and avoiding redundant densification. In addition, the learned lifespan modulates each primitives' motion, reducing drift in long-lived static points while retaining unrestricted motion for short-lived dynamic ones. This effectively decouples motion magnitude from temporal duration, improving long-term stability without compromising dynamic fidelity. Moreover, we design a lifespan-velocity-aware densification strategy that mitigates optimization imbalance between static and dynamic regions by allocating more capacity to regions with pronounced motion while keeping static areas compact and stable. Extensive experiments on multiple benchmarks demonstrate that our method achieves state-of-the-art performance while supporting real-time rendering up to 4K resolution at 100 FPS on one RTX 4090.

Preview loads on expand
Intellectual Property Protection for 3D Gaussian Splatting Assets: A Survey Longjie Zhao, Ziming Hong, Jiaxin Huang, Runnan Chen, Mingming Gong, Tongliang Liu Updated 2026-02-02

3D Gaussian Splatting (3DGS) has become a mainstream representation for real-time 3D scene synthesis, enabling applications in virtual and augmented reality, robotics, and 3D content creation. Its rising commercial value and explicit parametric structure raise emerging intellectual property (IP) protection concerns, prompting a surge of research on 3DGS IP protection. However, current progress remains fragmented, lacking a unified view of the underlying mechanisms, protection paradigms, and robustness challenges. To address this gap, we present the first systematic survey on 3DGS IP protection and introduce a bottom-up framework that examines (i) underlying Gaussian-based perturbation mechanisms, (ii) passive and active protection paradigms, and (iii) robustness threats under emerging generative AI era, revealing gaps in technical foundations and robustness characterization and indicating opportunities for deeper investigation. Finally, we outline six research directions across robustness, efficiency, and protection paradigms, offering a roadmap toward reliable and trustworthy IP protection for 3DGS assets.

Preview loads on expand
SoMA: A Real-to-Sim Neural Simulator for Robotic Soft-body Manipulation Mu Huang, Hui Wang, Kerui Ren, Linning Xu, Yunsong Zhou, Mulin Yu, Bo Dai, Jiangmiao Pang Updated 2026-02-02

Simulating deformable objects under rich interactions remains a fundamental challenge for real-to-sim robot manipulation, with dynamics jointly driven by environmental effects and robot actions. Existing simulators rely on predefined physics or data-driven dynamics without robot-conditioned control, limiting accuracy, stability, and generalization. This paper presents SoMA, a 3D Gaussian Splat simulator for soft-body manipulation. SoMA couples deformable dynamics, environmental forces, and robot joint actions in a unified latent neural space for end-to-end real-to-sim simulation. Modeling interactions over learned Gaussian splats enables controllable, stable long-horizon manipulation and generalization beyond observed trajectories without predefined physical models. SoMA improves resimulation accuracy and generalization on real-world robot manipulation by 20%, enabling stable simulation of complex tasks such as long-horizon cloth folding.

Preview loads on expand
PLANING: A Loosely Coupled Triangle-Gaussian Framework for Streaming 3D Reconstruction Changjian Jiang, Kerui Ren, Xudong Li, Kaiwen Song, Linning Xu, Tao Lu, Junting Dong, Yu Zhang, Bo Dai, Mulin Yu Updated 2026-01-29

Streaming reconstruction from monocular image sequences remains challenging, as existing methods typically favor either high-quality rendering or accurate geometry, but rarely both. We present PLANING, an efficient on-the-fly reconstruction framework built on a hybrid representation that loosely couples explicit geometric primitives with neural Gaussians, enabling geometry and appearance to be modeled in a decoupled manner. This decoupling supports an online initialization and optimization strategy that separates geometry and appearance updates, yielding stable streaming reconstruction with substantially reduced structural redundancy. PLANING improves dense mesh Chamfer-L2 by 18.52% over PGSR, surpasses ARTDECO by 1.31 dB PSNR, and reconstructs ScanNetV2 scenes in under 100 seconds, over 5x faster than 2D Gaussian Splatting, while matching the quality of offline per-scene optimization. Beyond reconstruction quality, the structural clarity and computational efficiency of \modelname~make it well suited for a broad range of downstream applications, such as enabling large-scale scene modeling and simulation-ready environments for embodied AI. Project page: https://city-super.github.io/PLANING/ .

Preview loads on expand
Hybrid Foveated Path Tracing with Peripheral Gaussians for Immersive Anatomy Constantin Kleinbeck, Luisa Theelke, Hannah Schieber, Ulrich Eck, Rüdiger von Eisenhart-Rothe, Daniel Roth Updated 2026-01-29

Volumetric medical imaging offers great potential for understanding complex pathologies. Yet, traditional 2D slices provide little support for interpreting spatial relationships, forcing users to mentally reconstruct anatomy into three dimensions. Direct volumetric path tracing and VR rendering can improve perception but are computationally expensive, while precomputed representations, like Gaussian Splatting, require planning ahead. Both approaches limit interactive use. We propose a hybrid rendering approach for high-quality, interactive, and immersive anatomical visualization. Our method combines streamed foveated path tracing with a lightweight Gaussian Splatting approximation of the periphery. The peripheral model generation is optimized with volume data and continuously refined using foveal renderings, enabling interactive updates. Depth-guided reprojection further improves robustness to latency and allows users to balance fidelity with refresh rate. We compare our method against direct path tracing and Gaussian Splatting. Our results highlight how their combination can preserve strengths in visual quality while re-generating the peripheral model in under a second, eliminating extensive preprocessing and approximations. This opens new options for interactive medical visualization.

Preview loads on expand
FreeFix: Boosting 3D Gaussian Splatting via Fine-Tuning-Free Diffusion Models Hongyu Zhou, Zisen Shao, Sheng Miao, Pan Wang, Dongfeng Bai, Bingbing Liu, Yiyi Liao Updated 2026-01-28

Neural Radiance Fields and 3D Gaussian Splatting have advanced novel view synthesis, yet still rely on dense inputs and often degrade at extrapolated views. Recent approaches leverage generative models, such as diffusion models, to provide additional supervision, but face a trade-off between generalization and fidelity: fine-tuning diffusion models for artifact removal improves fidelity but risks overfitting, while fine-tuning-free methods preserve generalization but often yield lower fidelity. We introduce FreeFix, a fine-tuning-free approach that pushes the boundary of this trade-off by enhancing extrapolated rendering with pretrained image diffusion models. We present an interleaved 2D-3D refinement strategy, showing that image diffusion models can be leveraged for consistent refinement without relying on costly video diffusion models. Furthermore, we take a closer look at the guidance signal for 2D refinement and propose a per-pixel confidence mask to identify uncertain regions for targeted improvement. Experiments across multiple datasets show that FreeFix improves multi-frame consistency and achieves performance comparable to or surpassing fine-tuning-based methods, while retaining strong generalization ability.

Preview loads on expand
GRTX: Efficient Ray Tracing for 3D Gaussian-Based Rendering Junseo Lee, Sangyun Jeon, Jungi Lee, Junyong Park, Jaewoong Sim Updated 2026-01-28

3D Gaussian Splatting has gained widespread adoption across diverse applications due to its exceptional rendering performance and visual quality. While most existing methods rely on rasterization to render Gaussians, recent research has started investigating ray tracing approaches to overcome the fundamental limitations inherent in rasterization. However, current Gaussian ray tracing methods suffer from inefficiencies such as bloated acceleration structures and redundant node traversals, which greatly degrade ray tracing performance. In this work, we present GRTX, a set of software and hardware optimizations that enable efficient ray tracing for 3D Gaussian-based rendering. First, we introduce a novel approach for constructing streamlined acceleration structures for Gaussian primitives. Our key insight is that anisotropic Gaussians can be treated as unit spheres through ray space transformations, which substantially reduces BVH size and traversal overhead. Second, we propose dedicated hardware support for traversal checkpointing within ray tracing units. This eliminates redundant node visits during multi-round tracing by resuming traversal from checkpointed nodes rather than restarting from the root node in each subsequent round. Our evaluation shows that GRTX significantly improves ray tracing performance compared to the baseline ray tracing method with a negligible hardware cost.

Preview loads on expand
GVGS: Gaussian Visibility-Aware Multi-View Geometry for Accurate Surface Reconstruction Mai Su, Qihan Yu, Zhongtao Wang, Yilong Li, Chengwei Pan, Yisong Chen, Guoping Wang Updated 2026-01-28

3D Gaussian Splatting enables efficient optimization and high-quality rendering, yet accurate surface reconstruction remains challenging. Prior methods improve surface reconstruction by refining Gaussian depth estimates, either via multi-view geometric consistency or through monocular depth priors. However, multi-view constraints become unreliable under large geometric discrepancies, while monocular priors suffer from scale ambiguity and local inconsistency, ultimately leading to inaccurate Gaussian depth supervision. To address these limitations, we introduce a Gaussian visibility-aware multi-view geometric consistency constraint that aggregates the visibility of shared Gaussian primitives across views, enabling more accurate and stable geometric supervision. In addition, we propose a progressive quadtree-calibrated Monocular depth constraint that performs block-wise affine calibration from coarse to fine spatial scales, mitigating the scale ambiguity of depth priors while preserving fine-grained surface details. Extensive experiments on DTU and TNT datasets demonstrate consistent improvements in geometric accuracy over prior Gaussian-based and implicit surface reconstruction methods. Codes are available at an anonymous repository: https://github.com/GVGScode/GVGS.

Preview loads on expand
Fast Converging 3D Gaussian Splatting for 1-Minute Reconstruction Ziyu Zhang, Tianle Liu, Diantao Tu, Shuhan Shen Updated 2026-01-28

We present a fast 3DGS reconstruction pipeline designed to converge within one minute, developed for the SIGGRAPH Asia 3DGS Fast Reconstruction Challenge. The challenge consists of an initial round using SLAM-generated camera poses (with noisy trajectories) and a final round using COLMAP poses (highly accurate). To robustly handle these heterogeneous settings, we develop a two-stage solution. In the first round, we use reverse per-Gaussian parallel optimization and compact forward splatting based on Taming-GS and Speedy-splat, load-balanced tiling, an anchor-based Neural-Gaussian representation enabling rapid convergence with fewer learnable parameters, initialization from monocular depth and partially from feed-forward 3DGS models, and a global pose refinement module for noisy SLAM trajectories. In the final round, the accurate COLMAP poses change the optimization landscape; we disable pose refinement, revert from Neural-Gaussians back to standard 3DGS to eliminate MLP inference overhead, introduce multi-view consistency-guided Gaussian splitting inspired by Fast-GS, and introduce a depth estimator to supervise the rendered depth. Together, these techniques enable high-fidelity reconstruction under a strict one-minute budget. Our method achieved the top performance with a PSNR of 28.43 and ranked first in the competition.

Preview loads on expand
Graphical X Splatting (GraphiXS): A Graphical Model for 4D Gaussian Splatting under Uncertainty Doga Yilmaz, Jialin Zhu, Deshan Gong, He Wang Updated 2026-01-27

We propose a new framework to systematically incorporate data uncertainty in Gaussian Splatting. Being the new paradigm of neural rendering, Gaussian Splatting has been investigated in many applications, with the main effort in extending its representation, improving its optimization process, and accelerating its speed. However, one orthogonal, much needed, but under-explored area is data uncertainty. In standard 4D Gaussian Splatting, data uncertainty can manifest as view sparsity, missing frames, camera asynchronization, etc. So far, there has been little research to holistically incorporating various types of data uncertainty under a single framework. To this end, we propose Graphical X Splatting, or GraphiXS, a new probabilistic framework that considers multiple types of data uncertainty, aiming for a fundamental augmentation of the current 4D Gaussian Splatting paradigm into a probabilistic setting. GraphiXS is general and can be instantiated with a range of primitives, e.g. Gaussians, Student's-t. Furthermore, GraphiXS can be used to `upgrade' existing methods to accommodate data uncertainty. Through exhaustive evaluation and comparison, we demonstrate that GraphiXS can systematically model various uncertainties in data, outperform existing methods in many settings where data are missing or polluted in space and time, and therefore is a major generalization of the current 4D Gaussian Splatting research.

Preview loads on expand
WaterClear-GS: Optical-Aware Gaussian Splatting for Underwater Reconstruction and Restoration Xinrui Zhang, Yufeng Wang, Shuangkang Fang, Zesheng Wang, Dacheng Qi, Wenrui Ding Updated 2026-01-27

Underwater 3D reconstruction and appearance restoration are hindered by the complex optical properties of water, such as wavelength-dependent attenuation and scattering. Existing Neural Radiance Fields (NeRF)-based methods struggle with slow rendering speeds and suboptimal color restoration, while 3D Gaussian Splatting (3DGS) inherently lacks the capability to model complex volumetric scattering effects. To address these issues, we introduce WaterClear-GS, the first pure 3DGS-based framework that explicitly integrates underwater optical properties of local attenuation and scattering into Gaussian primitives, eliminating the need for an auxiliary medium network. Our method employs a dual-branch optimization strategy to ensure underwater photometric consistency while naturally recovering water-free appearances. This strategy is enhanced by depth-guided geometry regularization and perception-driven image loss, together with exposure constraints, spatially-adaptive regularization, and physically guided spectral regularization, which collectively enforce local 3D coherence and maintain natural visual perception. Experiments on standard benchmarks and our newly collected dataset demonstrate that WaterClear-GS achieves outstanding performance on both novel view synthesis (NVS) and underwater image restoration (UIR) tasks, while maintaining real-time rendering. The code will be available at https://buaaxrzhang.github.io/WaterClear-GS/.

Preview loads on expand
ClipGS-VR: Immersive and Interactive Cinematic Visualization of Volumetric Medical Data in Mobile Virtual Reality Yuqi Tong, Ruiyang Li, Chengkun Li, Qixuan Liu, Shi Qiu, Pheng-Ann Heng Updated 2026-01-27

High-fidelity cinematic medical visualization on mobile virtual reality (VR) remains challenging. Although ClipGS enables cross-sectional exploration via 3D Gaussian Splatting, it lacks arbitrary-angle slicing on consumer-grade VR headsets. To achieve real-time interactive performance, we introduce ClipGS-VR and restructure ClipGS's neural inference into a consolidated dataset, integrating high-fidelity layers from multiple pre-computed slicing states into a unified rendering structure. Our framework further supports arbitrary-angle slicing via gradient-based opacity modulation for smooth, visually coherent rendering. Evaluations confirm our approach maintains visual fidelity comparable to offline results while offering superior usability and interaction efficiency.

Preview loads on expand
TIGaussian: Disentangle Gaussians for Spatial-Awared Text-Image-3D Alignment Jiarun Liu, Qifeng Chen, Yiru Zhao, Minghua Liu, Baorui Ma, Sheng Yang Updated 2026-01-27

While visual-language models have profoundly linked features between texts and images, the incorporation of 3D modality data, such as point clouds and 3D Gaussians, further enables pretraining for 3D-related tasks, e.g., cross-modal retrieval, zero-shot classification, and scene recognition. As challenges remain in extracting 3D modal features and bridging the gap between different modalities, we propose TIGaussian, a framework that harnesses 3D Gaussian Splatting (3DGS) characteristics to strengthen cross-modality alignment through multi-branch 3DGS tokenizer and modality-specific 3D feature alignment strategies. Specifically, our multi-branch 3DGS tokenizer decouples the intrinsic properties of 3DGS structures into compact latent representations, enabling more generalizable feature extraction. To further bridge the modality gap, we develop a bidirectional cross-modal alignment strategies: a multi-view feature fusion mechanism that leverages diffusion priors to resolve perspective ambiguity in image-3D alignment, while a text-3D projection module adaptively maps 3D features to text embedding space for better text-3D alignment. Extensive experiments on various datasets demonstrate the state-of-the-art performance of TIGaussian in multiple tasks.

Preview loads on expand
EVolSplat4D: Efficient Volume-based Gaussian Splatting for 4D Urban Scene Synthesis Sheng Miao, Sijin Li, Pan Wang, Dongfeng Bai, Bingbing Liu, Yue Wang, Andreas Geiger, Yiyi Liao Updated 2026-01-22

Novel view synthesis (NVS) of static and dynamic urban scenes is essential for autonomous driving simulation, yet existing methods often struggle to balance reconstruction time with quality. While state-of-the-art neural radiance fields and 3D Gaussian Splatting approaches achieve photorealism, they often rely on time-consuming per-scene optimization. Conversely, emerging feed-forward methods frequently adopt per-pixel Gaussian representations, which lead to 3D inconsistencies when aggregating multi-view predictions in complex, dynamic environments. We propose EvolSplat4D, a feed-forward framework that moves beyond existing per-pixel paradigms by unifying volume-based and pixel-based Gaussian prediction across three specialized branches. For close-range static regions, we predict consistent geometry of 3D Gaussians over multiple frames directly from a 3D feature volume, complemented by a semantically-enhanced image-based rendering module for predicting their appearance. For dynamic actors, we utilize object-centric canonical spaces and a motion-adjusted rendering module to aggregate temporal features, ensuring stable 4D reconstruction despite noisy motion priors. Far-Field scenery is handled by an efficient per-pixel Gaussian branch to ensure full-scene coverage. Experimental results on the KITTI-360, KITTI, Waymo, and PandaSet datasets show that EvolSplat4D reconstructs both static and dynamic environments with superior accuracy and consistency, outperforming both per-scene optimization and state-of-the-art feed-forward baselines.

Preview loads on expand
ThermoSplat: Cross-Modal 3D Gaussian Splatting with Feature Modulation and Geometry Decoupling Zhaoqi Su, Shihai Chen, Xinyan Lin, Liqin Huang, Zhipeng Su, Xiaoqiang Lu Updated 2026-01-22

Multi-modal scene reconstruction integrating RGB and thermal infrared data is essential for robust environmental perception across diverse lighting and weather conditions. However, extending 3D Gaussian Splatting (3DGS) to multi-spectral scenarios remains challenging. Current approaches often struggle to fully leverage the complementary information of multi-modal data, typically relying on mechanisms that either tend to neglect cross-modal correlations or leverage shared representations that fail to adaptively handle the complex structural correlations and physical discrepancies between spectrums. To address these limitations, we propose ThermoSplat, a novel framework that enables deep spectral-aware reconstruction through active feature modulation and adaptive geometry decoupling. First, we introduce a Cross-Modal FiLM Modulation mechanism that dynamically conditions shared latent features on thermal structural priors, effectively guiding visible texture synthesis with reliable cross-modal geometric cues. Second, to accommodate modality-specific geometric inconsistencies, we propose a Modality-Adaptive Geometric Decoupling scheme that learns independent opacity offsets and executes an independent rasterization pass for the thermal branch. Additionally, a hybrid rendering pipeline is employed to integrate explicit Spherical Harmonics with implicit neural decoding, ensuring both semantic consistency and high-frequency detail preservation. Extensive experiments on the RGBT-Scenes dataset demonstrate that ThermoSplat achieves state-of-the-art rendering quality across both visible and thermal spectrums.

Preview loads on expand
LL-GaussianImage: Efficient Image Representation for Zero-shot Low-Light Enhancement with 2D Gaussian Splatting Yuhan Chen, Wenxuan Yu, Guofa Li, Yijun Xu, Ying Fang, Yicui Shi, Long Cao, Wenbo Chu, Keqiang Li Updated 2026-01-22

2D Gaussian Splatting (2DGS) is an emerging explicit scene representation method with significant potential for image compression due to high fidelity and high compression ratios. However, existing low-light enhancement algorithms operate predominantly within the pixel domain. Processing 2DGS-compressed images necessitates a cumbersome decompression-enhancement-recompression pipeline, which compromises efficiency and introduces secondary degradation. To address these limitations, we propose LL-GaussianImage, the first zero-shot unsupervised framework designed for low-light enhancement directly within the 2DGS compressed representation domain. Three primary advantages are offered by this framework. First, a semantic-guided Mixture-of-Experts enhancement framework is designed. Dynamic adaptive transformations are applied to the sparse attribute space of 2DGS using rendered images as guidance to enable compression-as-enhancement without full decompression to a pixel grid. Second, a multi-objective collaborative loss function system is established to strictly constrain smoothness and fidelity during enhancement, suppressing artifacts while improving visual quality. Third, a two-stage optimization process is utilized to achieve reconstruction-as-enhancement. The accuracy of the base representation is ensured through single-scale reconstruction and network robustness is enhanced. High-quality enhancement of low-light images is achieved while high compression ratios are maintained. The feasibility and superiority of the paradigm for direct processing within the compressed representation domain are validated through experimental results.

Preview loads on expand
LL-GaussianMap: Zero-shot Low-Light Image Enhancement via 2D Gaussian Splatting Guided Gain Maps Yuhan Chen, Ying Fang, Guofa Li, Wenxuan Yu, Yicui Shi, Jingrui Zhang, Kefei Qian, Wenbo Chu, Keqiang Li Updated 2026-01-22

Significant progress has been made in low-light image enhancement with respect to visual quality. However, most existing methods primarily operate in the pixel domain or rely on implicit feature representations. As a result, the intrinsic geometric structural priors of images are often neglected. 2D Gaussian Splatting (2DGS) has emerged as a prominent explicit scene representation technique characterized by superior structural fitting capabilities and high rendering efficiency. Despite these advantages, the utilization of 2DGS in low-level vision tasks remains unexplored. To bridge this gap, LL-GaussianMap is proposed as the first unsupervised framework incorporating 2DGS into low-light image enhancement. Distinct from conventional methodologies, the enhancement task is formulated as a gain map generation process guided by 2DGS primitives. The proposed method comprises two primary stages. First, high-fidelity structural reconstruction is executed utilizing 2DGS. Then, data-driven enhancement dictionary coefficients are rendered via the rasterization mechanism of Gaussian splatting through an innovative unified enhancement module. This design effectively incorporates the structural perception capabilities of 2DGS into gain map generation, thereby preserving edges and suppressing artifacts during enhancement. Additionally, the reliance on paired data is circumvented through unsupervised learning. Experimental results demonstrate that LL-GaussianMap achieves superior enhancement performance with an extremely low storage footprint, highlighting the effectiveness of explicit Gaussian representations for image enhancement.

Preview loads on expand
Structured Image-based Coding for Efficient Gaussian Splatting Compression Pedro Martin, Antonio Rodrigues, Joao Ascenso, Maria Paula Queluz Updated 2026-01-22

Gaussian Splatting (GS) has recently emerged as a state-of-the-art representation for radiance fields, combining real-time rendering with high visual fidelity. However, GS models require storing millions of parameters, leading to large file sizes that impair their use in practical multimedia systems. To address this limitation, this paper introduces GS Image-based Compression (GSICO), a novel GS codec that efficiently compresses pre-trained GS models while preserving perceptual fidelity. The core contribution lies in a mapping procedure that arranges GS parameters into structured images, guided by a novel algorithm that enhances spatial coherence. These GS parameter images are then encoded using a conventional image codec. Experimental evaluations on Tanks and Temples, Deep Blending, and Mip-NeRF360 datasets show that GSICO achieves average compression factors of 20.2x with minimal loss in visual quality, as measured by PSNR, SSIM, and LPIPS. Compared with state-of-the-art GS compression methods, the proposed codec consistently yields superior rate-distortion (RD) trade-offs.

Preview loads on expand
SplatBus: A Gaussian Splatting Viewer Framework via GPU Interprocess Communication Yinghan Xu, Théo Morales, John Dingliana Updated 2026-01-21

Radiance field-based rendering methods have attracted significant interest from the computer vision and computer graphics communities. They enable high-fidelity rendering with complex real-world lighting effects, but at the cost of high rendering time. 3D Gaussian Splatting solves this issue with a rasterisation-based approach for real-time rendering, enabling applications such as autonomous driving, robotics, virtual reality, and extended reality. However, current 3DGS implementations are difficult to integrate into traditional mesh-based rendering pipelines, which is a common use case for interactive applications and artistic exploration. To address this limitation, this software solution uses Nvidia's interprocess communication (IPC) APIs to easily integrate into implementations and allow the results to be viewed in external clients such as Unity, Blender, Unreal Engine, and OpenGL viewers. The code is available at https://github.com/RockyXu66/splatbus.

Preview loads on expand
LuxRemix: Lighting Decomposition and Remixing for Indoor Scenes Ruofan Liang, Norman Müller, Ethan Weber, Duncan Zauss, Nandita Vijaykumar, Peter Kontschieder, Christian Richardt Updated 2026-01-21

We present a novel approach for interactive light editing in indoor scenes from a single multi-view scene capture. Our method leverages a generative image-based light decomposition model that factorizes complex indoor scene illumination into its constituent light sources. This factorization enables independent manipulation of individual light sources, specifically allowing control over their state (on/off), chromaticity, and intensity. We further introduce multi-view lighting harmonization to ensure consistent propagation of the lighting decomposition across all scene views. This is integrated into a relightable 3D Gaussian splatting representation, providing real-time interactive control over the individual light sources. Our results demonstrate highly photorealistic lighting decomposition and relighting outcomes across diverse indoor scenes. We evaluate our method on both synthetic and real-world datasets and provide a quantitative and qualitative comparison to state-of-the-art techniques. For video results and interactive demos, see https://luxremix.github.io.

Preview loads on expand
POTR: Post-Training 3DGS Compression Bert Ramlot, Martijn Courteaux, Peter Lambert, Glenn Van Wallendael Updated 2026-01-21

3D Gaussian Splatting (3DGS) has recently emerged as a promising contender to Neural Radiance Fields (NeRF) in 3D scene reconstruction and real-time novel view synthesis. 3DGS outperforms NeRF in training and inference speed but has substantially higher storage requirements. To remedy this downside, we propose POTR, a post-training 3DGS codec built on two novel techniques. First, POTR introduces a novel pruning approach that uses a modified 3DGS rasterizer to efficiently calculate every splat's individual removal effect simultaneously. This technique results in 2-4x fewer splats than other post-training pruning techniques and as a result also significantly accelerates inference with experiments demonstrating 1.5-2x faster inference than other compressed models. Second, we propose a novel method to recompute lighting coefficients, significantly reducing their entropy without using any form of training. Our fast and highly parallel approach especially increases AC lighting coefficient sparsity, with experiments demonstrating increases from 70% to 97%, with minimal loss in quality. Finally, we extend POTR with a simple fine-tuning scheme to further enhance pruning, inference, and rate-distortion performance. Experiments demonstrate that POTR, even without fine-tuning, consistently outperforms all other post-training compression techniques in both rate-distortion performance and inference speed.

Preview loads on expand
Rig-Aware 3D Reconstruction of Vehicle Undercarriages using Gaussian Splatting Nitin Kulkarni, Akhil Devarashetti, Charlie Cluss, Livio Forte, Dan Buckmaster, Philip Schneider, Chunming Qiao, Alina Vereshchaka Updated 2026-01-20

Inspecting the undercarriage of used vehicles is a labor-intensive task that requires inspectors to crouch or crawl underneath each vehicle to thoroughly examine it. Additionally, online buyers rarely see undercarriage photos. We present an end-to-end pipeline that utilizes a three-camera rig to capture videos of the undercarriage as the vehicle drives over it, and produces an interactive 3D model of the undercarriage. The 3D model enables inspectors and customers to rotate, zoom, and slice through the undercarriage, allowing them to detect rust, leaks, or impact damage in seconds, thereby improving both workplace safety and buyer confidence. Our primary contribution is a rig-aware Structure-from-Motion (SfM) pipeline specifically designed to overcome the challenges of wide-angle lens distortion and low-parallax scenes. Our method overcomes the challenges of wide-angle lens distortion and low-parallax scenes by integrating precise camera calibration, synchronized video streams, and strong geometric priors from the camera rig. We use a constrained matching strategy with learned components, the DISK feature extractor, and the attention-based LightGlue matcher to generate high-quality sparse point clouds that are often unattainable with standard SfM pipelines. These point clouds seed the Gaussian splatting process to generate photorealistic undercarriage models that render in real-time. Our experiments and ablation studies demonstrate that our design choices are essential to achieve state-of-the-art quality.

Preview loads on expand
One-Shot Refiner: Boosting Feed-forward Novel View Synthesis via One-Step Diffusion Yitong Dong, Qi Zhang, Minchao Jiang, Zhiqiang Wu, Qingnan Fan, Ying Feng, Huaqi Zhang, Hujun Bao, Guofeng Zhang Updated 2026-01-20

We present a novel framework for high-fidelity novel view synthesis (NVS) from sparse images, addressing key limitations in recent feed-forward 3D Gaussian Splatting (3DGS) methods built on Vision Transformer (ViT) backbones. While ViT-based pipelines offer strong geometric priors, they are often constrained by low-resolution inputs due to computational costs. Moreover, existing generative enhancement methods tend to be 3D-agnostic, resulting in inconsistent structures across views, especially in unseen regions. To overcome these challenges, we design a Dual-Domain Detail Perception Module, which enables handling high-resolution images without being limited by the ViT backbone, and endows Gaussians with additional features to store high-frequency details. We develop a feature-guided diffusion network, which can preserve high-frequency details during the restoration process. We introduce a unified training strategy that enables joint optimization of the ViT-based geometric backbone and the diffusion-based refinement module. Experiments demonstrate that our method can maintain superior generation quality across multiple datasets.

Preview loads on expand
RSATalker: Realistic Socially-Aware Talking Head Generation for Multi-Turn Conversation Peng Chen, Xiaobao Wei, Yi Yang, Naiming Yao, Hui Chen, Feng Tian Updated 2026-01-15

Talking head generation is increasingly important in virtual reality (VR), especially for social scenarios involving multi-turn conversation. Existing approaches face notable limitations: mesh-based 3D methods can model dual-person dialogue but lack realistic textures, while large-model-based 2D methods produce natural appearances but incur prohibitive computational costs. Recently, 3D Gaussian Splatting (3DGS) based methods achieve efficient and realistic rendering but remain speaker-only and ignore social relationships. We introduce RSATalker, the first framework that leverages 3DGS for realistic and socially-aware talking head generation with support for multi-turn conversation. Our method first drives mesh-based 3D facial motion from speech, then binds 3D Gaussians to mesh facets to render high-fidelity 2D avatar videos. To capture interpersonal dynamics, we propose a socially-aware module that encodes social relationships, including blood and non-blood as well as equal and unequal, into high-level embeddings through a learnable query mechanism. We design a three-stage training paradigm and construct the RSATalker dataset with speech-mesh-image triplets annotated with social relationships. Extensive experiments demonstrate that RSATalker achieves state-of-the-art performance in both realism and social awareness. The code and dataset will be released.

Preview loads on expand
Thinking Like Van Gogh: Structure-Aware Style Transfer via Flow-Guided 3D Gaussian Splatting Zhendong Wang, Lebin Zhou, Jingchuan Xiao, Rongduo Han, Nam Ling, Cihan Ruan Updated 2026-01-15

In 1888, Vincent van Gogh wrote, "I am seeking exaggeration in the essential." This principle, amplifying structural form while suppressing photographic detail, lies at the core of Post-Impressionist art. However, most existing 3D style transfer methods invert this philosophy, treating geometry as a rigid substrate for surface-level texture projection. To authentically reproduce Post-Impressionist stylization, geometric abstraction must be embraced as the primary vehicle of expression. We propose a flow-guided geometric advection framework for 3D Gaussian Splatting (3DGS) that operationalizes this principle in a mesh-free setting. Our method extracts directional flow fields from 2D paintings and back-propagates them into 3D space, rectifying Gaussian primitives to form flow-aligned brushstrokes that conform to scene topology without relying on explicit mesh priors. This enables expressive structural deformation driven directly by painterly motion rather than photometric constraints. Our contributions are threefold: (1) a projection-based, mesh-free flow guidance mechanism that transfers 2D artistic motion into 3D Gaussian geometry; (2) a luminance-structure decoupling strategy that isolates geometric deformation from color optimization, mitigating artifacts during aggressive structural abstraction; and (3) a VLM-as-a-Judge evaluation framework that assesses artistic authenticity through aesthetic judgment instead of conventional pixel-level metrics, explicitly addressing the subjective nature of artistic stylization.

Preview loads on expand
Variable Basis Mapping for Real-Time Volumetric Visualization Qibiao Li, Yuxuan Wang, Youcheng Cai, Huangsheng Du, Ligang Liu Updated 2026-01-14

Real-time visualization of large-scale volumetric data remains challenging, as direct volume rendering and voxel-based methods suffer from prohibitively high computational cost. We propose Variable Basis Mapping (VBM), a framework that transforms volumetric fields into 3D Gaussian Splatting (3DGS) representations through wavelet-domain analysis. First, we precompute a compact Wavelet-to-Gaussian Transition Bank that provides optimal Gaussian surrogates for canonical wavelet atoms across multiple scales. Second, we perform analytical Gaussian construction that maps discrete wavelet coefficients directly to 3DGS parameters using a closed-form, mathematically principled rule. Finally, a lightweight image-space fine-tuning stage further refines the representation to improve rendering fidelity. Experiments on diverse datasets demonstrate that VBM significantly accelerates convergence and enhances rendering quality, enabling real-time volumetric visualization.

Preview loads on expand
TIDI-GS: Floater Suppression in 3D Gaussian Splatting for Enhanced Indoor Scene Fidelity Sooyeun Yang, Cheyul Im, Jee Won Lee, Jongseong Brad Choi Updated 2026-01-14

3D Gaussian Splatting (3DGS) is a technique to create high-quality, real-time 3D scenes from images. This method often produces visual artifacts known as floaters--nearly transparent, disconnected elements that drift in space away from the actual surface. This geometric inaccuracy undermines the reliability of these models for practical applications, which is critical. To address this issue, we introduce TIDI-GS, a new training framework designed to eliminate these floaters. A key benefit of our approach is that it functions as a lightweight plugin for the standard 3DGS pipeline, requiring no major architectural changes and adding minimal overhead to the training process. The core of our method is a floater pruning algorithm--TIDI--that identifies and removes floaters based on several criteria: their consistency across multiple viewpoints, their spatial relationship to other elements, and an importance score learned during training. The framework includes a mechanism to preserve fine details, ensuring that important high-frequency elements are not mistakenly removed. This targeted cleanup is supported by a monocular depth-based loss function that helps improve the overall geometric structure of the scene. Our experiments demonstrate that TIDI-GS improves both the perceptual quality and geometric integrity of reconstructions, transforming them into robust digital assets, suitable for high-fidelity applications.

Preview loads on expand
GaussianFluent: Gaussian Simulation for Dynamic Scenes with Mixed Materials Bei Huang, Yixin Chen, Ruijie Lu, Gang Zeng, Hongbin Zha, Yuru Pei, Siyuan Huang Updated 2026-01-14

3D Gaussian Splatting (3DGS) has emerged as a prominent 3D representation for high-fidelity and real-time rendering. Prior work has coupled physics simulation with Gaussians, but predominantly targets soft, deformable materials, leaving brittle fracture largely unresolved. This stems from two key obstacles: the lack of volumetric interiors with coherent textures in GS representation, and the absence of fracture-aware simulation methods for Gaussians. To address these challenges, we introduce GaussianFluent, a unified framework for realistic simulation and rendering of dynamic object states. First, it synthesizes photorealistic interiors by densifying internal Gaussians guided by generative models. Second, it integrates an optimized Continuum Damage Material Point Method (CD-MPM) to enable brittle fracture simulation at remarkably high speed. Our approach handles complex scenarios including mixed-material objects and multi-stage fracture propagation, achieving results infeasible with previous methods. Experiments clearly demonstrate GaussianFluent's capability for photo-realistic, real-time rendering with structurally consistent interiors, highlighting its potential for downstream application, such as VR and Robotics.

Preview loads on expand
A$^2$TG: Adaptive Anisotropic Textured Gaussians for Efficient 3D Scene Representation Sheng-Chi Hsu, Ting-Yu Yen, Shih-Hsuan Hung, Hung-Kuo Chu Updated 2026-01-14

Gaussian Splatting has emerged as a powerful representation for high-quality, real-time 3D scene rendering. While recent works extend Gaussians with learnable textures to enrich visual appearance, existing approaches allocate a fixed square texture per primitive, leading to inefficient memory usage and limited adaptability to scene variability. In this paper, we introduce adaptive anisotropic textured Gaussians (A$^2$TG), a novel representation that generalizes textured Gaussians by equipping each primitive with an anisotropic texture. Our method employs a gradient-guided adaptive rule to jointly determine texture resolution and aspect ratio, enabling non-uniform, detail-aware allocation that aligns with the anisotropic nature of Gaussian splats. This design significantly improves texture efficiency, reducing memory consumption while enhancing image quality. Experiments on multiple benchmark datasets demonstrate that A TG consistently outperforms fixed-texture Gaussian Splatting methods, achieving comparable rendering fidelity with substantially lower memory requirements.

Preview loads on expand
ViewMorpher3D: A 3D-aware Diffusion Framework for Multi-Camera Novel View Synthesis in Autonomous Driving Farhad G. Zanjani, Hong Cai, Amirhossein Habibian Updated 2026-01-13

Autonomous driving systems rely heavily on multi-view images to ensure accurate perception and robust decision-making. To effectively develop and evaluate perception stacks and planning algorithms, realistic closed-loop simulators are indispensable. While 3D reconstruction techniques such as Gaussian Splatting offer promising avenues for simulator construction, the rendered novel views often exhibit artifacts, particularly in extrapolated perspectives or when available observations are sparse. We introduce ViewMorpher3D, a multi-view image enhancement framework based on image diffusion models, designed to elevate photorealism and multi-view coherence in driving scenes. Unlike single-view approaches, ViewMorpher3D jointly processes a set of rendered views conditioned on camera poses, 3D geometric priors, and temporally adjacent or spatially overlapping reference views. This enables the model to infer missing details, suppress rendering artifacts, and enforce cross-view consistency. Our framework accommodates variable numbers of cameras and flexible reference/target view configurations, making it adaptable to diverse sensor setups. Experiments on real-world driving datasets demonstrate substantial improvements in image quality metrics, effectively reducing artifacts while preserving geometric fidelity.

Preview loads on expand
3DGS-Drag: Dragging Gaussians for Intuitive Point-Based 3D Editing Jiahua Dong, Yu-Xiong Wang Updated 2026-01-12

The transformative potential of 3D content creation has been progressively unlocked through advancements in generative models. Recently, intuitive drag editing with geometric changes has attracted significant attention in 2D editing yet remains challenging for 3D scenes. In this paper, we introduce 3DGS-Drag -- a point-based 3D editing framework that provides efficient, intuitive drag manipulation of real 3D scenes. Our approach bridges the gap between deformation-based and 2D-editing-based 3D editing methods, addressing their limitations to geometry-related content editing. We leverage two key innovations: deformation guidance utilizing 3D Gaussian Splatting for consistent geometric modifications and diffusion guidance for content correction and visual quality enhancement. A progressive editing strategy further supports aggressive 3D drag edits. Our method enables a wide range of edits, including motion change, shape adjustment, inpainting, and content extension. Experimental results demonstrate the effectiveness of 3DGS-Drag in various scenes, achieving state-of-the-art performance in geometry-related 3D content editing. Notably, the editing is efficient, taking 10 to 20 minutes on a single RTX 4090 GPU.

Preview loads on expand
Mon3tr: Monocular 3D Telepresence with Pre-built Gaussian Avatars as Amortization Fangyu Lin, Yingdong Hu, Zhening Liu, Yufan Zhuang, Zehong Lin, Jun Zhang Updated 2026-01-12

Immersive telepresence aims to transform human interaction in AR/VR applications by enabling lifelike full-body holographic representations for enhanced remote collaboration. However, existing systems rely on hardware-intensive multi-camera setups and demand high bandwidth for volumetric streaming, limiting their real-time performance on mobile devices. To overcome these challenges, we propose Mon3tr, a novel Monocular 3D telepresence framework that integrates 3D Gaussian splatting (3DGS) based parametric human modeling into telepresence for the first time. Mon3tr adopts an amortized computation strategy, dividing the process into a one-time offline multi-view reconstruction phase to build a user-specific avatar and a monocular online inference phase during live telepresence sessions. A single monocular RGB camera is used to capture body motions and facial expressions in real time to drive the 3DGS-based parametric human model, significantly reducing system complexity and cost. The extracted motion and appearance features are transmitted at < 0.2 Mbps over WebRTC's data channel, allowing robust adaptation to network fluctuations. On the receiver side, e.g., Meta Quest 3, we develop a lightweight 3DGS attribute deformation network to dynamically generate corrective 3DGS attribute adjustments on the pre-built avatar, synthesizing photorealistic motion and appearance at ~ 60 FPS. Extensive experiments demonstrate the state-of-the-art performance of our method, achieving a PSNR of > 28 dB for novel poses, an end-to-end latency of ~ 80 ms, and > 1000x bandwidth reduction compared to point-cloud streaming, while supporting real-time operation from monocular inputs across diverse scenarios. Our demos can be found at https://mon3tr3d.github.io.

Preview loads on expand
R3-RECON: Radiance-Field-Free Active Reconstruction via Renderability Xiaofeng Jin, Matteo Frosi, Yiran Guo, Matteo Matteucci Updated 2026-01-12

In active reconstruction, an embodied agent must decide where to look next to efficiently acquire views that support high-quality novel-view rendering. Recent work on active view planning for neural rendering largely derives next-best-view (NBV) criteria by backpropagating through radiance fields or estimating information entropy over 3D Gaussian primitives. While effective, these strategies tightly couple view selection to heavy, representation-specific mechanisms and fail to account for the computational and resource constraints required for lightweight online deployment. In this paper, we revisit active reconstruction from a renderability-centric perspective. We propose $\mathbb{R}^{3}$-RECON, a radiance-fields-free active reconstruction framework that induces an implicit, pose-conditioned renderability field over SE(3) from a lightweight voxel map. Our formulation aggregates per-voxel online observation statistics into a unified scalar renderability score that is cheap to update and can be queried in closed form at arbitrary candidate viewpoints in milliseconds, without requiring gradients or radiance-field training. This renderability field is strongly correlated with image-space reconstruction error, naturally guiding NBV selection. We further introduce a panoramic extension that estimates omnidirectional (360$^\circ$) view utility to accelerate candidate evaluation. In the standard indoor Replica dataset, $\mathbb{R}^{3}$-RECON achieves more uniform novel-view quality and higher 3D Gaussian splatting (3DGS) reconstruction accuracy than recent active GS baselines with matched view and time budgets.

Preview loads on expand
SARA: Scene-Aware Reconstruction Accelerator Jee Won Lee, Hansol Lim, Minhyeok Im, Dohyeon Lee, Jongseong Brad Choi Updated 2026-01-11

We present SARA (Scene-Aware Reconstruction Accelerator), a geometry-driven pair selection module for Structure-from-Motion (SfM). Unlike conventional pipelines that select pairs based on visual similarity alone, SARA introduces geometry-first pair selection by scoring reconstruction informativeness - the product of overlap and parallax - before expensive matching. A lightweight pre-matching stage uses mutual nearest neighbors and RANSAC to estimate these cues, then constructs an Information-Weighted Spanning Tree (IWST) augmented with targeted edges for loop closure, long-baseline anchors, and weak-view reinforcement. Compared to exhaustive matching, SARA reduces rotation errors by 46.5+-5.5% and translation errors by 12.5+-6.5% across modern learned detectors, while achieving at most 50x speedup through 98% pair reduction (from 30,848 to 580 pairs). This reduces matching complexity from quadratic to quasi-linear, maintaining within +-3% of baseline reconstruction metrics for 3D Gaussian Splatting and SVRaster.

Preview loads on expand
SRFlow: A Dataset and Regularization Model for High-Resolution Facial Optical Flow via Splatting Rasterization JiaLin Zhang, Dong Li Updated 2026-01-10

Facial optical flow supports a wide range of tasks in facial motion analysis. However, the lack of high-resolution facial optical flow datasets has hindered progress in this area. In this paper, we introduce Splatting Rasterization Flow (SRFlow), a high-resolution facial optical flow dataset, and Splatting Rasterization Guided FlowNet (SRFlowNet), a facial optical flow model with tailored regularization losses. These losses constrain flow predictions using masks and gradients computed via difference or Sobel operator. This effectively suppresses high-frequency noise and large-scale errors in texture-less or repetitive-pattern regions, enabling SRFlowNet to be the first model explicitly capable of capturing high-resolution skin motion guided by Gaussian splatting rasterization. Experiments show that training with the SRFlow dataset improves facial optical flow estimation across various optical flow models, reducing end-point error (EPE) by up to 42% (from 0.5081 to 0.2953). Furthermore, when coupled with the SRFlow dataset, SRFlowNet achieves up to a 48% improvement in F1-score (from 0.4733 to 0.6947) on a composite of three micro-expression datasets. These results demonstrate the value of advancing both facial optical flow estimation and micro-expression recognition.

Preview loads on expand
OceanSplat: Object-aware Gaussian Splatting with Trinocular View Consistency for Underwater Scene Reconstruction Minseong Kweon, Jinsun Park Updated 2026-01-08

We introduce OceanSplat, a novel 3D Gaussian Splatting-based approach for accurately representing 3D geometry in underwater scenes. To overcome multi-view inconsistencies caused by underwater optical degradation, our method enforces trinocular view consistency by rendering horizontally and vertically translated camera views relative to each input view and aligning them via inverse warping. Furthermore, these translated camera views are used to derive a synthetic epipolar depth prior through triangulation, which serves as a self-supervised depth regularizer. These geometric constraints facilitate the spatial optimization of 3D Gaussians and preserve scene structure in underwater environments. We also propose a depth-aware alpha adjustment that modulates the opacity of 3D Gaussians during early training based on their $z$-component and viewing direction, deterring the formation of medium-induced primitives. With our contributions, 3D Gaussians are disentangled from the scattering medium, enabling robust representation of object geometry and significantly reducing floating artifacts in reconstructed underwater scenes. Experiments on real-world underwater and simulated scenes demonstrate that OceanSplat substantially outperforms existing methods for both scene reconstruction and restoration in scattering media.

Preview loads on expand
ProFuse: Efficient Cross-View Context Fusion for Open-Vocabulary 3D Gaussian Splatting Yen-Jen Chiou, Wei-Tse Cheng, Yuan-Fu Yang Updated 2026-01-08

We present ProFuse, an efficient context-aware framework for open-vocabulary 3D scene understanding with 3D Gaussian Splatting (3DGS). The pipeline enhances cross-view consistency and intra-mask cohesion within a direct registration setup, adding minimal overhead and requiring no render-supervised fine-tuning. Instead of relying on a pretrained 3DGS scene, we introduce a dense correspondence-guided pre-registration phase that initializes Gaussians with accurate geometry while jointly constructing 3D Context Proposals via cross-view clustering. Each proposal carries a global feature obtained through weighted aggregation of member embeddings, and this feature is fused onto Gaussians during direct registration to maintain per-primitive language coherence across views. With associations established in advance, semantic fusion requires no additional optimization beyond standard reconstruction, and the model retains geometric refinement without densification. ProFuse achieves strong open-vocabulary 3DGS understanding while completing semantic attachment in about five minutes per scene, which is two times faster than SOTA.

Preview loads on expand
SCAR-GS: Spatial Context Attention for Residuals in Progressive Gaussian Splatting Diego Revilla, Pooja Suresh, Anand Bhojan, Ooi Wei Tsang Updated 2026-01-07

Recent advances in 3D Gaussian Splatting have allowed for real-time, high-fidelity novel view synthesis. Nonetheless, these models have significant storage requirements for large and medium-sized scenes, hindering their deployment over cloud and streaming services. Some of the most recent progressive compression techniques for these models rely on progressive masking and scalar quantization techniques to reduce the bitrate of Gaussian attributes using spatial context models. While effective, scalar quantization may not optimally capture the correlations of high-dimensional feature vectors, which can potentially limit the rate-distortion performance. In this work, we introduce a novel progressive codec for 3D Gaussian Splatting that replaces traditional methods with a more powerful Residual Vector Quantization approach to compress the primitive features. Our key contribution is an auto-regressive entropy model, guided by a multi-resolution hash grid, that accurately predicts the conditional probability of each successive transmitted index, allowing for coarse and refinement layers to be compressed with high efficiency.

Preview loads on expand
IDESplat: Iterative Depth Probability Estimation for Generalizable 3D Gaussian Splatting Wei Long, Haifeng Wu, Shiyin Jiang, Jinhua Zhang, Xinchun Ji, Shuhang Gu Updated 2026-01-07

Generalizable 3D Gaussian Splatting aims to directly predict Gaussian parameters using a feed-forward network for scene reconstruction. Among these parameters, Gaussian means are particularly difficult to predict, so depth is usually estimated first and then unprojected to obtain the Gaussian sphere centers. Existing methods typically rely solely on a single warp to estimate depth probability, which hinders their ability to fully leverage cross-view geometric cues, resulting in unstable and coarse depth maps. To address this limitation, we propose IDESplat, which iteratively applies warp operations to boost depth probability estimation for accurate Gaussian mean prediction. First, to eliminate the inherent instability of a single warp, we introduce a Depth Probability Boosting Unit (DPBU) that integrates epipolar attention maps produced by cascading warp operations in a multiplicative manner. Next, we construct an iterative depth estimation process by stacking multiple DPBUs, progressively identifying potential depth candidates with high likelihood. As IDESplat iteratively boosts depth probability estimates and updates the depth candidates, the depth map is gradually refined, resulting in accurate Gaussian means. We conduct experiments on RealEstate10K, ACID, and DL3DV. IDESplat achieves outstanding reconstruction quality and state-of-the-art performance with real-time efficiency. On RE10K, it outperforms DepthSplat by 0.33 dB in PSNR, using only 10.7% of the parameters and 70% of the memory. Additionally, our IDESplat improves PSNR by 2.95 dB over DepthSplat on the DTU dataset in cross-dataset experiments, demonstrating its strong generalization ability.

Preview loads on expand
G2P: Gaussian-to-Point Attribute Alignment for Boundary-Aware 3D Semantic Segmentation Hojun Song, Chae-yeong Song, Jeong-hun Hong, Chaewon Moon, Dong-hwi Kim, Gahyeon Kim, Soo Ye Kim, Yiyi Liao, Jaehyup Lee, Sang-hyo Park Updated 2026-01-07

Semantic segmentation on point clouds is critical for 3D scene understanding. However, sparse and irregular point distributions provide limited appearance evidence, making geometry-only features insufficient to distinguish objects with similar shapes but distinct appearances (e.g., color, texture, material). We propose Gaussian-to-Point (G2P), which transfers appearance-aware attributes from 3D Gaussian Splatting to point clouds for more discriminative and appearance-consistent segmentation. Our G2P address the misalignment between optimized Gaussians and original point geometry by establishing point-wise correspondences. By leveraging Gaussian opacity attributes, we resolve the geometric ambiguity that limits existing models. Additionally, Gaussian scale attributes enable precise boundary localization in complex 3D scenes. Extensive experiments demonstrate that our approach achieves superior performance on standard benchmarks and shows significant improvements on geometrically challenging classes, all without any 2D or language supervision.

Preview loads on expand
RelightAnyone: A Generalized Relightable 3D Gaussian Head Model Yingyan Xu, Pramod Rao, Sebastian Weiss, Gaspard Zoss, Markus Gross, Christian Theobalt, Marc Habermann, Derek Bradley Updated 2026-01-06

3D Gaussian Splatting (3DGS) has become a standard approach to reconstruct and render photorealistic 3D head avatars. A major challenge is to relight the avatars to match any scene illumination. For high quality relighting, existing methods require subjects to be captured under complex time-multiplexed illumination, such as one-light-at-a-time (OLAT). We propose a new generalized relightable 3D Gaussian head model that can relight any subject observed in a single- or multi-view images without requiring OLAT data for that subject. Our core idea is to learn a mapping from flat-lit 3DGS avatars to corresponding relightable Gaussian parameters for that avatar. Our model consists of two stages: a first stage that models flat-lit 3DGS avatars without OLAT lighting, and a second stage that learns the mapping to physically-based reflectance parameters for high-quality relighting. This two-stage design allows us to train the first stage across diverse existing multi-view datasets without OLAT lighting ensuring cross-subject generalization, where we learn a dataset-specific lighting code for self-supervised lighting alignment. Subsequently, the second stage can be trained on a significantly smaller dataset of subjects captured under OLAT illumination. Together, this allows our method to generalize well and relight any subject from the first stage as if we had captured them under OLAT lighting. Furthermore, we can fit our model to unseen subjects from as little as a single image, allowing several applications in novel view synthesis and relighting for digital avatars.

Preview loads on expand
CaricatureGS: Exaggerating 3D Gaussian Splatting Faces With Gaussian Curvature Eldad Matmon, Amit Bracha, Noam Rotstein, Ron Kimmel Updated 2026-01-06

A photorealistic and controllable 3D caricaturization framework for faces is introduced. We start with an intrinsic Gaussian curvature-based surface exaggeration technique, which, when coupled with texture, tends to produce over-smoothed renders. To address this, we resort to 3D Gaussian Splatting (3DGS), which has recently been shown to produce realistic free-viewpoint avatars. Given a multiview sequence, we extract a FLAME mesh, solve a curvature-weighted Poisson equation, and obtain its exaggerated form. However, directly deforming the Gaussians yields poor results, necessitating the synthesis of pseudo-ground-truth caricature images by warping each frame to its exaggerated 2D representation using local affine transformations. We then devise a training scheme that alternates real and synthesized supervision, enabling a single Gaussian collection to represent both natural and exaggerated avatars. This scheme improves fidelity, supports local edits, and allows continuous control over the intensity of the caricature. In order to achieve real-time deformations, an efficient interpolation between the original and exaggerated surfaces is introduced. We further analyze and show that it has a bounded deviation from closed-form solutions. In both quantitative and qualitative evaluations, our results outperform prior work, delivering photorealistic, geometry-controlled caricature avatars.

Preview loads on expand
A High-Fidelity Digital Twin for Robotic Manipulation Based on 3D Gaussian Splatting Ziyang Sun, Lingfan Bao, Tianhu Peng, Jingcheng Sun, Chengxu Zhou Updated 2026-01-06

Developing high-fidelity, interactive digital twins is crucial for enabling closed-loop motion planning and reliable real-world robot execution, which are essential to advancing sim-to-real transfer. However, existing approaches often suffer from slow reconstruction, limited visual fidelity, and difficulties in converting photorealistic models into planning-ready collision geometry. We present a practical framework that constructs high-quality digital twins within minutes from sparse RGB inputs. Our system employs 3D Gaussian Splatting (3DGS) for fast, photorealistic reconstruction as a unified scene representation. We enhance 3DGS with visibility-aware semantic fusion for accurate 3D labelling and introduce an efficient, filter-based geometry conversion method to produce collision-ready models seamlessly integrated with a Unity-ROS2-MoveIt physics engine. In experiments with a Franka Emika Panda robot performing pick-and-place tasks, we demonstrate that this enhanced geometric accuracy effectively supports robust manipulation in real-world trials. These results demonstrate that 3DGS-based digital twins, enriched with semantic and geometric consistency, offer a fast, reliable, and scalable path from perception to manipulation in unstructured environments.

Preview loads on expand
SA-ResGS: Self-Augmented Residual 3D Gaussian Splatting for Next Best View Selection Kim Jun-Seong, Tae-Hyun Oh, Eduardo Pérez-Pellitero, Youngkyoon Jang Updated 2026-01-06

We propose Self-Augmented Residual 3D Gaussian Splatting (SA-ResGS), a novel framework to stabilize uncertainty quantification and enhancing uncertainty-aware supervision in next-best-view (NBV) selection for active scene reconstruction. SA-ResGS improves both the reliability of uncertainty estimates and their effectiveness for supervision by generating Self-Augmented point clouds (SA-Points) via triangulation between a training view and a rasterized extrapolated view, enabling efficient scene coverage estimation. While improving scene coverage through physically guided view selection, SA-ResGS also addresses the challenge of under-supervised Gaussians, exacerbated by sparse and wide-baseline views, by introducing the first residual learning strategy tailored for 3D Gaussian Splatting. This targeted supervision enhances gradient flow in high-uncertainty Gaussians by combining uncertainty-driven filtering with dropout- and hard-negative-mining-inspired sampling. Our contributions are threefold: (1) a physically grounded view selection strategy that promotes efficient and uniform scene coverage; (2) an uncertainty-aware residual supervision scheme that amplifies learning signals for weakly contributing Gaussians, improving training stability and uncertainty estimation across scenes with diverse camera distributions; (3) an implicit unbiasing of uncertainty quantification as a consequence of constrained view selection and residual supervision, which together mitigate conflicting effects of wide-baseline exploration and sparse-view ambiguity in NBV planning. Experiments on active view selection demonstrate that SA-ResGS outperforms state-of-the-art baselines in both reconstruction quality and view selection robustness.

Preview loads on expand
CAMO: Category-Agnostic 3D Motion Transfer from Monocular 2D Videos Taeyeon Kim, Youngju Na, Jumin Lee, Minhyuk Sung, Sung-Eui Yoon Updated 2026-01-06

Motion transfer from 2D videos to 3D assets is a challenging problem, due to inherent pose ambiguities and diverse object shapes, often requiring category-specific parametric templates. We propose CAMO, a category-agnostic framework that transfers motion to diverse target meshes directly from monocular 2D videos without relying on predefined templates or explicit 3D supervision. The core of CAMO is a morphology-parameterized articulated 3D Gaussian splatting model combined with dense semantic correspondences to jointly adapt shape and pose through optimization. This approach effectively alleviates shape-pose ambiguities, enabling visually faithful motion transfer for diverse categories. Experimental results demonstrate superior motion accuracy, efficiency, and visual coherence compared to existing methods, significantly advancing motion transfer in varied object categories and casual video scenarios.

Preview loads on expand
HeadLighter: Disentangling Illumination in Generative 3D Gaussian Heads via Lightstage Captures Yating Wang, Yuan Sun, Xuan Wang, Ran Yi, Boyao Zhou, Yipengjing Sun, Hongyu Liu, Yinuo Wang, Lizhuang Ma Updated 2026-01-05

Recent 3D-aware head generative models based on 3D Gaussian Splatting achieve real-time, photorealistic and view-consistent head synthesis. However, a fundamental limitation persists: the deep entanglement of illumination and intrinsic appearance prevents controllable relighting. Existing disentanglement methods rely on strong assumptions to enable weakly supervised learning, which restricts their capacity for complex illumination. To address this challenge, we introduce HeadLighter, a novel supervised framework that learns a physically plausible decomposition of appearance and illumination in head generative models. Specifically, we design a dual-branch architecture that separately models lighting-invariant head attributes and physically grounded rendering components. A progressive disentanglement training is employed to gradually inject head appearance priors into the generative architecture, supervised by multi-view images captured under controlled light conditions with a light stage setup. We further introduce a distillation strategy to generate high-quality normals for realistic rendering. Experiments demonstrate that our method preserves high-quality generation and real-time rendering, while simultaneously supporting explicit lighting and viewpoint editing. We will publicly release our code and dataset.

Preview loads on expand
360-GeoGS: Geometrically Consistent Feed-Forward 3D Gaussian Splatting Reconstruction for 360 Images Jiaqi Yao, Zhongmiao Yan, Jingyi Xu, Songpengcheng Xia, Yan Xiang, Ling Pei Updated 2026-01-05

3D scene reconstruction is fundamental for spatial intelligence applications such as AR, robotics, and digital twins. Traditional multi-view stereo struggles with sparse viewpoints or low-texture regions, while neural rendering approaches, though capable of producing high-quality results, require per-scene optimization and lack real-time efficiency. Explicit 3D Gaussian Splatting (3DGS) enables efficient rendering, but most feed-forward variants focus on visual quality rather than geometric consistency, limiting accurate surface reconstruction and overall reliability in spatial perception tasks. This paper presents a novel feed-forward 3DGS framework for 360 images, capable of generating geometrically consistent Gaussian primitives while maintaining high rendering quality. A Depth-Normal geometric regularization is introduced to couple rendered depth gradients with normal information, supervising Gaussian rotation, scale, and position to improve point cloud and surface accuracy. Experimental results show that the proposed method maintains high rendering quality while significantly improving geometric consistency, providing an effective solution for 3D reconstruction in spatial perception tasks.

Preview loads on expand
InpaintHuman: Reconstructing Occluded Humans with Multi-Scale UV Mapping and Identity-Preserving Diffusion Inpainting Jinlong Fan, Shanshan Zhao, Liang Zheng, Jing Zhang, Yuxiang Yang, Mingming Gong Updated 2026-01-05

Reconstructing complete and animatable 3D human avatars from monocular videos remains challenging, particularly under severe occlusions. While 3D Gaussian Splatting has enabled photorealistic human rendering, existing methods struggle with incomplete observations, often producing corrupted geometry and temporal inconsistencies. We present InpaintHuman, a novel method for generating high-fidelity, complete, and animatable avatars from occluded monocular videos. Our approach introduces two key innovations: (i) a multi-scale UV-parameterized representation with hierarchical coarse-to-fine feature interpolation, enabling robust reconstruction of occluded regions while preserving geometric details; and (ii) an identity-preserving diffusion inpainting module that integrates textual inversion with semantic-conditioned guidance for subject-specific, temporally coherent completion. Unlike SDS-based methods, our approach employs direct pixel-level supervision to ensure identity fidelity. Experiments on synthetic benchmarks (PeopleSnapshot, ZJU-MoCap) and real-world scenarios (OcMotion) demonstrate competitive performance with consistent improvements in reconstruction quality across diverse poses and viewpoints.

Preview loads on expand
SketchRodGS: Sketch-based Extraction of Slender Geometries for Animating Gaussian Splatting Scenes Haato Watanabe, Nobuyuki Umetani Updated 2026-01-05

Physics simulation of slender elastic objects often requires discretization as a polyline. However, constructing a polyline from Gaussian splatting is challenging as Gaussian splatting lacks connectivity information and the configuration of Gaussian primitives contains much noise. This paper presents a method to extract a polyline representation of the slender part of the objects in a Gaussian splatting scene from the user's sketching input. Our method robustly constructs a polyline mesh that represents the slender parts using the screen-space shortest path analysis that can be efficiently solved using dynamic programming. We demonstrate the effectiveness of our approach in several in-the-wild examples.

Preview loads on expand
ESGaussianFace: Emotional and Stylized Audio-Driven Facial Animation via 3D Gaussian Splatting Chuhang Ma, Shuai Tan, Ye Pan, Jiaolong Yang, Xin Tong Updated 2026-01-05

Most current audio-driven facial animation research primarily focuses on generating videos with neutral emotions. While some studies have addressed the generation of facial videos driven by emotional audio, efficiently generating high-quality talking head videos that integrate both emotional expressions and style features remains a significant challenge. In this paper, we propose ESGaussianFace, an innovative framework for emotional and stylized audio-driven facial animation. Our approach leverages 3D Gaussian Splatting to reconstruct 3D scenes and render videos, ensuring efficient generation of 3D consistent results. We propose an emotion-audio-guided spatial attention method that effectively integrates emotion features with audio content features. Through emotion-guided attention, the model is able to reconstruct facial details across different emotional states more accurately. To achieve emotional and stylized deformations of the 3D Gaussian points through emotion and style features, we introduce two 3D Gaussian deformation predictors. Futhermore, we propose a multi-stage training strategy, enabling the step-by-step learning of the character's lip movements, emotional variations, and style features. Our generated results exhibit high efficiency, high quality, and 3D consistency. Extensive experimental results demonstrate that our method outperforms existing state-of-the-art techniques in terms of lip movement accuracy, expression variation, and style feature expressiveness.

Preview loads on expand
Animated 3DGS Avatars in Diverse Scenes with Consistent Lighting and Shadows Aymen Mir, Riza Alp Guler, Jian Wang, Gerard Pons-Moll, Bing Zhou Updated 2026-01-04

We present a method for consistent lighting and shadows when animated 3D Gaussian Splatting (3DGS) avatars interact with 3DGS scenes or with dynamic objects inserted into otherwise static scenes. Our key contribution is Deep Gaussian Shadow Maps (DGSM), a modern analogue of the classical shadow mapping algorithm tailored to the volumetric 3DGS representation. Building on the classic deep shadow mapping idea, we show that 3DGS admits closed form light accumulation along light rays, enabling volumetric shadow computation without meshing. For each estimated light, we tabulate transmittance over concentric radial shells and store them in octahedral atlases, which modern GPUs can sample in real time per query to attenuate affected scene Gaussians and thus cast and receive shadows consistently. To relight moving avatars, we approximate the local environment illumination with HDRI probes represented in a spherical harmonic (SH) basis and apply a fast per Gaussian radiance transfer, avoiding explicit BRDF estimation or offline optimization. We demonstrate environment consistent lighting for avatars from AvatarX and ActorsHQ, composited into ScanNet++, DL3DV, and SuperSplat scenes, and show interactions with inserted objects. Across single and multi avatar settings, DGSM and SH relighting operate fully in the volumetric 3DGS representation, yielding coherent shadows and relighting while avoiding meshing.

Preview loads on expand
ParkGaussian: Surround-view 3D Gaussian Splatting for Autonomous Parking Xiaobao Wei, Zhangjie Ye, Yuxiang Gu, Zunjie Zhu, Yunfei Guo, Yingying Shen, Shan Zhao, Ming Lu, Haiyang Sun, Bing Wang, Guang Chen, Rongfeng Lu, Hangjun Ye Updated 2026-01-04

Parking is a critical task for autonomous driving systems (ADS), with unique challenges in crowded parking slots and GPS-denied environments. However, existing works focus on 2D parking slot perception, mapping, and localization, 3D reconstruction remains underexplored, which is crucial for capturing complex spatial geometry in parking scenarios. Naively improving the visual quality of reconstructed parking scenes does not directly benefit autonomous parking, as the key entry point for parking is the slots perception module. To address these limitations, we curate the first benchmark named ParkRecon3D, specifically designed for parking scene reconstruction. It includes sensor data from four surround-view fisheye cameras with calibrated extrinsics and dense parking slot annotations. We then propose ParkGaussian, the first framework that integrates 3D Gaussian Splatting (3DGS) for parking scene reconstruction. To further improve the alignment between reconstruction and downstream parking slot detection, we introduce a slot-aware reconstruction strategy that leverages existing parking perception methods to enhance the synthesis quality of slot regions. Experiments on ParkRecon3D demonstrate that ParkGaussian achieves state-of-the-art reconstruction quality and better preserves perception consistency for downstream tasks. The code and dataset will be released at: https://github.com/wm-research/ParkGaussian

Preview loads on expand
ShadowGS: Shadow-Aware 3D Gaussian Splatting for Satellite Imagery Feng Luo, Hongbo Pan, Xiang Yang, Baoyu Jiang, Fengqing Liu, Tao Huang Updated 2026-01-04

3D Gaussian Splatting (3DGS) has emerged as a novel paradigm for 3D reconstruction from satellite imagery. However, in multi-temporal satellite images, prevalent shadows exhibit significant inconsistencies due to varying illumination conditions. To address this, we propose ShadowGS, a novel framework based on 3DGS. It leverages a physics-based rendering equation from remote sensing, combined with an efficient ray marching technique, to precisely model geometrically consistent shadows while maintaining efficient rendering. Additionally, it effectively disentangles different illumination components and apparent attributes in the scene. Furthermore, we introduce a shadow consistency constraint that significantly enhances the geometric accuracy of 3D reconstruction. We also incorporate a novel shadow map prior to improve performance with sparse-view inputs. Extensive experiments demonstrate that ShadowGS outperforms current state-of-the-art methods in shadow decoupling accuracy, 3D reconstruction precision, and novel view synthesis quality, with only a few minutes of training. ShadowGS exhibits robust performance across various settings, including RGB, pansharpened, and sparse-view satellite inputs.

Preview loads on expand
Clean-GS: Semantic Mask-Guided Pruning for 3D Gaussian Splatting Subhankar Mishra Updated 2026-01-01

3D Gaussian Splatting produces high-quality scene reconstructions but generates hundreds of thousands of spurious Gaussians (floaters) scattered throughout the environment. These artifacts obscure objects of interest and inflate model sizes, hindering deployment in bandwidth-constrained applications. We present Clean-GS, a method for removing background clutter and floaters from 3DGS reconstructions using sparse semantic masks. Our approach combines whitelist-based spatial filtering with color-guided validation and outlier removal to achieve 60-80\% model compression while preserving object quality. Unlike existing 3DGS pruning methods that rely on global importance metrics, Clean-GS uses semantic information from as few as 3 segmentation masks (1\% of views) to identify and remove Gaussians not belonging to the target object. Our multi-stage approach consisting of (1) whitelist filtering via projection to masked regions, (2) depth-buffered color validation, and (3) neighbor-based outlier removal isolates monuments and objects from complex outdoor scenes. Experiments on Tanks and Temples show that Clean-GS reduces file sizes from 125MB to 47MB while maintaining rendering quality, making 3DGS models practical for web deployment and AR/VR applications. Our code is available at https://github.com/smlab-niser/clean-gs

Preview loads on expand
SV-GS: Sparse View 4D Reconstruction with Skeleton-Driven Gaussian Splatting Jun-Jee Chao, Volkan Isler Updated 2026-01-01

Reconstructing a dynamic target moving over a large area is challenging. Standard approaches for dynamic object reconstruction require dense coverage in both the viewing space and the temporal dimension, typically relying on multi-view videos captured at each time step. However, such setups are only possible in constrained environments. In real-world scenarios, observations are often sparse over time and captured sparsely from diverse viewpoints (e.g., from security cameras), making dynamic reconstruction highly ill-posed. We present SV-GS, a framework that simultaneously estimates a deformation model and the object's motion over time under sparse observations. To initialize SV-GS, we leverage a rough skeleton graph and an initial static reconstruction as inputs to guide motion estimation. (Later, we show that this input requirement can be relaxed.) Our method optimizes a skeleton-driven deformation field composed of a coarse skeleton joint pose estimator and a module for fine-grained deformations. By making only the joint pose estimator time-dependent, our model enables smooth motion interpolation while preserving learned geometric details. Experiments on synthetic datasets show that our method outperforms existing approaches under sparse observations by up to 34% in PSNR, and achieves comparable performance to dense monocular video methods on real-world datasets despite using significantly fewer frames. Moreover, we demonstrate that the input initial static reconstruction can be replaced by a diffusion-based generative prior, making our method more practical for real-world scenarios.

Preview loads on expand
PhysTalk: Language-driven Real-time Physics in 3D Gaussian Scenes Luca Collorone et.al. Updated 2025-12-31

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
UniC-Lift: Unified 3D Instance Segmentation via Contrastive Learning Ankit Dhiman et.al. Updated 2025-12-31

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
Splatwizard: A Benchmark Toolkit for 3D Gaussian Splatting Compression Xiang Liu et.al. Updated 2025-12-31

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
Structure-Guided Allocation of 2D Gaussians for Image Representation and Compression Huanxiong Liang et.al. Updated 2025-12-30

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
Improved 3D Gaussian Splatting of Unknown Spacecraft Structure Using Space Environment Illumination Knowledge Tae Ha Park et.al. Updated 2025-12-30

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
Differentiable Physics-Driven Human Representation for Millimeter-Wave Based Pose Estimation Shuntian Zheng et.al. Updated 2025-12-30

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
Contour Information Aware 2D Gaussian Splatting for Image Representation Masaya Takabe et.al. Updated 2025-12-29

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
GVSynergy-Det: Synergistic Gaussian-Voxel Representations for Multi-View 3D Object Detection Yi Zhang et.al. Updated 2025-12-29

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
Hash Grid Feature Pruning Yangzhi Ma et.al. Updated 2025-12-28

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
Next Best View Selections for Semantic and Dynamic 3D Gaussian Splatting Yiqian Li et.al. Updated 2025-12-28

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
SCPainter: A Unified Framework for Realistic 3D Asset Insertion and Novel View Synthesis Paul Dobre et.al. Updated 2025-12-27

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
Tracking by Predicting 3-D Gaussians Over Time Tanish Baranwal et.al. Updated 2025-12-27

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
AirGS: Real-Time 4D Gaussian Streaming for Free-Viewpoint Video Experiences Zhe Wang et.al. Updated 2025-12-24

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
Quantile Rendering: Efficiently Embedding High-dimensional Feature on 3D Gaussian Splatting Yoonwoo Jeong et.al. Updated 2025-12-24

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
Nebula: Enable City-Scale 3D Gaussian Splatting in Virtual Reality via Collaborative Rendering and Accelerated Stereo Rasterization He Zhu et.al. Updated 2025-12-23

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
SmartSplat: Feature-Smart Gaussians for Scalable Compression of Ultra-High-Resolution Images Linfei Li et.al. Updated 2025-12-23

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
Enhancing annotations for 5D apple pose estimation through 3D Gaussian Splatting (3DGS) Robert van de Ven et.al. Updated 2025-12-23

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
Dreamcrafter: Immersive Editing of 3D Radiance Fields Through Flexible, Generative Inputs and Outputs Cyrus Vachha et.al. Updated 2025-12-23

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
WorldWarp: Propagating 3D Geometry with Asynchronous Video Diffusion Hanyang Kong et.al. Updated 2025-12-22

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
4D Gaussian Splatting as a Learned Dynamical System Arnold Caleb Asiimwe et.al. Updated 2025-12-22

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
GaussianImage++: Boosted Image Representation and Compression with 2D Gaussian Splatting Tiantian Li et.al. Updated 2025-12-22

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
EcoSplat: Efficiency-controllable Feed-forward 3D Gaussian Splatting from Multi-view Images Jongmin Park et.al. Updated 2025-12-21

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
Instant Expressive Gaussian Head Avatar via 3D-Aware Expression Distillation Kaiwen Jiang et.al. Updated 2025-12-18

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
SDFoam: Signed-Distance Foam for explicit surface reconstruction Antonella Rech et.al. Updated 2025-12-18

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
Using Gaussian Splats to Create High-Fidelity Facial Geometry and Texture Haodi He et.al. Updated 2025-12-18

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
Gaussian Pixel Codec Avatars: A Hybrid Representation for Efficient Rendering Divam Gupta et.al. Updated 2025-12-17

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
Off The Grid: Detection of Primitives for Feed-Forward 3D Gaussian Splatting Arthur Moreau et.al. Updated 2025-12-17

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
VLA-AN: An Efficient and Onboard Vision-Language-Action Framework for Aerial Navigation in Complex Environments Yuze Wu et.al. Updated 2025-12-17

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
MVGSR: Multi-View Consistent 3D Gaussian Super-Resolution via Epipolar Guidance Kaizhe Zhang et.al. Updated 2025-12-17

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
A Gaussian Parameterization for Direct Atomic Structure Identification in Electron Tomography Nalini M. Singh et.al. Updated 2025-12-17

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
Broadening View Synthesis of Dynamic Scenes from Constrained Monocular Videos Le Jiang et.al. Updated 2025-12-16

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
HGS: Hybrid Gaussian Splatting with Static-Dynamic Decomposition for Compact Dynamic View Synthesis Kaizhe Zhang et.al. Updated 2025-12-16

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
Moment-Based 3D Gaussian Splatting: Resolving Volumetric Occlusion with Order-Independent Transmittance Jan U. Müller et.al. Updated 2025-12-12

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
Prior-Enhanced Gaussian Splatting for Dynamic Scene Reconstruction from Casual Video Meng-Li Shih et.al. Updated 2025-12-12

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
Lightweight 3D Gaussian Splatting Compression via Video Codec Qi Yang et.al. Updated 2025-12-12

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
GaussianHeadTalk: Wobble-Free 3D Talking Heads with Audio Driven Gaussian Splatting Madhav Agarwal et.al. Updated 2025-12-11

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
DeMapGS: Simultaneous Mesh Deformation and Surface Attribute Mapping via Gaussian Splatting Shuyi Zhou et.al. Updated 2025-12-11

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
Neural Hamiltonian Deformation Fields for Dynamic Scene Rendering Hai-Long Qin et.al. Updated 2025-12-11

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
Breaking the Vicious Cycle: Coherent 3D Gaussian Splatting from Sparse and Motion-Blurred Views Zhankuo Xu et.al. Updated 2025-12-11

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
Physically Aware 360$^\circ$ View Generation from a Single Image using Disentangled Scene Embeddings Karthikeya KV et.al. Updated 2025-12-11

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
Long-LRM++: Preserving Fine Details in Feed-Forward Wide-Coverage Reconstruction Chen Ziwen et.al. Updated 2025-12-11

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
TraceFlow: Dynamic 3D Reconstruction of Specular Scenes Driven by Ray Tracing Jiachen Tao et.al. Updated 2025-12-10

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
OpenMonoGS-SLAM: Monocular Gaussian Splatting SLAM with Open-set Semantics Jisang Yoo et.al. Updated 2025-12-09

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
On-the-fly Large-scale 3D Reconstruction from Multi-Camera Rigs Yijia Guo et.al. Updated 2025-12-09

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
Visionary: The World Model Carrier Built on WebGPU-Powered Gaussian Splatting Platform Yuning Gong et.al. Updated 2025-12-09

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
HybridSplat: Fast Reflection-baked Gaussian Tracing using Hybrid Splatting Chang Liu et.al. Updated 2025-12-09

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
Zero-Splat TeleAssist: A Zero-Shot Pose Estimation Framework for Semantic Teleoperation Srijan Dokania et.al. Updated 2025-12-09

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
Multi-view Pyramid Transformer: Look Coarser to See Broader Gyeongjin Kang et.al. Updated 2025-12-08

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
Tessellation GS: Neural Mesh Gaussians for Robust Monocular Reconstruction of Dynamic Objects Shuohan Tao et.al. Updated 2025-12-08

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
Debiasing Diffusion Priors via 3D Attention for Consistent Gaussian Splatting Shilong Jin et.al. Updated 2025-12-08

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
AdLift: Lifting Adversarial Perturbations to Safeguard 3D Gaussian Splatting Assets Against Instruction-Driven Editing Ziming Hong et.al. Updated 2025-12-08

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
STRinGS: Selective Text Refinement in Gaussian Splatting Abhinav Raundhal et.al. Updated 2025-12-08

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
Splannequin: Freezing Monocular Mannequin-Challenge Footage with Dual-Detection Splatting Hao-Jen Chien et.al. Updated 2025-12-04

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
4DLangVGGT: 4D Language-Visual Geometry Grounded Transformer Xianfeng Wu et.al. Updated 2025-12-04

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
RobustSplat++: Decoupling Densification, Dynamics, and Illumination for In-the-Wild 3DGS Chuanyu Fu et.al. Updated 2025-12-04

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
Bridging Simulation and Reality: Cross-Domain Transfer with Semantic 2D Gaussian Splatting Jian Tang et.al. Updated 2025-12-04

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
Gaussian Entropy Fields: Driving Adaptive Sparsity in 3D Gaussian Optimization Hong Kuang et.al. Updated 2025-12-04

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
UTrice: Unifying Primitives in Differentiable Ray Tracing and Rasterization via Triangles for Particle-Based 3D Scenes Changhe Liu et.al. Updated 2025-12-04

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
SyncTrack4D: Cross-Video Motion Alignment and Video Synchronization for Multi-Video 4D Gaussian Splatting Yonghan Lee et.al. Updated 2025-12-03

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
Mind-to-Face: Neural-Driven Photorealistic Avatar Synthesis via EEG Decoding Haolin Xiong et.al. Updated 2025-12-03

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
C3G: Learning Compact 3D Representations with 2K Gaussians Honggyu An et.al. Updated 2025-12-03

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
Motion4D: Learning 3D-Consistent Motion and Semantics for 4D Scene Understanding Haoran Zhou et.al. Updated 2025-12-03

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
Resolution Where It Counts: Hash-based GPU-Accelerated 3D Reconstruction via Variance-Adaptive Voxel Grids Lorenzo De Rebotti et.al. Updated 2025-11-26

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
Endo-G$^{2}$T: Geometry-Guided & Temporally Aware Time-Embedded 4DGS For Endoscopic Scenes Yangle Liu et.al. Updated 2025-11-26

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
Unlocking Zero-shot Potential of Semi-dense Image Matching via Gaussian Splatting Juncheng Chen et.al. Updated 2025-11-26

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
GS-Checker: Tampering Localization for 3D Gaussian Splatting Haoliang Han et.al. Updated 2025-11-25

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
Material-informed Gaussian Splatting for 3D World Reconstruction in a Digital Twin João Malheiro Silva et.al. Updated 2025-11-25

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
GigaWorld-0: World Models as Data Engine to Empower Embodied AI GigaWorld Team et.al. Updated 2025-11-25

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
STAvatar: Soft Binding and Temporal Density Control for Monocular 3D Head Avatars Reconstruction Jiankuo Zhao et.al. Updated 2025-11-25

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
DensifyBeforehand: LiDAR-assisted Content-aware Densification for Efficient and Quality 3D Gaussian Splatting Phurtivilai Patt et.al. Updated 2025-11-24

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
IDSplat: Instance-Decomposed 3D Gaussian Splatting for Driving Scenes Carl Lindström et.al. Updated 2025-11-24

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
NVGS: Neural Visibility for Occlusion Culling in 3D Gaussian Splatting Brent Zoomers et.al. Updated 2025-11-24

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
MetroGS: Efficient and Stable Reconstruction of Geometrically Accurate High-Fidelity Large-Scale Scenes Kehua Chen et.al. Updated 2025-11-24

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
Neural Texture Splatting: Expressive 3D Gaussian Splatting for View Synthesis, Geometry, and Dynamic Reconstruction Yiming Wang et.al. Updated 2025-11-24

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
Splatonic: Architecture Support for 3D Gaussian Splatting SLAM via Sparse Processing Xiaotong Huang et.al. Updated 2025-11-24

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
PhysGS: Bayesian-Inferred Gaussian Splatting for Physical Property Estimation Samarth Chopra et.al. Updated 2025-11-23

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
Splatblox: Traversability-Aware Gaussian Splatting for Outdoor Robot Navigation Samarth Chopra et.al. Updated 2025-11-23

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
ReCoGS: Real-time ReColoring for Gaussian Splatting scenes Lorenzo Rutayisire et.al. Updated 2025-11-23

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
SegSplat: Feed-forward Gaussian Splatting and Open-Set Semantic Segmentation Peter Siegel et.al. Updated 2025-11-23

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
Gaussian Blending: Rethinking Alpha Blending in 3D Gaussian Splatting Junseo Koo et.al. Updated 2025-11-19

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
Gaussian Splatting-based Low-Rank Tensor Representation for Multi-Dimensional Image Recovery Yiming Zeng et.al. Updated 2025-11-19

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
Gaussian See, Gaussian Do: Semantic 3D Motion Transfer from Multiview Video Yarin Bekor et.al. Updated 2025-11-18

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
SparseSurf: Sparse-View 3D Gaussian Splatting for Surface Reconstruction Meiying Gu et.al. Updated 2025-11-18

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
Interaction-Aware 4D Gaussian Splatting for Dynamic Hand-Object Interaction Reconstruction Hao Tian et.al. Updated 2025-11-18

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
2D Gaussians Spatial Transport for Point-supervised Density Regression Miao Shang et.al. Updated 2025-11-18

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
IBGS: Image-Based Gaussian Splatting Hoang Chuong Nguyen et.al. Updated 2025-11-18

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
Silhouette-to-Contour Registration: Aligning Intraoral Scan Models with Cephalometric Radiographs Yiyi Miao et.al. Updated 2025-11-18

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
Dental3R: Geometry-Aware Pairing for Intraoral 3D Reconstruction from Sparse-View Photographs Yiyi Miao et.al. Updated 2025-11-18

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
GEN3D: Generating Domain-Free 3D Scenes from a Single Image Yuxin Zhang et.al. Updated 2025-11-18

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
4D3R: Motion-Aware Neural Reconstruction and Rendering of Dynamic Scenes from Monocular Videos Mengqi Guo et.al. Updated 2025-11-07

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
Splatography: Sparse multi-view dynamic Gaussian Splatting for filmmaking challenges Adrian Azzarelli et.al. Updated 2025-11-07

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
Efficient representation of 3D spatial data for defense-related applications Benjamin Kahl et.al. Updated 2025-11-07

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
CLM: Removing the GPU Memory Barrier for 3D Gaussian Splatting Hexu Zhao et.al. Updated 2025-11-07

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
Channel Knowledge Map Construction: Recent Advances and Open Challenges Zixiang Ren et.al. Updated 2025-11-07

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
3D Gaussian Point Encoders Jim James et.al. Updated 2025-11-06

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
Real-to-Sim Robot Policy Evaluation with Gaussian Splatting Simulation of Soft-Body Interactions Kaifeng Zhang et.al. Updated 2025-11-06

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
FastGS: Training 3D Gaussian Splatting in 100 Seconds Shiwei Ren et.al. Updated 2025-11-06

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
CaRF: Enhancing Multi-View Consistency in Referring 3D Gaussian Splatting Segmentation Yuwen Tao et.al. Updated 2025-11-06

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
DentalSplat: Dental Occlusion Novel View Synthesis from Sparse Intra-Oral Photographs Yiyi Miao et.al. Updated 2025-11-05

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
PercHead: Perceptual Head Model for Single-Image 3D Head Reconstruction & Editing Antonio Oroz et.al. Updated 2025-11-04

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
Object-Centric 3D Gaussian Splatting for Strawberry Plant Reconstruction and Phenotyping Jiajia Li et.al. Updated 2025-11-04

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
4D Neural Voxel Splatting: Dynamic Scene Rendering with Voxelized Guassian Splatting Chun-Tin Wu et.al. Updated 2025-11-01

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
SAGS: Self-Adaptive Alias-Free Gaussian Splatting for Dynamic Surgical Endoscopic Reconstruction Wenfeng Huang et.al. Updated 2025-10-31

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
WildfireX-SLAM: A Large-scale Low-altitude RGB-D Dataset for Wildfire SLAM and Beyond Zhicong Sun et.al. Updated 2025-10-31

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
DC4GS: Directional Consistency-Driven Adaptive Density Control for 3D Gaussian Splatting Moonsoo Jeong et.al. Updated 2025-10-30

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
HEIR: Learning Graph-Based Motion Hierarchies Cheng Zheng et.al. Updated 2025-10-30

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
The Impact and Outlook of 3D Gaussian Splatting Bernhard Kerbl et.al. Updated 2025-10-30

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
AgriGS-SLAM: Orchard Mapping Across Seasons via Multi-View Gaussian Splatting SLAM Mirko Usuelli et.al. Updated 2025-10-30

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
6D Channel Knowledge Map Construction via Bidirectional Wireless Gaussian Splatting Juncong Zhou et.al. Updated 2025-10-30

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
JOGS: Joint Optimization of Pose Estimation and 3D Gaussian Splatting Yuxuan Li et.al. Updated 2025-10-30

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
$D^2GS$: Dense Depth Regularization for LiDAR-free Urban Scene Reconstruction Kejing Xia et.al. Updated 2025-10-29

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
AtlasGS: Atlanta-world Guided Surface Reconstruction with Implicit Structured Gaussians Xiyu Zhang et.al. Updated 2025-10-29

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
NVSim: Novel View Synthesis Simulator for Large Scale Indoor Navigation Mingyu Jeong et.al. Updated 2025-10-28

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
LagMemo: Language 3D Gaussian Splatting Memory for Multi-modal Open-vocabulary Multi-goal Visual Navigation Haotian Zhou et.al. Updated 2025-10-28

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
A Survey on Collaborative SLAM with 3D Gaussian Splatting Phuc Nguyen Xuan et.al. Updated 2025-10-28

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
PlanarGS: High-Fidelity Indoor 3D Gaussian Splatting Guided by Vision-Language Planar Priors Xirui Jin et.al. Updated 2025-10-27

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
Explicit Memory through Online 3D Gaussian Splatting Improves Class-Agnostic Video Segmentation Anthony Opipari et.al. Updated 2025-10-27

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
VR-Drive: Viewpoint-Robust End-to-End Driving with Feed-Forward 3D Gaussian Splatting Hoonhee Cho et.al. Updated 2025-10-27

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
EndoWave: Rational-Wavelet 4D Gaussian Splatting for Endoscopic Reconstruction Taoyu Wu et.al. Updated 2025-10-27

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
Scaling Up Occupancy-centric Driving Scene Generation: Dataset and Method Bohan Li et.al. Updated 2025-10-27

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
GSWorld: Closed-Loop Photo-Realistic Simulation Suite for Robotic Manipulation Guangqi Jiang et.al. Updated 2025-10-23

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
Dino-Diffusion Modular Designs Bridge the Cross-Domain Gap in Autonomous Parking Zixuan Wu et.al. Updated 2025-10-23

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
COS3D: Collaborative Open-Vocabulary 3D Segmentation Runsong Zhu et.al. Updated 2025-10-23

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
Extreme Views: 3DGS Filter for Novel View Synthesis from Out-of-Distribution Camera Poses Damian Bowness et.al. Updated 2025-10-22

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
VGD: Visual Geometry Gaussian Splatting for Feed-Forward Surround-view Driving Reconstruction Junhong Lin et.al. Updated 2025-10-22

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
Advances in 4D Representation: Geometry, Motion, and Interaction Mingrui Zhao et.al. Updated 2025-10-22

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
MoE-GS: Mixture of Experts for Dynamic Gaussian Splatting In-Hwan Jin et.al. Updated 2025-10-22

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
GRASPLAT: Enabling dexterous grasping through novel view synthesis Matteo Bortolon et.al. Updated 2025-10-22

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
Re-Activating Frozen Primitives for 3D Gaussian Splatting Yuxin Cheng et.al. Updated 2025-10-21

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
Moving Light Adaptive Colonoscopy Reconstruction via Illumination-Attenuation-Aware 3D Gaussian Splatting Hao Wang et.al. Updated 2025-10-21

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
PFGS: Pose-Fused 3D Gaussian Splatting for Complete Multi-Pose Object Reconstruction Ting-Yu Yen et.al. Updated 2025-10-17

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
GaussGym: An open-source real-to-sim framework for learning locomotion from pixels Alejandro Escontrela et.al. Updated 2025-10-17

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
Capture, Canonicalize, Splat: Zero-Shot 3D Gaussian Avatars from Unstructured Phone Images Emanuel Garbin et.al. Updated 2025-10-17

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
SaLon3R: Structure-aware Long-term Generalizable 3D Reconstruction from Unposed Images Jiaxin Guo et.al. Updated 2025-10-16

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
Leveraging Learned Image Prior for 3D Gaussian Compression Seungjoo Shin et.al. Updated 2025-10-16

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
BalanceGS: Algorithm-System Co-design for Efficient 3D Gaussian Splatting Training on GPU Junyi Wu et.al. Updated 2025-10-16

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
GauSSmart: Enhanced 3D Reconstruction through 2D Foundation Models and Geometric Filtering Alexander Valverde et.al. Updated 2025-10-16

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
Virtually Being: Customizing Camera-Controllable Video Diffusion Models with Multi-View Performance Captures Yuancheng Xu et.al. Updated 2025-10-16

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
Instant Skinned Gaussian Avatars for Web, Mobile and VR Applications Naruya Kondo et.al. Updated 2025-10-15

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
VIST3A: Text-to-3D by Stitching a Multi-view Reconstruction Network to a Video Generator Hyojun Go et.al. Updated 2025-10-15

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
Uncertainty Matters in Dynamic Gaussian Splatting for Monocular 4D Reconstruction Fengzhi Guo et.al. Updated 2025-10-14

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
BSGS: Bi-stage 3D Gaussian Splatting for Camera Motion Deblurring An Zhao et.al. Updated 2025-10-14

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
Hybrid Gaussian Splatting for Novel Urban View Synthesis Mohamed Omran et.al. Updated 2025-10-14

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
PAGS: Priority-Adaptive Gaussian Splatting for Dynamic Driving Scenes Ying A et.al. Updated 2025-10-14

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
UniGS: Unified Geometry-Aware Gaussian Splatting for Multimodal Rendering Yusen Xie et.al. Updated 2025-10-14

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
G4Splat: Geometry-Guided Gaussian Splatting with Generative Prior Junfeng Ni et.al. Updated 2025-10-14

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
GS-Verse: Mesh-based Gaussian Splatting for Physics-aware Interaction in Virtual Reality Anastasiya Pechko et.al. Updated 2025-10-13

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
Ev4DGS: Novel-view Rendering of Non-Rigid Objects from Monocular Event Streams Takuya Nakabayashi et.al. Updated 2025-10-13

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
Phys2Real: Fusing VLM Priors with Interactive Online Adaptation for Uncertainty-Aware Sim-to-Real Manipulation Maggie Wang et.al. Updated 2025-10-13

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
VA-GS: Enhancing the Geometric Representation of Gaussian Splatting via View Alignment Qing Li et.al. Updated 2025-10-13

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
ReSplat: Learning Recurrent Gaussian Splats Haofei Xu et.al. Updated 2025-10-09

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
D$^2$GS: Depth-and-Density Guided Gaussian Splatting for Stable and Accurate Sparse-View Reconstruction Meixi Song et.al. Updated 2025-10-09

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
Splat the Net: Radiance Fields with Splattable Neural Primitives Xilong Zhou et.al. Updated 2025-10-09

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
Efficient Label Refinement for Face Parsing Under Extreme Poses Using 3D Gaussian Splatting Ankit Gahlawat et.al. Updated 2025-10-09

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
CVD-STORM: Cross-View Video Diffusion with Spatial-Temporal Reconstruction Model for Autonomous Driving Tianrui Zhang et.al. Updated 2025-10-09

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
PrismGS: Physically-Grounded Anti-Aliasing for High-Fidelity Large-Scale 3D Gaussian Splatting Houqiang Zhong et.al. Updated 2025-10-09

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
DEGS: Deformable Event-based 3D Gaussian Splatting from RGB and Event Stream Junhao He et.al. Updated 2025-10-09

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
ComGS: Efficient 3D Object-Scene Composition via Surface Octahedral Probes Jian Gao et.al. Updated 2025-10-09

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
Generating Surface for Text-to-3D using 2D Gaussian Splatting Huanning Dong et.al. Updated 2025-10-08

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
Capture and Interact: Rapid 3D Object Acquisition and Rendering with Gaussian Splatting in Unity Islomjon Shukhratov et.al. Updated 2025-10-08

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
StealthAttack: Robust 3D Gaussian Splatting Poisoning via Density-Guided Illusions Bo-Hsu Ke et.al. Updated 2025-10-02

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
Performance-Guided Refinement for Visual Aerial Navigation using Editable Gaussian Splatting in FalconGym 2.0 Yan Miao et.al. Updated 2025-10-02

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
Spec-Gloss Surfels and Normal-Diffuse Priors for Relightable Glossy Objects Georgios Kouros et.al. Updated 2025-10-02

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
GaussianMorphing: Mesh-Guided 3D Gaussians for Semantic-Aware Object Morphing Mengtian Li et.al. Updated 2025-10-02

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
4DGS-Craft: Consistent and Interactive 4D Gaussian Splatting Editing Lei Liu et.al. Updated 2025-10-02

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
ROI-GS: Interest-based Local Quality 3D Gaussian Splatting Quoc-Anh Bui et.al. Updated 2025-10-02

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
GreenhouseSplat: A Dataset of Photorealistic Greenhouse Simulations for Mobile Robotics Diram Tabaa et.al. Updated 2025-10-02

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
LOBE-GS: Load-Balanced and Efficient 3D Gaussian Splatting for Large-Scale Scene Reconstruction Sheng-Hsiang Hung et.al. Updated 2025-10-02

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
MPMAvatar: Learning 3D Gaussian Avatars with Accurate and Robust Physics-Based Dynamics Changmin Lee et.al. Updated 2025-10-02

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
Instant4D: 4D Gaussian Splatting in Minutes Zhanpeng Luo et.al. Updated 2025-10-01

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
Triangle Splatting+: Differentiable Rendering with Opaque Triangles Jan Held et.al. Updated 2025-09-29

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
GEM: 3D Gaussian Splatting for Efficient and Accurate Cryo-EM Reconstruction Huaizhi Qu et.al. Updated 2025-09-29

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
LVT: Large-Scale Scene Reconstruction via Local View Transformers Tooba Imtiaz et.al. Updated 2025-09-29

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
DWGS: Enhancing Sparse-View Gaussian Splatting with Hybrid-Loss Depth Estimation and Bidirectional Warping Yu Ma et.al. Updated 2025-09-29

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
ExGS: Extreme 3D Gaussian Compression with Diffusion Priors Jiaqi Chen et.al. Updated 2025-09-29

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
Proxy-GS: Efficient 3D Gaussian Splatting via Proxy Mesh Yuanyuan Gao et.al. Updated 2025-09-29

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
OMeGa: Joint Optimization of Explicit Meshes and Gaussian Splats for Robust Scene-Level Surface Reconstruction Yuhang Cao et.al. Updated 2025-09-29

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
CrashSplat: 2D to 3D Vehicle Damage Segmentation in Gaussian Splatting Dragoş-Andrei Chileban et.al. Updated 2025-09-28

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
From Fields to Splats: A Cross-Domain Survey of Real-Time Neural Scene Representations Javed Ahmad et.al. Updated 2025-09-28

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
Orientation-anchored Hyper-Gaussian for 4D Reconstruction from Casual Videos Junyi Wu et.al. Updated 2025-09-27

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
4D Driving Scene Generation With Stereo Forcing Hao Lu et.al. Updated 2025-09-24

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
GS-RoadPatching: Inpainting Gaussians via 3D Searching and Placing for Driving Scenes Guo Chen et.al. Updated 2025-09-24

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
Aerial-Ground Image Feature Matching via 3D Gaussian Splatting-based Intermediate View Rendering Jiangxue Yu et.al. Updated 2025-09-24

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
BiTAA: A Bi-Task Adversarial Attack for Object Detection and Depth Estimation via 3D Gaussian Splatting Yixun Zhang et.al. Updated 2025-09-24

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
PolGS: Polarimetric Gaussian Splatting for Fast Reflective Surface Reconstruction Yufei Han et.al. Updated 2025-09-24

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
VolSplat: Rethinking Feed-Forward 3D Gaussian Splatting with Voxel-Aligned Prediction Weijie Wang et.al. Updated 2025-09-23

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
Lyra: Generative 3D Scene Reconstruction via Video Diffusion Model Self-Distillation Sherwin Bahmani et.al. Updated 2025-09-23

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
WaveletGaussian: Wavelet-domain Diffusion for Sparse-view 3D Gaussian Object Reconstruction Hung Nguyen et.al. Updated 2025-09-23

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
Seeing Through Reflections: Advancing 3D Scene Reconstruction in Mirror-Containing Environments with Gaussian Splatting Zijing Guo et.al. Updated 2025-09-23

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
DeblurSplat: SfM-free 3D Gaussian Splatting with Event Camera for Robust Deblurring Pengteng Li et.al. Updated 2025-09-23

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
FMGS-Avatar: Mesh-Guided 2D Gaussian Splatting with Foundation Model Priors for 3D Monocular Avatar Reconstruction Jinlong Fan et.al. Updated 2025-09-18

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
RealMirror: A Comprehensive, Open-Source Vision-Language-Action Platform for Embodied AI Cong Tai et.al. Updated 2025-09-18

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
Perception-Integrated Safety Critical Control via Analytic Collision Cone Barrier Functions on 3D Gaussian Splatting Dario Tscholl et.al. Updated 2025-09-17

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
MCGS-SLAM: A Multi-Camera SLAM Framework Using Gaussian Splatting for High-Fidelity Mapping Zhihao Cao et.al. Updated 2025-09-17

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
Plug-and-Play PDE Optimization for 3D Gaussian Splatting: Toward High-Quality Rendering and Reconstruction Yifan Mo et.al. Updated 2025-09-17

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
LamiGauss: Pitching Radiative Gaussian for Sparse-View X-ray Laminography Reconstruction Chu Chen et.al. Updated 2025-09-17

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
MemGS: Memory-Efficient Gaussian Splatting for Real-Time SLAM Yinlong Bai et.al. Updated 2025-09-16

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
Improving 3D Gaussian Splatting Compression by Scene-Adaptive Lattice Vector Quantization Hao Xu et.al. Updated 2025-09-16

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
Dream3DAvatar: Text-Controlled 3D Avatar Reconstruction from a Single Image Gaofeng Liu et.al. Updated 2025-09-16

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
Beyond Averages: Open-Vocabulary 3D Scene Understanding with Gaussian Splatting and Bag of Embeddings Abdalla Arafa et.al. Updated 2025-09-16

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
On the Geometric Accuracy of Implicit and Primitive-based Representations Derived from View Rendering Constraints Elias De Smijter et.al. Updated 2025-09-12

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
VIM-GS: Visual-Inertial Monocular Gaussian Splatting via Object-level Guidance in Large Scenes Shengkai Zhang et.al. Updated 2025-09-10

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
SplatFill: 3D Scene Inpainting via Depth-Guided Gaussian Splatting Mahtab Dahaghin et.al. Updated 2025-09-09

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
HairGS: Hair Strand Reconstruction based on 3D Gaussian Splatting Yimin Pan et.al. Updated 2025-09-09

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
DiGS: Accurate and Complete Surface Reconstruction from 3D Gaussians via Direct SDF Learning Wenzhi Guo et.al. Updated 2025-09-09

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
DreamLifting: A Plug-in Module Lifting MV Diffusion Models for 3D Asset Generation Ze-Xin Yin et.al. Updated 2025-09-09

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
Real-time Photorealistic Mapping for Situational Awareness in Robot Teleoperation Ian Page et.al. Updated 2025-09-08

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
3DOF+Quantization: 3DGS quantization for large scenes with limited Degrees of Freedom Matthieu Gendrin et.al. Updated 2025-09-08

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
MEGS$^{2}$: Memory-Efficient Gaussian Splatting via Spherical Gaussians and Unified Pruning Jiarui Chen et.al. Updated 2025-09-07

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
Visibility-Aware Language Aggregation for Open-Vocabulary Segmentation in 3D Gaussian Splatting Sen Wang et.al. Updated 2025-09-05

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
Toward Distributed 3D Gaussian Splatting for High-Resolution Isosurface Visualization Mengjiao Han et.al. Updated 2025-09-05

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
VolSegGS: Segmentation and Tracking in Dynamic Volumetric Scenes via Deformable 3D Gaussians Siyuan Yao et.al. Updated 2025-07-16

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
NLI4VolVis: Natural Language Interaction for Volume Visualization via LLM Multi-Agents and Editable 3D Gaussian Splatting Kuangshi Ai et.al. Updated 2025-07-16

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
Wavelet-GS: 3D Gaussian Splatting with Wavelet Decomposition Beizhen Zhao et.al. Updated 2025-07-16

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
AD-GS: Object-Aware B-Spline Gaussian Splatting for Self-Supervised Autonomous Driving Jiawei Xu et.al. Updated 2025-07-16

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
BRUM: Robust 3D Vehicle Reconstruction from 360 Sparse Images Davide Di Nucci et.al. Updated 2025-07-16

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
SGLoc: Semantic Localization System for Camera Pose Estimation from 3D Gaussian Splatting Representation Beining Xu et.al. Updated 2025-07-16

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
Dark-EvGS: Event Camera as an Eye for Radiance Field in the Dark Jingqian Wu et.al. Updated 2025-07-16

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
TRAN-D: 2D Gaussian Splatting-based Sparse-view Transparent Object Depth Reconstruction via Physics Simulation for Scene Update Jeongyun Kim et.al. Updated 2025-07-16

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
A Mixed-Primitive-based Gaussian Splatting Method for Surface Reconstruction Haoxuan Qu et.al. Updated 2025-07-15

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
Robust 3D-Masked Part-level Editing in 3D Gaussian Splatting with Regularized Score Distillation Sampling Hayeon Kim et.al. Updated 2025-07-15

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
ScaffoldAvatar: High-Fidelity Gaussian Avatars with Patch Expressions Shivangi Aneja et.al. Updated 2025-07-14

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
3DGAA: Realistic and Robust 3D Gaussian-based Adversarial Attack for Autonomous Driving Yixun Zhang et.al. Updated 2025-07-14

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
Learning human-to-robot handovers through 3D scene reconstruction Yuekun Wu et.al. Updated 2025-07-11

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
RePaintGS: Reference-Guided Gaussian Splatting for Realistic and View-Consistent 3D Scene Inpainting Ji Hyun Seo et.al. Updated 2025-07-11

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
Temporally Consistent Amodal Completion for 3D Human-Object Interaction Reconstruction Hyungjun Doh et.al. Updated 2025-07-10

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
RegGS: Unposed Sparse Views Gaussian Splatting with 3DGS Registration Chong Cheng et.al. Updated 2025-07-10

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
RTR-GS: 3D Gaussian Splatting for Inverse Rendering with Radiance Transfer and Reflection Yongyang Zhou et.al. Updated 2025-07-10

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
MUVOD: A Novel Multi-view Video Object Segmentation Dataset and A Benchmark for 3D Segmentation Bangning Wei et.al. Updated 2025-07-10

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
SD-GS: Structured Deformable 3D Gaussians for Efficient Dynamic Scene Reconstruction Wei Yao et.al. Updated 2025-07-10

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
Seg-Wild: Interactive Segmentation based on 3D Gaussian Splatting for Unconstrained Image Collections Yongtang Bao et.al. Updated 2025-07-10

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
Enhancing non-Rigid 3D Model Deformations Using Mesh-based Gaussian Splatting Wijayathunga W. M. R. D. B et.al. Updated 2025-07-09

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
Photometric Stereo using Gaussian Splatting and inverse rendering Matéo Ducastel et.al. Updated 2025-07-09

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
FlexGaussian: Flexible and Cost-Effective Training-Free Compression for 3D Gaussian Splatting Boyuan Tian et.al. Updated 2025-07-09

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
ClipGS: Clippable Gaussian Splatting for Interactive Cinematic Visualization of Volumetric Medical Data Chengkun Li et.al. Updated 2025-07-09

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
LighthouseGS: Indoor Structure-aware 3D Gaussian Splatting for Panorama-Style Mobile Captures Seungoh Han et.al. Updated 2025-07-08

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
Reflections Unlock: Geometry-Aware Reflection Disentanglement in 3D Gaussian Splatting for Photorealistic Scenes Rendering Jiayi Song et.al. Updated 2025-07-08

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
VisualSpeaker: Visually-Guided 3D Avatar Lip Synthesis Alexandre Symeonidis-Herzig et.al. Updated 2025-07-08

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
D-FCGS: Feedforward Compression of Dynamic Gaussian Splatting for Free-Viewpoint Videos Wenkang Zhang et.al. Updated 2025-07-08

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
DreamArt: Generating Interactable Articulated Objects from a Single Image Ruijie Lu et.al. Updated 2025-07-08

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
3DGS_LSR:Large_Scale Relocation for Autonomous Driving Based on 3D Gaussian Splatting Haitao Lu et.al. Updated 2025-07-08

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
HyperGaussians: High-Dimensional Gaussian Splatting for High-Fidelity Animatable Face Avatars Gent Serifi et.al. Updated 2025-07-03

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
ArtGS:3D Gaussian Splatting for Interactive Visual-Physical Modeling and Manipulation of Articulated Objects Qiaojun Yu et.al. Updated 2025-07-03

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
LocalDyGS: Multi-view Global Dynamic Scene Modeling via Adaptive Local Implicit Feature Decoupling Jiahao Wu et.al. Updated 2025-07-03

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
Gbake: Baking 3D Gaussian Splats into Reflection Probes Stephen Pasch et.al. Updated 2025-07-03

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
3D Gaussian Splatting Driven Multi-View Robust Physical Adversarial Camouflage Generation Tianrui Lou et.al. Updated 2025-07-02

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
VISTA: Open-Vocabulary, Task-Relevant Robot Exploration with Online Semantic Gaussian Splatting Keiko Nagami et.al. Updated 2025-07-01

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
A LoD of Gaussians: Unified Training and Rendering for Ultra-Large Scale Reconstruction with External Memory Felix Windisch et.al. Updated 2025-07-01

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
Masks make discriminative models great again! Tianshi Cao et.al. Updated 2025-07-01

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
GaussianVLM: Scene-centric 3D Vision-Language Models using Language-aligned Gaussian Splats for Embodied Reasoning and Beyond Anna-Maria Halacheva et.al. Updated 2025-07-01

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
LOD-GS: Level-of-Detail-Sensitive 3D Gaussian Splatting for Detail Conserved Anti-Aliasing Zhenya Yang et.al. Updated 2025-07-01

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
Curve-Aware Gaussian Splatting for 3D Parametric Curve Reconstruction Zhirui Gao et.al. Updated 2025-06-28

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
DIGS: Dynamic CBCT Reconstruction using Deformation-Informed 4D Gaussian Splatting and a Low-Rank Free-Form Deformation Model Yuliang Huang et.al. Updated 2025-06-27

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
BézierGS: Dynamic Urban Scene Reconstruction with Bézier Curve Gaussian Splatting Zipei Ma et.al. Updated 2025-06-27

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
MADrive: Memory-Augmented Driving Scene Modeling Polina Karpikova et.al. Updated 2025-06-26

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
EndoFlow-SLAM: Real-Time Endoscopic SLAM with Flow-Constrained Gaussian Splatting Taoyu Wu et.al. Updated 2025-06-26

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
Geometry and Perception Guided Gaussians for Multiview-consistent 3D Generation from a Single Image Pufan Li et.al. Updated 2025-06-26

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
CL-Splats: Continual Learning of Gaussian Splatting with Local Optimization Jan Ackermann et.al. Updated 2025-06-26

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
User-in-the-Loop View Sampling with Error Peaking Visualization Ayaka Yasunaga et.al. Updated 2025-06-26

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
DBMovi-GS: Dynamic View Synthesis from Blurry Monocular Video via Sparse-Controlled Gaussian Splatting Yeon-Ji Song et.al. Updated 2025-06-26

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
3DGH: 3D Head Generation with Composable Hair and Face Chengan He et.al. Updated 2025-06-25

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
ManiGaussian++: General Robotic Bimanual Manipulation with Hierarchical Gaussian World Model Tengbo Yu et.al. Updated 2025-06-24

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
Virtual Memory for 3D Gaussian Splatting Jonathan Haberl et.al. Updated 2025-06-24

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
HoliGS: Holistic Gaussian Splatting for Embodied View Synthesis Xiaoyuan Wang et.al. Updated 2025-06-24

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
GRAND-SLAM: Local Optimization for Globally Consistent Large-Scale Multi-Agent Gaussian SLAM Annika Thomas et.al. Updated 2025-06-23

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
ViDAR: Video Diffusion-Aware 4D Reconstruction From Monocular Inputs Michal Nazarczuk et.al. Updated 2025-06-23

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
3D Arena: An Open Platform for Generative 3D Evaluation Dylan Ebert et.al. Updated 2025-06-23

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
Reconstructing Tornadoes in 3D with Gaussian Splatting Adam Yang et.al. Updated 2025-06-23

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
R3eVision: A Survey on Robust Rendering, Restoration, and Enhancement for 3D Low-Level Vision Weeyoung Kwon et.al. Updated 2025-06-23

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
3D Gaussian Splatting for Fine-Detailed Surface Reconstruction in Large-Scale Scene Shihan Chen et.al. Updated 2025-06-21

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
Part$^{2}$GS: Part-aware Modeling of Articulated Objects using 3D Gaussian Splatting Tianjiao Yu et.al. Updated 2025-06-20

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
Particle-Grid Neural Dynamics for Learning Deformable Object Models from RGB-D Videos Kaifeng Zhang et.al. Updated 2025-06-18

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
RA-NeRF: Robust Neural Radiance Field Reconstruction with Accurate Camera Pose Estimation under Complex Trajectories Qingsong Yan et.al. Updated 2025-06-18

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
Peering into the Unknown: Active View Selection with Neural Uncertainty Maps for 3D Reconstruction Zhengquan Zhang et.al. Updated 2025-06-17

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
SyncTalk++: High-Fidelity and Efficient Synchronized Talking Heads Synthesis Using Gaussian Splatting Ziqiao Peng et.al. Updated 2025-06-17

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
3DGS-IEval-15K: A Large-scale Image Quality Evaluation Database for 3D Gaussian-Splatting Yuke Xing et.al. Updated 2025-06-17

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
HRGS: Hierarchical Gaussian Splatting for Memory-Efficient High-Resolution 3D Reconstruction Changbai Li et.al. Updated 2025-06-17

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
GAF: Gaussian Action Field as a Dvnamic World Model for Robotic Mlanipulation Ying Chai et.al. Updated 2025-06-17

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
GRaD-Nav++: Vision-Language Model Enabled Visual Drone Navigation with Gaussian Radiance Fields and Differentiable Dynamics Qianzhong Chen et.al. Updated 2025-06-16

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
PF-LHM: 3D Animatable Avatar Reconstruction from Pose-free Articulated Human Images Lingteng Qiu et.al. Updated 2025-06-16

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
Micro-macro Gaussian Splatting with Enhanced Scalability for Unconstrained Scene Reconstruction Yihui Li et.al. Updated 2025-06-16

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
PointGS: Point Attention-Aware Sparse View Synthesis with Gaussian Splatting Lintao Xiang et.al. Updated 2025-06-12

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
ODG: Occupancy Prediction Using Dual Gaussians Yunxiao Shi et.al. Updated 2025-06-12

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
DGS-LRM: Real-Time Deformable 3D Gaussian Reconstruction From Monocular Videos Chieh Hubert Lin et.al. Updated 2025-06-11

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
UniPre3D: Unified Pre-training of 3D Point Cloud Models with Cross-Modal Gaussian Splatting Ziyi Wang et.al. Updated 2025-06-11

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
DynaSplat: Dynamic-Static Gaussian Splatting with Hierarchical Motion Decomposition for Scene Reconstruction Junli Deng et.al. Updated 2025-06-11

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
Self-Supervised Multi-Part Articulated Objects Modeling via Deformable Gaussian Splatting and Progressive Primitive Segmentation Haowen Wang et.al. Updated 2025-06-11

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
Gaussian Herding across Pens: An Optimal Transport Perspective on Global Gaussian Reduction for 3DGS Tao Wang et.al. Updated 2025-06-11

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
HAIF-GS: Hierarchical and Induced Flow-Guided Gaussian Splatting for Dynamic Scene Jianing Chen et.al. Updated 2025-06-11

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
TinySplat: Feedforward Approach for Generating Compact 3D Scene Representation Zetian Song et.al. Updated 2025-06-11

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
UniForward: Unified 3D Scene and Semantic Field Reconstruction via Feed-Forward Gaussian Splatting from Only Sparse-View Images Qijian Tian et.al. Updated 2025-06-11

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
Speedy Deformable 3D Gaussian Splatting: Fast Rendering and Compression of Dynamic Scenes Allen Tu et.al. Updated 2025-06-09

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
GaussianVAE: Adaptive Learning Dynamics of 3D Gaussians for High-Fidelity Super-Resolution Shuja Khalid et.al. Updated 2025-06-09

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
R3D2: Realistic 3D Asset Insertion via Diffusion for Autonomous Driving Simulation William Ljungbergh et.al. Updated 2025-06-09

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
OpenSplat3D: Open-Vocabulary 3D Instance Segmentation using Gaussian Splatting Jens Piekenbrinck et.al. Updated 2025-06-09

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
ProSplat: Improved Feed-Forward 3D Gaussian Splatting for Wide-Baseline Sparse Views Xiaohan Lu et.al. Updated 2025-06-09

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
PIG: Physically-based Multi-Material Interaction with 3D Gaussians Zeyu Xiao et.al. Updated 2025-06-09

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
Hierarchical Scoring with 3D Gaussian Splatting for Instance Image-Goal Navigation Yijie Deng et.al. Updated 2025-06-09

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
Accelerating 3D Gaussian Splatting with Neural Sorting and Axis-Oriented Rasterization Zhican Wang et.al. Updated 2025-06-08

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
Hybrid Mesh-Gaussian Representation for Efficient Indoor Scene Reconstruction Binxiao Huang et.al. Updated 2025-06-08

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
Gaussian Mapping for Evolving Scenes Vladimir Yugay et.al. Updated 2025-06-07

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
Pseudo-Simulation for Autonomous Driving Wei Cao et.al. Updated 2025-06-04

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
FlexGS: Train Once, Deploy Everywhere with Many-in-One Flexible 3D Gaussian Splatting Hengyu Liu et.al. Updated 2025-06-04

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
Splatting Physical Scenes: End-to-End Real-to-Sim from Imperfect Robot Data Ben Moran et.al. Updated 2025-06-04

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
JointSplat: Probabilistic Joint Flow-Depth Optimization for Sparse-View Gaussian Splatting Yang Xiao et.al. Updated 2025-06-04

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
SplArt: Articulation Estimation and Part-Level Reconstruction with 3D Gaussian Splatting Shengjie Lin et.al. Updated 2025-06-04

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
Robust Neural Rendering in the Wild with Asymmetric Dual 3D Gaussian Splatting Chengqi Li et.al. Updated 2025-06-04

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
Voyager: Real-Time Splatting City-Scale 3D Gaussians on Your Phone Zheng Liu et.al. Updated 2025-06-04

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
Multi-Spectral Gaussian Splatting with Neural Color Representation Lukas Meyer et.al. Updated 2025-06-03

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
LEG-SLAM: Real-Time Language-Enhanced Gaussian Splatting for SLAM Roman Titkov et.al. Updated 2025-06-03

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
Large Processor Chip Model Kaiyan Chang et.al. Updated 2025-06-03

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
ZPressor: Bottleneck-Aware Compression for Scalable Feed-Forward 3DGS Weijie Wang et.al. Updated 2025-05-29

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
AnySplat: Feed-forward 3D Gaussian Splatting from Unconstrained Views Lihan Jiang et.al. Updated 2025-05-29

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
Mobi-$π$: Mobilizing Your Robot Learning Policy Jingyun Yang et.al. Updated 2025-05-29

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
Radiant Triangle Soup with Soft Connectivity Forces for 3D Reconstruction and Novel View Synthesis Nathaniel Burgdorfer et.al. Updated 2025-05-29

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
Holistic Large-Scale Scene Reconstruction via Mixed Gaussian Splatting Chuandong Liu et.al. Updated 2025-05-29

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
LODGE: Level-of-Detail Large-Scale Gaussian Splatting with Efficient Rendering Jonas Kulhanek et.al. Updated 2025-05-29

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
Pose-free 3D Gaussian splatting via shape-ray estimation Youngju Na et.al. Updated 2025-05-29

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
3DGS Compression with Sparsity-guided Hierarchical Transform Coding Hao Xu et.al. Updated 2025-05-28

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
CLIPGaussian: Universal and Multimodal Style Transfer Based on Gaussian Splatting Kornel Howil et.al. Updated 2025-05-28

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
STDR: Spatio-Temporal Decoupling for Real-Time Dynamic Scene Rendering Zehao Li et.al. Updated 2025-05-28

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
SHaDe: Compact and Consistent Dynamic 3D Reconstruction via Tri-Plane Deformation and Latent Diffusion Asrar Alruwayqi et.al. Updated 2025-05-22

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
Motion Matters: Compact Gaussian Streaming for Free-Viewpoint Video Reconstruction Jiacong Chen et.al. Updated 2025-05-22

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
RUSplatting: Robust 3D Gaussian Splatting for Sparse-View Underwater Scene Reconstruction Zhuodong Jiang et.al. Updated 2025-05-21

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
PlantDreamer: Achieving Realistic 3D Plant Models with Diffusion-Guided Gaussian Splatting Zane K J Hartley et.al. Updated 2025-05-21

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
R3GS: Gaussian Splatting for Robust Reconstruction and Relocalization in Unconstrained Image Collections Xu yan et.al. Updated 2025-05-21

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
GS2E: Gaussian Splatting is an Effective Data Generator for Event Stream Generation Yuchen Li et.al. Updated 2025-05-21

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
X-GRM: Large Gaussian Reconstruction Model for Sparse-view X-rays to Computed Tomography Yifan Liu et.al. Updated 2025-05-21

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
GT^2-GS: Geometry-aware Texture Transfer for Gaussian Splatting Wenjie Liu et.al. Updated 2025-05-21

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
MonoSplat: Generalizable 3D Gaussian Splatting from Monocular Depth Foundation Models Yifan Liu et.al. Updated 2025-05-21

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
Scan, Materialize, Simulate: A Generalizable Framework for Physically Grounded Robot Planning Amine Elhafsi et.al. Updated 2025-05-20

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
Recollection from Pensieve: Novel View Synthesis via Learning from Uncalibrated Videos Ruoyu Wang et.al. Updated 2025-05-19

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
Hybrid 3D-4D Gaussian Splatting for Fast Dynamic Scene Representation Seungjun Oh et.al. Updated 2025-05-19

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
3D Gaussian Adaptive Reconstruction for Fourier Light-Field Microscopy Chenyu Xu et.al. Updated 2025-05-19

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
TACOcc:Target-Adaptive Cross-Modal Fusion with Volume Rendering for 3D Semantic Occupancy Luyao Lei et.al. Updated 2025-05-19

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
Is Semantic SLAM Ready for Embedded Systems ? A Comparative Survey Calvin Galagain et.al. Updated 2025-05-18

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
GTR: Gaussian Splatting Tracking and Reconstruction of Unknown Objects Based on Appearance and Geometric Complexity Takuya Ikeda et.al. Updated 2025-05-17

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
MonoMobility: Zero-Shot 3D Mobility Analysis from Monocular Videos Hongyi Zhou et.al. Updated 2025-05-17

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
Gaussian Splatting as a Unified Representation for Autonomy in Unstructured Environments Dexter Ong et.al. Updated 2025-05-17

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
Exploiting Radiance Fields for Grasp Generation on Novel Synthetic Views Abhishek Kashyap et.al. Updated 2025-05-16

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
GrowSplat: Constructing Temporal Digital Twins of Plants with Gaussian Splats Simeon Adebola et.al. Updated 2025-05-16

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
Real2Render2Real: Scaling Robot Data Without Dynamics Simulation or Robot Hardware Justin Yu et.al. Updated 2025-05-14

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
Neural Video Compression using 2D Gaussian Splatting Lakshya Gupta et.al. Updated 2025-05-14

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
NavDP: Learning Sim-to-Real Navigation Diffusion Policy with Privileged Information Guidance Wenzhe Cai et.al. Updated 2025-05-13

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
DLO-Splatting: Tracking Deformable Linear Objects Using 3D Gaussian Splatting Holly Dinkel et.al. Updated 2025-05-13

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
FOCI: Trajectory Optimization on Gaussian Splats Mario Gomez Andreu et.al. Updated 2025-05-13

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
A Survey of 3D Reconstruction with Event Cameras: From Event-based Geometry to Neural 3D Rendering Chuanzhi Xu et.al. Updated 2025-05-13

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
ADC-GS: Anchor-Driven Deformable and Compressed Gaussian Splatting for Dynamic Scene Reconstruction He Huang et.al. Updated 2025-05-13

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
TUM2TWIN: Introducing the Large-Scale Multimodal Urban Digital Twin Benchmark Dataset Olaf Wysocki et.al. Updated 2025-05-13

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
SLAG: Scalable Language-Augmented Gaussian Splatting Laszlo Szilagyi et.al. Updated 2025-05-12

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
GIFStream: 4D Gaussian-based Immersive Video with Feature Stream Hao Li et.al. Updated 2025-05-12

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
SVAD: From Single Image to 3D Avatar via Synthetic Data Generation with Video Diffusion and Data Augmentation Yonwoo Choi et.al. Updated 2025-05-08

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
Time of the Flight of the Gaussians: Optimizing Depth Indirectly in Dynamic Radiance Fields Runfeng Li et.al. Updated 2025-05-08

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
SGCR: Spherical Gaussians for Efficient 3D Curve Reconstruction Xinran Yang et.al. Updated 2025-05-07

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
GSsplat: Generalizable Semantic Gaussian Splatting for Novel-view Synthesis in 3D Scenes Feng Xiao et.al. Updated 2025-05-07

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
Bridging Geometry-Coherent Text-to-3D Generation with Multi-View Diffusion Priors and Gaussian Splatting Feng Yang et.al. Updated 2025-05-07

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
3D Gaussian Splatting Data Compression with Mixture of Priors Lei Liu et.al. Updated 2025-05-06

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
Sparfels: Fast Reconstruction from Sparse Unposed Imagery Shubhendu Jena et.al. Updated 2025-05-04

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
SparSplat: Fast Multi-View Reconstruction with Generalizable 2D Gaussian Splatting Shubhendu Jena et.al. Updated 2025-05-04

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
GarmentGS: Point-Cloud Guided Gaussian Splatting for High-Fidelity Non-Watertight 3D Garment Reconstruction Zhihao Tang et.al. Updated 2025-05-04

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
SignSplat: Rendering Sign Language via Gaussian Splatting Maksym Ivashechkin et.al. Updated 2025-05-04

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
FalconWing: An Open-Source Platform for Ultra-Light Fixed-Wing Aircraft Research Yan Miao et.al. Updated 2025-05-02

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
Compensating Spatiotemporally Inconsistent Observations for Online Dynamic 3D Gaussian Splatting Youngsik Yun et.al. Updated 2025-05-02

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
Real-Time Animatable 2DGS-Avatars with Detail Enhancement from Monocular Videos Xia Yuan et.al. Updated 2025-05-01

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
GSFeatLoc: Visual Localization Using Feature Correspondence on 3D Gaussian Splatting Jongwon Lee et.al. Updated 2025-05-01

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
A Survey on 3D Reconstruction Techniques in Plant Phenotyping: From Classical Methods to Neural Radiance Fields (NeRF), 3D Gaussian Splatting (3DGS), and Beyond Jiajia Li et.al. Updated 2025-04-30

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
HoloTime: Taming Video Diffusion Models for Panoramic 4D Scene Generation Haiyang Zhou et.al. Updated 2025-04-30

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
4DGS-CC: A Contextual Coding Framework for 4D Gaussian Splatting Data Compression Zicong Chen et.al. Updated 2025-04-30

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
GauSS-MI: Gaussian Splatting Shannon Mutual Information for Active 3D Reconstruction Yuhan Xie et.al. Updated 2025-04-29

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
GaussTrap: Stealthy Poisoning Attacks on 3D Gaussian Splatting for Targeted Scene Confusion Jiaxin Hong et.al. Updated 2025-04-29

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
EfficientHuman: Efficient Training and Reconstruction of Moving Human using Articulated 2D Gaussian Hao Tian et.al. Updated 2025-04-29

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
Creating Your Editable 3D Photorealistic Avatar with Tetrahedron-constrained Gaussian Splatting Hanxi Liu et.al. Updated 2025-04-29

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
Sparse2DGS: Geometry-Prioritized Gaussian Splatting for Surface Reconstruction from Sparse Views Jiang Wu et.al. Updated 2025-04-29

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
Mesh-Learner: Texturing Mesh with Spherical Harmonics Yunfei Wan et.al. Updated 2025-04-28

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
CE-NPBG: Connectivity Enhanced Neural Point-Based Graphics for Novel View Synthesis in Autonomous Driving Scenes Mohammad Altillawi et.al. Updated 2025-04-28

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
GSFF-SLAM: 3D Semantic Gaussian Splatting SLAM via Feature Field Zuxing Lu et.al. Updated 2025-04-28

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
Rendering Anywhere You See: Renderability Field-guided Gaussian Splatting Xiaofeng Jin et.al. Updated 2025-04-27

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
CasualHDRSplat: Robust High Dynamic Range 3D Gaussian Splatting from Casually Captured Videos Shucheng Gong et.al. Updated 2025-04-24

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
Gaussian Splatting is an Effective Data Generator for 3D Object Detection Farhad G. Zanjani et.al. Updated 2025-04-23

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
PIN-WM: Learning Physics-INformed World Models for Non-Prehensile Manipulation Wenxuan Li et.al. Updated 2025-04-23

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
HUG: Hierarchical Urban Gaussian Splatting with Block-Based Reconstruction Zhongtao Wang et.al. Updated 2025-04-23

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
ToF-Splatting: Dense SLAM using Sparse Time-of-Flight Depth and Multi-Frame Integration Andrea Conti et.al. Updated 2025-04-23

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
StyleMe3D: Stylization with Disentangled Priors by Multiple Encoders on 3D Gaussians Cailin Zhuang et.al. Updated 2025-04-21

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
Immersive Teleoperation Framework for Locomanipulation Tasks Takuya Boehringer et.al. Updated 2025-04-21

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
MoBGS: Motion Deblurring Dynamic 3D Gaussian Splatting for Blurry Monocular Video Minh-Quan Viet Bui et.al. Updated 2025-04-21

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
IXGS-Intraoperative 3D Reconstruction from Sparse, Arbitrarily Posed Real X-rays Sascha Jecklin et.al. Updated 2025-04-20

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
NVSMask3D: Hard Visual Prompting with Camera Pose Interpolation for 3D Open Vocabulary Instance Segmentation Junyuan Fang et.al. Updated 2025-04-20

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
ODHSR: Online Dense 3D Reconstruction of Humans and Scenes from Monocular Videos Zetong Zhang et.al. Updated 2025-04-18

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
Novel Demonstration Generation with Gaussian Splatting Enables Robust One-Shot Manipulation Sizhe Yang et.al. Updated 2025-04-17

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
Digital Twin Generation from Visual Data: A Survey Andrew Melnik et.al. Updated 2025-04-17

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
Training-Free Hierarchical Scene Understanding for Gaussian Splatting with Superpoint Graphs Shaohui Dai et.al. Updated 2025-04-17

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
CompGS++: Compressed Gaussian Splatting for Static and Dynamic Scene Representation Xiangrui Liu et.al. Updated 2025-04-17

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
GSAC: Leveraging Gaussian Splatting for Photorealistic Avatar Creation with Unity Integration Rendong Zhang et.al. Updated 2025-04-17

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
Second-order Optimization of Gaussian Splats with Importance Sampling Hamza Pehlivan et.al. Updated 2025-04-17

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
AAA-Gaussians: Anti-Aliased and Artifact-Free 3D Gaussian Rendering Michael Steiner et.al. Updated 2025-04-17

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
CAGE-GS: High-fidelity Cage Based 3D Gaussian Splatting Deformation Yifei Tong et.al. Updated 2025-04-17

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
TSGS: Improving Gaussian Splatting for Transparent Surface Reconstruction via Normal and De-lighting Priors Mingwei Li et.al. Updated 2025-04-17

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
LL-Gaussian: Low-Light Scene Reconstruction and Enhancement via Gaussian Splatting for Novel View Synthesis Hao Sun et.al. Updated 2025-04-15

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
GaussVideoDreamer: 3D Scene Generation with Video Diffusion and Inconsistency-Aware Gaussian Splatting Junlin Hao et.al. Updated 2025-04-15

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
DNF-Avatar: Distilling Neural Fields for Real-time Animatable Avatar Relighting Zeren Jiang et.al. Updated 2025-04-14

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
ESCT3D: Efficient and Selectively Controllable Text-Driven 3D Content Generation with Gaussian Splatting Huiqi Wu et.al. Updated 2025-04-14

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
EBAD-Gaussian: Event-driven Bundle Adjusted Deblur Gaussian Splatting Yufei Deng et.al. Updated 2025-04-14

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
MCBlock: Boosting Neural Radiance Field Training Speed by MCTS-based Dynamic-Resolution Ray Sampling Yunpeng Tan et.al. Updated 2025-04-14

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
TextSplat: Text-Guided Semantic Fusion for Generalizable Gaussian Splatting Zhicong Wu et.al. Updated 2025-04-13

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
DropoutGS: Dropping Out Gaussians for Better Sparse-view Rendering Yexing Xu et.al. Updated 2025-04-13

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
A Constrained Optimization Approach for Gaussian Splatting from Coarsely-posed Images and Noisy Lidar Point Clouds Jizong Peng et.al. Updated 2025-04-12

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
BIGS: Bimanual Category-agnostic Interaction Reconstruction from Monocular Videos via 3D Gaussian Splatting Jeongwan On et.al. Updated 2025-04-12

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
Stochastic Ray Tracing of 3D Transparent Gaussians Xin Sun et.al. Updated 2025-04-10

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
Wheat3DGS: In-field 3D Reconstruction, Instance Segmentation and Phenotyping of Wheat Heads with Gaussian Splatting Daiwei Zhang et.al. Updated 2025-04-09

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
IAAO: Interactive Affordance Learning for Articulated Objects in 3D Environments Can Zhang et.al. Updated 2025-04-09

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
SVG-IR: Spatially-Varying Gaussian Splatting for Inverse Rendering Hanxiao Sun et.al. Updated 2025-04-09

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
GSta: Efficient Training Scheme with Siestaed Gaussians for Monocular 3D Scene Reconstruction Anil Armagan et.al. Updated 2025-04-09

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
Collision avoidance from monocular vision trained with novel view synthesis Valentin Tordjman--Levavasseur et.al. Updated 2025-04-09

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
Micro-splatting: Maximizing Isotropic Constraints for Refined Optimization in 3D Gaussian Splatting Jee Won Lee et.al. Updated 2025-04-08

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
View-Dependent Deformation Fields for 2D Editing of 3D Models Martin El Mqirmi et.al. Updated 2025-04-07

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
L3GS: Layered 3D Gaussian Splats for Efficient 3D Scene Delivery Yi-Zhen Tsai et.al. Updated 2025-04-07

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
Let it Snow! Animating Static Gaussian Scenes With Dynamic Weather Effects Gal Fiebelman et.al. Updated 2025-04-07

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
MonoGS++: Fast and Accurate Monocular RGB Gaussian SLAM Renwu Li et.al. Updated 2025-04-03

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
ConsDreamer: Advancing Multi-View Consistency for Zero-Shot Text-to-3D Generation Yuan Zhou et.al. Updated 2025-04-03

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
Digital-twin imaging based on descattering Gaussian splatting Suguru Shimomura et.al. Updated 2025-04-03

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
Toward Real-world BEV Perception: Depth Uncertainty Estimation via Gaussian Splatting Shu-Wei Lu et.al. Updated 2025-04-03

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
UAVTwin: Neural Digital Twins for UAVs using Gaussian Splatting Jaehoon Choi et.al. Updated 2025-04-02

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
WorldPrompter: Traversable Text-to-Scene Generation Zhaoyang Zhang et.al. Updated 2025-04-02

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
Diffusion-Guided Gaussian Splatting for Large-Scale Unconstrained 3D Reconstruction and Novel View Synthesis Niluthpol Chowdhury Mithun et.al. Updated 2025-04-02

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
BOGausS: Better Optimized Gaussian Splatting Stéphane Pateux et.al. Updated 2025-04-02

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
FIORD: A Fisheye Indoor-Outdoor Dataset with LIDAR Ground Truth for 3D Scene Reconstruction and Benchmarking Ulas Gunes et.al. Updated 2025-04-02

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
FlowR: Flowing from Sparse to Dense 3D Reconstructions Tobias Fischer et.al. Updated 2025-04-02

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
TranSplat: Lighting-Consistent Cross-Scene Object Transfer with 3D Gaussian Splatting Boyang et.al. Updated 2025-03-28

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
Audio-Plane: Audio Factorization Plane Gaussian Splatting for Real-Time Talking Head Synthesis Shuai Shen et.al. Updated 2025-03-28

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
EndoLRMGS: Complete Endoscopic Scene Reconstruction combining Large Reconstruction Modelling and Gaussian Splatting Xu Wang et.al. Updated 2025-03-28

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
AH-GS: Augmented 3D Gaussian Splatting for High-Frequency Detail Representation Chenyang Xu et.al. Updated 2025-03-28

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
Follow Your Motion: A Generic Temporal Consistency Portrait Editing Framework with Trajectory Guidance Haijie Yang et.al. Updated 2025-03-28

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
ABC-GS: Alignment-Based Controllable Style Transfer for 3D Gaussian Splatting Wenjie Liu et.al. Updated 2025-03-28

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
Segment then Splat: A Unified Approach for 3D Open-Vocabulary Segmentation based on Gaussian Splatting Yiren Lu et.al. Updated 2025-03-28

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
Disentangled 4D Gaussian Splatting: Towards Faster and More Efficient Dynamic Scene Rendering Hao Feng et.al. Updated 2025-03-28

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
LandMarkSystem Technical Report Zhenxiang Ma et.al. Updated 2025-03-28

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
Feature4X: Bridging Any Monocular Video to 4D Agentic AI with Versatile Gaussian Feature Fields Shijie Zhou et.al. Updated 2025-03-28

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
X$^{2}$-Gaussian: 4D Radiative Gaussian Splatting for Continuous-time Tomographic Reconstruction Weihao Yu et.al. Updated 2025-03-27

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
Semantic Consistent Language Gaussian Splatting for Point-Level Open-vocabulary Querying Hairong Yin et.al. Updated 2025-03-27

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
RainyGS: Efficient Rain Synthesis with Physically-Based Gaussian Splatting Qiyu Dai et.al. Updated 2025-03-27

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
Frequency-Aware Gaussian Splatting Decomposition Yishai Lavi et.al. Updated 2025-03-27

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
StyledStreets: Multi-style Street Simulator with Spatial and Temporal Consistency Yuyin Chen et.al. Updated 2025-03-27

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
PGC: Physics-Based Gaussian Cloth from a Single Pose Michelle Guo et.al. Updated 2025-03-26

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
TC-GS: Tri-plane based compression for 3D Gaussian Splatting Taorui Wang et.al. Updated 2025-03-26

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
EVolSplat: Efficient Volume-based Gaussian Splatting for Urban View Synthesis Sheng Miao et.al. Updated 2025-03-26

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
StableGS: A Floater-Free Framework for 3D Gaussian Splatting Luchao Wang et.al. Updated 2025-03-25

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
NexusGS: Sparse View Synthesis with Epipolar Depth Priors in 3D Gaussian Splatting Yulong Zheng et.al. Updated 2025-03-24

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
GS-Marker: Generalizable and Robust Watermarking for 3D Gaussian Splatting Lijiang Li et.al. Updated 2025-03-24

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
Hardware-Rasterized Ray-Based Gaussian Splatting Samuel Rota Bulò et.al. Updated 2025-03-24

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
LLGS: Unsupervised Gaussian Splatting for Image Enhancement and Reconstruction in Pure Dark Environment Haoran Wang et.al. Updated 2025-03-24

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
4DGC: Rate-Aware 4D Gaussian Compression for Efficient Streamable Free-Viewpoint Video Qiang Hu et.al. Updated 2025-03-24

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
DashGaussian: Optimizing 3D Gaussian Splatting in 200 Seconds Youyu Chen et.al. Updated 2025-03-24

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
GI-SLAM: Gaussian-Inertial SLAM Xulang Liu et.al. Updated 2025-03-24

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
Unraveling the Effects of Synthetic Data on End-to-End Autonomous Driving Junhao Ge et.al. Updated 2025-03-23

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
PanoGS: Gaussian-based Panoptic Segmentation for 3D Open Vocabulary Scene Understanding Hongjia Zhai et.al. Updated 2025-03-23

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
HandSplat: Embedding-Driven Gaussian Splatting for High-Fidelity Hand Rendering Yilan Dong et.al. Updated 2025-03-18

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
SplatVoxel: History-Aware Novel View Streaming without Temporal Training Yiming Wang et.al. Updated 2025-03-18

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
Optimized 3D Gaussian Splatting using Coarse-to-Fine Image Frequency Modulation Umar Farooq et.al. Updated 2025-03-18

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
Improving Adaptive Density Control for 3D Gaussian Splatting Glenn Grubert et.al. Updated 2025-03-18

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
RoGSplat: Learning Robust Generalizable Human Gaussian Splatting from Sparse Multi-View Images Junjin Xiao et.al. Updated 2025-03-18

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
Lightweight Gradient-Aware Upscaling of 3D Gaussian Splatting Images Simon Niedermayr et.al. Updated 2025-03-18

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
Rethinking End-to-End 2D to 3D Scene Segmentation in Gaussian Splatting Runsong Zhu et.al. Updated 2025-03-18

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
Light4GS: Lightweight Compact 4D Gaussian Splatting Generation via Context Model Mufan Liu et.al. Updated 2025-03-18

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
Generative Gaussian Splatting: Generating 3D Scenes with Video Diffusion Priors Katja Schwarz et.al. Updated 2025-03-17

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
DeGauss: Dynamic-Static Decomposition with Gaussian Splatting for Distractor-free 3D Reconstruction Rui Wang et.al. Updated 2025-03-17

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
LHM: Large Animatable Human Reconstruction Model from a Single Image in Seconds Lingteng Qiu et.al. Updated 2025-03-13

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
MuDG: Taming Multi-modal Diffusion with Gaussian Splatting for Urban Scene Reconstruction Yingshuang Zou et.al. Updated 2025-03-13

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
4D LangSplat: 4D Language Gaussian Splatting via Multimodal Large Language Models Wanhua Li et.al. Updated 2025-03-13

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
VicaSplat: A Single Run is All You Need for 3D Gaussian Splatting and Camera Estimation from Unposed Video Frames Zhiqi Li et.al. Updated 2025-03-13

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
ROODI: Reconstructing Occluded Objects with Denoising Inpainters Yeonjin Chang et.al. Updated 2025-03-13

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
GS-SDF: LiDAR-Augmented Gaussian Splatting and Neural SDF for Geometrically Consistent Rendering and Reconstruction Jianheng Liu et.al. Updated 2025-03-13

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
3D Student Splatting and Scooping Jialin Zhu et.al. Updated 2025-03-13

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
GaussHDR: High Dynamic Range Gaussian Splatting via Learning Unified 3D and 2D Local Tone Mapping Jinfeng Liu et.al. Updated 2025-03-13

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
Hybrid Rendering for Multimodal Autonomous Driving: Merging Neural and Physics-Based Simulation Máté Tóth et.al. Updated 2025-03-12

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand
Online Language Splatting Saimouli Katragadda et.al. Updated 2025-03-12

Abstract unavailable in cached data. It will appear after the next refresh.

Preview loads on expand