JamBot: site:arxiv.org high-quality

1-100 of about 247 matches for site:arxiv.org high-quality

[2306.09109] NAVI: Category-Agnostic Image Collections with High-Quality 3D Shape and Pose Annotatio

https://arxiv.org/abs/2306.09109

2306.09109] NAVI: Category-Agnostic Image Collections with High-Quality 3D Shape and Pose Annotations Skip to main

[1905.10711] DISN: Deep Implicit Surface Network for High-quality Single-view 3D Reconstruction

https://arxiv.org/abs/1905.10711

1905.10711] DISN: Deep Implicit Surface Network for High-quality Single-view 3D Reconstruction Skip to main

[2404.12385] MeshLRM: Large Reconstruction Model for High-Quality Meshes

https://arxiv.org/abs/2404.12385

2404.12385] MeshLRM: Large Reconstruction Model for High-Quality Meshes Happy Open Access Week from arXiv! YOU make open

[2311.17261] SceneTex: High-Quality Texture Synthesis for Indoor Scenes via Diffusion Priors

https://arxiv.org/abs/2311.17261

2311.17261] SceneTex: High-Quality Texture Synthesis for Indoor Scenes via Diffusion Priors Skip to main content

[2306.09109] NAVI: Category-Agnostic Image Collections with High-Quality 3D Shape and Pose Annotatio

https://arxiv.org/abs//2306.09109

2306.09109] NAVI: Category-Agnostic Image Collections with High-Quality 3D Shape and Pose Annotations Skip to main

[2311.17261] SceneTex: High-Quality Texture Synthesis for Indoor Scenes via Diffusion Priors

https://arxiv.org/abs/2311.17261

2311.17261] SceneTex: High-Quality Texture Synthesis for Indoor Scenes via Diffusion Priors Skip to main content

[2306.09109] NAVI: Category-Agnostic Image Collections with High-Quality 3D Shape and Pose Annotatio

https://arxiv.org/abs//2306.09109

2306.09109] NAVI: Category-Agnostic Image Collections with High-Quality 3D Shape and Pose Annotations Skip to main

[2409.12957] 3DTopia-XL: Scaling High-quality 3D Asset Generation via Primitive Diffusion

https://arxiv.org/abs/2409.12957

2409.12957] 3DTopia-XL: Scaling High-quality 3D Asset Generation via Primitive Diffusion The Scheduled Database Maintenance 2025-09

[2409.12957] 3DTopia-XL: Scaling High-quality 3D Asset Generation via Primitive Diffusion

https://arxiv.org/abs/2409.12957

2409.12957] 3DTopia-XL: Scaling High-quality 3D Asset Generation via Primitive Diffusion Skip to main content We gratefully

[2310.01406] HumanNorm: Learning Normal Diffusion Model for High-quality and Realistic 3D Human Gene

https://arxiv.org/abs/2310.01406

2310.01406] HumanNorm: Learning Normal Diffusion Model for High-quality and Realistic 3D Human Generation Skip to

[2411.07126] Edify Image: High-Quality Image Generation with Pixel Space Laplacian Diffusion Models

https://arxiv.org/abs/2411.07126

2411.07126] Edify Image: High-Quality Image Generation with Pixel Space Laplacian Diffusion Models Happy Open Access Week from arXiv! YOU

Untitled

[2506.07643] Synthetic Visual Genome

https://arxiv.org/abs/2506.07643

We introduce ROBIN: an MLM instruction-tuned with densely annotated relationships capable of constructing high-quality dense scene graphs

[2310.13772] TexFusion: Synthesizing 3D Textures with Text-Guided Image Diffusion Models

https://arxiv.org/abs/2310.13772

latent texture. We thoroughly validate TexFusion and show that we can efficiently generate diverse, high quality and globally

[2506.07643] Synthetic Visual Genome

https://arxiv.org/abs/2506.07643

We introduce ROBIN: an MLM instruction-tuned with densely annotated relationships capable of constructing high-quality dense scene graphs

[2012.05116] Deep Denoising of Flash and No-Flash Pairs for Photography in Low-Light Environments

https://arxiv.org/abs/2012.05116

flash, in low-light environments. Our goal is to produce a high-quality rendering of

[2010.02502] Denoising Diffusion Implicit Models

https://arxiv.org/abs/2010.02502

Song and 2 other authors View PDF Abstract: Denoising diffusion probabilistic models (DDPMs) have achieved high quality image generation without

[2012.05116] Deep Denoising of Flash and No-Flash Pairs for Photography in Low-Light Environments

https://arxiv.org/abs/2012.05116

flash, in low-light environments. Our goal is to produce a high-quality rendering of

[2410.06231] RelitLRM: Generative Relightable Radiance for Large Reconstruction Models

https://arxiv.org/abs/2410.06231

experimental) Abstract: We propose RelitLRM, a Large Reconstruction Model (LRM) for generating high-quality Gaussian splatting representations

[2103.00762] NeuTex: Neural Texture Mapping for Volumetric Neural Rendering

https://arxiv.org/abs/2103.00762

that this representation can be reconstructed using only multi-view image supervision and generates high-quality rendering results. More

[2412.15689] DOLLAR: Few-Step Video Generation via Distillation and Latent Reward Optimization

https://arxiv.org/abs/2412.15689

distillation and consistency distillation to achieve few-step video generation, maintaining both high quality and diversity

[2311.10709] Emu Video: Factorizing Text-to-Video Generation by Explicit Image Conditioning

https://arxiv.org/abs/2311.10709

for diffusion, and multi-stage training that enable us to directly generate high quality and high

[2405.10314] CAT3D: Create Anything in 3D with Multi-View Diffusion Models

https://arxiv.org/abs/2405.10314

7 other authors View PDF HTML (experimental) Abstract: Advances in 3D reconstruction have enabled high-quality 3D capture, but

[2412.12463] Pattern Analogies: Learning to Perform Programmatic Image Edits by Analogy

https://arxiv.org/abs/2412.12463

for sampling synthetic pattern analogies, enables the creation of a large, high-quality synthetic training dataset

[2406.09401] MMScan: A Multi-Modal 3D Scene Dataset with Hierarchical Grounded Language Annotations

https://arxiv.org/abs/2406.09401

problems to be addressed in the future. Furthermore, we use this high-quality dataset to

[2312.09250] Single Mesh Diffusion Models with Field Latents for Texture Generation

https://arxiv.org/abs/2312.09250

the surfaces of 3D shapes, with the goal of synthesizing high-quality textures. Our approach

[2303.01469] Consistency Models

https://arxiv.org/abs/2303.01469

limitation, we propose consistency models, a new family of models that generate high quality samples by directly

[2412.15689] DOLLAR: Few-Step Video Generation via Distillation and Latent Reward Optimization

https://arxiv.org/abs/2412.15689

distillation and consistency distillation to achieve few-step video generation, maintaining both high quality and diversity

[2412.12463] Pattern Analogies: Learning to Perform Programmatic Image Edits by Analogy

https://arxiv.org/abs/2412.12463

for sampling synthetic pattern analogies, enables the creation of a large, high-quality synthetic training dataset

[2505.10566] 3D-Fixup: Advancing Photo Editing with 3D Priors

https://arxiv.org/abs/2505.10566

information into 3D space. We design a data generation pipeline to ensure high-quality 3D guidance throughout

[2305.15347] A Tale of Two Features: Stable Diffusion Complements DINO for Zero-Shot Semantic Corres

https://arxiv.org/abs/2305.15347

to-image diffusion models have made significant advances in generating and editing high-quality images. As a

[2406.09417] Rethinking Score Distillation as a Bridge Between Image Distributions

https://arxiv.org/abs/2406.09417

that calibrating the text conditioning of the source distribution can produce high-quality generation and

[2407.03162] Bunny-VisionPro: Real-Time Bimanual Dexterous Teleoperation for Imitation Learning

https://arxiv.org/abs/2407.03162

suite, achieving higher success rates and reduced task completion times. Moreover, the high-quality teleoperation demonstrations improve

[2305.15399] Sin3DM: Learning a Diffusion Model from a Single 3D Textured Shape

https://arxiv.org/abs/2305.15399

the internal patch distribution from a single 3D textured shape and generates high-quality variations with fine

[2311.17061] HumanGaussian: Text-Driven 3D Human Generation with Gaussian Splatting

https://arxiv.org/abs/2311.17061

time. In this paper, we propose an efficient yet effective framework, HumanGaussian, that generates high-quality 3D humans with

[2111.11215] Direct Voxel Grid Optimization: Super-fast Convergence for Radiance Fields Reconstructi

https://arxiv.org/abs/2111.11215

simple yet non-trivial techniques that contribute to fast convergence speed and high-quality output. First, we

[2303.04803] Open-Vocabulary Panoptic Segmentation with Text-to-Image Diffusion Models

https://arxiv.org/abs/2303.04803

Text-to-image diffusion models have the remarkable ability to generate high-quality images with diverse

[2303.15951] F$^{2}$-NeRF: Fast Neural Radiance Field Training with Free Camera Trajectories

https://arxiv.org/abs/2303.15951

is able to use the same perspective warping to render high-quality images on two

[2503.16430] Bridging Continuous and Discrete Tokens for Autoregressive Visual Generation

https://arxiv.org/abs/2503.16430

the strengths of both approaches, providing a promising direction for high-quality visual generation with

[2306.07200] Fill-Up: Balancing Long-Tailed Data with Generative Models

https://arxiv.org/abs/2306.07200

text-to-image synthesis models have achieved an exceptional level of photorealism, generating high-quality images from arbitrary

[2312.02981] ReconFusion: 3D Reconstruction with Diffusion Priors

https://arxiv.org/abs/2312.02981

excel at rendering photorealistic novel views of complex scenes. However, recovering a high-quality NeRF typically requires

[2406.09417] Rethinking Score Distillation as a Bridge Between Image Distributions

https://arxiv.org/abs/2406.09417

that calibrating the text conditioning of the source distribution can produce high-quality generation and

[2303.04803] Open-Vocabulary Panoptic Segmentation with Text-to-Image Diffusion Models

https://arxiv.org/abs/2303.04803

Text-to-image diffusion models have the remarkable ability to generate high-quality images with diverse

[2303.15951] F$^{2}$-NeRF: Fast Neural Radiance Field Training with Free Camera Trajectories

https://arxiv.org/abs/2303.15951

is able to use the same perspective warping to render high-quality images on two

[2311.17061] HumanGaussian: Text-Driven 3D Human Generation with Gaussian Splatting

https://arxiv.org/abs/2311.17061

time. In this paper, we propose an efficient yet effective framework, HumanGaussian, that generates high-quality 3D humans with

[2104.08418] FiG-NeRF: Figure-Ground Neural Radiance Fields for 3D Object Category Modelling

https://arxiv.org/abs/2104.08418

investigate the use of Neural Radiance Fields (NeRF) to learn high quality 3D object category

[2404.19702] GS-LRM: Large Reconstruction Model for 3D Gaussian Splatting

https://arxiv.org/abs/2404.19702

PDF Abstract: We propose GS-LRM, a scalable large reconstruction model that can predict high-quality 3D Gaussian primitives

[2406.07520] Neural Gaffer: Relighting Any Object via Diffusion

https://arxiv.org/abs/2406.07520

a single image of any object and can synthesize an accurate, high-quality relit image under

[2502.09614] DexTrack: Towards Generalizable Neural Tracking Control for Dexterous Manipulation from

https://arxiv.org/abs/2502.09614

performance in dynamic environments. At the same time, to obtain high-quality tracking demonstrations, we

[2406.09371] LRM-Zero: Training Large Reconstruction Models with Synthesized Data

https://arxiv.org/abs/2406.09371

LRM-Zero, a Large Reconstruction Model (LRM) trained entirely on synthesized 3D data, achieving high-quality sparse-view 3D

[2411.17249] Buffer Anytime: Zero-Shot Video Depth and Normal from Image Priors

https://arxiv.org/abs/2411.17249

normal training data. Instead of relying on large-scale annotated video datasets, we demonstrate high-quality video buffer estimation

[2211.16677] 3D Neural Field Generation using Triplane Diffusion

https://arxiv.org/abs/2211.16677

train existing 2D diffusion models on these representations to generate 3D neural fields with high quality and diversity

[2301.07525] OmniObject3D: Large-Vocabulary 3D Object Dataset for Realistic Perception, Reconstructi

https://arxiv.org/abs/2301.07525

the real world, we propose OmniObject3D, a large vocabulary 3D object dataset with massive high-quality real-scanned 3D

[2008.01815] Deep Multi Depth Panoramas for View Synthesis

https://arxiv.org/abs/2008.01815

360$^{\circ}$ images. MDPs are more compact than previous 3D scene representations and enable high-quality, efficient new view

[2304.02744] StyleGAN Salon: Multi-View Latent Optimization for Pose-Invariant Hairstyle Transfer

https://arxiv.org/abs/2304.02744

ambiguous regions. Our optimization shares information between two poses, which allows us to produce high fidelity and realistic

[2303.12074] CC3D: Layout-Conditioned Generation of Compositional 3D Scenes

https://arxiv.org/abs/2303.12074

have created a 3D GAN that is both efficient and of high quality, while allowing for

[2403.17888] 2D Gaussian Splatting for Geometrically Accurate Radiance Fields

https://arxiv.org/abs/2403.17888

View PDF HTML (experimental) Abstract: 3D Gaussian Splatting (3DGS) has recently revolutionized radiance field reconstruction, achieving high quality novel view synthesis

[2311.16854] A Unified Approach for Text- and Image-guided 4D Scene Generation

https://arxiv.org/abs/2311.16854

1) 3D and 2D diffusion guidance to effectively learn a high-quality static 3D asset

[2405.17414] Collaborative Video Diffusion: Consistent Multi-video Generation with Camera Control

https://arxiv.org/abs/2405.17414

authors View PDF HTML (experimental) Abstract: Research on video generation has recently made tremendous progress, enabling high-quality videos to

[2208.01626] Prompt-to-Prompt Image Editing with Cross Attention Control

https://arxiv.org/abs/2208.01626

in the image. We present our results over diverse images and prompts, demonstrating high-quality synthesis and

[2311.18828] One-step Diffusion with Distribution Matching Distillation

https://arxiv.org/abs/2311.18828

Tianwei Yin and 6 other authors View PDF HTML (experimental) Abstract: Diffusion models generate high-quality images but require

[2405.01796] TOPICAL: TOPIC Pages AutomagicaLly

https://arxiv.org/abs/2405.01796

in this work, we develop a completely automated process to generate high-quality topic pages for

[2405.17414] Collaborative Video Diffusion: Consistent Multi-video Generation with Camera Control

https://arxiv.org/abs/2405.17414

authors View PDF HTML (experimental) Abstract: Research on video generation has recently made tremendous progress, enabling high-quality videos to

[2412.09621] Stereo4D: Learning How Things Move in 3D from Internet Stereo Videos

https://arxiv.org/abs/2412.09621

of obtaining ground truth annotations. We present a system for mining high-quality 4D reconstructions from

[2406.00609] SuperGaussian: Repurposing Video Models for 3D Super Resolution

https://arxiv.org/abs/2406.00609

the problem of the shortage of large repositories of high-quality 3D training models

[2003.12642] Deep 3D Capture: Geometry and Reflectance from Sparse Multi-View Images

https://arxiv.org/abs/2003.12642

We introduce a novel learning-based method to reconstruct the high-quality geometry and

[2008.03824] Neural Reflectance Fields for Appearance Acquisition

https://arxiv.org/abs/2008.03824

challenging effects like specularities, shadows and occlusions. This allows us to perform high-quality view synthesis and

[2003.12642] Deep 3D Capture: Geometry and Reflectance from Sparse Multi-View Images

https://arxiv.org/abs/2003.12642

We introduce a novel learning-based method to reconstruct the high-quality geometry and

[2305.15347] A Tale of Two Features: Stable Diffusion Complements DINO for Zero-Shot Semantic Corres

https://arxiv.org/abs/2305.15347

to-image diffusion models have made significant advances in generating and editing high-quality images. As a

[2008.03824] Neural Reflectance Fields for Appearance Acquisition

https://arxiv.org/abs/2008.03824

challenging effects like specularities, shadows and occlusions. This allows us to perform high-quality view synthesis and

[2311.06214] Instant3D: Fast Text-to-3D with Sparse-View Generation and Large Reconstruction Model

https://arxiv.org/abs/2311.06214

data. In this paper, we propose Instant3D, a novel method that generates high-quality and diverse

[2312.11461] GAvatar: Animatable 3D Gaussian Avatars with Implicit Mesh Learning

https://arxiv.org/abs/2312.11461

NeRF-based representations. However, a naive application of Gaussian splatting cannot generate high-quality animatable avatars and

[2010.04595] GRF: Learning a General Radiance Field for 3D Representation and Rendering

https://arxiv.org/abs/2010.04595

that visual occlusions are implicitly taken into account. Extensive experiments demonstrate that our method can generate high-quality and realistic

[2310.15110] Zero123++: a Single Image to Consistent Multi-view Diffusion Base Model

https://arxiv.org/abs/2310.15110

the-shelf image diffusion models such as Stable Diffusion. Zero123++ excels in producing high-quality, consistent multi-view

[2311.09217] DMV3D: Denoising Multi-View Diffusion using 3D Large Reconstruction Model

https://arxiv.org/abs/2311.09217

object parts is required for generating diverse reconstructions with sharp textures. We also show high-quality text-to

[2312.08885] SceneWiz3D: Towards Text-guided 3D Scene Composition

https://arxiv.org/abs/2312.08885

We incorporate an RGBD panorama diffusion model to mitigate it, resulting in high-quality geometry. Extensive evaluation

[2311.17857] Gaussian Shell Maps for Efficient 3D Human Generation

https://arxiv.org/abs/2311.17857

scheme bypasses the need for view-inconsistent upsamplers and achieves high-quality multi-view consistent

[2010.04595] GRF: Learning a General Radiance Field for 3D Representation and Rendering

https://arxiv.org/abs/2010.04595

that visual occlusions are implicitly taken into account. Extensive experiments demonstrate that our method can generate high-quality and realistic

[2310.15110] Zero123++: a Single Image to Consistent Multi-view Diffusion Base Model

https://arxiv.org/abs/2310.15110

the-shelf image diffusion models such as Stable Diffusion. Zero123++ excels in producing high-quality, consistent multi-view

[2312.11461] GAvatar: Animatable 3D Gaussian Avatars with Implicit Mesh Learning

https://arxiv.org/abs/2312.11461

NeRF-based representations. However, a naive application of Gaussian splatting cannot generate high-quality animatable avatars and

[2104.08418] FiG-NeRF: Figure-Ground Neural Radiance Fields for 3D Object Category Modelling

https://arxiv.org/abs/2104.08418

investigate the use of Neural Radiance Fields (NeRF) to learn high quality 3D object category

[2312.02981] ReconFusion: 3D Reconstruction with Diffusion Priors

https://arxiv.org/abs/2312.02981

excel at rendering photorealistic novel views of complex scenes. However, recovering a high-quality NeRF typically requires

[1809.09761] PhotoShape: Photorealistic Materials for Large-Scale Shape Collections

https://arxiv.org/abs/1809.09761

of 3D models but lack photorealistic appearance. We present an approach to automatically assign high-quality, realistic appearance models

[2303.12074] CC3D: Layout-Conditioned Generation of Compositional 3D Scenes

https://arxiv.org/abs/2303.12074

have created a 3D GAN that is both efficient and of high quality, while allowing for

[2311.09217] DMV3D: Denoising Multi-View Diffusion using 3D Large Reconstruction Model

https://arxiv.org/abs/2311.09217

object parts is required for generating diverse reconstructions with sharp textures. We also show high-quality text-to

[2405.01796] TOPICAL: TOPIC Pages AutomagicaLly

https://arxiv.org/abs/2405.01796

in this work, we develop a completely automated process to generate high-quality topic pages for

[2311.18828] One-step Diffusion with Distribution Matching Distillation

https://arxiv.org/abs/2311.18828

Tianwei Yin and 6 other authors View PDF HTML (experimental) Abstract: Diffusion models generate high-quality images but require

[2212.03860] Diffusion Art or Digital Forgery? Investigating Data Replication in Diffusion Models

https://arxiv.org/abs/2212.03860

Somepalli and 4 other authors View PDF Abstract: Cutting-edge diffusion models produce images with high quality and customizability

[2311.06214] Instant3D: Fast Text-to-3D with Sparse-View Generation and Large Reconstruction Model

https://arxiv.org/abs/2311.06214

data. In this paper, we propose Instant3D, a novel method that generates high-quality and diverse

[2206.14797] 3D-Aware Video Generation

https://arxiv.org/abs/2206.14797

synthesis and editing tasks. Recent advances in this field have also enabled high-quality 3D or video

[2403.12409] ComboVerse: Compositional 3D Assets Creation Using Spatially-Aware Diffusion Guidance

https://arxiv.org/abs/2403.12409

Guidance, by Yongwei Chen and 5 other authors View PDF HTML (experimental) Abstract: Generating high-quality 3D assets from

[2412.18565] 3DEnhancer: Consistent Multi-View Diffusion for 3D Enhancement

https://arxiv.org/abs/2412.18565

advances in neural rendering, due to the scarcity of high-quality 3D datasets and

[2112.07945] Efficient Geometry-aware 3D Generative Adversarial Networks

https://arxiv.org/abs/2112.07945

R. Chan and 10 other authors View PDF Abstract: Unsupervised generation of high-quality multi-view-consistent

[2109.01349] Dual-Camera Super-Resolution with Aligned Attention Modules

https://arxiv.org/abs/2109.01349

the focus on dual-camera super-resolution (DCSR), which utilizes reference images for high-quality and high

[2312.09168] DiffusionLight: Light Probes for Free by Painting a Chrome Ball

https://arxiv.org/abs/2312.09168

and the initial diffusion noise map, which we utilize to consistently generate high-quality chrome balls. We

[2406.07754] HOI-Swap: Swapping Objects in Videos with Hand-Object Interaction Awareness

https://arxiv.org/abs/2406.07754

Comprehensive qualitative and quantitative evaluations demonstrate that HOI-Swap significantly outperforms existing methods, delivering high-quality video edits with

[2204.02232] IRON: Inverse Rendering by Optimizing Neural SDFs and Materials from Photometric Images

https://arxiv.org/abs/2204.02232

a neural inverse rendering pipeline called IRON that operates on photometric images and outputs high-quality 3D content in

[2212.05032] Training-Free Structured Diffusion Guidance for Compositional Text-to-Image Synthesis

https://arxiv.org/abs/2212.05032

on text-to-image synthesis (T2I) tasks. Despite their ability to generate high-quality yet creative images

[2210.09276] Imagic: Text-Based Real Image Editing with Diffusion Models

https://arxiv.org/abs/2210.09276

our method on numerous inputs from various domains, showcasing a plethora of high quality complex semantic image

1 2 3 Next