Picture for Zixuan Huang

Zixuan Huang

Symmetry Strikes Back: From Single-Image Symmetry Detection to 3D Generation

Add code
Nov 26, 2024
Viaarxiv icon

MMGenBench: Evaluating the Limits of LMMs from the Text-to-Image Generation Perspective

Add code
Nov 21, 2024
Figure 1 for MMGenBench: Evaluating the Limits of LMMs from the Text-to-Image Generation Perspective
Figure 2 for MMGenBench: Evaluating the Limits of LMMs from the Text-to-Image Generation Perspective
Figure 3 for MMGenBench: Evaluating the Limits of LMMs from the Text-to-Image Generation Perspective
Figure 4 for MMGenBench: Evaluating the Limits of LMMs from the Text-to-Image Generation Perspective
Viaarxiv icon

Implicit Contact Diffuser: Sequential Contact Reasoning with Latent Point Cloud Diffusion

Add code
Oct 21, 2024
Figure 1 for Implicit Contact Diffuser: Sequential Contact Reasoning with Latent Point Cloud Diffusion
Figure 2 for Implicit Contact Diffuser: Sequential Contact Reasoning with Latent Point Cloud Diffusion
Figure 3 for Implicit Contact Diffuser: Sequential Contact Reasoning with Latent Point Cloud Diffusion
Figure 4 for Implicit Contact Diffuser: Sequential Contact Reasoning with Latent Point Cloud Diffusion
Viaarxiv icon

SF3D: Stable Fast 3D Mesh Reconstruction with UV-unwrapping and Illumination Disentanglement

Add code
Aug 01, 2024
Figure 1 for SF3D: Stable Fast 3D Mesh Reconstruction with UV-unwrapping and Illumination Disentanglement
Figure 2 for SF3D: Stable Fast 3D Mesh Reconstruction with UV-unwrapping and Illumination Disentanglement
Figure 3 for SF3D: Stable Fast 3D Mesh Reconstruction with UV-unwrapping and Illumination Disentanglement
Figure 4 for SF3D: Stable Fast 3D Mesh Reconstruction with UV-unwrapping and Illumination Disentanglement
Viaarxiv icon

Multi-beam Training for Near-field Communications in High-frequency Bands

Add code
Jun 21, 2024
Figure 1 for Multi-beam Training for Near-field Communications in High-frequency Bands
Figure 2 for Multi-beam Training for Near-field Communications in High-frequency Bands
Figure 3 for Multi-beam Training for Near-field Communications in High-frequency Bands
Figure 4 for Multi-beam Training for Near-field Communications in High-frequency Bands
Viaarxiv icon

PointInfinity: Resolution-Invariant Point Diffusion Models

Add code
Apr 04, 2024
Viaarxiv icon

Subgoal Diffuser: Coarse-to-fine Subgoal Generation to Guide Model Predictive Control for Robot Manipulation

Add code
Mar 19, 2024
Figure 1 for Subgoal Diffuser: Coarse-to-fine Subgoal Generation to Guide Model Predictive Control for Robot Manipulation
Figure 2 for Subgoal Diffuser: Coarse-to-fine Subgoal Generation to Guide Model Predictive Control for Robot Manipulation
Figure 3 for Subgoal Diffuser: Coarse-to-fine Subgoal Generation to Guide Model Predictive Control for Robot Manipulation
Figure 4 for Subgoal Diffuser: Coarse-to-fine Subgoal Generation to Guide Model Predictive Control for Robot Manipulation
Viaarxiv icon

TripoSR: Fast 3D Object Reconstruction from a Single Image

Add code
Mar 04, 2024
Figure 1 for TripoSR: Fast 3D Object Reconstruction from a Single Image
Figure 2 for TripoSR: Fast 3D Object Reconstruction from a Single Image
Figure 3 for TripoSR: Fast 3D Object Reconstruction from a Single Image
Figure 4 for TripoSR: Fast 3D Object Reconstruction from a Single Image
Viaarxiv icon

ZeroShape: Regression-based Zero-shot Shape Reconstruction

Add code
Jan 16, 2024
Viaarxiv icon

If LLM Is the Wizard, Then Code Is the Wand: A Survey on How Code Empowers Large Language Models to Serve as Intelligent Agents

Add code
Jan 08, 2024
Figure 1 for If LLM Is the Wizard, Then Code Is the Wand: A Survey on How Code Empowers Large Language Models to Serve as Intelligent Agents
Figure 2 for If LLM Is the Wizard, Then Code Is the Wand: A Survey on How Code Empowers Large Language Models to Serve as Intelligent Agents
Figure 3 for If LLM Is the Wizard, Then Code Is the Wand: A Survey on How Code Empowers Large Language Models to Serve as Intelligent Agents
Figure 4 for If LLM Is the Wizard, Then Code Is the Wand: A Survey on How Code Empowers Large Language Models to Serve as Intelligent Agents
Viaarxiv icon