Picture for Yun Fu

Yun Fu

Text-to-3D Gaussian Splatting with Physics-Grounded Motion Generation

Add code
Dec 07, 2024
Viaarxiv icon

Slicing Vision Transformer for Flexible Inference

Add code
Dec 06, 2024
Viaarxiv icon

Towards Zero-shot 3D Anomaly Localization

Add code
Dec 05, 2024
Viaarxiv icon

LightAvatar: Efficient Head Avatar as Dynamic Neural Light Field

Add code
Sep 26, 2024
Figure 1 for LightAvatar: Efficient Head Avatar as Dynamic Neural Light Field
Figure 2 for LightAvatar: Efficient Head Avatar as Dynamic Neural Light Field
Figure 3 for LightAvatar: Efficient Head Avatar as Dynamic Neural Light Field
Figure 4 for LightAvatar: Efficient Head Avatar as Dynamic Neural Light Field
Viaarxiv icon

Accessing Vision Foundation Models at ImageNet-level Costs

Add code
Jul 15, 2024
Viaarxiv icon

SoupLM: Model Integration in Large Language and Multi-Modal Models

Add code
Jul 11, 2024
Viaarxiv icon

Through the Theory of Mind's Eye: Reading Minds with Multimodal Video Large Language Models

Add code
Jun 19, 2024
Figure 1 for Through the Theory of Mind's Eye: Reading Minds with Multimodal Video Large Language Models
Figure 2 for Through the Theory of Mind's Eye: Reading Minds with Multimodal Video Large Language Models
Figure 3 for Through the Theory of Mind's Eye: Reading Minds with Multimodal Video Large Language Models
Figure 4 for Through the Theory of Mind's Eye: Reading Minds with Multimodal Video Large Language Models
Viaarxiv icon

Deciphering Movement: Unified Trajectory Generation Model for Multi-Agent

Add code
May 27, 2024
Figure 1 for Deciphering Movement: Unified Trajectory Generation Model for Multi-Agent
Figure 2 for Deciphering Movement: Unified Trajectory Generation Model for Multi-Agent
Figure 3 for Deciphering Movement: Unified Trajectory Generation Model for Multi-Agent
Figure 4 for Deciphering Movement: Unified Trajectory Generation Model for Multi-Agent
Viaarxiv icon

Consistency and Uncertainty: Identifying Unreliable Responses From Black-Box Vision-Language Models for Selective Visual Question Answering

Add code
Apr 16, 2024
Figure 1 for Consistency and Uncertainty: Identifying Unreliable Responses From Black-Box Vision-Language Models for Selective Visual Question Answering
Figure 2 for Consistency and Uncertainty: Identifying Unreliable Responses From Black-Box Vision-Language Models for Selective Visual Question Answering
Figure 3 for Consistency and Uncertainty: Identifying Unreliable Responses From Black-Box Vision-Language Models for Selective Visual Question Answering
Figure 4 for Consistency and Uncertainty: Identifying Unreliable Responses From Black-Box Vision-Language Models for Selective Visual Question Answering
Viaarxiv icon

Self-Training Large Language Models for Improved Visual Program Synthesis With Visual Reinforcement

Add code
Apr 06, 2024
Figure 1 for Self-Training Large Language Models for Improved Visual Program Synthesis With Visual Reinforcement
Figure 2 for Self-Training Large Language Models for Improved Visual Program Synthesis With Visual Reinforcement
Figure 3 for Self-Training Large Language Models for Improved Visual Program Synthesis With Visual Reinforcement
Figure 4 for Self-Training Large Language Models for Improved Visual Program Synthesis With Visual Reinforcement
Viaarxiv icon