Picture for Yike Guo

Yike Guo

EVA: An Embodied World Model for Future Video Anticipation

Add code
Oct 20, 2024
Figure 1 for EVA: An Embodied World Model for Future Video Anticipation
Figure 2 for EVA: An Embodied World Model for Future Video Anticipation
Figure 3 for EVA: An Embodied World Model for Future Video Anticipation
Figure 4 for EVA: An Embodied World Model for Future Video Anticipation
Viaarxiv icon

Both Ears Wide Open: Towards Language-Driven Spatial Audio Generation

Add code
Oct 14, 2024
Figure 1 for Both Ears Wide Open: Towards Language-Driven Spatial Audio Generation
Figure 2 for Both Ears Wide Open: Towards Language-Driven Spatial Audio Generation
Figure 3 for Both Ears Wide Open: Towards Language-Driven Spatial Audio Generation
Figure 4 for Both Ears Wide Open: Towards Language-Driven Spatial Audio Generation
Viaarxiv icon

You Know What I'm Saying -- Jailbreak Attack via Implicit Reference

Add code
Oct 04, 2024
Figure 1 for You Know What I'm Saying -- Jailbreak Attack via Implicit Reference
Figure 2 for You Know What I'm Saying -- Jailbreak Attack via Implicit Reference
Figure 3 for You Know What I'm Saying -- Jailbreak Attack via Implicit Reference
Figure 4 for You Know What I'm Saying -- Jailbreak Attack via Implicit Reference
Viaarxiv icon

PSHuman: Photorealistic Single-view Human Reconstruction using Cross-Scale Diffusion

Add code
Sep 16, 2024
Viaarxiv icon

HiPrompt: Tuning-free Higher-Resolution Generation with Hierarchical MLLM Prompts

Add code
Sep 04, 2024
Viaarxiv icon

Deep learning surrogate models of JULES-INFERNO for wildfire prediction on a global scale

Add code
Aug 30, 2024
Viaarxiv icon

Codec Does Matter: Exploring the Semantic Shortcoming of Codec for Audio Language Model

Add code
Aug 30, 2024
Figure 1 for Codec Does Matter: Exploring the Semantic Shortcoming of Codec for Audio Language Model
Figure 2 for Codec Does Matter: Exploring the Semantic Shortcoming of Codec for Audio Language Model
Figure 3 for Codec Does Matter: Exploring the Semantic Shortcoming of Codec for Audio Language Model
Figure 4 for Codec Does Matter: Exploring the Semantic Shortcoming of Codec for Audio Language Model
Viaarxiv icon

AgentMonitor: A Plug-and-Play Framework for Predictive and Secure Multi-Agent Systems

Add code
Aug 27, 2024
Figure 1 for AgentMonitor: A Plug-and-Play Framework for Predictive and Secure Multi-Agent Systems
Figure 2 for AgentMonitor: A Plug-and-Play Framework for Predictive and Secure Multi-Agent Systems
Figure 3 for AgentMonitor: A Plug-and-Play Framework for Predictive and Secure Multi-Agent Systems
Figure 4 for AgentMonitor: A Plug-and-Play Framework for Predictive and Secure Multi-Agent Systems
Viaarxiv icon

Importance Weighting Can Help Large Language Models Self-Improve

Add code
Aug 19, 2024
Figure 1 for Importance Weighting Can Help Large Language Models Self-Improve
Figure 2 for Importance Weighting Can Help Large Language Models Self-Improve
Figure 3 for Importance Weighting Can Help Large Language Models Self-Improve
Figure 4 for Importance Weighting Can Help Large Language Models Self-Improve
Viaarxiv icon

NoRA: Nested Low-Rank Adaptation for Efficient Fine-Tuning Large Models

Add code
Aug 18, 2024
Figure 1 for NoRA: Nested Low-Rank Adaptation for Efficient Fine-Tuning Large Models
Figure 2 for NoRA: Nested Low-Rank Adaptation for Efficient Fine-Tuning Large Models
Figure 3 for NoRA: Nested Low-Rank Adaptation for Efficient Fine-Tuning Large Models
Figure 4 for NoRA: Nested Low-Rank Adaptation for Efficient Fine-Tuning Large Models
Viaarxiv icon