Picture for Yike Guo

Yike Guo

SINGER: Vivid Audio-driven Singing Video Generation with Multi-scale Spectral Diffusion Model

Add code
Dec 04, 2024
Viaarxiv icon

Fire-Image-DenseNet (FIDN) for predicting wildfire burnt area using remote sensing data

Add code
Dec 02, 2024
Viaarxiv icon

Foundation Cures Personalization: Recovering Facial Personalized Models' Prompt Consistency

Add code
Nov 22, 2024
Viaarxiv icon

EVA: An Embodied World Model for Future Video Anticipation

Add code
Oct 20, 2024
Figure 1 for EVA: An Embodied World Model for Future Video Anticipation
Figure 2 for EVA: An Embodied World Model for Future Video Anticipation
Figure 3 for EVA: An Embodied World Model for Future Video Anticipation
Figure 4 for EVA: An Embodied World Model for Future Video Anticipation
Viaarxiv icon

Both Ears Wide Open: Towards Language-Driven Spatial Audio Generation

Add code
Oct 14, 2024
Figure 1 for Both Ears Wide Open: Towards Language-Driven Spatial Audio Generation
Figure 2 for Both Ears Wide Open: Towards Language-Driven Spatial Audio Generation
Figure 3 for Both Ears Wide Open: Towards Language-Driven Spatial Audio Generation
Figure 4 for Both Ears Wide Open: Towards Language-Driven Spatial Audio Generation
Viaarxiv icon

You Know What I'm Saying -- Jailbreak Attack via Implicit Reference

Add code
Oct 04, 2024
Figure 1 for You Know What I'm Saying -- Jailbreak Attack via Implicit Reference
Figure 2 for You Know What I'm Saying -- Jailbreak Attack via Implicit Reference
Figure 3 for You Know What I'm Saying -- Jailbreak Attack via Implicit Reference
Figure 4 for You Know What I'm Saying -- Jailbreak Attack via Implicit Reference
Viaarxiv icon

PSHuman: Photorealistic Single-view Human Reconstruction using Cross-Scale Diffusion

Add code
Sep 16, 2024
Viaarxiv icon

HiPrompt: Tuning-free Higher-Resolution Generation with Hierarchical MLLM Prompts

Add code
Sep 04, 2024
Viaarxiv icon

Deep learning surrogate models of JULES-INFERNO for wildfire prediction on a global scale

Add code
Aug 30, 2024
Viaarxiv icon

Codec Does Matter: Exploring the Semantic Shortcoming of Codec for Audio Language Model

Add code
Aug 30, 2024
Figure 1 for Codec Does Matter: Exploring the Semantic Shortcoming of Codec for Audio Language Model
Figure 2 for Codec Does Matter: Exploring the Semantic Shortcoming of Codec for Audio Language Model
Figure 3 for Codec Does Matter: Exploring the Semantic Shortcoming of Codec for Audio Language Model
Figure 4 for Codec Does Matter: Exploring the Semantic Shortcoming of Codec for Audio Language Model
Viaarxiv icon