Picture for Wen Wang

Wen Wang

SEAGULL: No-reference Image Quality Assessment for Regions of Interest via Vision-Language Instruction Tuning

Add code
Nov 15, 2024
Viaarxiv icon

MagicQuill: An Intelligent Interactive Image Editing System

Add code
Nov 14, 2024
Viaarxiv icon

HtmlRAG: HTML is Better Than Plain Text for Modeling Retrieved Knowledge in RAG Systems

Add code
Nov 05, 2024
Viaarxiv icon

Framer: Interactive Frame Interpolation

Add code
Oct 24, 2024
Figure 1 for Framer: Interactive Frame Interpolation
Figure 2 for Framer: Interactive Frame Interpolation
Figure 3 for Framer: Interactive Frame Interpolation
Figure 4 for Framer: Interactive Frame Interpolation
Viaarxiv icon

OmniFlatten: An End-to-end GPT Model for Seamless Voice Conversation

Add code
Oct 23, 2024
Figure 1 for OmniFlatten: An End-to-end GPT Model for Seamless Voice Conversation
Figure 2 for OmniFlatten: An End-to-end GPT Model for Seamless Voice Conversation
Figure 3 for OmniFlatten: An End-to-end GPT Model for Seamless Voice Conversation
Figure 4 for OmniFlatten: An End-to-end GPT Model for Seamless Voice Conversation
Viaarxiv icon

Two Birds With One Stone: Enhancing Communication and Sensing via Multi-Functional RIS

Add code
Oct 09, 2024
Viaarxiv icon

Unified Audio Event Detection

Add code
Sep 13, 2024
Viaarxiv icon

WavTokenizer: an Efficient Acoustic Discrete Codec Tokenizer for Audio Language Modeling

Add code
Aug 29, 2024
Figure 1 for WavTokenizer: an Efficient Acoustic Discrete Codec Tokenizer for Audio Language Modeling
Figure 2 for WavTokenizer: an Efficient Acoustic Discrete Codec Tokenizer for Audio Language Modeling
Figure 3 for WavTokenizer: an Efficient Acoustic Discrete Codec Tokenizer for Audio Language Modeling
Figure 4 for WavTokenizer: an Efficient Acoustic Discrete Codec Tokenizer for Audio Language Modeling
Viaarxiv icon

Recording for Eyes, Not Echoing to Ears: Contextualized Spoken-to-Written Conversion of ASR Transcripts

Add code
Aug 19, 2024
Viaarxiv icon

Multimodal Fusion and Coherence Modeling for Video Topic Segmentation

Add code
Aug 01, 2024
Viaarxiv icon