Picture for Wenhui Wang

Wenhui Wang

VLA-Adapter: An Effective Paradigm for Tiny-Scale Vision-Language-Action Model

Add code
Sep 11, 2025
Viaarxiv icon

VibeVoice Technical Report

Add code
Aug 26, 2025
Viaarxiv icon

Maximizing Asynchronicity in Event-based Neural Networks

Add code
May 16, 2025
Viaarxiv icon

Multimodal Latent Language Modeling with Next-Token Diffusion

Add code
Dec 11, 2024
Figure 1 for Multimodal Latent Language Modeling with Next-Token Diffusion
Figure 2 for Multimodal Latent Language Modeling with Next-Token Diffusion
Figure 3 for Multimodal Latent Language Modeling with Next-Token Diffusion
Figure 4 for Multimodal Latent Language Modeling with Next-Token Diffusion
Viaarxiv icon

RedStone: Curating General, Code, Math, and QA Data for Large Language Models

Add code
Dec 04, 2024
Viaarxiv icon

LICM: Effective and Efficient Long Interest Chain Modeling for News Recommendation

Add code
Aug 01, 2024
Figure 1 for LICM: Effective and Efficient Long Interest Chain Modeling for News Recommendation
Figure 2 for LICM: Effective and Efficient Long Interest Chain Modeling for News Recommendation
Figure 3 for LICM: Effective and Efficient Long Interest Chain Modeling for News Recommendation
Figure 4 for LICM: Effective and Efficient Long Interest Chain Modeling for News Recommendation
Viaarxiv icon

You Only Cache Once: Decoder-Decoder Architectures for Language Models

Add code
May 08, 2024
Viaarxiv icon

Multi-Head Mixture-of-Experts

Add code
Apr 23, 2024
Viaarxiv icon

The Era of 1-bit LLMs: All Large Language Models are in 1.58 Bits

Add code
Feb 27, 2024
Viaarxiv icon

When an Image is Worth 1,024 x 1,024 Words: A Case Study in Computational Pathology

Add code
Dec 06, 2023
Viaarxiv icon