Picture for Yicheng Feng

Yicheng Feng

From Pixels to Tokens: Byte-Pair Encoding on Quantized Visual Modalities

Add code
Oct 03, 2024
Viaarxiv icon

UniCode: Learning a Unified Codebook for Multimodal Large Language Models

Add code
Mar 14, 2024
Viaarxiv icon

Steve-Eye: Equipping LLM-based Embodied Agents with Visual Perception in Open Worlds

Add code
Oct 20, 2023
Viaarxiv icon

LLaMA Rider: Spurring Large Language Models to Explore the Open World

Add code
Oct 13, 2023
Viaarxiv icon

Learning Multi-Object Positional Relationships via Emergent Communication

Add code
Feb 16, 2023
Viaarxiv icon