Picture for Yicheng Feng

Yicheng Feng

Echo: Simulating Distributed Training At Scale

Add code
Dec 17, 2024
Viaarxiv icon

VideoOrion: Tokenizing Object Dynamics in Videos

Add code
Nov 25, 2024
Viaarxiv icon

From Pixels to Tokens: Byte-Pair Encoding on Quantized Visual Modalities

Add code
Oct 03, 2024
Figure 1 for From Pixels to Tokens: Byte-Pair Encoding on Quantized Visual Modalities
Figure 2 for From Pixels to Tokens: Byte-Pair Encoding on Quantized Visual Modalities
Figure 3 for From Pixels to Tokens: Byte-Pair Encoding on Quantized Visual Modalities
Figure 4 for From Pixels to Tokens: Byte-Pair Encoding on Quantized Visual Modalities
Viaarxiv icon

UniCode: Learning a Unified Codebook for Multimodal Large Language Models

Add code
Mar 14, 2024
Viaarxiv icon

Steve-Eye: Equipping LLM-based Embodied Agents with Visual Perception in Open Worlds

Add code
Oct 20, 2023
Viaarxiv icon

LLaMA Rider: Spurring Large Language Models to Explore the Open World

Add code
Oct 13, 2023
Viaarxiv icon

Learning Multi-Object Positional Relationships via Emergent Communication

Add code
Feb 16, 2023
Viaarxiv icon