Picture for Ke Li

Ke Li

Jack

Long-VITA: Scaling Large Multi-modal Models to 1 Million Tokens with Leading Short-Context Accuray

Add code
Feb 07, 2025
Viaarxiv icon

Dual-BEV Nav: Dual-layer BEV-based Heuristic Path Planning for Robotic Navigation in Unstructured Outdoor Environments

Add code
Jan 30, 2025
Figure 1 for Dual-BEV Nav: Dual-layer BEV-based Heuristic Path Planning for Robotic Navigation in Unstructured Outdoor Environments
Figure 2 for Dual-BEV Nav: Dual-layer BEV-based Heuristic Path Planning for Robotic Navigation in Unstructured Outdoor Environments
Figure 3 for Dual-BEV Nav: Dual-layer BEV-based Heuristic Path Planning for Robotic Navigation in Unstructured Outdoor Environments
Figure 4 for Dual-BEV Nav: Dual-layer BEV-based Heuristic Path Planning for Robotic Navigation in Unstructured Outdoor Environments
Viaarxiv icon

LUCY: Linguistic Understanding and Control Yielding Early Stage of Her

Add code
Jan 27, 2025
Viaarxiv icon

The ICME 2025 Audio Encoder Capability Challenge

Add code
Jan 25, 2025
Viaarxiv icon

Solving the Catastrophic Forgetting Problem in Generalized Category Discovery

Add code
Jan 09, 2025
Viaarxiv icon

VITA-1.5: Towards GPT-4o Level Real-Time Vision and Speech Interaction

Add code
Jan 03, 2025
Viaarxiv icon

Probability-density-aware Semi-supervised Learning

Add code
Dec 23, 2024
Viaarxiv icon

Rethinking Performance Analysis for Configurable Software Systems: A Case Study from a Fitness Landscape Perspective

Add code
Dec 22, 2024
Viaarxiv icon

Transcribing and Translating, Fast and Slow: Joint Speech Translation and Recognition

Add code
Dec 19, 2024
Viaarxiv icon

VersaGen: Unleashing Versatile Visual Control for Text-to-Image Synthesis

Add code
Dec 17, 2024
Figure 1 for VersaGen: Unleashing Versatile Visual Control for Text-to-Image Synthesis
Figure 2 for VersaGen: Unleashing Versatile Visual Control for Text-to-Image Synthesis
Figure 3 for VersaGen: Unleashing Versatile Visual Control for Text-to-Image Synthesis
Figure 4 for VersaGen: Unleashing Versatile Visual Control for Text-to-Image Synthesis
Viaarxiv icon