Picture for Zhenni Bi

Zhenni Bi

Eve: Efficient Multimodal Vision Language Models with Elastic Visual Experts

Add code
Jan 08, 2025
Figure 1 for Eve: Efficient Multimodal Vision Language Models with Elastic Visual Experts
Figure 2 for Eve: Efficient Multimodal Vision Language Models with Elastic Visual Experts
Figure 3 for Eve: Efficient Multimodal Vision Language Models with Elastic Visual Experts
Figure 4 for Eve: Efficient Multimodal Vision Language Models with Elastic Visual Experts
Viaarxiv icon

Forest-of-Thought: Scaling Test-Time Compute for Enhancing LLM Reasoning

Add code
Dec 12, 2024
Viaarxiv icon

Large OCR Model:An Empirical Study of Scaling Law for OCR

Add code
Jan 02, 2024
Viaarxiv icon