Picture for Yutong Zhang

Yutong Zhang

UniMuMo: Unified Text, Music and Motion Generation

Add code
Oct 06, 2024
Viaarxiv icon

Evaluation of OpenAI o1: Opportunities and Challenges of AGI

Add code
Sep 27, 2024
Figure 1 for Evaluation of OpenAI o1: Opportunities and Challenges of AGI
Figure 2 for Evaluation of OpenAI o1: Opportunities and Challenges of AGI
Figure 3 for Evaluation of OpenAI o1: Opportunities and Challenges of AGI
Figure 4 for Evaluation of OpenAI o1: Opportunities and Challenges of AGI
Viaarxiv icon

A Comprehensive Review of Multimodal Large Language Models: Performance and Challenges Across Different Tasks

Add code
Aug 02, 2024
Viaarxiv icon

Potential of Multimodal Large Language Models for Data Mining of Medical Images and Free-text Reports

Add code
Jul 08, 2024
Viaarxiv icon

JIGGLE: An Active Sensing Framework for Boundary Parameters Estimation in Deformable Surgical Environments

Add code
May 16, 2024
Viaarxiv icon

CultureBank: An Online Community-Driven Knowledge Base Towards Culturally Aware Language Technologies

Add code
Apr 23, 2024
Viaarxiv icon

Near-Far Field Codebook Design for IOS-Aided Multi-User Communications

Add code
Jan 16, 2024
Figure 1 for Near-Far Field Codebook Design for IOS-Aided Multi-User Communications
Figure 2 for Near-Far Field Codebook Design for IOS-Aided Multi-User Communications
Figure 3 for Near-Far Field Codebook Design for IOS-Aided Multi-User Communications
Figure 4 for Near-Far Field Codebook Design for IOS-Aided Multi-User Communications
Viaarxiv icon

Understanding LLMs: A Comprehensive Overview from Training to Inference

Add code
Jan 06, 2024
Viaarxiv icon

Ophtha-LLaMA2: A Large Language Model for Ophthalmology

Add code
Dec 08, 2023
Viaarxiv icon

ChatRadio-Valuer: A Chat Large Language Model for Generalizable Radiology Report Generation Based on Multi-institution and Multi-system Data

Add code
Oct 10, 2023
Viaarxiv icon