Picture for Chong Zhang

Chong Zhang

Tony

MinMo: A Multimodal Large Language Model for Seamless Voice Interaction

Add code
Jan 10, 2025
Viaarxiv icon

Bridging Adaptivity and Safety: Learning Agile Collision-Free Locomotion Across Varied Physics

Add code
Jan 08, 2025
Viaarxiv icon

OpenAI o1 System Card

Add code
Dec 21, 2024
Viaarxiv icon

DocFusion: A Unified Framework for Document Parsing Tasks

Add code
Dec 17, 2024
Figure 1 for DocFusion: A Unified Framework for Document Parsing Tasks
Figure 2 for DocFusion: A Unified Framework for Document Parsing Tasks
Figure 3 for DocFusion: A Unified Framework for Document Parsing Tasks
Figure 4 for DocFusion: A Unified Framework for Document Parsing Tasks
Viaarxiv icon

Diffusion Implicit Policy for Unpaired Scene-aware Motion Synthesis

Add code
Dec 03, 2024
Viaarxiv icon

FedAH: Aggregated Head for Personalized Federated Learning

Add code
Dec 02, 2024
Figure 1 for FedAH: Aggregated Head for Personalized Federated Learning
Figure 2 for FedAH: Aggregated Head for Personalized Federated Learning
Figure 3 for FedAH: Aggregated Head for Personalized Federated Learning
Figure 4 for FedAH: Aggregated Head for Personalized Federated Learning
Viaarxiv icon

Target-driven Attack for Large Language Models

Add code
Nov 13, 2024
Viaarxiv icon

GPT-4o System Card

Add code
Oct 25, 2024
Viaarxiv icon

Modeling Layout Reading Order as Ordering Relations for Visually-rich Document Understanding

Add code
Sep 29, 2024
Viaarxiv icon

Emotional Dimension Control in Language Model-Based Text-to-Speech: Spanning a Broad Spectrum of Human Emotions

Add code
Sep 25, 2024
Viaarxiv icon