Picture for Yutong Zhang

Yutong Zhang

TrajFlow: Multi-modal Motion Prediction via Flow Matching

Add code
Jun 10, 2025
Viaarxiv icon

Unfettered Forceful Skill Acquisition with Physical Reasoning and Coordinate Frame Labeling

Add code
May 14, 2025
Viaarxiv icon

Kimi-Audio Technical Report

Add code
Apr 25, 2025
Viaarxiv icon

Transforming Vision Transformer: Towards Efficient Multi-Task Asynchronous Learning

Add code
Jan 12, 2025
Figure 1 for Transforming Vision Transformer: Towards Efficient Multi-Task Asynchronous Learning
Figure 2 for Transforming Vision Transformer: Towards Efficient Multi-Task Asynchronous Learning
Figure 3 for Transforming Vision Transformer: Towards Efficient Multi-Task Asynchronous Learning
Figure 4 for Transforming Vision Transformer: Towards Efficient Multi-Task Asynchronous Learning
Viaarxiv icon

A Machine Learning Approach to Contact Localization in Variable Density Three-Dimensional Tactile Artificial Skin

Add code
Dec 01, 2024
Figure 1 for A Machine Learning Approach to Contact Localization in Variable Density Three-Dimensional Tactile Artificial Skin
Figure 2 for A Machine Learning Approach to Contact Localization in Variable Density Three-Dimensional Tactile Artificial Skin
Figure 3 for A Machine Learning Approach to Contact Localization in Variable Density Three-Dimensional Tactile Artificial Skin
Figure 4 for A Machine Learning Approach to Contact Localization in Variable Density Three-Dimensional Tactile Artificial Skin
Viaarxiv icon

Way to Specialist: Closing Loop Between Specialized LLM and Evolving Domain Knowledge Graph

Add code
Nov 28, 2024
Figure 1 for Way to Specialist: Closing Loop Between Specialized LLM and Evolving Domain Knowledge Graph
Figure 2 for Way to Specialist: Closing Loop Between Specialized LLM and Evolving Domain Knowledge Graph
Figure 3 for Way to Specialist: Closing Loop Between Specialized LLM and Evolving Domain Knowledge Graph
Figure 4 for Way to Specialist: Closing Loop Between Specialized LLM and Evolving Domain Knowledge Graph
Viaarxiv icon

Legal Evalutions and Challenges of Large Language Models

Add code
Nov 15, 2024
Figure 1 for Legal Evalutions and Challenges of Large Language Models
Figure 2 for Legal Evalutions and Challenges of Large Language Models
Figure 3 for Legal Evalutions and Challenges of Large Language Models
Figure 4 for Legal Evalutions and Challenges of Large Language Models
Viaarxiv icon

UniMuMo: Unified Text, Music and Motion Generation

Add code
Oct 06, 2024
Figure 1 for UniMuMo: Unified Text, Music and Motion Generation
Figure 2 for UniMuMo: Unified Text, Music and Motion Generation
Figure 3 for UniMuMo: Unified Text, Music and Motion Generation
Figure 4 for UniMuMo: Unified Text, Music and Motion Generation
Viaarxiv icon

Evaluation of OpenAI o1: Opportunities and Challenges of AGI

Add code
Sep 27, 2024
Figure 1 for Evaluation of OpenAI o1: Opportunities and Challenges of AGI
Figure 2 for Evaluation of OpenAI o1: Opportunities and Challenges of AGI
Figure 3 for Evaluation of OpenAI o1: Opportunities and Challenges of AGI
Figure 4 for Evaluation of OpenAI o1: Opportunities and Challenges of AGI
Viaarxiv icon

A Comprehensive Review of Multimodal Large Language Models: Performance and Challenges Across Different Tasks

Add code
Aug 02, 2024
Figure 1 for A Comprehensive Review of Multimodal Large Language Models: Performance and Challenges Across Different Tasks
Viaarxiv icon