Picture for Xinyue Zhang

Xinyue Zhang

Beyond Flatlands: Unlocking Spatial Intelligence by Decoupling 3D Reasoning from Numerical Regression

Add code
Nov 18, 2025
Viaarxiv icon

Stroke Modeling Enables Vectorized Character Generation with Large Vectorized Glyph Model

Add code
Nov 14, 2025
Viaarxiv icon

YuE: Scaling Open Foundation Models for Long-Form Music Generation

Add code
Mar 11, 2025
Viaarxiv icon

Semantic Web and Creative AI -- A Technical Report from ISWS 2023

Add code
Jan 30, 2025
Figure 1 for Semantic Web and Creative AI -- A Technical Report from ISWS 2023
Figure 2 for Semantic Web and Creative AI -- A Technical Report from ISWS 2023
Figure 3 for Semantic Web and Creative AI -- A Technical Report from ISWS 2023
Figure 4 for Semantic Web and Creative AI -- A Technical Report from ISWS 2023
Viaarxiv icon

A Machine Learning Approach for Emergency Detection in Medical Scenarios Using Large Language Models

Add code
Dec 20, 2024
Figure 1 for A Machine Learning Approach for Emergency Detection in Medical Scenarios Using Large Language Models
Figure 2 for A Machine Learning Approach for Emergency Detection in Medical Scenarios Using Large Language Models
Figure 3 for A Machine Learning Approach for Emergency Detection in Medical Scenarios Using Large Language Models
Figure 4 for A Machine Learning Approach for Emergency Detection in Medical Scenarios Using Large Language Models
Viaarxiv icon

InternLM-XComposer2.5-OmniLive: A Comprehensive Multimodal System for Long-term Streaming Video and Audio Interactions

Add code
Dec 12, 2024
Figure 1 for InternLM-XComposer2.5-OmniLive: A Comprehensive Multimodal System for Long-term Streaming Video and Audio Interactions
Figure 2 for InternLM-XComposer2.5-OmniLive: A Comprehensive Multimodal System for Long-term Streaming Video and Audio Interactions
Figure 3 for InternLM-XComposer2.5-OmniLive: A Comprehensive Multimodal System for Long-term Streaming Video and Audio Interactions
Figure 4 for InternLM-XComposer2.5-OmniLive: A Comprehensive Multimodal System for Long-term Streaming Video and Audio Interactions
Viaarxiv icon

Homotopy Continuation Made Easy: Regression-based Online Simulation of Starting Problem-Solution Pairs

Add code
Nov 06, 2024
Viaarxiv icon

Mobility-LLM: Learning Visiting Intentions and Travel Preferences from Human Mobility Data with Large Language Models

Add code
Oct 29, 2024
Figure 1 for Mobility-LLM: Learning Visiting Intentions and Travel Preferences from Human Mobility Data with Large Language Models
Figure 2 for Mobility-LLM: Learning Visiting Intentions and Travel Preferences from Human Mobility Data with Large Language Models
Figure 3 for Mobility-LLM: Learning Visiting Intentions and Travel Preferences from Human Mobility Data with Large Language Models
Figure 4 for Mobility-LLM: Learning Visiting Intentions and Travel Preferences from Human Mobility Data with Large Language Models
Viaarxiv icon

How Initial Connectivity Shapes Biologically Plausible Learning in Recurrent Neural Networks

Add code
Oct 15, 2024
Viaarxiv icon

fCOP: Focal Length Estimation from Category-level Object Priors

Add code
Sep 29, 2024
Figure 1 for fCOP: Focal Length Estimation from Category-level Object Priors
Figure 2 for fCOP: Focal Length Estimation from Category-level Object Priors
Figure 3 for fCOP: Focal Length Estimation from Category-level Object Priors
Figure 4 for fCOP: Focal Length Estimation from Category-level Object Priors
Viaarxiv icon