Picture for Xinyue Zhang

Xinyue Zhang

InternLM-XComposer2.5-OmniLive: A Comprehensive Multimodal System for Long-term Streaming Video and Audio Interactions

Add code
Dec 12, 2024
Viaarxiv icon

Homotopy Continuation Made Easy: Regression-based Online Simulation of Starting Problem-Solution Pairs

Add code
Nov 06, 2024
Viaarxiv icon

Mobility-LLM: Learning Visiting Intentions and Travel Preferences from Human Mobility Data with Large Language Models

Add code
Oct 29, 2024
Viaarxiv icon

How Initial Connectivity Shapes Biologically Plausible Learning in Recurrent Neural Networks

Add code
Oct 15, 2024
Viaarxiv icon

fCOP: Focal Length Estimation from Category-level Object Priors

Add code
Sep 29, 2024
Figure 1 for fCOP: Focal Length Estimation from Category-level Object Priors
Figure 2 for fCOP: Focal Length Estimation from Category-level Object Priors
Figure 3 for fCOP: Focal Length Estimation from Category-level Object Priors
Figure 4 for fCOP: Focal Length Estimation from Category-level Object Priors
Viaarxiv icon

Can LLMs "Reason" in Music? An Evaluation of LLMs' Capability of Music Understanding and Generation

Add code
Jul 31, 2024
Figure 1 for Can LLMs "Reason" in Music? An Evaluation of LLMs' Capability of Music Understanding and Generation
Figure 2 for Can LLMs "Reason" in Music? An Evaluation of LLMs' Capability of Music Understanding and Generation
Figure 3 for Can LLMs "Reason" in Music? An Evaluation of LLMs' Capability of Music Understanding and Generation
Figure 4 for Can LLMs "Reason" in Music? An Evaluation of LLMs' Capability of Music Understanding and Generation
Viaarxiv icon

InternLM-XComposer-2.5: A Versatile Large Vision Language Model Supporting Long-Contextual Input and Output

Add code
Jul 03, 2024
Figure 1 for InternLM-XComposer-2.5: A Versatile Large Vision Language Model Supporting Long-Contextual Input and Output
Figure 2 for InternLM-XComposer-2.5: A Versatile Large Vision Language Model Supporting Long-Contextual Input and Output
Figure 3 for InternLM-XComposer-2.5: A Versatile Large Vision Language Model Supporting Long-Contextual Input and Output
Figure 4 for InternLM-XComposer-2.5: A Versatile Large Vision Language Model Supporting Long-Contextual Input and Output
Viaarxiv icon

MAP-Neo: Highly Capable and Transparent Bilingual Large Language Model Series

Add code
May 29, 2024
Figure 1 for MAP-Neo: Highly Capable and Transparent Bilingual Large Language Model Series
Figure 2 for MAP-Neo: Highly Capable and Transparent Bilingual Large Language Model Series
Figure 3 for MAP-Neo: Highly Capable and Transparent Bilingual Large Language Model Series
Figure 4 for MAP-Neo: Highly Capable and Transparent Bilingual Large Language Model Series
Viaarxiv icon

InternLM-XComposer2-4KHD: A Pioneering Large Vision-Language Model Handling Resolutions from 336 Pixels to 4K HD

Add code
Apr 09, 2024
Figure 1 for InternLM-XComposer2-4KHD: A Pioneering Large Vision-Language Model Handling Resolutions from 336 Pixels to 4K HD
Figure 2 for InternLM-XComposer2-4KHD: A Pioneering Large Vision-Language Model Handling Resolutions from 336 Pixels to 4K HD
Figure 3 for InternLM-XComposer2-4KHD: A Pioneering Large Vision-Language Model Handling Resolutions from 336 Pixels to 4K HD
Figure 4 for InternLM-XComposer2-4KHD: A Pioneering Large Vision-Language Model Handling Resolutions from 336 Pixels to 4K HD
Viaarxiv icon

InternLM2 Technical Report

Add code
Mar 26, 2024
Figure 1 for InternLM2 Technical Report
Figure 2 for InternLM2 Technical Report
Figure 3 for InternLM2 Technical Report
Figure 4 for InternLM2 Technical Report
Viaarxiv icon