Picture for Xinrun Xu

Xinrun Xu

High-Quality Pseudo-Label Generation Based on Visual Prompt Assisted Cloud Model Update

Add code
Apr 01, 2025
Viaarxiv icon

Being-0: A Humanoid Robotic Agent with Vision-Language Models and Modular Skills

Add code
Mar 16, 2025
Viaarxiv icon

Vulnerability of Text-to-Image Models to Prompt Template Stealing: A Differential Evolution Approach

Add code
Feb 20, 2025
Viaarxiv icon

Extract Information from Hybrid Long Documents Leveraging LLMs: A Framework and Dataset

Add code
Dec 28, 2024
Figure 1 for Extract Information from Hybrid Long Documents Leveraging LLMs: A Framework and Dataset
Figure 2 for Extract Information from Hybrid Long Documents Leveraging LLMs: A Framework and Dataset
Figure 3 for Extract Information from Hybrid Long Documents Leveraging LLMs: A Framework and Dataset
Figure 4 for Extract Information from Hybrid Long Documents Leveraging LLMs: A Framework and Dataset
Viaarxiv icon

SELU: Self-Learning Embodied MLLMs in Unknown Environments

Add code
Oct 04, 2024
Figure 1 for SELU: Self-Learning Embodied MLLMs in Unknown Environments
Figure 2 for SELU: Self-Learning Embodied MLLMs in Unknown Environments
Figure 3 for SELU: Self-Learning Embodied MLLMs in Unknown Environments
Figure 4 for SELU: Self-Learning Embodied MLLMs in Unknown Environments
Viaarxiv icon

A Clustering Method with Graph Maximum Decoding Information

Add code
Mar 18, 2024
Viaarxiv icon

A Survey on Game Playing Agents and Large Models: Methods, Applications, and Challenges

Add code
Mar 15, 2024
Viaarxiv icon

A Multi-constraint and Multi-objective Allocation Model for Emergency Rescue in IoT Environment

Add code
Mar 15, 2024
Figure 1 for A Multi-constraint and Multi-objective Allocation Model for Emergency Rescue in IoT Environment
Figure 2 for A Multi-constraint and Multi-objective Allocation Model for Emergency Rescue in IoT Environment
Figure 3 for A Multi-constraint and Multi-objective Allocation Model for Emergency Rescue in IoT Environment
Viaarxiv icon

Towards General Computer Control: A Multimodal Agent for Red Dead Redemption II as a Case Study

Add code
Mar 07, 2024
Viaarxiv icon

Can Large Language Models Recall Reference Location Like Humans?

Add code
Feb 26, 2024
Viaarxiv icon