Picture for Xueyu Hu

Xueyu Hu

AgentIF-OneDay: A Task-level Instruction-Following Benchmark for General AI Agents in Daily Scenarios

Add code
Jan 28, 2026
Viaarxiv icon

Every Step Evolves: Scaling Reinforcement Learning for Trillion-Scale Thinking Model

Add code
Oct 21, 2025
Figure 1 for Every Step Evolves: Scaling Reinforcement Learning for Trillion-Scale Thinking Model
Figure 2 for Every Step Evolves: Scaling Reinforcement Learning for Trillion-Scale Thinking Model
Figure 3 for Every Step Evolves: Scaling Reinforcement Learning for Trillion-Scale Thinking Model
Figure 4 for Every Step Evolves: Scaling Reinforcement Learning for Trillion-Scale Thinking Model
Viaarxiv icon

OS Agents: A Survey on MLLM-based Agents for General Computing Devices Use

Add code
Aug 06, 2025
Figure 1 for OS Agents: A Survey on MLLM-based Agents for General Computing Devices Use
Figure 2 for OS Agents: A Survey on MLLM-based Agents for General Computing Devices Use
Figure 3 for OS Agents: A Survey on MLLM-based Agents for General Computing Devices Use
Figure 4 for OS Agents: A Survey on MLLM-based Agents for General Computing Devices Use
Viaarxiv icon

AEIA-MN: Evaluating the Robustness of Multimodal LLM-Powered Mobile Agents Against Active Environmental Injection Attacks

Add code
Feb 18, 2025
Figure 1 for AEIA-MN: Evaluating the Robustness of Multimodal LLM-Powered Mobile Agents Against Active Environmental Injection Attacks
Figure 2 for AEIA-MN: Evaluating the Robustness of Multimodal LLM-Powered Mobile Agents Against Active Environmental Injection Attacks
Figure 3 for AEIA-MN: Evaluating the Robustness of Multimodal LLM-Powered Mobile Agents Against Active Environmental Injection Attacks
Figure 4 for AEIA-MN: Evaluating the Robustness of Multimodal LLM-Powered Mobile Agents Against Active Environmental Injection Attacks
Viaarxiv icon

InfiGUIAgent: A Multimodal Generalist GUI Agent with Native Reasoning and Reflection

Add code
Jan 08, 2025
Viaarxiv icon

Fine-Grained Guidance for Retrievers: Leveraging LLMs' Feedback in Retrieval-Augmented Generation

Add code
Nov 06, 2024
Figure 1 for Fine-Grained Guidance for Retrievers: Leveraging LLMs' Feedback in Retrieval-Augmented Generation
Figure 2 for Fine-Grained Guidance for Retrievers: Leveraging LLMs' Feedback in Retrieval-Augmented Generation
Figure 3 for Fine-Grained Guidance for Retrievers: Leveraging LLMs' Feedback in Retrieval-Augmented Generation
Figure 4 for Fine-Grained Guidance for Retrievers: Leveraging LLMs' Feedback in Retrieval-Augmented Generation
Viaarxiv icon

Structure-Based Drug Design via 3D Molecular Generative Pre-training and Sampling

Add code
Feb 22, 2024
Figure 1 for Structure-Based Drug Design via 3D Molecular Generative Pre-training and Sampling
Figure 2 for Structure-Based Drug Design via 3D Molecular Generative Pre-training and Sampling
Figure 3 for Structure-Based Drug Design via 3D Molecular Generative Pre-training and Sampling
Figure 4 for Structure-Based Drug Design via 3D Molecular Generative Pre-training and Sampling
Viaarxiv icon

Leveraging Print Debugging to Improve Code Generation in Large Language Models

Add code
Jan 10, 2024
Viaarxiv icon

InfiAgent-DABench: Evaluating Agents on Data Analysis Tasks

Add code
Jan 10, 2024
Figure 1 for InfiAgent-DABench: Evaluating Agents on Data Analysis Tasks
Figure 2 for InfiAgent-DABench: Evaluating Agents on Data Analysis Tasks
Figure 3 for InfiAgent-DABench: Evaluating Agents on Data Analysis Tasks
Figure 4 for InfiAgent-DABench: Evaluating Agents on Data Analysis Tasks
Viaarxiv icon

One Model, Multiple Modalities: A Sparsely Activated Approach for Text, Sound, Image, Video and Code

Add code
May 12, 2022
Figure 1 for One Model, Multiple Modalities: A Sparsely Activated Approach for Text, Sound, Image, Video and Code
Figure 2 for One Model, Multiple Modalities: A Sparsely Activated Approach for Text, Sound, Image, Video and Code
Figure 3 for One Model, Multiple Modalities: A Sparsely Activated Approach for Text, Sound, Image, Video and Code
Figure 4 for One Model, Multiple Modalities: A Sparsely Activated Approach for Text, Sound, Image, Video and Code
Viaarxiv icon