Picture for Yang Yu

Yang Yu

Tsinghua University

Boosting RL-Based Visual Reasoning with Selective Adversarial Entropy Intervention

Add code
Dec 11, 2025
Figure 1 for Boosting RL-Based Visual Reasoning with Selective Adversarial Entropy Intervention
Figure 2 for Boosting RL-Based Visual Reasoning with Selective Adversarial Entropy Intervention
Figure 3 for Boosting RL-Based Visual Reasoning with Selective Adversarial Entropy Intervention
Figure 4 for Boosting RL-Based Visual Reasoning with Selective Adversarial Entropy Intervention
Viaarxiv icon

An LLM-based Quantitative Framework for Evaluating High-Stealthy Backdoor Risks in OSS Supply Chains

Add code
Nov 17, 2025
Viaarxiv icon

Multi-agent In-context Coordination via Decentralized Memory Retrieval

Add code
Nov 13, 2025
Figure 1 for Multi-agent In-context Coordination via Decentralized Memory Retrieval
Figure 2 for Multi-agent In-context Coordination via Decentralized Memory Retrieval
Figure 3 for Multi-agent In-context Coordination via Decentralized Memory Retrieval
Figure 4 for Multi-agent In-context Coordination via Decentralized Memory Retrieval
Viaarxiv icon

ECVL-ROUTER: Scenario-Aware Routing for Vision-Language Models

Add code
Oct 31, 2025
Viaarxiv icon

InfiAgent: Self-Evolving Pyramid Agent Framework for Infinite Scenarios

Add code
Sep 26, 2025
Figure 1 for InfiAgent: Self-Evolving Pyramid Agent Framework for Infinite Scenarios
Figure 2 for InfiAgent: Self-Evolving Pyramid Agent Framework for Infinite Scenarios
Figure 3 for InfiAgent: Self-Evolving Pyramid Agent Framework for Infinite Scenarios
Figure 4 for InfiAgent: Self-Evolving Pyramid Agent Framework for Infinite Scenarios
Viaarxiv icon

ReLAM: Learning Anticipation Model for Rewarding Visual Robotic Manipulation

Add code
Sep 26, 2025
Viaarxiv icon

State-of-the-Art Dysarthric Speech Recognition with MetaICL for on-the-fly Personalization

Add code
Sep 19, 2025
Viaarxiv icon

TOM: An Open-Source Tongue Segmentation Method with Multi-Teacher Distillation and Task-Specific Data Augmentation

Add code
Aug 19, 2025
Viaarxiv icon

CCI4.0: A Bilingual Pretraining Dataset for Enhancing Reasoning in Large Language Models

Add code
Jun 09, 2025
Figure 1 for CCI4.0: A Bilingual Pretraining Dataset for Enhancing Reasoning in Large Language Models
Figure 2 for CCI4.0: A Bilingual Pretraining Dataset for Enhancing Reasoning in Large Language Models
Figure 3 for CCI4.0: A Bilingual Pretraining Dataset for Enhancing Reasoning in Large Language Models
Figure 4 for CCI4.0: A Bilingual Pretraining Dataset for Enhancing Reasoning in Large Language Models
Viaarxiv icon

TCM-Ladder: A Benchmark for Multimodal Question Answering on Traditional Chinese Medicine

Add code
May 29, 2025
Viaarxiv icon