Picture for Yang Deng

Yang Deng

DR-Arena: an Automated Evaluation Framework for Deep Research Agents

Add code
Jan 15, 2026
Viaarxiv icon

Mastering Diverse, Unknown, and Cluttered Tracks for Robust Vision-Based Drone Racing

Add code
Dec 11, 2025
Viaarxiv icon

MASim: Multilingual Agent-Based Simulation for Social Science

Add code
Dec 08, 2025
Figure 1 for MASim: Multilingual Agent-Based Simulation for Social Science
Figure 2 for MASim: Multilingual Agent-Based Simulation for Social Science
Figure 3 for MASim: Multilingual Agent-Based Simulation for Social Science
Figure 4 for MASim: Multilingual Agent-Based Simulation for Social Science
Viaarxiv icon

CaberNet: Causal Representation Learning for Cross-Domain HVAC Energy Prediction

Add code
Nov 10, 2025
Figure 1 for CaberNet: Causal Representation Learning for Cross-Domain HVAC Energy Prediction
Figure 2 for CaberNet: Causal Representation Learning for Cross-Domain HVAC Energy Prediction
Figure 3 for CaberNet: Causal Representation Learning for Cross-Domain HVAC Energy Prediction
Figure 4 for CaberNet: Causal Representation Learning for Cross-Domain HVAC Energy Prediction
Viaarxiv icon

E2Edev: Benchmarking Large Language Models in End-to-End Software Development Task

Add code
Oct 16, 2025
Viaarxiv icon

Contrastive Weak-to-strong Generalization

Add code
Oct 09, 2025
Figure 1 for Contrastive Weak-to-strong Generalization
Figure 2 for Contrastive Weak-to-strong Generalization
Figure 3 for Contrastive Weak-to-strong Generalization
Figure 4 for Contrastive Weak-to-strong Generalization
Viaarxiv icon

Exploring and Exploiting the Inherent Efficiency within Large Reasoning Models for Self-Guided Efficiency Enhancement

Add code
Jun 18, 2025
Viaarxiv icon

Mitigating Safety Fallback in Editing-based Backdoor Injection on LLMs

Add code
Jun 16, 2025
Figure 1 for Mitigating Safety Fallback in Editing-based Backdoor Injection on LLMs
Figure 2 for Mitigating Safety Fallback in Editing-based Backdoor Injection on LLMs
Figure 3 for Mitigating Safety Fallback in Editing-based Backdoor Injection on LLMs
Figure 4 for Mitigating Safety Fallback in Editing-based Backdoor Injection on LLMs
Viaarxiv icon

InComeS: Integrating Compression and Selection Mechanisms into LLMs for Efficient Model Editing

Add code
May 28, 2025
Viaarxiv icon

MPO: Multilingual Safety Alignment via Reward Gap Optimization

Add code
May 22, 2025
Viaarxiv icon