Picture for Yang Deng

Yang Deng

When Does Context Help? Error Dynamics of Contextual Information in Large Language Models

Add code
Feb 09, 2026
Viaarxiv icon

Large Language Model Agents Are Not Always Faithful Self-Evolvers

Add code
Jan 30, 2026
Viaarxiv icon

DR-Arena: an Automated Evaluation Framework for Deep Research Agents

Add code
Jan 15, 2026
Viaarxiv icon

Mastering Diverse, Unknown, and Cluttered Tracks for Robust Vision-Based Drone Racing

Add code
Dec 11, 2025
Viaarxiv icon

MASim: Multilingual Agent-Based Simulation for Social Science

Add code
Dec 08, 2025
Figure 1 for MASim: Multilingual Agent-Based Simulation for Social Science
Figure 2 for MASim: Multilingual Agent-Based Simulation for Social Science
Figure 3 for MASim: Multilingual Agent-Based Simulation for Social Science
Figure 4 for MASim: Multilingual Agent-Based Simulation for Social Science
Viaarxiv icon

CaberNet: Causal Representation Learning for Cross-Domain HVAC Energy Prediction

Add code
Nov 10, 2025
Figure 1 for CaberNet: Causal Representation Learning for Cross-Domain HVAC Energy Prediction
Figure 2 for CaberNet: Causal Representation Learning for Cross-Domain HVAC Energy Prediction
Figure 3 for CaberNet: Causal Representation Learning for Cross-Domain HVAC Energy Prediction
Figure 4 for CaberNet: Causal Representation Learning for Cross-Domain HVAC Energy Prediction
Viaarxiv icon

E2Edev: Benchmarking Large Language Models in End-to-End Software Development Task

Add code
Oct 16, 2025
Viaarxiv icon

Contrastive Weak-to-strong Generalization

Add code
Oct 09, 2025
Figure 1 for Contrastive Weak-to-strong Generalization
Figure 2 for Contrastive Weak-to-strong Generalization
Figure 3 for Contrastive Weak-to-strong Generalization
Figure 4 for Contrastive Weak-to-strong Generalization
Viaarxiv icon

Exploring and Exploiting the Inherent Efficiency within Large Reasoning Models for Self-Guided Efficiency Enhancement

Add code
Jun 18, 2025
Viaarxiv icon

Mitigating Safety Fallback in Editing-based Backdoor Injection on LLMs

Add code
Jun 16, 2025
Figure 1 for Mitigating Safety Fallback in Editing-based Backdoor Injection on LLMs
Figure 2 for Mitigating Safety Fallback in Editing-based Backdoor Injection on LLMs
Figure 3 for Mitigating Safety Fallback in Editing-based Backdoor Injection on LLMs
Figure 4 for Mitigating Safety Fallback in Editing-based Backdoor Injection on LLMs
Viaarxiv icon