Picture for Tianyi Wu

Tianyi Wu

ClawMark: A Living-World Benchmark for Multi-Turn, Multi-Day, Multimodal Coworker Agents

Add code
Apr 26, 2026
Viaarxiv icon

Secure Code Generation via Online Reinforcement Learning with Vulnerability Reward Model

Add code
Feb 07, 2026
Viaarxiv icon

Autoregressive, Yet Revisable: In Decoding Revision for Secure Code Generation

Add code
Feb 01, 2026
Viaarxiv icon

Reasoning with Confidence: Efficient Verification of LLM Reasoning Steps via Uncertainty Heads

Add code
Nov 11, 2025
Viaarxiv icon

Investigating Advanced Reasoning of Large Language Models via Black-Box Interaction

Add code
Aug 26, 2025
Viaarxiv icon

Geneshift: Impact of different scenario shift on Jailbreaking LLM

Add code
Apr 10, 2025
Viaarxiv icon

RWKV-7 "Goose" with Expressive Dynamic State Evolution

Add code
Mar 18, 2025
Viaarxiv icon

Navigating the Helpfulness-Truthfulness Trade-Off with Uncertainty-Aware Instruction Fine-Tuning

Add code
Feb 17, 2025
Viaarxiv icon

GuardReasoner: Towards Reasoning-based LLM Safeguards

Add code
Jan 30, 2025
Figure 1 for GuardReasoner: Towards Reasoning-based LLM Safeguards
Figure 2 for GuardReasoner: Towards Reasoning-based LLM Safeguards
Figure 3 for GuardReasoner: Towards Reasoning-based LLM Safeguards
Figure 4 for GuardReasoner: Towards Reasoning-based LLM Safeguards
Viaarxiv icon

Dual-Optimized Adaptive Graph Reconstruction for Multi-View Graph Clustering

Add code
Oct 30, 2024
Viaarxiv icon