Picture for Kaiyan Zhao

Kaiyan Zhao

When Attention Betrays: Erasing Backdoor Attacks in Robotic Policies by Reconstructing Visual Tokens

Add code
Feb 03, 2026
Viaarxiv icon

Decouple Searching from Training: Scaling Data Mixing via Model Merging for Large Language Model Pre-training

Add code
Jan 31, 2026
Viaarxiv icon

Benchmarking Machine Translation on Chinese Social Media Texts

Add code
Jan 30, 2026
Viaarxiv icon

NeoAMT: Neologism-Aware Agentic Machine Translation with Reinforcement Learning

Add code
Jan 07, 2026
Viaarxiv icon

EComStage: Stage-wise and Orientation-specific Benchmarking for Large Language Models in E-commerce

Add code
Jan 06, 2026
Viaarxiv icon

RGMP: Recurrent Geometric-prior Multimodal Policy for Generalizable Humanoid Robot Manipulation

Add code
Nov 12, 2025
Viaarxiv icon

Improving Multimodal Contrastive Learning of Sentence Embeddings with Object-Phrase Alignment

Add code
Aug 01, 2025
Viaarxiv icon

Direct Quantized Training of Language Models with Stochastic Rounding

Add code
Dec 06, 2024
Figure 1 for Direct Quantized Training of Language Models with Stochastic Rounding
Figure 2 for Direct Quantized Training of Language Models with Stochastic Rounding
Figure 3 for Direct Quantized Training of Language Models with Stochastic Rounding
Figure 4 for Direct Quantized Training of Language Models with Stochastic Rounding
Viaarxiv icon

Efficient Diversity-based Experience Replay for Deep Reinforcement Learning

Add code
Oct 27, 2024
Figure 1 for Efficient Diversity-based Experience Replay for Deep Reinforcement Learning
Figure 2 for Efficient Diversity-based Experience Replay for Deep Reinforcement Learning
Figure 3 for Efficient Diversity-based Experience Replay for Deep Reinforcement Learning
Figure 4 for Efficient Diversity-based Experience Replay for Deep Reinforcement Learning
Viaarxiv icon

Enhancing LLM Agents for Code Generation with Possibility and Pass-rate Prioritized Experience Replay

Add code
Oct 16, 2024
Figure 1 for Enhancing LLM Agents for Code Generation with Possibility and Pass-rate Prioritized Experience Replay
Figure 2 for Enhancing LLM Agents for Code Generation with Possibility and Pass-rate Prioritized Experience Replay
Figure 3 for Enhancing LLM Agents for Code Generation with Possibility and Pass-rate Prioritized Experience Replay
Figure 4 for Enhancing LLM Agents for Code Generation with Possibility and Pass-rate Prioritized Experience Replay
Viaarxiv icon