Picture for Kaiyu Huang

Kaiyu Huang

Workflow-R1: Group Sub-sequence Policy Optimization for Multi-turn Workflow Construction

Add code
Feb 01, 2026
Viaarxiv icon

Language-Coupled Reinforcement Learning for Multilingual Retrieval-Augmented Generation

Add code
Jan 21, 2026
Viaarxiv icon

When Helpers Become Hazards: A Benchmark for Analyzing Multimodal LLM-Powered Safety in Daily Life

Add code
Jan 07, 2026
Viaarxiv icon

Think Natively: Unlocking Multilingual Reasoning with Consistency-Enhanced Reinforcement Learning

Add code
Oct 08, 2025
Figure 1 for Think Natively: Unlocking Multilingual Reasoning with Consistency-Enhanced Reinforcement Learning
Figure 2 for Think Natively: Unlocking Multilingual Reasoning with Consistency-Enhanced Reinforcement Learning
Figure 3 for Think Natively: Unlocking Multilingual Reasoning with Consistency-Enhanced Reinforcement Learning
Figure 4 for Think Natively: Unlocking Multilingual Reasoning with Consistency-Enhanced Reinforcement Learning
Viaarxiv icon

Boosting Data Utilization for Multilingual Dense Retrieval

Add code
Sep 11, 2025
Viaarxiv icon

Adaptive Personalized Conversational Information Retrieval

Add code
Aug 12, 2025
Viaarxiv icon

Multilingual Collaborative Defense for Large Language Models

Add code
May 17, 2025
Viaarxiv icon

Think in Safety: Unveiling and Mitigating Safety Alignment Collapse in Multimodal Large Reasoning Model

Add code
May 10, 2025
Viaarxiv icon

OmniGeo: Towards a Multimodal Large Language Models for Geospatial Artificial Intelligence

Add code
Mar 20, 2025
Viaarxiv icon

SpecServe: Efficient and SLO-Aware Large Language Model Serving with Adaptive Speculative Decoding

Add code
Mar 07, 2025
Viaarxiv icon