Picture for Yuhao Jiang

Yuhao Jiang

Segmental Advantage Estimation: Enhancing PPO for Long-Context LLM Training

Add code
Jan 12, 2026
Viaarxiv icon

Cumulative Path-Level Semantic Reasoning for Inductive Knowledge Graph Completion

Add code
Jan 09, 2026
Viaarxiv icon

Training Report of TeleChat3-MoE

Add code
Dec 30, 2025
Viaarxiv icon

Flexible and Foldable: Workspace Analysis and Object Manipulation Using a Soft, Interconnected, Origami-Inspired Actuator Array

Add code
Sep 17, 2025
Viaarxiv icon

Results of the NeurIPS 2023 Neural MMO Competition on Multi-task Reinforcement Learning

Add code
Aug 17, 2025
Viaarxiv icon

Technical Report of TeleChat2, TeleChat2.5 and T1

Add code
Jul 24, 2025
Figure 1 for Technical Report of TeleChat2, TeleChat2.5 and T1
Figure 2 for Technical Report of TeleChat2, TeleChat2.5 and T1
Figure 3 for Technical Report of TeleChat2, TeleChat2.5 and T1
Figure 4 for Technical Report of TeleChat2, TeleChat2.5 and T1
Viaarxiv icon

Adaptive Termination for Multi-round Parallel Reasoning: An Universal Semantic Entropy-Guided Framework

Add code
Jul 09, 2025
Viaarxiv icon

Vibration of Soft, Twisted Beams for Under-Actuated Quadrupedal Locomotion

Add code
Jul 03, 2025
Viaarxiv icon

Hunyuan-TurboS: Advancing Large Language Models through Mamba-Transformer Synergy and Adaptive Chain-of-Thought

Add code
May 21, 2025
Viaarxiv icon

CPG-Based Manipulation with Multi-Module Origami Robot Surface

Add code
Feb 26, 2025
Figure 1 for CPG-Based Manipulation with Multi-Module Origami Robot Surface
Figure 2 for CPG-Based Manipulation with Multi-Module Origami Robot Surface
Figure 3 for CPG-Based Manipulation with Multi-Module Origami Robot Surface
Figure 4 for CPG-Based Manipulation with Multi-Module Origami Robot Surface
Viaarxiv icon