Picture for Tong Zheng

Tong Zheng

CDE: Curiosity-Driven Exploration for Efficient Reinforcement Learning in Large Language Models

Add code
Sep 11, 2025
Viaarxiv icon

Parallel-R1: Towards Parallel Thinking via Reinforcement Learning

Add code
Sep 09, 2025
Viaarxiv icon

One Size Does Not Fit All: A Distribution-Aware Sparsification for More Precise Model Merging

Add code
Aug 08, 2025
Viaarxiv icon

Learning to Reason via Mixture-of-Thought for Logical Reasoning

Add code
May 21, 2025
Viaarxiv icon

Beyond Decoder-only: Large Language Models Can be Good Encoders for Machine Translation

Add code
Mar 09, 2025
Viaarxiv icon

Towards Optimal Multi-draft Speculative Decoding

Add code
Feb 26, 2025
Viaarxiv icon

Early Exit Is a Natural Capability in Transformer-based Models: An Empirical Study on Early Exit without Joint Optimization

Add code
Dec 02, 2024
Figure 1 for Early Exit Is a Natural Capability in Transformer-based Models: An Empirical Study on Early Exit without Joint Optimization
Figure 2 for Early Exit Is a Natural Capability in Transformer-based Models: An Empirical Study on Early Exit without Joint Optimization
Figure 3 for Early Exit Is a Natural Capability in Transformer-based Models: An Empirical Study on Early Exit without Joint Optimization
Figure 4 for Early Exit Is a Natural Capability in Transformer-based Models: An Empirical Study on Early Exit without Joint Optimization
Viaarxiv icon

Predictor-Corrector Enhanced Transformers with Exponential Moving Average Coefficient Learning

Add code
Nov 05, 2024
Viaarxiv icon

A Bayesian Approach to Harnessing the Power of LLMs in Authorship Attribution

Add code
Oct 29, 2024
Viaarxiv icon

Exploiting Memory-aware Q-distribution Prediction for Nuclear Fusion via Modern Hopfield Network

Add code
Oct 11, 2024
Figure 1 for Exploiting Memory-aware Q-distribution Prediction for Nuclear Fusion via Modern Hopfield Network
Figure 2 for Exploiting Memory-aware Q-distribution Prediction for Nuclear Fusion via Modern Hopfield Network
Figure 3 for Exploiting Memory-aware Q-distribution Prediction for Nuclear Fusion via Modern Hopfield Network
Figure 4 for Exploiting Memory-aware Q-distribution Prediction for Nuclear Fusion via Modern Hopfield Network
Viaarxiv icon