Picture for Tianyi Tang

Tianyi Tang

Outcome Accuracy is Not Enough: Aligning the Reasoning Process of Reward Models

Add code
Feb 04, 2026
Viaarxiv icon

PLawBench: A Rubric-Based Benchmark for Evaluating LLMs in Real-World Legal Practice

Add code
Jan 23, 2026
Viaarxiv icon

Qwen3Guard Technical Report

Add code
Oct 16, 2025
Viaarxiv icon

RMTBench: Benchmarking LLMs Through Multi-Turn User-Centric Role-Playing

Add code
Jul 27, 2025
Viaarxiv icon

Qwen3 Technical Report

Add code
May 14, 2025
Figure 1 for Qwen3 Technical Report
Figure 2 for Qwen3 Technical Report
Figure 3 for Qwen3 Technical Report
Figure 4 for Qwen3 Technical Report
Viaarxiv icon

A4-Unet: Deformable Multi-Scale Attention Network for Brain Tumor Segmentation

Add code
Dec 08, 2024
Figure 1 for A4-Unet: Deformable Multi-Scale Attention Network for Brain Tumor Segmentation
Figure 2 for A4-Unet: Deformable Multi-Scale Attention Network for Brain Tumor Segmentation
Figure 3 for A4-Unet: Deformable Multi-Scale Attention Network for Brain Tumor Segmentation
Figure 4 for A4-Unet: Deformable Multi-Scale Attention Network for Brain Tumor Segmentation
Viaarxiv icon

Language Models can Self-Lengthen to Generate Long Texts

Add code
Oct 31, 2024
Figure 1 for Language Models can Self-Lengthen to Generate Long Texts
Figure 2 for Language Models can Self-Lengthen to Generate Long Texts
Figure 3 for Language Models can Self-Lengthen to Generate Long Texts
Figure 4 for Language Models can Self-Lengthen to Generate Long Texts
Viaarxiv icon

Neuron-based Personality Trait Induction in Large Language Models

Add code
Oct 16, 2024
Viaarxiv icon

LLMBox: A Comprehensive Library for Large Language Models

Add code
Jul 08, 2024
Figure 1 for LLMBox: A Comprehensive Library for Large Language Models
Figure 2 for LLMBox: A Comprehensive Library for Large Language Models
Figure 3 for LLMBox: A Comprehensive Library for Large Language Models
Figure 4 for LLMBox: A Comprehensive Library for Large Language Models
Viaarxiv icon

Towards Coarse-to-Fine Evaluation of Inference Efficiency for Large Language Models

Add code
Apr 17, 2024
Figure 1 for Towards Coarse-to-Fine Evaluation of Inference Efficiency for Large Language Models
Figure 2 for Towards Coarse-to-Fine Evaluation of Inference Efficiency for Large Language Models
Figure 3 for Towards Coarse-to-Fine Evaluation of Inference Efficiency for Large Language Models
Figure 4 for Towards Coarse-to-Fine Evaluation of Inference Efficiency for Large Language Models
Viaarxiv icon