Picture for Tianyi Chen

Tianyi Chen

School of Civil and Environmental Engineering, Nanyang Technological University, Singapore

Fundamental Safety-Capability Trade-offs in Fine-tuning Large Language Models

Add code
Mar 24, 2025
Viaarxiv icon

DistiLLM-2: A Contrastive Approach Boosts the Distillation of LLMs

Add code
Mar 10, 2025
Viaarxiv icon

Every FLOP Counts: Scaling a 300B Mixture-of-Experts LING LLM without Premium GPUs

Add code
Mar 07, 2025
Figure 1 for Every FLOP Counts: Scaling a 300B Mixture-of-Experts LING LLM without Premium GPUs
Figure 2 for Every FLOP Counts: Scaling a 300B Mixture-of-Experts LING LLM without Premium GPUs
Figure 3 for Every FLOP Counts: Scaling a 300B Mixture-of-Experts LING LLM without Premium GPUs
Figure 4 for Every FLOP Counts: Scaling a 300B Mixture-of-Experts LING LLM without Premium GPUs
Viaarxiv icon

Robust Polyp Detection and Diagnosis through Compositional Prompt-Guided Diffusion Models

Add code
Feb 25, 2025
Viaarxiv icon

Automatic Joint Structured Pruning and Quantization for Efficient Neural Network Training and Compression

Add code
Feb 23, 2025
Viaarxiv icon

A First-order Generative Bilevel Optimization Framework for Diffusion Models

Add code
Feb 12, 2025
Viaarxiv icon

ReTreever: Tree-based Coarse-to-Fine Representations for Retrieval

Add code
Feb 11, 2025
Viaarxiv icon

Analog In-memory Training on General Non-ideal Resistive Elements: The Impact of Response Functions

Add code
Feb 10, 2025
Viaarxiv icon

Bilevel Joint Unsupervised and Supervised Training for Automatic Speech Recognition

Add code
Dec 11, 2024
Viaarxiv icon

FERERO: A Flexible Framework for Preference-Guided Multi-Objective Learning

Add code
Dec 02, 2024
Viaarxiv icon