Picture for Tong Yang

Tong Yang

Michael Pokorny

MergeMoE: Efficient Compression of MoE Models via Expert Output Merging

Add code
Oct 16, 2025
Viaarxiv icon

Fast Visuomotor Policy for Robotic Manipulation

Add code
Oct 14, 2025
Viaarxiv icon

Inverse IFEval: Can LLMs Unlearn Stubborn Training Conventions to Follow Real Instructions?

Add code
Sep 04, 2025
Viaarxiv icon

Adapting Foundation Model for Dental Caries Detection with Dual-View Co-Training

Add code
Aug 28, 2025
Viaarxiv icon

Multi-head Transformers Provably Learn Symbolic Multi-step Reasoning via Gradient Descent

Add code
Aug 11, 2025
Viaarxiv icon

Fairy$\pm i$: the First 2-bit Complex LLM with All Parameters in $\{\pm1, \pm i\}$

Add code
Aug 07, 2025
Viaarxiv icon

FAF: A Feature-Adaptive Framework for Few-Shot Time Series Forecasting

Add code
Jun 24, 2025
Viaarxiv icon

SciDA: Scientific Dynamic Assessor of LLMs

Add code
Jun 15, 2025
Viaarxiv icon

Continuous Semi-Implicit Models

Add code
Jun 07, 2025
Viaarxiv icon

KeepKV: Eliminating Output Perturbation in KV Cache Compression for Efficient LLMs Inference

Add code
Apr 14, 2025
Figure 1 for KeepKV: Eliminating Output Perturbation in KV Cache Compression for Efficient LLMs Inference
Figure 2 for KeepKV: Eliminating Output Perturbation in KV Cache Compression for Efficient LLMs Inference
Figure 3 for KeepKV: Eliminating Output Perturbation in KV Cache Compression for Efficient LLMs Inference
Figure 4 for KeepKV: Eliminating Output Perturbation in KV Cache Compression for Efficient LLMs Inference
Viaarxiv icon