Picture for Yu Cheng

Yu Cheng

Make LoRA Great Again: Boosting LoRA with Adaptive Singular Values and Mixture-of-Experts Optimization Alignment

Add code
Feb 26, 2025
Viaarxiv icon

Multi-LLM Collaborative Search for Complex Problem Solving

Add code
Feb 26, 2025
Viaarxiv icon

Unveiling Attractor Cycles in Large Language Models: A Dynamical Systems View of Successive Paraphrasing

Add code
Feb 21, 2025
Viaarxiv icon

AttentionEngine: A Versatile Framework for Efficient Attention Mechanisms on Diverse Hardware Platforms

Add code
Feb 21, 2025
Viaarxiv icon

MoM: Linear Sequence Modeling with Mixture-of-Memories

Add code
Feb 19, 2025
Viaarxiv icon

DGSense: A Domain Generalization Framework for Wireless Sensing

Add code
Feb 12, 2025
Viaarxiv icon

LASP-2: Rethinking Sequence Parallelism for Linear Attention and Its Hybrid

Add code
Feb 11, 2025
Viaarxiv icon

UltraIF: Advancing Instruction Following from the Wild

Add code
Feb 06, 2025
Figure 1 for UltraIF: Advancing Instruction Following from the Wild
Figure 2 for UltraIF: Advancing Instruction Following from the Wild
Figure 3 for UltraIF: Advancing Instruction Following from the Wild
Figure 4 for UltraIF: Advancing Instruction Following from the Wild
Viaarxiv icon

Process Reinforcement through Implicit Rewards

Add code
Feb 03, 2025
Viaarxiv icon

Each Rank Could be an Expert: Single-Ranked Mixture of Experts LoRA for Multi-Task Learning

Add code
Jan 25, 2025
Viaarxiv icon