Picture for Muhan Zhang

Muhan Zhang

Training Large Language Models to be Better Rule Followers

Add code
Feb 17, 2025
Viaarxiv icon

TransMLA: Multi-head Latent Attention Is All You Need

Add code
Feb 11, 2025
Viaarxiv icon

Using Random Noise Equivariantly to Boost Graph Neural Networks Universally

Add code
Feb 04, 2025
Viaarxiv icon

Do Graph Diffusion Models Accurately Capture and Generate Substructure Distributions?

Add code
Feb 04, 2025
Viaarxiv icon

RedStar: Does Scaling Long-CoT Data Unlock Better Slow-Reasoning Systems?

Add code
Jan 20, 2025
Viaarxiv icon

Exact Acceleration of Subgraph Graph Neural Networks by Eliminating Computation Redundancy

Add code
Dec 24, 2024
Figure 1 for Exact Acceleration of Subgraph Graph Neural Networks by Eliminating Computation Redundancy
Figure 2 for Exact Acceleration of Subgraph Graph Neural Networks by Eliminating Computation Redundancy
Figure 3 for Exact Acceleration of Subgraph Graph Neural Networks by Eliminating Computation Redundancy
Figure 4 for Exact Acceleration of Subgraph Graph Neural Networks by Eliminating Computation Redundancy
Viaarxiv icon

LIFT: Improving Long Context Understanding Through Long Input Fine-Tuning

Add code
Dec 18, 2024
Viaarxiv icon

GL-Fusion: Rethinking the Combination of Graph Neural Network and Large Language model

Add code
Dec 08, 2024
Figure 1 for GL-Fusion: Rethinking the Combination of Graph Neural Network and Large Language model
Figure 2 for GL-Fusion: Rethinking the Combination of Graph Neural Network and Large Language model
Figure 3 for GL-Fusion: Rethinking the Combination of Graph Neural Network and Large Language model
Figure 4 for GL-Fusion: Rethinking the Combination of Graph Neural Network and Large Language model
Viaarxiv icon

CLOVER: Constrained Learning with Orthonormal Vectors for Eliminating Redundancy

Add code
Nov 26, 2024
Viaarxiv icon

Reconsidering the Performance of GAE in Link Prediction

Add code
Nov 06, 2024
Viaarxiv icon