Picture for Quy-Anh Dang

Quy-Anh Dang

Reinforcement Learning for Reasoning in Small LLMs: What Works and What Doesn't

Add code
Mar 20, 2025
Viaarxiv icon

MoD: A Distribution-Based Approach for Merging Large Language Models

Add code
Nov 01, 2024
Viaarxiv icon

wav2graph: A Framework for Supervised Learning Knowledge Graph from Speech

Add code
Aug 08, 2024
Viaarxiv icon