Picture for Binxuan Huang

Binxuan Huang

HeaPA: Difficulty-Aware Heap Sampling and On-Policy Query Augmentation for LLM Reinforcement Learning

Add code
Jan 30, 2026
Viaarxiv icon

Teach Diffusion Language Models to Learn from Their Own Mistakes

Add code
Jan 10, 2026
Viaarxiv icon

END: Early Noise Dropping for Efficient and Effective Context Denoising

Add code
Feb 26, 2025
Viaarxiv icon

Hephaestus: Improving Fundamental Agent Capabilities of Large Language Models through Continual Pre-Training

Add code
Feb 10, 2025
Figure 1 for Hephaestus: Improving Fundamental Agent Capabilities of Large Language Models through Continual Pre-Training
Figure 2 for Hephaestus: Improving Fundamental Agent Capabilities of Large Language Models through Continual Pre-Training
Figure 3 for Hephaestus: Improving Fundamental Agent Capabilities of Large Language Models through Continual Pre-Training
Figure 4 for Hephaestus: Improving Fundamental Agent Capabilities of Large Language Models through Continual Pre-Training
Viaarxiv icon

Scaling Laws for Predicting Downstream Performance in LLMs

Add code
Oct 11, 2024
Figure 1 for Scaling Laws for Predicting Downstream Performance in LLMs
Figure 2 for Scaling Laws for Predicting Downstream Performance in LLMs
Figure 3 for Scaling Laws for Predicting Downstream Performance in LLMs
Figure 4 for Scaling Laws for Predicting Downstream Performance in LLMs
Viaarxiv icon

Inductive or Deductive? Rethinking the Fundamental Reasoning Abilities of LLMs

Add code
Aug 07, 2024
Figure 1 for Inductive or Deductive? Rethinking the Fundamental Reasoning Abilities of LLMs
Figure 2 for Inductive or Deductive? Rethinking the Fundamental Reasoning Abilities of LLMs
Figure 3 for Inductive or Deductive? Rethinking the Fundamental Reasoning Abilities of LLMs
Figure 4 for Inductive or Deductive? Rethinking the Fundamental Reasoning Abilities of LLMs
Viaarxiv icon

Concept2Box: Joint Geometric Embeddings for Learning Two-View Knowledge Graphs

Add code
Jul 04, 2023
Figure 1 for Concept2Box: Joint Geometric Embeddings for Learning Two-View Knowledge Graphs
Figure 2 for Concept2Box: Joint Geometric Embeddings for Learning Two-View Knowledge Graphs
Figure 3 for Concept2Box: Joint Geometric Embeddings for Learning Two-View Knowledge Graphs
Figure 4 for Concept2Box: Joint Geometric Embeddings for Learning Two-View Knowledge Graphs
Viaarxiv icon

Label-Efficient Self-Training for Attribute Extraction from Semi-Structured Web Documents

Add code
Aug 27, 2022
Figure 1 for Label-Efficient Self-Training for Attribute Extraction from Semi-Structured Web Documents
Figure 2 for Label-Efficient Self-Training for Attribute Extraction from Semi-Structured Web Documents
Figure 3 for Label-Efficient Self-Training for Attribute Extraction from Semi-Structured Web Documents
Figure 4 for Label-Efficient Self-Training for Attribute Extraction from Semi-Structured Web Documents
Viaarxiv icon

DOM-LM: Learning Generalizable Representations for HTML Documents

Add code
Jan 25, 2022
Figure 1 for DOM-LM: Learning Generalizable Representations for HTML Documents
Figure 2 for DOM-LM: Learning Generalizable Representations for HTML Documents
Figure 3 for DOM-LM: Learning Generalizable Representations for HTML Documents
Figure 4 for DOM-LM: Learning Generalizable Representations for HTML Documents
Viaarxiv icon

RATE: Overcoming Noise and Sparsity of Textual Features in Real-Time Location Estimation

Add code
Nov 12, 2021
Figure 1 for RATE: Overcoming Noise and Sparsity of Textual Features in Real-Time Location Estimation
Figure 2 for RATE: Overcoming Noise and Sparsity of Textual Features in Real-Time Location Estimation
Figure 3 for RATE: Overcoming Noise and Sparsity of Textual Features in Real-Time Location Estimation
Figure 4 for RATE: Overcoming Noise and Sparsity of Textual Features in Real-Time Location Estimation
Viaarxiv icon