Picture for Binxuan Huang

Binxuan Huang

END: Early Noise Dropping for Efficient and Effective Context Denoising

Add code
Feb 26, 2025
Viaarxiv icon

Hephaestus: Improving Fundamental Agent Capabilities of Large Language Models through Continual Pre-Training

Add code
Feb 10, 2025
Figure 1 for Hephaestus: Improving Fundamental Agent Capabilities of Large Language Models through Continual Pre-Training
Figure 2 for Hephaestus: Improving Fundamental Agent Capabilities of Large Language Models through Continual Pre-Training
Figure 3 for Hephaestus: Improving Fundamental Agent Capabilities of Large Language Models through Continual Pre-Training
Figure 4 for Hephaestus: Improving Fundamental Agent Capabilities of Large Language Models through Continual Pre-Training
Viaarxiv icon

Scaling Laws for Predicting Downstream Performance in LLMs

Add code
Oct 11, 2024
Figure 1 for Scaling Laws for Predicting Downstream Performance in LLMs
Figure 2 for Scaling Laws for Predicting Downstream Performance in LLMs
Figure 3 for Scaling Laws for Predicting Downstream Performance in LLMs
Figure 4 for Scaling Laws for Predicting Downstream Performance in LLMs
Viaarxiv icon

Inductive or Deductive? Rethinking the Fundamental Reasoning Abilities of LLMs

Add code
Aug 07, 2024
Figure 1 for Inductive or Deductive? Rethinking the Fundamental Reasoning Abilities of LLMs
Figure 2 for Inductive or Deductive? Rethinking the Fundamental Reasoning Abilities of LLMs
Figure 3 for Inductive or Deductive? Rethinking the Fundamental Reasoning Abilities of LLMs
Figure 4 for Inductive or Deductive? Rethinking the Fundamental Reasoning Abilities of LLMs
Viaarxiv icon

Concept2Box: Joint Geometric Embeddings for Learning Two-View Knowledge Graphs

Add code
Jul 04, 2023
Viaarxiv icon

Label-Efficient Self-Training for Attribute Extraction from Semi-Structured Web Documents

Add code
Aug 27, 2022
Figure 1 for Label-Efficient Self-Training for Attribute Extraction from Semi-Structured Web Documents
Figure 2 for Label-Efficient Self-Training for Attribute Extraction from Semi-Structured Web Documents
Figure 3 for Label-Efficient Self-Training for Attribute Extraction from Semi-Structured Web Documents
Figure 4 for Label-Efficient Self-Training for Attribute Extraction from Semi-Structured Web Documents
Viaarxiv icon

DOM-LM: Learning Generalizable Representations for HTML Documents

Add code
Jan 25, 2022
Figure 1 for DOM-LM: Learning Generalizable Representations for HTML Documents
Figure 2 for DOM-LM: Learning Generalizable Representations for HTML Documents
Figure 3 for DOM-LM: Learning Generalizable Representations for HTML Documents
Figure 4 for DOM-LM: Learning Generalizable Representations for HTML Documents
Viaarxiv icon

RATE: Overcoming Noise and Sparsity of Textual Features in Real-Time Location Estimation

Add code
Nov 12, 2021
Figure 1 for RATE: Overcoming Noise and Sparsity of Textual Features in Real-Time Location Estimation
Figure 2 for RATE: Overcoming Noise and Sparsity of Textual Features in Real-Time Location Estimation
Figure 3 for RATE: Overcoming Noise and Sparsity of Textual Features in Real-Time Location Estimation
Figure 4 for RATE: Overcoming Noise and Sparsity of Textual Features in Real-Time Location Estimation
Viaarxiv icon

TCN: Table Convolutional Network for Web Table Interpretation

Add code
Feb 17, 2021
Figure 1 for TCN: Table Convolutional Network for Web Table Interpretation
Figure 2 for TCN: Table Convolutional Network for Web Table Interpretation
Figure 3 for TCN: Table Convolutional Network for Web Table Interpretation
Figure 4 for TCN: Table Convolutional Network for Web Table Interpretation
Viaarxiv icon

Discover Your Social Identity from What You Tweet: a Content Based Approach

Add code
Mar 03, 2020
Figure 1 for Discover Your Social Identity from What You Tweet: a Content Based Approach
Figure 2 for Discover Your Social Identity from What You Tweet: a Content Based Approach
Figure 3 for Discover Your Social Identity from What You Tweet: a Content Based Approach
Figure 4 for Discover Your Social Identity from What You Tweet: a Content Based Approach
Viaarxiv icon