Picture for Tianyu Gao

Tianyu Gao

How to Train Long-Context Language Models (Effectively)

Add code
Oct 03, 2024
Viaarxiv icon

HELMET: How to Evaluate Long-Context Language Models Effectively and Thoroughly

Add code
Oct 03, 2024
Figure 1 for HELMET: How to Evaluate Long-Context Language Models Effectively and Thoroughly
Figure 2 for HELMET: How to Evaluate Long-Context Language Models Effectively and Thoroughly
Figure 3 for HELMET: How to Evaluate Long-Context Language Models Effectively and Thoroughly
Figure 4 for HELMET: How to Evaluate Long-Context Language Models Effectively and Thoroughly
Viaarxiv icon

Long-Context Language Modeling with Parallel Context Encoding

Add code
Feb 26, 2024
Figure 1 for Long-Context Language Modeling with Parallel Context Encoding
Figure 2 for Long-Context Language Modeling with Parallel Context Encoding
Figure 3 for Long-Context Language Modeling with Parallel Context Encoding
Figure 4 for Long-Context Language Modeling with Parallel Context Encoding
Viaarxiv icon

Improving Language Understanding from Screenshots

Add code
Feb 21, 2024
Figure 1 for Improving Language Understanding from Screenshots
Figure 2 for Improving Language Understanding from Screenshots
Figure 3 for Improving Language Understanding from Screenshots
Figure 4 for Improving Language Understanding from Screenshots
Viaarxiv icon

Harmonizing Covariance and Expressiveness for Deep Hamiltonian Regression in Crystalline Material Research: a Hybrid Cascaded Regression Framework

Add code
Jan 15, 2024
Figure 1 for Harmonizing Covariance and Expressiveness for Deep Hamiltonian Regression in Crystalline Material Research: a Hybrid Cascaded Regression Framework
Figure 2 for Harmonizing Covariance and Expressiveness for Deep Hamiltonian Regression in Crystalline Material Research: a Hybrid Cascaded Regression Framework
Figure 3 for Harmonizing Covariance and Expressiveness for Deep Hamiltonian Regression in Crystalline Material Research: a Hybrid Cascaded Regression Framework
Figure 4 for Harmonizing Covariance and Expressiveness for Deep Hamiltonian Regression in Crystalline Material Research: a Hybrid Cascaded Regression Framework
Viaarxiv icon

Evaluating Large Language Models at Evaluating Instruction Following

Add code
Oct 11, 2023
Viaarxiv icon

Sheared LLaMA: Accelerating Language Model Pre-training via Structured Pruning

Add code
Oct 10, 2023
Viaarxiv icon

Fine-Tuning Language Models with Just Forward Passes

Add code
May 27, 2023
Figure 1 for Fine-Tuning Language Models with Just Forward Passes
Figure 2 for Fine-Tuning Language Models with Just Forward Passes
Figure 3 for Fine-Tuning Language Models with Just Forward Passes
Figure 4 for Fine-Tuning Language Models with Just Forward Passes
Viaarxiv icon

Enabling Large Language Models to Generate Text with Citations

Add code
May 24, 2023
Viaarxiv icon

What In-Context Learning "Learns" In-Context: Disentangling Task Recognition and Task Learning

Add code
May 16, 2023
Viaarxiv icon