Picture for Danqi Chen

Danqi Chen

Shammie

LongProc: Benchmarking Long-Context Language Models on Long Procedural Generation

Add code
Jan 09, 2025
Figure 1 for LongProc: Benchmarking Long-Context Language Models on Long Procedural Generation
Figure 2 for LongProc: Benchmarking Long-Context Language Models on Long Procedural Generation
Figure 3 for LongProc: Benchmarking Long-Context Language Models on Long Procedural Generation
Figure 4 for LongProc: Benchmarking Long-Context Language Models on Long Procedural Generation
Viaarxiv icon

Metadata Conditioning Accelerates Language Model Pre-training

Add code
Jan 03, 2025
Figure 1 for Metadata Conditioning Accelerates Language Model Pre-training
Figure 2 for Metadata Conditioning Accelerates Language Model Pre-training
Figure 3 for Metadata Conditioning Accelerates Language Model Pre-training
Figure 4 for Metadata Conditioning Accelerates Language Model Pre-training
Viaarxiv icon

Continual Memorization of Factoids in Large Language Models

Add code
Nov 11, 2024
Figure 1 for Continual Memorization of Factoids in Large Language Models
Figure 2 for Continual Memorization of Factoids in Large Language Models
Figure 3 for Continual Memorization of Factoids in Large Language Models
Figure 4 for Continual Memorization of Factoids in Large Language Models
Viaarxiv icon

Unintentional Unalignment: Likelihood Displacement in Direct Preference Optimization

Add code
Oct 11, 2024
Viaarxiv icon

How to Train Long-Context Language Models (Effectively)

Add code
Oct 03, 2024
Viaarxiv icon

HELMET: How to Evaluate Long-Context Language Models Effectively and Thoroughly

Add code
Oct 03, 2024
Figure 1 for HELMET: How to Evaluate Long-Context Language Models Effectively and Thoroughly
Figure 2 for HELMET: How to Evaluate Long-Context Language Models Effectively and Thoroughly
Figure 3 for HELMET: How to Evaluate Long-Context Language Models Effectively and Thoroughly
Figure 4 for HELMET: How to Evaluate Long-Context Language Models Effectively and Thoroughly
Viaarxiv icon

BRIGHT: A Realistic and Challenging Benchmark for Reasoning-Intensive Retrieval

Add code
Jul 16, 2024
Viaarxiv icon

Representing Rule-based Chatbots with Transformers

Add code
Jul 15, 2024
Figure 1 for Representing Rule-based Chatbots with Transformers
Figure 2 for Representing Rule-based Chatbots with Transformers
Figure 3 for Representing Rule-based Chatbots with Transformers
Figure 4 for Representing Rule-based Chatbots with Transformers
Viaarxiv icon

CharXiv: Charting Gaps in Realistic Chart Understanding in Multimodal LLMs

Add code
Jun 26, 2024
Figure 1 for CharXiv: Charting Gaps in Realistic Chart Understanding in Multimodal LLMs
Figure 2 for CharXiv: Charting Gaps in Realistic Chart Understanding in Multimodal LLMs
Figure 3 for CharXiv: Charting Gaps in Realistic Chart Understanding in Multimodal LLMs
Figure 4 for CharXiv: Charting Gaps in Realistic Chart Understanding in Multimodal LLMs
Viaarxiv icon

Finding Transformer Circuits with Edge Pruning

Add code
Jun 24, 2024
Figure 1 for Finding Transformer Circuits with Edge Pruning
Figure 2 for Finding Transformer Circuits with Edge Pruning
Figure 3 for Finding Transformer Circuits with Edge Pruning
Figure 4 for Finding Transformer Circuits with Edge Pruning
Viaarxiv icon