Picture for Greg Durrett

Greg Durrett

RankAlign: A Ranking View of the Generator-Validator Gap in Large Language Models

Add code
Apr 15, 2025
Viaarxiv icon

QUDsim: Quantifying Discourse Similarities in LLM-Generated Text

Add code
Apr 12, 2025
Viaarxiv icon

Is the Top Still Spinning? Evaluating Subjectivity in Narrative Understanding

Add code
Apr 01, 2025
Viaarxiv icon

${\rm P{\small ROOF}W{\small ALA}}$: Multilingual Proof Data Synthesis and Theorem-Proving

Add code
Feb 07, 2025
Viaarxiv icon

LongProc: Benchmarking Long-Context Language Models on Long Procedural Generation

Add code
Jan 09, 2025
Figure 1 for LongProc: Benchmarking Long-Context Language Models on Long Procedural Generation
Figure 2 for LongProc: Benchmarking Long-Context Language Models on Long Procedural Generation
Figure 3 for LongProc: Benchmarking Long-Context Language Models on Long Procedural Generation
Figure 4 for LongProc: Benchmarking Long-Context Language Models on Long Procedural Generation
Viaarxiv icon

Understanding Synthetic Context Extension via Retrieval Heads

Add code
Oct 29, 2024
Figure 1 for Understanding Synthetic Context Extension via Retrieval Heads
Figure 2 for Understanding Synthetic Context Extension via Retrieval Heads
Figure 3 for Understanding Synthetic Context Extension via Retrieval Heads
Figure 4 for Understanding Synthetic Context Extension via Retrieval Heads
Viaarxiv icon

Contrastive Learning to Improve Retrieval for Real-world Fact Checking

Add code
Oct 07, 2024
Figure 1 for Contrastive Learning to Improve Retrieval for Real-world Fact Checking
Figure 2 for Contrastive Learning to Improve Retrieval for Real-world Fact Checking
Figure 3 for Contrastive Learning to Improve Retrieval for Real-world Fact Checking
Figure 4 for Contrastive Learning to Improve Retrieval for Real-world Fact Checking
Viaarxiv icon

To CoT or not to CoT? Chain-of-thought helps mainly on math and symbolic reasoning

Add code
Sep 18, 2024
Figure 1 for To CoT or not to CoT? Chain-of-thought helps mainly on math and symbolic reasoning
Figure 2 for To CoT or not to CoT? Chain-of-thought helps mainly on math and symbolic reasoning
Figure 3 for To CoT or not to CoT? Chain-of-thought helps mainly on math and symbolic reasoning
Figure 4 for To CoT or not to CoT? Chain-of-thought helps mainly on math and symbolic reasoning
Viaarxiv icon

CodeUpdateArena: Benchmarking Knowledge Editing on API Updates

Add code
Jul 08, 2024
Viaarxiv icon

Learning to Refine with Fine-Grained Natural Language Feedback

Add code
Jul 02, 2024
Viaarxiv icon