Picture for Neel Guha

Neel Guha

Smoothie: Label Free Language Model Routing

Add code
Dec 06, 2024
Viaarxiv icon

Prospector Heads: Generalized Feature Attribution for Large Models & Data

Add code
Feb 18, 2024
Viaarxiv icon

Benchmarking and Building Long-Context Retrieval Models with LoCo and M2-BERT

Add code
Feb 14, 2024
Viaarxiv icon

LegalBench: A Collaboratively Built Benchmark for Measuring Legal Reasoning in Large Language Models

Add code
Aug 20, 2023
Figure 1 for LegalBench: A Collaboratively Built Benchmark for Measuring Legal Reasoning in Large Language Models
Figure 2 for LegalBench: A Collaboratively Built Benchmark for Measuring Legal Reasoning in Large Language Models
Figure 3 for LegalBench: A Collaboratively Built Benchmark for Measuring Legal Reasoning in Large Language Models
Figure 4 for LegalBench: A Collaboratively Built Benchmark for Measuring Legal Reasoning in Large Language Models
Viaarxiv icon

Embroid: Unsupervised Prediction Smoothing Can Improve Few-Shot Classification

Add code
Jul 20, 2023
Viaarxiv icon

Holistic Evaluation of Language Models

Add code
Nov 16, 2022
Figure 1 for Holistic Evaluation of Language Models
Figure 2 for Holistic Evaluation of Language Models
Figure 3 for Holistic Evaluation of Language Models
Figure 4 for Holistic Evaluation of Language Models
Viaarxiv icon

Ask Me Anything: A simple strategy for prompting language models

Add code
Oct 06, 2022
Figure 1 for Ask Me Anything: A simple strategy for prompting language models
Figure 2 for Ask Me Anything: A simple strategy for prompting language models
Figure 3 for Ask Me Anything: A simple strategy for prompting language models
Figure 4 for Ask Me Anything: A simple strategy for prompting language models
Viaarxiv icon

LegalBench: Prototyping a Collaborative Benchmark for Legal Reasoning

Add code
Sep 13, 2022
Figure 1 for LegalBench: Prototyping a Collaborative Benchmark for Legal Reasoning
Figure 2 for LegalBench: Prototyping a Collaborative Benchmark for Legal Reasoning
Figure 3 for LegalBench: Prototyping a Collaborative Benchmark for Legal Reasoning
Figure 4 for LegalBench: Prototyping a Collaborative Benchmark for Legal Reasoning
Viaarxiv icon

Pile of Law: Learning Responsible Data Filtering from the Law and a 256GB Open-Source Legal Dataset

Add code
Jul 01, 2022
Figure 1 for Pile of Law: Learning Responsible Data Filtering from the Law and a 256GB Open-Source Legal Dataset
Figure 2 for Pile of Law: Learning Responsible Data Filtering from the Law and a 256GB Open-Source Legal Dataset
Figure 3 for Pile of Law: Learning Responsible Data Filtering from the Law and a 256GB Open-Source Legal Dataset
Figure 4 for Pile of Law: Learning Responsible Data Filtering from the Law and a 256GB Open-Source Legal Dataset
Viaarxiv icon

On the Opportunities and Risks of Foundation Models

Add code
Aug 18, 2021
Figure 1 for On the Opportunities and Risks of Foundation Models
Figure 2 for On the Opportunities and Risks of Foundation Models
Figure 3 for On the Opportunities and Risks of Foundation Models
Figure 4 for On the Opportunities and Risks of Foundation Models
Viaarxiv icon