Picture for Carolyn Rose

Carolyn Rose

Where is this coming from? Making groundedness count in the evaluation of Document VQA models

Add code
Mar 24, 2025
Viaarxiv icon

RepoST: Scalable Repository-Level Coding Environment Construction with Sandbox Testing

Add code
Mar 10, 2025
Viaarxiv icon

Programming by Examples Meets Historical Linguistics: A Large Language Model Based Approach to Sound Law Induction

Add code
Jan 27, 2025
Figure 1 for Programming by Examples Meets Historical Linguistics: A Large Language Model Based Approach to Sound Law Induction
Figure 2 for Programming by Examples Meets Historical Linguistics: A Large Language Model Based Approach to Sound Law Induction
Figure 3 for Programming by Examples Meets Historical Linguistics: A Large Language Model Based Approach to Sound Law Induction
Figure 4 for Programming by Examples Meets Historical Linguistics: A Large Language Model Based Approach to Sound Law Induction
Viaarxiv icon

Improving Model Factuality with Fine-grained Critique-based Evaluator

Add code
Oct 24, 2024
Viaarxiv icon

CRScore: Grounding Automated Evaluation of Code Review Comments in Code Claims and Smells

Add code
Sep 29, 2024
Viaarxiv icon

CodeBenchGen: Creating Scalable Execution-based Code Generation Benchmarks

Add code
Mar 31, 2024
Figure 1 for CodeBenchGen: Creating Scalable Execution-based Code Generation Benchmarks
Figure 2 for CodeBenchGen: Creating Scalable Execution-based Code Generation Benchmarks
Figure 3 for CodeBenchGen: Creating Scalable Execution-based Code Generation Benchmarks
Figure 4 for CodeBenchGen: Creating Scalable Execution-based Code Generation Benchmarks
Viaarxiv icon

Data Augmentation for Code Translation with Comparable Corpora and Multiple References

Add code
Nov 01, 2023
Viaarxiv icon

Linguistic representations for fewer-shot relation extraction across domains

Add code
Jul 07, 2023
Viaarxiv icon

Multi-Scale Contrastive Co-Training for Event Temporal Relation Extraction

Add code
Sep 01, 2022
Figure 1 for Multi-Scale Contrastive Co-Training for Event Temporal Relation Extraction
Figure 2 for Multi-Scale Contrastive Co-Training for Event Temporal Relation Extraction
Figure 3 for Multi-Scale Contrastive Co-Training for Event Temporal Relation Extraction
Figure 4 for Multi-Scale Contrastive Co-Training for Event Temporal Relation Extraction
Viaarxiv icon

Adapting to the Long Tail: A Meta-Analysis of Transfer Learning Research for Language Understanding Tasks

Add code
Nov 02, 2021
Figure 1 for Adapting to the Long Tail: A Meta-Analysis of Transfer Learning Research for Language Understanding Tasks
Figure 2 for Adapting to the Long Tail: A Meta-Analysis of Transfer Learning Research for Language Understanding Tasks
Figure 3 for Adapting to the Long Tail: A Meta-Analysis of Transfer Learning Research for Language Understanding Tasks
Figure 4 for Adapting to the Long Tail: A Meta-Analysis of Transfer Learning Research for Language Understanding Tasks
Viaarxiv icon