Picture for Yixin Cao

Yixin Cao

Multi-Programming Language Sandbox for LLMs

Add code
Oct 30, 2024
Figure 1 for Multi-Programming Language Sandbox for LLMs
Figure 2 for Multi-Programming Language Sandbox for LLMs
Figure 3 for Multi-Programming Language Sandbox for LLMs
Figure 4 for Multi-Programming Language Sandbox for LLMs
Viaarxiv icon

RM-Bench: Benchmarking Reward Models of Language Models with Subtlety and Style

Add code
Oct 21, 2024
Viaarxiv icon

ChartifyText: Automated Chart Generation from Data-Involved Texts via LLM

Add code
Oct 18, 2024
Viaarxiv icon

Grounded-VideoLLM: Sharpening Fine-grained Temporal Grounding in Video Large Language Models

Add code
Oct 04, 2024
Viaarxiv icon

Knowledge Graph Embedding by Normalizing Flows

Add code
Sep 30, 2024
Figure 1 for Knowledge Graph Embedding by Normalizing Flows
Figure 2 for Knowledge Graph Embedding by Normalizing Flows
Figure 3 for Knowledge Graph Embedding by Normalizing Flows
Viaarxiv icon

Navigating the Nuances: A Fine-grained Evaluation of Vision-Language Navigation

Add code
Sep 25, 2024
Figure 1 for Navigating the Nuances: A Fine-grained Evaluation of Vision-Language Navigation
Figure 2 for Navigating the Nuances: A Fine-grained Evaluation of Vision-Language Navigation
Figure 3 for Navigating the Nuances: A Fine-grained Evaluation of Vision-Language Navigation
Figure 4 for Navigating the Nuances: A Fine-grained Evaluation of Vision-Language Navigation
Viaarxiv icon

S$^3$c-Math: Spontaneous Step-level Self-correction Makes Large Language Models Better Mathematical Reasoners

Add code
Sep 03, 2024
Figure 1 for S$^3$c-Math: Spontaneous Step-level Self-correction Makes Large Language Models Better Mathematical Reasoners
Figure 2 for S$^3$c-Math: Spontaneous Step-level Self-correction Makes Large Language Models Better Mathematical Reasoners
Figure 3 for S$^3$c-Math: Spontaneous Step-level Self-correction Makes Large Language Models Better Mathematical Reasoners
Figure 4 for S$^3$c-Math: Spontaneous Step-level Self-correction Makes Large Language Models Better Mathematical Reasoners
Viaarxiv icon

Diagnosing and Remedying Knowledge Deficiencies in LLMs via Label-free Curricular Meaningful Learning

Add code
Aug 21, 2024
Viaarxiv icon

MMLongBench-Doc: Benchmarking Long-context Document Understanding with Visualizations

Add code
Jul 01, 2024
Viaarxiv icon

LLMs-as-Instructors: Learning from Errors Toward Automating Model Improvement

Add code
Jun 29, 2024
Viaarxiv icon