Picture for Rosanne Liu

Rosanne Liu

Shammie

Logit Scaling for Out-of-Distribution Detection

Add code
Sep 02, 2024
Viaarxiv icon

Training Language Models on the Knowledge Graph: Insights on Hallucinations and Their Detectability

Add code
Aug 14, 2024
Figure 1 for Training Language Models on the Knowledge Graph: Insights on Hallucinations and Their Detectability
Figure 2 for Training Language Models on the Knowledge Graph: Insights on Hallucinations and Their Detectability
Figure 3 for Training Language Models on the Knowledge Graph: Insights on Hallucinations and Their Detectability
Figure 4 for Training Language Models on the Knowledge Graph: Insights on Hallucinations and Their Detectability
Viaarxiv icon

Improve Mathematical Reasoning in Language Models by Automated Process Supervision

Add code
Jun 05, 2024
Figure 1 for Improve Mathematical Reasoning in Language Models by Automated Process Supervision
Figure 2 for Improve Mathematical Reasoning in Language Models by Automated Process Supervision
Figure 3 for Improve Mathematical Reasoning in Language Models by Automated Process Supervision
Figure 4 for Improve Mathematical Reasoning in Language Models by Automated Process Supervision
Viaarxiv icon

Long-Span Question-Answering: Automatic Question Generation and QA-System Ranking via Side-by-Side Evaluation

Add code
May 31, 2024
Viaarxiv icon

Gemini 1.5: Unlocking multimodal understanding across millions of tokens of context

Add code
Mar 08, 2024
Viaarxiv icon

Beyond Human Data: Scaling Self-Training for Problem-Solving with Language Models

Add code
Dec 22, 2023
Viaarxiv icon

Frontier Language Models are not Robust to Adversarial Arithmetic, or "What do I need to say so you agree 2+2=5?

Add code
Nov 15, 2023
Figure 1 for Frontier Language Models are not Robust to Adversarial Arithmetic, or "What do I need to say so you agree 2+2=5?
Figure 2 for Frontier Language Models are not Robust to Adversarial Arithmetic, or "What do I need to say so you agree 2+2=5?
Figure 3 for Frontier Language Models are not Robust to Adversarial Arithmetic, or "What do I need to say so you agree 2+2=5?
Figure 4 for Frontier Language Models are not Robust to Adversarial Arithmetic, or "What do I need to say so you agree 2+2=5?
Viaarxiv icon

Character-Aware Models Improve Visual Text Rendering

Add code
Dec 20, 2022
Figure 1 for Character-Aware Models Improve Visual Text Rendering
Figure 2 for Character-Aware Models Improve Visual Text Rendering
Figure 3 for Character-Aware Models Improve Visual Text Rendering
Figure 4 for Character-Aware Models Improve Visual Text Rendering
Viaarxiv icon

Extremely Simple Activation Shaping for Out-of-Distribution Detection

Add code
Sep 20, 2022
Figure 1 for Extremely Simple Activation Shaping for Out-of-Distribution Detection
Figure 2 for Extremely Simple Activation Shaping for Out-of-Distribution Detection
Figure 3 for Extremely Simple Activation Shaping for Out-of-Distribution Detection
Figure 4 for Extremely Simple Activation Shaping for Out-of-Distribution Detection
Viaarxiv icon

What does a platypus look like? Generating customized prompts for zero-shot image classification

Add code
Sep 07, 2022
Figure 1 for What does a platypus look like? Generating customized prompts for zero-shot image classification
Figure 2 for What does a platypus look like? Generating customized prompts for zero-shot image classification
Figure 3 for What does a platypus look like? Generating customized prompts for zero-shot image classification
Figure 4 for What does a platypus look like? Generating customized prompts for zero-shot image classification
Viaarxiv icon