Picture for Xiangyang Liu

Xiangyang Liu

DetectiveQA: Evaluating Long-Context Reasoning on Detective Novels

Add code
Sep 04, 2024
Viaarxiv icon

Can AI Assistants Know What They Don't Know?

Add code
Jan 28, 2024
Figure 1 for Can AI Assistants Know What They Don't Know?
Figure 2 for Can AI Assistants Know What They Don't Know?
Figure 3 for Can AI Assistants Know What They Don't Know?
Figure 4 for Can AI Assistants Know What They Don't Know?
Viaarxiv icon

Flames: Benchmarking Value Alignment of Chinese Large Language Models

Add code
Nov 12, 2023
Viaarxiv icon

Evaluating Hallucinations in Chinese Large Language Models

Add code
Oct 05, 2023
Figure 1 for Evaluating Hallucinations in Chinese Large Language Models
Figure 2 for Evaluating Hallucinations in Chinese Large Language Models
Figure 3 for Evaluating Hallucinations in Chinese Large Language Models
Figure 4 for Evaluating Hallucinations in Chinese Large Language Models
Viaarxiv icon

The Rise and Potential of Large Language Model Based Agents: A Survey

Add code
Sep 19, 2023
Figure 1 for The Rise and Potential of Large Language Model Based Agents: A Survey
Figure 2 for The Rise and Potential of Large Language Model Based Agents: A Survey
Figure 3 for The Rise and Potential of Large Language Model Based Agents: A Survey
Figure 4 for The Rise and Potential of Large Language Model Based Agents: A Survey
Viaarxiv icon

Federated Prompting and Chain-of-Thought Reasoning for Improving LLMs Answering

Add code
Apr 27, 2023
Figure 1 for Federated Prompting and Chain-of-Thought Reasoning for Improving LLMs Answering
Figure 2 for Federated Prompting and Chain-of-Thought Reasoning for Improving LLMs Answering
Figure 3 for Federated Prompting and Chain-of-Thought Reasoning for Improving LLMs Answering
Figure 4 for Federated Prompting and Chain-of-Thought Reasoning for Improving LLMs Answering
Viaarxiv icon

OmniForce: On Human-Centered, Large Model Empowered and Cloud-Edge Collaborative AutoML System

Add code
Mar 01, 2023
Figure 1 for OmniForce: On Human-Centered, Large Model Empowered and Cloud-Edge Collaborative AutoML System
Figure 2 for OmniForce: On Human-Centered, Large Model Empowered and Cloud-Edge Collaborative AutoML System
Figure 3 for OmniForce: On Human-Centered, Large Model Empowered and Cloud-Edge Collaborative AutoML System
Figure 4 for OmniForce: On Human-Centered, Large Model Empowered and Cloud-Edge Collaborative AutoML System
Viaarxiv icon

Late Prompt Tuning: A Late Prompt Could Be Better Than Many Prompts

Add code
Oct 20, 2022
Figure 1 for Late Prompt Tuning: A Late Prompt Could Be Better Than Many Prompts
Figure 2 for Late Prompt Tuning: A Late Prompt Could Be Better Than Many Prompts
Figure 3 for Late Prompt Tuning: A Late Prompt Could Be Better Than Many Prompts
Figure 4 for Late Prompt Tuning: A Late Prompt Could Be Better Than Many Prompts
Viaarxiv icon

A Simple Hash-Based Early Exiting Approach For Language Understanding and Generation

Add code
Mar 03, 2022
Figure 1 for A Simple Hash-Based Early Exiting Approach For Language Understanding and Generation
Figure 2 for A Simple Hash-Based Early Exiting Approach For Language Understanding and Generation
Figure 3 for A Simple Hash-Based Early Exiting Approach For Language Understanding and Generation
Figure 4 for A Simple Hash-Based Early Exiting Approach For Language Understanding and Generation
Viaarxiv icon

Towards Efficient NLP: A Standard Evaluation and A Strong Baseline

Add code
Oct 13, 2021
Figure 1 for Towards Efficient NLP: A Standard Evaluation and A Strong Baseline
Figure 2 for Towards Efficient NLP: A Standard Evaluation and A Strong Baseline
Figure 3 for Towards Efficient NLP: A Standard Evaluation and A Strong Baseline
Figure 4 for Towards Efficient NLP: A Standard Evaluation and A Strong Baseline
Viaarxiv icon