Picture for Haofei Yu

Haofei Yu

HEMM: Holistic Evaluation of Multimodal Foundation Models

Add code
Jul 03, 2024
Figure 1 for HEMM: Holistic Evaluation of Multimodal Foundation Models
Figure 2 for HEMM: Holistic Evaluation of Multimodal Foundation Models
Figure 3 for HEMM: Holistic Evaluation of Multimodal Foundation Models
Figure 4 for HEMM: Holistic Evaluation of Multimodal Foundation Models
Viaarxiv icon

SOTOPIA-$π$: Interactive Learning of Socially Intelligent Language Agents

Add code
Mar 14, 2024
Viaarxiv icon

MMOE: Mixture of Multimodal Interaction Experts

Add code
Nov 16, 2023
Viaarxiv icon

TRAMS: Training-free Memory Selection for Long-range Language Modeling

Add code
Nov 05, 2023
Viaarxiv icon

Counting the Bugs in ChatGPT's Wugs: A Multilingual Investigation into the Morphological Capabilities of a Large Language Model

Add code
Oct 26, 2023
Figure 1 for Counting the Bugs in ChatGPT's Wugs: A Multilingual Investigation into the Morphological Capabilities of a Large Language Model
Figure 2 for Counting the Bugs in ChatGPT's Wugs: A Multilingual Investigation into the Morphological Capabilities of a Large Language Model
Figure 3 for Counting the Bugs in ChatGPT's Wugs: A Multilingual Investigation into the Morphological Capabilities of a Large Language Model
Figure 4 for Counting the Bugs in ChatGPT's Wugs: A Multilingual Investigation into the Morphological Capabilities of a Large Language Model
Viaarxiv icon

SOTOPIA: Interactive Evaluation for Social Intelligence in Language Agents

Add code
Oct 18, 2023
Viaarxiv icon

RFiD: Towards Rational Fusion-in-Decoder for Open-Domain Question Answering

Add code
May 26, 2023
Viaarxiv icon

Global-Selector: A New Benchmark Dataset and Model Architecture for Multi-turn Response Selection

Add code
Jun 02, 2021
Figure 1 for Global-Selector: A New Benchmark Dataset and Model Architecture for Multi-turn Response Selection
Figure 2 for Global-Selector: A New Benchmark Dataset and Model Architecture for Multi-turn Response Selection
Figure 3 for Global-Selector: A New Benchmark Dataset and Model Architecture for Multi-turn Response Selection
Figure 4 for Global-Selector: A New Benchmark Dataset and Model Architecture for Multi-turn Response Selection
Viaarxiv icon