Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Shizhu Liu

Halo: Estimation and Reduction of Hallucinations in Open-Source Weak Large Language Models

Sep 13, 2023

Mohamed Elaraby, Mengyin Lu, Jacob Dunn, Xueying Zhang, Yu Wang, Shizhu Liu, Pingchuan Tian, Yuping Wang, Yuxuan Wang

Figure 1 for Halo: Estimation and Reduction of Hallucinations in Open-Source Weak Large Language Models

Figure 2 for Halo: Estimation and Reduction of Hallucinations in Open-Source Weak Large Language Models

Figure 3 for Halo: Estimation and Reduction of Hallucinations in Open-Source Weak Large Language Models

Figure 4 for Halo: Estimation and Reduction of Hallucinations in Open-Source Weak Large Language Models

Abstract:Large Language Models (LLMs) have revolutionized Natural Language Processing (NLP). Although convenient for research and practical applications, open-source LLMs with fewer parameters often suffer from severe hallucinations compared to their larger counterparts. This paper focuses on measuring and reducing hallucinations in BLOOM 7B, a representative of such weaker open-source LLMs that are publicly available for research and commercial applications. We introduce HaloCheck, a lightweight BlackBox knowledge-free framework designed to quantify the severity of hallucinations in LLMs. Additionally, we explore techniques like knowledge injection and teacher-student approaches to alleviate hallucinations in low-parameter LLMs. Our experiments effectively demonstrate the reduction of hallucinations in challenging domains for these LLMs.

Via

Access Paper or Ask Questions

Imitation Learning for Fashion Style Based on Hierarchical Multimodal Representation

Apr 13, 2020

Shizhu Liu, Shanglin Yang, Hui Zhou

Figure 1 for Imitation Learning for Fashion Style Based on Hierarchical Multimodal Representation

Figure 2 for Imitation Learning for Fashion Style Based on Hierarchical Multimodal Representation

Figure 3 for Imitation Learning for Fashion Style Based on Hierarchical Multimodal Representation

Figure 4 for Imitation Learning for Fashion Style Based on Hierarchical Multimodal Representation

Abstract:Fashion is a complex social phenomenon. People follow fashion styles from demonstrations by experts or fashion icons. However, for machine agent, learning to imitate fashion experts from demonstrations can be challenging, especially for complex styles in environments with high-dimensional, multimodal observations. Most existing research regarding fashion outfit composition utilizes supervised learning methods to mimic the behaviors of style icons. These methods suffer from distribution shift: because the agent greedily imitates some given outfit demonstrations, it can drift away from one style to another styles given subtle differences. In this work, we propose an adversarial inverse reinforcement learning formulation to recover reward functions based on hierarchical multimodal representation (HM-AIRL) during the imitation process. The hierarchical joint representation can more comprehensively model the expert composited outfit demonstrations to recover the reward function. We demonstrate that the proposed HM-AIRL model is able to recover reward functions that are robust to changes in multimodal observations, enabling us to learn policies under significant variation between different styles.

Via

Access Paper or Ask Questions