Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Yuanting Pan

Persona-SQ: A Personalized Suggested Question Generation Framework For Real-world Documents

Dec 17, 2024

Zihao Lin, Zichao Wang, Yuanting Pan, Varun Manjunatha, Ryan Rossi, Angela Lau, Lifu Huang, Tong Sun

Figure 1 for Persona-SQ: A Personalized Suggested Question Generation Framework For Real-world Documents

Figure 2 for Persona-SQ: A Personalized Suggested Question Generation Framework For Real-world Documents

Figure 3 for Persona-SQ: A Personalized Suggested Question Generation Framework For Real-world Documents

Figure 4 for Persona-SQ: A Personalized Suggested Question Generation Framework For Real-world Documents

Abstract:Suggested questions (SQs) provide an effective initial interface for users to engage with their documents in AI-powered reading applications. In practical reading sessions, users have diverse backgrounds and reading goals, yet current SQ features typically ignore such user information, resulting in homogeneous or ineffective questions. We introduce a pipeline that generates personalized SQs by incorporating reader profiles (professions and reading goals) and demonstrate its utility in two ways: 1) as an improved SQ generation pipeline that produces higher quality and more diverse questions compared to current baselines, and 2) as a data generator to fine-tune extremely small models that perform competitively with much larger models on SQ generation. Our approach can not only serve as a drop-in replacement in current SQ systems to immediately improve their performance but also help develop on-device SQ models that can run locally to deliver fast and private SQ experience.

* 38 pages, 26 figures

Via

Access Paper or Ask Questions

MMedAgent: Learning to Use Medical Tools with Multi-modal Agent

Jul 02, 2024

Binxu Li, Tiankai Yan, Yuanting Pan, Zhe Xu, Jie Luo, Ruiyang Ji, Shilong Liu, Haoyu Dong, Zihao Lin, Yixin Wang

Figure 1 for MMedAgent: Learning to Use Medical Tools with Multi-modal Agent

Figure 2 for MMedAgent: Learning to Use Medical Tools with Multi-modal Agent

Figure 3 for MMedAgent: Learning to Use Medical Tools with Multi-modal Agent

Figure 4 for MMedAgent: Learning to Use Medical Tools with Multi-modal Agent

Abstract:Multi-Modal Large Language Models (MLLMs), despite being successful, exhibit limited generality and often fall short when compared to specialized models. Recently, LLM-based agents have been developed to address these challenges by selecting appropriate specialized models as tools based on user inputs. However, such advancements have not been extensively explored within the medical domain. To bridge this gap, this paper introduces the first agent explicitly designed for the medical field, named \textbf{M}ulti-modal \textbf{Med}ical \textbf{Agent} (MMedAgent). We curate an instruction-tuning dataset comprising six medical tools solving seven tasks, enabling the agent to choose the most suitable tools for a given task. Comprehensive experiments demonstrate that MMedAgent achieves superior performance across a variety of medical tasks compared to state-of-the-art open-source methods and even the closed-source model, GPT-4o. Furthermore, MMedAgent exhibits efficiency in updating and integrating new medical tools.

Via

Access Paper or Ask Questions