Picture for Minda Hu

Minda Hu

Purple-teaming LLMs with Adversarial Defender Training

Add code
Jul 01, 2024
Viaarxiv icon

Enhancing Biomedical Knowledge Retrieval-Augmented Generation with Self-Rewarding Tree Search and Proximal Policy Optimization

Add code
Jun 17, 2024
Viaarxiv icon

Mitigating Large Language Model Hallucination with Faithful Finetuning

Add code
Jun 17, 2024
Viaarxiv icon

The Integration of Semantic and Structural Knowledge in Knowledge Graph Entity Typing

Add code
Apr 12, 2024
Viaarxiv icon

RL-GPT: Integrating Reinforcement Learning and Code-as-policy

Add code
Feb 29, 2024
Viaarxiv icon

Large Language Models as Source Planner for Personalized Knowledge-grounded Dialogue

Add code
Oct 13, 2023
Viaarxiv icon

TPE: Towards Better Compositional Reasoning over Conceptual Tools with Multi-persona Collaboration

Add code
Sep 28, 2023
Viaarxiv icon

Rethinking Machine Ethics -- Can LLMs Perform Moral Reasoning through the Lens of Moral Theories?

Add code
Aug 29, 2023
Viaarxiv icon

Momentum Contrastive Pre-training for Question Answering

Add code
Dec 12, 2022
Viaarxiv icon