Picture for Kexin Huang

Kexin Huang

SPRec: Leveraging Self-Play to Debias Preference Alignment for Large Language Model-based Recommendations

Add code
Dec 12, 2024
Viaarxiv icon

Toward Generalizing Visual Brain Decoding to Unseen Subjects

Add code
Oct 21, 2024
Viaarxiv icon

MEOW: MEMOry Supervised LLM Unlearning Via Inverted Facts

Add code
Sep 18, 2024
Viaarxiv icon

RelBench: A Benchmark for Deep Learning on Relational Databases

Add code
Jul 29, 2024
Viaarxiv icon

TrialBench: Multi-Modal Artificial Intelligence-Ready Clinical Trial Datasets

Add code
Jun 30, 2024
Viaarxiv icon

ESC-Eval: Evaluating Emotion Support Conversations in Large Language Models

Add code
Jun 24, 2024
Figure 1 for ESC-Eval: Evaluating Emotion Support Conversations in Large Language Models
Figure 2 for ESC-Eval: Evaluating Emotion Support Conversations in Large Language Models
Figure 3 for ESC-Eval: Evaluating Emotion Support Conversations in Large Language Models
Figure 4 for ESC-Eval: Evaluating Emotion Support Conversations in Large Language Models
Viaarxiv icon

AvaTaR: Optimizing LLM Agents for Tool-Assisted Knowledge Retrieval

Add code
Jun 18, 2024
Figure 1 for AvaTaR: Optimizing LLM Agents for Tool-Assisted Knowledge Retrieval
Figure 2 for AvaTaR: Optimizing LLM Agents for Tool-Assisted Knowledge Retrieval
Figure 3 for AvaTaR: Optimizing LLM Agents for Tool-Assisted Knowledge Retrieval
Figure 4 for AvaTaR: Optimizing LLM Agents for Tool-Assisted Knowledge Retrieval
Viaarxiv icon

Optimizing Large Model Training through Overlapped Activation Recomputation

Add code
Jun 13, 2024
Viaarxiv icon

MLLMGuard: A Multi-dimensional Safety Evaluation Suite for Multimodal Large Language Models

Add code
Jun 11, 2024
Figure 1 for MLLMGuard: A Multi-dimensional Safety Evaluation Suite for Multimodal Large Language Models
Figure 2 for MLLMGuard: A Multi-dimensional Safety Evaluation Suite for Multimodal Large Language Models
Figure 3 for MLLMGuard: A Multi-dimensional Safety Evaluation Suite for Multimodal Large Language Models
Figure 4 for MLLMGuard: A Multi-dimensional Safety Evaluation Suite for Multimodal Large Language Models
Viaarxiv icon

STaRK: Benchmarking LLM Retrieval on Textual and Relational Knowledge Bases

Add code
Apr 19, 2024
Viaarxiv icon