Picture for Di Jin

Di Jin

Computer Science and Artificial Intelligence Laboratory, Massachusetts Institute of Technology, Cambridge, U.S.A

ChemSafetyBench: Benchmarking LLM Safety on Chemistry Domain

Add code
Nov 23, 2024
Viaarxiv icon

Improving Model Factuality with Fine-grained Critique-based Evaluator

Add code
Oct 24, 2024
Viaarxiv icon

Multi-IF: Benchmarking LLMs on Multi-Turn and Multilingual Instructions Following

Add code
Oct 21, 2024
Viaarxiv icon

From Pixels to Personas: Investigating and Modeling Self-Anthropomorphism in Human-Robot Dialogues

Add code
Oct 04, 2024
Viaarxiv icon

The Perfect Blend: Redefining RLHF with Mixture of Judges

Add code
Sep 30, 2024
Figure 1 for The Perfect Blend: Redefining RLHF with Mixture of Judges
Figure 2 for The Perfect Blend: Redefining RLHF with Mixture of Judges
Figure 3 for The Perfect Blend: Redefining RLHF with Mixture of Judges
Figure 4 for The Perfect Blend: Redefining RLHF with Mixture of Judges
Viaarxiv icon

InjectTST: A Transformer Method of Injecting Global Information into Independent Channels for Long Time Series Forecasting

Add code
Mar 05, 2024
Figure 1 for InjectTST: A Transformer Method of Injecting Global Information into Independent Channels for Long Time Series Forecasting
Figure 2 for InjectTST: A Transformer Method of Injecting Global Information into Independent Channels for Long Time Series Forecasting
Figure 3 for InjectTST: A Transformer Method of Injecting Global Information into Independent Channels for Long Time Series Forecasting
Figure 4 for InjectTST: A Transformer Method of Injecting Global Information into Independent Channels for Long Time Series Forecasting
Viaarxiv icon

Vision-Flan: Scaling Human-Labeled Tasks in Visual Instruction Tuning

Add code
Feb 18, 2024
Figure 1 for Vision-Flan: Scaling Human-Labeled Tasks in Visual Instruction Tuning
Figure 2 for Vision-Flan: Scaling Human-Labeled Tasks in Visual Instruction Tuning
Figure 3 for Vision-Flan: Scaling Human-Labeled Tasks in Visual Instruction Tuning
Figure 4 for Vision-Flan: Scaling Human-Labeled Tasks in Visual Instruction Tuning
Viaarxiv icon

GOODAT: Towards Test-time Graph Out-of-Distribution Detection

Add code
Jan 10, 2024
Viaarxiv icon

Data-Efficient Alignment of Large Language Models with Human Feedback Through Natural Language

Add code
Nov 24, 2023
Viaarxiv icon

A Survey on Fairness-aware Recommender Systems

Add code
Jun 01, 2023
Viaarxiv icon