Picture for Xiaoran Fan

Xiaoran Fan

Distill Visual Chart Reasoning Ability from LLMs to MLLMs

Add code
Oct 24, 2024
Figure 1 for Distill Visual Chart Reasoning Ability from LLMs to MLLMs
Figure 2 for Distill Visual Chart Reasoning Ability from LLMs to MLLMs
Figure 3 for Distill Visual Chart Reasoning Ability from LLMs to MLLMs
Figure 4 for Distill Visual Chart Reasoning Ability from LLMs to MLLMs
Viaarxiv icon

Have the VLMs Lost Confidence? A Study of Sycophancy in VLMs

Add code
Oct 15, 2024
Figure 1 for Have the VLMs Lost Confidence? A Study of Sycophancy in VLMs
Figure 2 for Have the VLMs Lost Confidence? A Study of Sycophancy in VLMs
Figure 3 for Have the VLMs Lost Confidence? A Study of Sycophancy in VLMs
Figure 4 for Have the VLMs Lost Confidence? A Study of Sycophancy in VLMs
Viaarxiv icon

Leveraging Foundation Models for Zero-Shot IoT Sensing

Add code
Jul 29, 2024
Figure 1 for Leveraging Foundation Models for Zero-Shot IoT Sensing
Figure 2 for Leveraging Foundation Models for Zero-Shot IoT Sensing
Figure 3 for Leveraging Foundation Models for Zero-Shot IoT Sensing
Figure 4 for Leveraging Foundation Models for Zero-Shot IoT Sensing
Viaarxiv icon

mmPlace: Robust Place Recognition with Intermediate Frequency Signal of Low-cost Single-chip Millimeter Wave Radar

Add code
Mar 07, 2024
Viaarxiv icon

Training Large Language Models for Reasoning through Reverse Curriculum Reinforcement Learning

Add code
Feb 08, 2024
Figure 1 for Training Large Language Models for Reasoning through Reverse Curriculum Reinforcement Learning
Figure 2 for Training Large Language Models for Reasoning through Reverse Curriculum Reinforcement Learning
Figure 3 for Training Large Language Models for Reasoning through Reverse Curriculum Reinforcement Learning
Figure 4 for Training Large Language Models for Reasoning through Reverse Curriculum Reinforcement Learning
Viaarxiv icon

StepCoder: Improve Code Generation with Reinforcement Learning from Compiler Feedback

Add code
Feb 05, 2024
Viaarxiv icon

MouSi: Poly-Visual-Expert Vision-Language Models

Add code
Jan 30, 2024
Viaarxiv icon

RoTBench: A Multi-Level Benchmark for Evaluating the Robustness of Large Language Models in Tool Learning

Add code
Jan 19, 2024
Viaarxiv icon

ToolEyes: Fine-Grained Evaluation for Tool Learning Capabilities of Large Language Models in Real-world Scenarios

Add code
Jan 14, 2024
Viaarxiv icon

Secrets of RLHF in Large Language Models Part II: Reward Modeling

Add code
Jan 12, 2024
Figure 1 for Secrets of RLHF in Large Language Models Part II: Reward Modeling
Figure 2 for Secrets of RLHF in Large Language Models Part II: Reward Modeling
Figure 3 for Secrets of RLHF in Large Language Models Part II: Reward Modeling
Figure 4 for Secrets of RLHF in Large Language Models Part II: Reward Modeling
Viaarxiv icon