Picture for Yiwen Ding

Yiwen Ding

Vrije Universiteit Amsterdam

Mitigating Tail Narrowing in LLM Self-Improvement via Socratic-Guided Sampling

Add code
Nov 01, 2024
Viaarxiv icon

Distill Visual Chart Reasoning Ability from LLMs to MLLMs

Add code
Oct 24, 2024
Figure 1 for Distill Visual Chart Reasoning Ability from LLMs to MLLMs
Figure 2 for Distill Visual Chart Reasoning Ability from LLMs to MLLMs
Figure 3 for Distill Visual Chart Reasoning Ability from LLMs to MLLMs
Figure 4 for Distill Visual Chart Reasoning Ability from LLMs to MLLMs
Viaarxiv icon

Defeasible Reasoning on Concepts

Add code
Sep 07, 2024
Viaarxiv icon

AgentGym: Evolving Large Language Model-based Agents across Diverse Environments

Add code
Jun 06, 2024
Viaarxiv icon

Self-Demos: Eliciting Out-of-Demonstration Generalizability in Large Language Models

Add code
Apr 01, 2024
Figure 1 for Self-Demos: Eliciting Out-of-Demonstration Generalizability in Large Language Models
Figure 2 for Self-Demos: Eliciting Out-of-Demonstration Generalizability in Large Language Models
Figure 3 for Self-Demos: Eliciting Out-of-Demonstration Generalizability in Large Language Models
Figure 4 for Self-Demos: Eliciting Out-of-Demonstration Generalizability in Large Language Models
Viaarxiv icon

LongAgent: Scaling Language Models to 128k Context through Multi-Agent Collaboration

Add code
Feb 18, 2024
Viaarxiv icon

Training Large Language Models for Reasoning through Reverse Curriculum Reinforcement Learning

Add code
Feb 08, 2024
Figure 1 for Training Large Language Models for Reasoning through Reverse Curriculum Reinforcement Learning
Figure 2 for Training Large Language Models for Reasoning through Reverse Curriculum Reinforcement Learning
Figure 3 for Training Large Language Models for Reasoning through Reverse Curriculum Reinforcement Learning
Figure 4 for Training Large Language Models for Reasoning through Reverse Curriculum Reinforcement Learning
Viaarxiv icon

Identifying the Defective: Detecting Damaged Grains for Cereal Appearance Inspection

Add code
Nov 20, 2023
Viaarxiv icon

The Rise and Potential of Large Language Model Based Agents: A Survey

Add code
Sep 19, 2023
Viaarxiv icon

Causal Kripke Models

Add code
Jul 11, 2023
Viaarxiv icon