Picture for Tianduo Wang

Tianduo Wang

Self-Training with Direct Preference Optimization Improves Chain-of-Thought Reasoning

Add code
Jul 25, 2024
Figure 1 for Self-Training with Direct Preference Optimization Improves Chain-of-Thought Reasoning
Figure 2 for Self-Training with Direct Preference Optimization Improves Chain-of-Thought Reasoning
Figure 3 for Self-Training with Direct Preference Optimization Improves Chain-of-Thought Reasoning
Figure 4 for Self-Training with Direct Preference Optimization Improves Chain-of-Thought Reasoning
Viaarxiv icon

TinyLlama: An Open-Source Small Language Model

Add code
Jan 04, 2024
Viaarxiv icon

Learning Multi-Step Reasoning by Solving Arithmetic Tasks

Add code
Jun 07, 2023
Figure 1 for Learning Multi-Step Reasoning by Solving Arithmetic Tasks
Figure 2 for Learning Multi-Step Reasoning by Solving Arithmetic Tasks
Figure 3 for Learning Multi-Step Reasoning by Solving Arithmetic Tasks
Figure 4 for Learning Multi-Step Reasoning by Solving Arithmetic Tasks
Viaarxiv icon

Differentiable Data Augmentation for Contrastive Sentence Representation Learning

Add code
Oct 29, 2022
Viaarxiv icon