Picture for Jiyun Chun

Jiyun Chun

ARES: Alternating Reinforcement Learning and Supervised Fine-Tuning for Enhanced Multi-Modal Chain-of-Thought Reasoning Through Diverse AI Feedback

Add code
Jun 25, 2024
Viaarxiv icon