Picture for Xiyao Wang

Xiyao Wang

Towards Self-Improvement of LLMs via MCTS: Leveraging Stepwise Knowledge with Curriculum Preference Learning

Add code
Oct 09, 2024
Figure 1 for Towards Self-Improvement of LLMs via MCTS: Leveraging Stepwise Knowledge with Curriculum Preference Learning
Figure 2 for Towards Self-Improvement of LLMs via MCTS: Leveraging Stepwise Knowledge with Curriculum Preference Learning
Figure 3 for Towards Self-Improvement of LLMs via MCTS: Leveraging Stepwise Knowledge with Curriculum Preference Learning
Figure 4 for Towards Self-Improvement of LLMs via MCTS: Leveraging Stepwise Knowledge with Curriculum Preference Learning
Viaarxiv icon

LLaVA-Critic: Learning to Evaluate Multimodal Models

Add code
Oct 03, 2024
Viaarxiv icon

Multi-Stage Balanced Distillation: Addressing Long-Tail Challenges in Sequence-Level Knowledge Distillation

Add code
Jun 19, 2024
Figure 1 for Multi-Stage Balanced Distillation: Addressing Long-Tail Challenges in Sequence-Level Knowledge Distillation
Figure 2 for Multi-Stage Balanced Distillation: Addressing Long-Tail Challenges in Sequence-Level Knowledge Distillation
Figure 3 for Multi-Stage Balanced Distillation: Addressing Long-Tail Challenges in Sequence-Level Knowledge Distillation
Figure 4 for Multi-Stage Balanced Distillation: Addressing Long-Tail Challenges in Sequence-Level Knowledge Distillation
Viaarxiv icon

World Models with Hints of Large Language Models for Goal Achieving

Add code
Jun 11, 2024
Figure 1 for World Models with Hints of Large Language Models for Goal Achieving
Figure 2 for World Models with Hints of Large Language Models for Goal Achieving
Figure 3 for World Models with Hints of Large Language Models for Goal Achieving
Figure 4 for World Models with Hints of Large Language Models for Goal Achieving
Viaarxiv icon

Enhancing Visual-Language Modality Alignment in Large Vision Language Models via Self-Improvement

Add code
May 29, 2024
Viaarxiv icon

Calibrated Self-Rewarding Vision Language Models

Add code
May 23, 2024
Viaarxiv icon

Premier-TACO is a Few-Shot Policy Learner: Pretraining Multitask Representation via Temporal Action-Driven Contrastive Loss

Add code
Feb 13, 2024
Figure 1 for Premier-TACO is a Few-Shot Policy Learner: Pretraining Multitask Representation via Temporal Action-Driven Contrastive Loss
Figure 2 for Premier-TACO is a Few-Shot Policy Learner: Pretraining Multitask Representation via Temporal Action-Driven Contrastive Loss
Figure 3 for Premier-TACO is a Few-Shot Policy Learner: Pretraining Multitask Representation via Temporal Action-Driven Contrastive Loss
Figure 4 for Premier-TACO is a Few-Shot Policy Learner: Pretraining Multitask Representation via Temporal Action-Driven Contrastive Loss
Viaarxiv icon

Mementos: A Comprehensive Benchmark for Multimodal Large Language Model Reasoning over Image Sequences

Add code
Jan 25, 2024
Figure 1 for Mementos: A Comprehensive Benchmark for Multimodal Large Language Model Reasoning over Image Sequences
Figure 2 for Mementos: A Comprehensive Benchmark for Multimodal Large Language Model Reasoning over Image Sequences
Figure 3 for Mementos: A Comprehensive Benchmark for Multimodal Large Language Model Reasoning over Image Sequences
Figure 4 for Mementos: A Comprehensive Benchmark for Multimodal Large Language Model Reasoning over Image Sequences
Viaarxiv icon

Emojis Decoded: Leveraging ChatGPT for Enhanced Understanding in Social Media Communications

Add code
Jan 22, 2024
Figure 1 for Emojis Decoded: Leveraging ChatGPT for Enhanced Understanding in Social Media Communications
Figure 2 for Emojis Decoded: Leveraging ChatGPT for Enhanced Understanding in Social Media Communications
Figure 3 for Emojis Decoded: Leveraging ChatGPT for Enhanced Understanding in Social Media Communications
Figure 4 for Emojis Decoded: Leveraging ChatGPT for Enhanced Understanding in Social Media Communications
Viaarxiv icon

DrM: Mastering Visual Reinforcement Learning through Dormant Ratio Minimization

Add code
Oct 30, 2023
Viaarxiv icon