Picture for Dongyeop Kang

Dongyeop Kang

UC Berkeley

Learning a High-quality Robotic Wiping Policy Using Systematic Reward Analysis and Visual-Language Model Based Curriculum

Add code
Feb 18, 2025
Viaarxiv icon

RoSTE: An Efficient Quantization-Aware Supervised Fine-Tuning Approach for Large Language Models

Add code
Feb 13, 2025
Viaarxiv icon

ScholaWrite: A Dataset of End-to-End Scholarly Writing Process

Add code
Feb 05, 2025
Viaarxiv icon

Anchors Aweigh! Sail for Optimal Unified Multi-Modal Representations

Add code
Oct 02, 2024
Figure 1 for Anchors Aweigh! Sail for Optimal Unified Multi-Modal Representations
Figure 2 for Anchors Aweigh! Sail for Optimal Unified Multi-Modal Representations
Figure 3 for Anchors Aweigh! Sail for Optimal Unified Multi-Modal Representations
Figure 4 for Anchors Aweigh! Sail for Optimal Unified Multi-Modal Representations
Viaarxiv icon

LearnerVoice: A Dataset of Non-Native English Learners' Spontaneous Speech

Add code
Jul 05, 2024
Viaarxiv icon

Human-AI Collaborative Taxonomy Construction: A Case Study in Profession-Specific Writing Assistants

Add code
Jun 26, 2024
Viaarxiv icon

i-SRT: Aligning Large Multimodal Models for Videos by Iterative Self-Retrospective Judgment

Add code
Jun 17, 2024
Figure 1 for i-SRT: Aligning Large Multimodal Models for Videos by Iterative Self-Retrospective Judgment
Figure 2 for i-SRT: Aligning Large Multimodal Models for Videos by Iterative Self-Retrospective Judgment
Figure 3 for i-SRT: Aligning Large Multimodal Models for Videos by Iterative Self-Retrospective Judgment
Figure 4 for i-SRT: Aligning Large Multimodal Models for Videos by Iterative Self-Retrospective Judgment
Viaarxiv icon

Joint Demonstration and Preference Learning Improves Policy Alignment with Human Feedback

Add code
Jun 11, 2024
Viaarxiv icon

On the Sequence Evaluation based on Stochastic Processes

Add code
May 28, 2024
Viaarxiv icon

Confidence Calibration and Rationalization for LLMs via Multi-Agent Deliberation

Add code
Apr 14, 2024
Viaarxiv icon