Picture for Zichen Zhu

Zichen Zhu

MobA: A Two-Level Agent System for Efficient Mobile Task Automation

Add code
Oct 17, 2024
Figure 1 for MobA: A Two-Level Agent System for Efficient Mobile Task Automation
Figure 2 for MobA: A Two-Level Agent System for Efficient Mobile Task Automation
Figure 3 for MobA: A Two-Level Agent System for Efficient Mobile Task Automation
Figure 4 for MobA: A Two-Level Agent System for Efficient Mobile Task Automation
Viaarxiv icon

Rejection Improves Reliability: Training LLMs to Refuse Unknown Questions Using RL from Knowledge Feedback

Add code
Apr 07, 2024
Viaarxiv icon

Hierarchical Multimodal Pre-training for Visually Rich Webpage Understanding

Add code
Feb 28, 2024
Viaarxiv icon

Multi: Multimodal Understanding Leaderboard with Text and Images

Add code
Feb 05, 2024
Figure 1 for Multi: Multimodal Understanding Leaderboard with Text and Images
Figure 2 for Multi: Multimodal Understanding Leaderboard with Text and Images
Figure 3 for Multi: Multimodal Understanding Leaderboard with Text and Images
Figure 4 for Multi: Multimodal Understanding Leaderboard with Text and Images
Viaarxiv icon

ChemDFM: Dialogue Foundation Model for Chemistry

Add code
Jan 26, 2024
Figure 1 for ChemDFM: Dialogue Foundation Model for Chemistry
Figure 2 for ChemDFM: Dialogue Foundation Model for Chemistry
Figure 3 for ChemDFM: Dialogue Foundation Model for Chemistry
Figure 4 for ChemDFM: Dialogue Foundation Model for Chemistry
Viaarxiv icon

META-GUI: Towards Multi-modal Conversational Agents on Mobile GUI

Add code
May 23, 2022
Figure 1 for META-GUI: Towards Multi-modal Conversational Agents on Mobile GUI
Figure 2 for META-GUI: Towards Multi-modal Conversational Agents on Mobile GUI
Figure 3 for META-GUI: Towards Multi-modal Conversational Agents on Mobile GUI
Figure 4 for META-GUI: Towards Multi-modal Conversational Agents on Mobile GUI
Viaarxiv icon

Rb-PaStaNet: A Few-Shot Human-Object Interaction Detection Based on Rules and Part States

Add code
Aug 14, 2020
Figure 1 for Rb-PaStaNet: A Few-Shot Human-Object Interaction Detection Based on Rules and Part States
Figure 2 for Rb-PaStaNet: A Few-Shot Human-Object Interaction Detection Based on Rules and Part States
Figure 3 for Rb-PaStaNet: A Few-Shot Human-Object Interaction Detection Based on Rules and Part States
Viaarxiv icon

Learning Key-Value Store Design

Add code
Jul 11, 2019
Figure 1 for Learning Key-Value Store Design
Figure 2 for Learning Key-Value Store Design
Figure 3 for Learning Key-Value Store Design
Figure 4 for Learning Key-Value Store Design
Viaarxiv icon