Picture for Handong Zhao

Handong Zhao

DynaSaur: Large Language Agents Beyond Predefined Actions

Add code
Nov 04, 2024
Viaarxiv icon

VSP: Assessing the dual challenges of perception and reasoning in spatial planning tasks for VLMs

Add code
Jul 02, 2024
Viaarxiv icon

Reminding Multimodal Large Language Models of Object-aware Knowledge with Retrieved Tags

Add code
Jun 16, 2024
Viaarxiv icon

SOHES: Self-supervised Open-world Hierarchical Entity Segmentation

Add code
Apr 18, 2024
Viaarxiv icon

Fine-tuning CLIP Text Encoders with Two-step Paraphrasing

Add code
Feb 23, 2024
Viaarxiv icon

Augment before You Try: Knowledge-Enhanced Table Question Answering via Table Expansion

Add code
Jan 28, 2024
Viaarxiv icon

Discovering Low-rank Subspaces for Language-agnostic Multilingual Representations

Add code
Jan 11, 2024
Viaarxiv icon

InfoPrompt: Information-Theoretic Soft Prompt Tuning for Natural Language Understanding

Add code
Jun 08, 2023
Figure 1 for InfoPrompt: Information-Theoretic Soft Prompt Tuning for Natural Language Understanding
Figure 2 for InfoPrompt: Information-Theoretic Soft Prompt Tuning for Natural Language Understanding
Figure 3 for InfoPrompt: Information-Theoretic Soft Prompt Tuning for Natural Language Understanding
Figure 4 for InfoPrompt: Information-Theoretic Soft Prompt Tuning for Natural Language Understanding
Viaarxiv icon

Few-Shot Dialogue Summarization via Skeleton-Assisted Prompt Transfer

Add code
May 20, 2023
Viaarxiv icon

Harnessing the Spatial-Temporal Attention of Diffusion Models for High-Fidelity Text-to-Image Synthesis

Add code
Apr 07, 2023
Viaarxiv icon