Picture for Duyu Tang

Duyu Tang

Ensuring Consistency for In-Image Translation

Add code
Dec 24, 2024
Viaarxiv icon

Cross-Lingual Text-Rich Visual Comprehension: An Information Theory Perspective

Add code
Dec 23, 2024
Viaarxiv icon

XTransplant: A Probe into the Upper Bound Performance of Multilingual Capability and Culture Adaptability in LLMs via Mutual Cross-lingual Feed-forward Transplantation

Add code
Dec 17, 2024
Viaarxiv icon

ToolACE: Winning the Points of LLM Function Calling

Add code
Sep 02, 2024
Figure 1 for ToolACE: Winning the Points of LLM Function Calling
Figure 2 for ToolACE: Winning the Points of LLM Function Calling
Figure 3 for ToolACE: Winning the Points of LLM Function Calling
Figure 4 for ToolACE: Winning the Points of LLM Function Calling
Viaarxiv icon

Learning Fine-Grained Grounded Citations for Attributed Large Language Models

Add code
Aug 08, 2024
Viaarxiv icon

Android in the Zoo: Chain-of-Action-Thought for GUI Agents

Add code
Mar 05, 2024
Viaarxiv icon

Emage: Non-Autoregressive Text-to-Image Generation

Add code
Dec 22, 2023
Viaarxiv icon

SkillNet-X: A Multilingual Multitask Model with Sparsely Activated Skills

Add code
Jun 28, 2023
Viaarxiv icon

Improved Visual Story Generation with Adaptive Context Modeling

Add code
May 26, 2023
Figure 1 for Improved Visual Story Generation with Adaptive Context Modeling
Figure 2 for Improved Visual Story Generation with Adaptive Context Modeling
Figure 3 for Improved Visual Story Generation with Adaptive Context Modeling
Figure 4 for Improved Visual Story Generation with Adaptive Context Modeling
Viaarxiv icon

STOA-VLP: Spatial-Temporal Modeling of Object and Action for Video-Language Pre-training

Add code
Feb 20, 2023
Figure 1 for STOA-VLP: Spatial-Temporal Modeling of Object and Action for Video-Language Pre-training
Figure 2 for STOA-VLP: Spatial-Temporal Modeling of Object and Action for Video-Language Pre-training
Figure 3 for STOA-VLP: Spatial-Temporal Modeling of Object and Action for Video-Language Pre-training
Figure 4 for STOA-VLP: Spatial-Temporal Modeling of Object and Action for Video-Language Pre-training
Viaarxiv icon