Picture for Tong Sun

Tong Sun

LoRA-Contextualizing Adaptation of Large Multimodal Models for Long Document Understanding

Add code
Nov 02, 2024
Viaarxiv icon

Autonomous Driving in Unstructured Environments: How Far Have We Come?

Add code
Oct 10, 2024
Figure 1 for Autonomous Driving in Unstructured Environments: How Far Have We Come?
Figure 2 for Autonomous Driving in Unstructured Environments: How Far Have We Come?
Figure 3 for Autonomous Driving in Unstructured Environments: How Far Have We Come?
Figure 4 for Autonomous Driving in Unstructured Environments: How Far Have We Come?
Viaarxiv icon

LLaVA-Read: Enhancing Reading Ability of Multimodal Language Models

Add code
Jul 27, 2024
Figure 1 for LLaVA-Read: Enhancing Reading Ability of Multimodal Language Models
Figure 2 for LLaVA-Read: Enhancing Reading Ability of Multimodal Language Models
Figure 3 for LLaVA-Read: Enhancing Reading Ability of Multimodal Language Models
Figure 4 for LLaVA-Read: Enhancing Reading Ability of Multimodal Language Models
Viaarxiv icon

ARTIST: Improving the Generation of Text-rich Images by Disentanglement

Add code
Jun 17, 2024
Figure 1 for ARTIST: Improving the Generation of Text-rich Images by Disentanglement
Figure 2 for ARTIST: Improving the Generation of Text-rich Images by Disentanglement
Figure 3 for ARTIST: Improving the Generation of Text-rich Images by Disentanglement
Figure 4 for ARTIST: Improving the Generation of Text-rich Images by Disentanglement
Viaarxiv icon

Toffee: Efficient Million-Scale Dataset Construction for Subject-Driven Text-to-Image Generation

Add code
Jun 13, 2024
Viaarxiv icon

DocSynthv2: A Practical Autoregressive Modeling for Document Generation

Add code
Jun 12, 2024
Viaarxiv icon

TRINS: Towards Multimodal Language Models that Can Read

Add code
Jun 10, 2024
Viaarxiv icon

Improve Temporal Awareness of LLMs for Sequential Recommendation

Add code
May 05, 2024
Figure 1 for Improve Temporal Awareness of LLMs for Sequential Recommendation
Figure 2 for Improve Temporal Awareness of LLMs for Sequential Recommendation
Figure 3 for Improve Temporal Awareness of LLMs for Sequential Recommendation
Figure 4 for Improve Temporal Awareness of LLMs for Sequential Recommendation
Viaarxiv icon

Automatic Layout Planning for Visually-Rich Documents with Instruction-Following Models

Add code
Apr 23, 2024
Viaarxiv icon

SOHES: Self-supervised Open-world Hierarchical Entity Segmentation

Add code
Apr 18, 2024
Viaarxiv icon