Picture for Feiqi Cao

Feiqi Cao

ChuLo: Chunk-Level Key Information Representation for Long Document Processing

Add code
Oct 14, 2024
Viaarxiv icon

Multimodal Large Language Models and Tunings: Vision, Language, Sensors, Audio, and Beyond

Add code
Oct 08, 2024
Viaarxiv icon

3M: Multi-modal Multi-task Multi-teacher Learning for Game Event Detection

Add code
Jun 13, 2024
Figure 1 for 3M: Multi-modal Multi-task Multi-teacher Learning for Game Event Detection
Figure 2 for 3M: Multi-modal Multi-task Multi-teacher Learning for Game Event Detection
Figure 3 for 3M: Multi-modal Multi-task Multi-teacher Learning for Game Event Detection
Figure 4 for 3M: Multi-modal Multi-task Multi-teacher Learning for Game Event Detection
Viaarxiv icon

Game-MUG: Multimodal Oriented Game Situation Understanding and Commentary Generation Dataset

Add code
Apr 30, 2024
Viaarxiv icon

PEACH: Pretrained-embedding Explanation Across Contextual and Hierarchical Structure

Add code
Apr 21, 2024
Viaarxiv icon

SceneGATE: Scene-Graph based co-Attention networks for TExt visual question answering

Add code
Dec 16, 2022
Figure 1 for SceneGATE: Scene-Graph based co-Attention networks for TExt visual question answering
Figure 2 for SceneGATE: Scene-Graph based co-Attention networks for TExt visual question answering
Figure 3 for SceneGATE: Scene-Graph based co-Attention networks for TExt visual question answering
Figure 4 for SceneGATE: Scene-Graph based co-Attention networks for TExt visual question answering
Viaarxiv icon

In-game Toxic Language Detection: Shared Task and Attention Residuals

Add code
Nov 19, 2022
Viaarxiv icon

Understanding Attention for Vision-and-Language Tasks

Add code
Aug 17, 2022
Figure 1 for Understanding Attention for Vision-and-Language Tasks
Figure 2 for Understanding Attention for Vision-and-Language Tasks
Figure 3 for Understanding Attention for Vision-and-Language Tasks
Figure 4 for Understanding Attention for Vision-and-Language Tasks
Viaarxiv icon

Vision-and-Language Pretrained Models: A Survey

Add code
Apr 28, 2022
Figure 1 for Vision-and-Language Pretrained Models: A Survey
Figure 2 for Vision-and-Language Pretrained Models: A Survey
Viaarxiv icon