Picture for Josiah Poon

Josiah Poon

TriG-NER: Triplet-Grid Framework for Discontinuous Named Entity Recognition

Add code
Nov 04, 2024
Figure 1 for TriG-NER: Triplet-Grid Framework for Discontinuous Named Entity Recognition
Figure 2 for TriG-NER: Triplet-Grid Framework for Discontinuous Named Entity Recognition
Figure 3 for TriG-NER: Triplet-Grid Framework for Discontinuous Named Entity Recognition
Figure 4 for TriG-NER: Triplet-Grid Framework for Discontinuous Named Entity Recognition
Viaarxiv icon

GEM-VPC: A dual Graph-Enhanced Multimodal integration for Video Paragraph Captioning

Add code
Oct 12, 2024
Viaarxiv icon

Multimodal Large Language Models and Tunings: Vision, Language, Sensors, Audio, and Beyond

Add code
Oct 08, 2024
Viaarxiv icon

Do Text-to-Vis Benchmarks Test Real Use of Visualisations?

Add code
Jul 29, 2024
Viaarxiv icon

3M-Health: Multimodal Multi-Teacher Knowledge Distillation for Mental Health Detection

Add code
Jul 12, 2024
Viaarxiv icon

Game-MUG: Multimodal Oriented Game Situation Understanding and Commentary Generation Dataset

Add code
Apr 30, 2024
Viaarxiv icon

M3-VRD: Multimodal Multi-task Multi-teacher Visually-Rich Form Document Understanding

Add code
Feb 28, 2024
Figure 1 for M3-VRD: Multimodal Multi-task Multi-teacher Visually-Rich Form Document Understanding
Figure 2 for M3-VRD: Multimodal Multi-task Multi-teacher Visually-Rich Form Document Understanding
Figure 3 for M3-VRD: Multimodal Multi-task Multi-teacher Visually-Rich Form Document Understanding
Figure 4 for M3-VRD: Multimodal Multi-task Multi-teacher Visually-Rich Form Document Understanding
Viaarxiv icon

SCO-VIST: Social Interaction Commonsense Knowledge-based Visual Storytelling

Add code
Feb 01, 2024
Viaarxiv icon

Re-Temp: Relation-Aware Temporal Representation Learning for Temporal Knowledge Graph Completion

Add code
Oct 24, 2023
Viaarxiv icon

MC-DRE: Multi-Aspect Cross Integration for Drug Event/Entity Extraction

Add code
Aug 15, 2023
Viaarxiv icon