Picture for Chia-Wen Kuo

Chia-Wen Kuo

Beyond Raw Videos: Understanding Edited Videos with Large Multimodal Model

Add code
Jun 15, 2024
Viaarxiv icon

CuMo: Scaling Multimodal LLM with Co-Upcycled Mixture-of-Experts

Add code
May 09, 2024
Viaarxiv icon

HAAV: Hierarchical Aggregation of Augmented Views for Image Captioning

Add code
May 25, 2023
Viaarxiv icon

CLIP-GCD: Simple Language Guided Generalized Category Discovery

Add code
May 17, 2023
Viaarxiv icon

Structure-Encoding Auxiliary Tasks for Improved Visual Representation in Vision-and-Language Navigation

Add code
Nov 20, 2022
Viaarxiv icon

Beyond a Pre-Trained Object Detector: Cross-Modal Textual and Visual Context for Image Captioning

Add code
May 09, 2022
Figure 1 for Beyond a Pre-Trained Object Detector: Cross-Modal Textual and Visual Context for Image Captioning
Figure 2 for Beyond a Pre-Trained Object Detector: Cross-Modal Textual and Visual Context for Image Captioning
Figure 3 for Beyond a Pre-Trained Object Detector: Cross-Modal Textual and Visual Context for Image Captioning
Figure 4 for Beyond a Pre-Trained Object Detector: Cross-Modal Textual and Visual Context for Image Captioning
Viaarxiv icon

Unbiased Teacher for Semi-Supervised Object Detection

Add code
Feb 18, 2021
Figure 1 for Unbiased Teacher for Semi-Supervised Object Detection
Figure 2 for Unbiased Teacher for Semi-Supervised Object Detection
Figure 3 for Unbiased Teacher for Semi-Supervised Object Detection
Figure 4 for Unbiased Teacher for Semi-Supervised Object Detection
Viaarxiv icon

FeatMatch: Feature-Based Augmentation for Semi-Supervised Learning

Add code
Jul 16, 2020
Figure 1 for FeatMatch: Feature-Based Augmentation for Semi-Supervised Learning
Figure 2 for FeatMatch: Feature-Based Augmentation for Semi-Supervised Learning
Figure 3 for FeatMatch: Feature-Based Augmentation for Semi-Supervised Learning
Figure 4 for FeatMatch: Feature-Based Augmentation for Semi-Supervised Learning
Viaarxiv icon

Who2com: Collaborative Perception via Learnable Handshake Communication

Add code
Mar 21, 2020
Figure 1 for Who2com: Collaborative Perception via Learnable Handshake Communication
Figure 2 for Who2com: Collaborative Perception via Learnable Handshake Communication
Figure 3 for Who2com: Collaborative Perception via Learnable Handshake Communication
Figure 4 for Who2com: Collaborative Perception via Learnable Handshake Communication
Viaarxiv icon

Manifold Graph with Learned Prototypes for Semi-Supervised Image Classification

Add code
Jun 13, 2019
Figure 1 for Manifold Graph with Learned Prototypes for Semi-Supervised Image Classification
Figure 2 for Manifold Graph with Learned Prototypes for Semi-Supervised Image Classification
Figure 3 for Manifold Graph with Learned Prototypes for Semi-Supervised Image Classification
Figure 4 for Manifold Graph with Learned Prototypes for Semi-Supervised Image Classification
Viaarxiv icon