Picture for Chong-Wah Ngo

Chong-Wah Ngo

Retrieval Augmented Recipe Generation

Add code
Nov 13, 2024
Viaarxiv icon

WorldCuisines: A Massive-Scale Benchmark for Multilingual and Multicultural Visual Question Answering on Global Cuisines

Add code
Oct 16, 2024
Figure 1 for WorldCuisines: A Massive-Scale Benchmark for Multilingual and Multicultural Visual Question Answering on Global Cuisines
Figure 2 for WorldCuisines: A Massive-Scale Benchmark for Multilingual and Multicultural Visual Question Answering on Global Cuisines
Figure 3 for WorldCuisines: A Massive-Scale Benchmark for Multilingual and Multicultural Visual Question Answering on Global Cuisines
Figure 4 for WorldCuisines: A Massive-Scale Benchmark for Multilingual and Multicultural Visual Question Answering on Global Cuisines
Viaarxiv icon

Hi3D: Pursuing High-Resolution Image-to-3D Generation with Video Diffusion Models

Add code
Sep 11, 2024
Viaarxiv icon

RoDE: Linear Rectified Mixture of Diverse Experts for Food Large Multi-Modal Models

Add code
Jul 17, 2024
Viaarxiv icon

PosMLP-Video: Spatial and Temporal Relative Position Encoding for Efficient Video Recognition

Add code
Jul 03, 2024
Viaarxiv icon

Improving Interpretable Embeddings for Ad-hoc Video Search with Generative Captions and Multi-word Concept Bank

Add code
Apr 09, 2024
Viaarxiv icon

OVFoodSeg: Elevating Open-Vocabulary Food Image Segmentation via Image-Informed Textual Representation

Add code
Apr 01, 2024
Viaarxiv icon

Interpretable Embedding for Ad-hoc Video Search

Add code
Feb 19, 2024
Viaarxiv icon

FoodLMM: A Versatile Food Assistant using Large Multi-modal Model

Add code
Dec 22, 2023
Viaarxiv icon

Incremental Learning on Food Instance Segmentation

Add code
Jun 28, 2023
Viaarxiv icon