Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Multi-modal Cooking Workflow Construction for Food Recipes

Aug 20, 2020

Liangming Pan, Jingjing Chen, Jianlong Wu, Shaoteng Liu, Chong-Wah Ngo, Min-Yen Kan, Yu-Gang Jiang, Tat-Seng Chua

Figure 1 for Multi-modal Cooking Workflow Construction for Food Recipes

Figure 2 for Multi-modal Cooking Workflow Construction for Food Recipes

Figure 3 for Multi-modal Cooking Workflow Construction for Food Recipes

Figure 4 for Multi-modal Cooking Workflow Construction for Food Recipes

Share this with someone who'll enjoy it:

Abstract:Understanding food recipe requires anticipating the implicit causal effects of cooking actions, such that the recipe can be converted into a graph describing the temporal workflow of the recipe. This is a non-trivial task that involves common-sense reasoning. However, existing efforts rely on hand-crafted features to extract the workflow graph from recipes due to the lack of large-scale labeled datasets. Moreover, they fail to utilize the cooking images, which constitute an important part of food recipes. In this paper, we build MM-ReS, the first large-scale dataset for cooking workflow construction, consisting of 9,850 recipes with human-labeled workflow graphs. Cooking steps are multi-modal, featuring both text instructions and cooking images. We then propose a neural encoder-decoder model that utilizes both visual and textual information to construct the cooking workflow, which achieved over 20% performance gain over existing hand-crafted baselines.

* This manuscript has been accepted at ACM MM 2020

View paper on

Share this with someone who'll enjoy it:

Title:Multi-modal Cooking Workflow Construction for Food Recipes

Paper and Code