Recipe2Vec: Multi-modal Recipe Representation Learning with Graph Neural Networks

Tian, Yijun; Zhang, Chuxu; Guo, Zhichun; Ma, Yihong; Metoyer, Ronald; Chawla, Nitesh V.

Computer Science > Machine Learning

arXiv:2205.12396 (cs)

[Submitted on 24 May 2022]

Title:Recipe2Vec: Multi-modal Recipe Representation Learning with Graph Neural Networks

Authors:Yijun Tian, Chuxu Zhang, Zhichun Guo, Yihong Ma, Ronald Metoyer, Nitesh V. Chawla

View PDF

Abstract:Learning effective recipe representations is essential in food studies. Unlike what has been developed for image-based recipe retrieval or learning structural text embeddings, the combined effect of multi-modal information (i.e., recipe images, text, and relation data) receives less attention. In this paper, we formalize the problem of multi-modal recipe representation learning to integrate the visual, textual, and relational information into recipe embeddings. In particular, we first present Large-RG, a new recipe graph data with over half a million nodes, making it the largest recipe graph to date. We then propose Recipe2Vec, a novel graph neural network based recipe embedding model to capture multi-modal information. Additionally, we introduce an adversarial attack strategy to ensure stable learning and improve performance. Finally, we design a joint objective function of node classification and adversarial learning to optimize the model. Extensive experiments demonstrate that Recipe2Vec outperforms state-of-the-art baselines on two classic food study tasks, i.e., cuisine category classification and region prediction. Dataset and codes are available at this https URL.

Comments:	Accepted by IJCAI 2022
Subjects:	Machine Learning (cs.LG); Computation and Language (cs.CL)
Cite as:	arXiv:2205.12396 [cs.LG]
	(or arXiv:2205.12396v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2205.12396

Submission history

From: Yijun Tian [view email]
[v1] Tue, 24 May 2022 23:04:02 UTC (2,663 KB)

Computer Science > Machine Learning

Title:Recipe2Vec: Multi-modal Recipe Representation Learning with Graph Neural Networks

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Recipe2Vec: Multi-modal Recipe Representation Learning with Graph Neural Networks

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators