SeeDS: Semantic Separable Diffusion Synthesizer for Zero-shot Food Detection

Zhou, Pengfei; Min, Weiqing; Zhang, Yang; Song, Jiajun; Jin, Ying; Jiang, Shuqiang

doi:10.1145/3581783.3612661

Computer Science > Computer Vision and Pattern Recognition

arXiv:2310.04689 (cs)

[Submitted on 7 Oct 2023]

Title:SeeDS: Semantic Separable Diffusion Synthesizer for Zero-shot Food Detection

Authors:Pengfei Zhou, Weiqing Min, Yang Zhang, Jiajun Song, Ying Jin, Shuqiang Jiang

View PDF

Abstract:Food detection is becoming a fundamental task in food computing that supports various multimedia applications, including food recommendation and dietary monitoring. To deal with real-world scenarios, food detection needs to localize and recognize novel food objects that are not seen during training, demanding Zero-Shot Detection (ZSD). However, the complexity of semantic attributes and intra-class feature diversity poses challenges for ZSD methods in distinguishing fine-grained food classes. To tackle this, we propose the Semantic Separable Diffusion Synthesizer (SeeDS) framework for Zero-Shot Food Detection (ZSFD). SeeDS consists of two modules: a Semantic Separable Synthesizing Module (S$^3$M) and a Region Feature Denoising Diffusion Model (RFDDM). The S$^3$M learns the disentangled semantic representation for complex food attributes from ingredients and cuisines, and synthesizes discriminative food features via enhanced semantic information. The RFDDM utilizes a novel diffusion model to generate diversified region features and enhances ZSFD via fine-grained synthesized features. Extensive experiments show the state-of-the-art ZSFD performance of our proposed method on two food datasets, ZSFooD and UECFOOD-256. Moreover, SeeDS also maintains effectiveness on general ZSD datasets, PASCAL VOC and MS COCO. The code and dataset can be found at this https URL.

Comments:	Accepted by ACM Multimedia 2023
Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2310.04689 [cs.CV]
	(or arXiv:2310.04689v1 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2310.04689
Related DOI:	https://doi.org/10.1145/3581783.3612661

Submission history

From: Pengfei Zhou [view email]
[v1] Sat, 7 Oct 2023 05:29:18 UTC (9,294 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:SeeDS: Semantic Separable Diffusion Synthesizer for Zero-shot Food Detection

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:SeeDS: Semantic Separable Diffusion Synthesizer for Zero-shot Food Detection

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators