Back to Search
Start Over
Foodnet: multi-scale and label dependency learning-based multi-task network for food and ingredient recognition.
- Source :
-
Neural Computing & Applications . Mar2024, Vol. 36 Issue 9, p4485-4501. 17p. - Publication Year :
- 2024
-
Abstract
- Image-based food pattern classification poses challenges of non-fixed spatial distribution and ingredient occlusion for mainstream computer vision algorithms. However, most current approaches classify food and ingredients by directly extracting abstract features of the entire image through a convolutional neural network (CNN), ignoring the relationship between food and ingredients and ingredient occlusion problem. To address these issues mentioned, we propose a FoodNet for both food and ingredient recognition, which uses a multi-task structure with a multi-scale relationship learning module (MSRL) and a label dependency learning module (LDL). As ingredients normally co-occur in an image, we present the LDL to use the dependency of ingredient to alleviate the occlusion problem of ingredient. MSRL aggregates multi-scale information of food and ingredients, then uses two relational matrixs to model the food-ingredient matching relationship to obtain richer feature representation. The experimental results show that FoodNet can achieve good performance on the Vireo Food-172 and UEC Food-100 datasets. It is worth noting that it reaches the most state-of-the-art level in terms of ingredient recognition in the Vireo Food-172 and UECFood-100.The source code will be made available at https://github.com/visipaper/FoodNet. [ABSTRACT FROM AUTHOR]
- Subjects :
- *CONVOLUTIONAL neural networks
*ALGORITHMS
*LEARNING modules
*SOURCE code
Subjects
Details
- Language :
- English
- ISSN :
- 09410643
- Volume :
- 36
- Issue :
- 9
- Database :
- Academic Search Index
- Journal :
- Neural Computing & Applications
- Publication Type :
- Academic Journal
- Accession number :
- 175529913
- Full Text :
- https://doi.org/10.1007/s00521-023-09349-4