An effective combination of loss gradients for multi-task learning applied on instance segmentation and depth estimation. (April 2021)
- Record Type:
- Journal Article
- Title:
- An effective combination of loss gradients for multi-task learning applied on instance segmentation and depth estimation. (April 2021)
- Main Title:
- An effective combination of loss gradients for multi-task learning applied on instance segmentation and depth estimation
- Authors:
- Nakamura, Angelica Tiemi Mizuno
Grassi, Valdir
Wolf, Denis Fernando - Abstract:
- Abstract: Advanced driver assistance systems are responsible for assisting decision making and can play an important role in safety and traffic efficiency. Such systems require robust perception methods to handle complex urban scenes, and one way to achieve this is through instance segmentation. However, due to the difficulty in separating overlapping objects into different instances, this task becomes very challenging. For this, several authors proposed CNN-based methods and used depth information to enhance the instance segmentation performance. A promising way to explore this information is by adopting a multi-task learning approach, in which multiple tasks are learned simultaneously by sharing the same architecture. Usually, this combination is made by the weighted sum of loss functions, in which the weight of each task is defined manually. Nonetheless, when tasks have different natures with variation in the order of magnitude, performing this combination during training so that all tasks converge towards their optimal solution is not trivial. Aiming to get the best possible solution, we modeled the multi-task learning as a multiobjective optimization problem and, as the main contribution of this paper, we proposed a greedy approach to find the weighting coefficients for each task, performing a trade-off between tasks that allow the optimization of multiple loss functions. Experimental results showed that it is possible to enhance instance segmentation when depthAbstract: Advanced driver assistance systems are responsible for assisting decision making and can play an important role in safety and traffic efficiency. Such systems require robust perception methods to handle complex urban scenes, and one way to achieve this is through instance segmentation. However, due to the difficulty in separating overlapping objects into different instances, this task becomes very challenging. For this, several authors proposed CNN-based methods and used depth information to enhance the instance segmentation performance. A promising way to explore this information is by adopting a multi-task learning approach, in which multiple tasks are learned simultaneously by sharing the same architecture. Usually, this combination is made by the weighted sum of loss functions, in which the weight of each task is defined manually. Nonetheless, when tasks have different natures with variation in the order of magnitude, performing this combination during training so that all tasks converge towards their optimal solution is not trivial. Aiming to get the best possible solution, we modeled the multi-task learning as a multiobjective optimization problem and, as the main contribution of this paper, we proposed a greedy approach to find the weighting coefficients for each task, performing a trade-off between tasks that allow the optimization of multiple loss functions. Experimental results showed that it is possible to enhance instance segmentation when depth information is properly explored. Moreover, not only did depth information help instance segmentation, but also did the instance segmentation help the depth estimations, achieving better performance compared to single-task models. … (more)
- Is Part Of:
- Engineering applications of artificial intelligence. Volume 100(2021)
- Journal:
- Engineering applications of artificial intelligence
- Issue:
- Volume 100(2021)
- Issue Display:
- Volume 100, Issue 2021 (2021)
- Year:
- 2021
- Volume:
- 100
- Issue:
- 2021
- Issue Sort Value:
- 2021-0100-2021-0000
- Page Start:
- Page End:
- Publication Date:
- 2021-04
- Subjects:
- Multi-task learning -- Instance segmentation -- Depth estimation
Engineering -- Data processing -- Periodicals
Artificial intelligence -- Periodicals
Expert systems (Computer science) -- Periodicals
Ingénierie -- Informatique -- Périodiques
Intelligence artificielle -- Périodiques
Systèmes experts (Informatique) -- Périodiques
Artificial intelligence
Engineering -- Data processing
Expert systems (Computer science)
Periodicals
620.00285 - Journal URLs:
- http://www.sciencedirect.com/science/journal/09521976 ↗
http://www.elsevier.com/journals ↗ - DOI:
- 10.1016/j.engappai.2021.104205 ↗
- Languages:
- English
- ISSNs:
- 0952-1976
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - 3755.704500
British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store - Ingest File:
- 16719.xml