Preprint

Article

Deep ensembles based on Stochastic Activations for Semantic Segmentation

Altmetrics

Downloads

299

Views

255

Comments

A peer-reviewed article of this preprint also exists.

Alessandra Lumini

Loris Nanni^*,Gianluca Maguolo

Alessandra Lumini

Loris Nanni^*,Gianluca Maguolo

This version is not peer-reviewed

Submitted:

28 July 2021

Posted:

30 July 2021

You are already at the latest version

Alerts

Abstract

Semantic segmentation is a very popular topic in modern computer vision and it has applications to many fields. Researchers proposed a variety of architectures over time, but the most common ones exploit an encoder-decoder structure that aims to capture the semantics of the image and it low level features. The encoder uses convolutional layers, in general with a stride larger than one, to extract the features, while the decoder recreates the image by upsampling an using skip connections with the first layers. In this work, we use DeepLab as architecture to test the effectiveness of creating an ensemble of networks by randomly changing the activation functions inside the network multiple times. We also use different backbone networks in our DeepLab to validate our findings. We manage to reach a dice coefficient of 0.888, and a mean Intersection over Union (mIoU) of 0.825, in the competitive Kvasir-SEG dataset. Results in skin detection also confirm the performance of the proposed ensemble, which is ranked first with respect to other state-of-the-art approaches (including HardNet) in a large set of testing datasets. The developed code will be available at https://github.com/LorisNanni.

Keywords:

Subject: Computer Science and Mathematics - Algebra and Number Theory

Copyright: This open access article is published under a Creative Commons CC BY 4.0 license, which permit the free download, distribution, and reuse, provided that the author and preprint are cited in any reuse.