Attentional bottleneck: Towards an interpretable deep driving network
Proceedings of the IEEE/CVF Conference on Computer Vision and …, 2020•openaccess.thecvf.com
Deep neural networks are a key component of behavior prediction and motion generation
for self-driving cars. One of their main drawbacks is a lack of transparency: they should
provide easy to interpret rationales for what triggers certain behaviors. We propose an
architecture called Attentional Bottleneck with the goal of improving transparency. Our key
idea is to combine visual attention, which identifies what aspects of the input the model is
using, with an information bottleneck that enables the model to only use aspects of the input …
for self-driving cars. One of their main drawbacks is a lack of transparency: they should
provide easy to interpret rationales for what triggers certain behaviors. We propose an
architecture called Attentional Bottleneck with the goal of improving transparency. Our key
idea is to combine visual attention, which identifies what aspects of the input the model is
using, with an information bottleneck that enables the model to only use aspects of the input …
Abstract
Deep neural networks are a key component of behavior prediction and motion generation for self-driving cars. One of their main drawbacks is a lack of transparency: they should provide easy to interpret rationales for what triggers certain behaviors. We propose an architecture called Attentional Bottleneck with the goal of improving transparency. Our key idea is to combine visual attention, which identifies what aspects of the input the model is using, with an information bottleneck that enables the model to only use aspects of the input which are important. This not only provides sparse and interpretable attention maps (eg focusing only on specific vehicles in the scene), but it adds this transparency at no cost to model accuracy. In fact, we find slight improvements in accuracy when applying Attentional Bottleneck to the ChauffeurNet model in comparison to a traditional visual attention model that degrades accuracy.
openaccess.thecvf.com
Showing the best result for this search. See all results