Enhance Sketch Recognition’s Explainability via Semantic Component-Level Parsing

Authors

  • Guangming Zhu School of Computer Science and Technology, Xidian University, China Key Laboratory of Smart Human-Computer Interaction and Wearable Technology of Shaanxi Province Xi'an Key Laboratory of Intelligent Software Engineering
  • Siyuan Wang School of Computer Science and Technology, Xidian University, China
  • Tianci Wu School of Computer Science and Technology, Xidian University, China
  • Liang Zhang School of Computer Science and Technology, Xidian University, China Key Laboratory of Smart Human-Computer Interaction and Wearable Technology of Shaanxi Province Xi'an Key Laboratory of Intelligent Software Engineering

DOI:

https://doi.org/10.1609/aaai.v38i7.28607

Keywords:

CV: Applications, CV: Interpretability, Explainability, and Transparency, HAI: Interaction Techniques and Devices

Abstract

Free-hand sketches are appealing for humans as a universal tool to depict the visual world. Humans can recognize varied sketches of a category easily by identifying the concurrence and layout of the intrinsic semantic components of the category, since humans draw free-hand sketches based a common consensus that which types of semantic components constitute each sketch category. For example, an airplane should at least have a fuselage and wings. Based on this analysis, a semantic component-level memory module is constructed and embedded in the proposed structured sketch recognition network in this paper. The memory keys representing semantic components of each sketch category can be self-learned and enhance the recognition network's explainability. Our proposed networks can deal with different situations of sketch recognition, i.e., with or without semantic components labels of strokes. Experiments on the SPG and SketchIME datasets demonstrate the memory module's flexibility and the recognition network's explainability. The code and data are available at https://github.com/GuangmingZhu/SketchESC.

Published

2024-03-24

How to Cite

Zhu, G., Wang, S., Wu, T., & Zhang, L. (2024). Enhance Sketch Recognition’s Explainability via Semantic Component-Level Parsing. Proceedings of the AAAI Conference on Artificial Intelligence, 38(7), 7731-7738. https://doi.org/10.1609/aaai.v38i7.28607

Issue

Section

AAAI Technical Track on Computer Vision VI