Study on High-Level Structure of cognition control construction in Exploration and Exploitation within Multi-Armed Bandit Model of Reinforcement Learning | IEEE Conference Publication | IEEE Xplore