Reinforcement Learning-Based Bandwidth Decision in Optical Access Networks: A Study of Exploration Strategy and Time With Confidence Guarantee | IEEE Journals & Magazine | IEEE Xplore