UCB中置信区间怎么推导出来的
Upper Confidence BoundsRandom exploration gives us an opportunity to try out options that we have not known much about. However, due to the randomness, it is possible we end up exploring a bad actio...