上一条: Exploring Best Arm with Top Reward-Cost Ratio in Stochastic Bandits
下一条: A Feedback reduction algorithm for OFDM based transmit power adaptation