In the multi-armed bandit problem, the challenge is to balance between exploration of arms and ________ of the best-known arm.

  • Exploitation
  • Reward accumulation
  • Arm selection
  • Probability estimation
The multi-armed bandit problem involves the trade-off between exploration (trying new arms) and exploitation (selecting the best-known arm).
Add your answer
Loading...

Leave a comment

Your email address will not be published. Required fields are marked *