In reinforcement learning, what does the term "exploration vs. exploitation" refer to?
- A) The trade-off between using a known action versus trying new actions to discover potentially better ones
- B) The trade-off between increasing the size of the state space and the action space
- C) The trade-off between the speed of learning and the accuracy of predictions
- D) The trade-off between training on historical data versus real-time data