In the context of reinforcement learning, what is the "Bellman Equation" used for?

  • A) To update the weights of a neural network
  • B) To compute the value of a state or action in terms of the expected future rewards
  • C) To initialize the parameters of a model
  • D) To encode categorical features into numerical values