Method | Description | |
---|---|---|
Learn ( |
Updates the Q-Learner model by reinforcing with the new state/action and transition state feedback values.
|
|
Learn ( |
Updates the Q-Learner model by reinforcing with the new state/action feedback values.
|
|
Predict ( |
Predicts the best action for the current state.
|
|
QLearnerModel ( ) : System |
Initializes a new Q-Learner model.
|
public Learn ( |
||
x1 | Item features, i.e. the original State. | |
y | double | Action label. |
x2 | Transition state value. | |
r | double | Reward value. |
return | void |
public Learn ( |
||
x | State vector. | |
y | double | Action label. |
r | double | Reward value. |
return | void |
public Predict ( |
||
y | ||
return | double |