메소드 | 설명 | |
---|---|---|
Learn ( |
Reinforces the model from the new State, Action, StateP and Reward.
|
|
Learn ( |
Reinforces the model from the new state, action and reward.
|
|
Learn ( object state, object stateP ) : void |
Reinforces the model from the new item and reward.
|
public abstract Learn ( |
||
x1 | Item features, i.e. the State. | |
y | double | Action label. |
x2 | State/action reward value. | |
r | double | Reward value. |
리턴 | void |
public abstract Learn ( |
||
x | Item features, i.e. the State. | |
y | double | Action label. |
r | double | Reward value. |
리턴 | void |
public Learn ( object state, object stateP ) : void | ||
state | object | Initial State object or features with action label. |
stateP | object | New State object or features with reward label. |
리턴 | void |