메소드 | 설명 | |
---|---|---|
ChooseAction ( double actionEstimates ) : int |
Choose an action. The method chooses an action depending on the provided estimates. The estimates can be any sort of estimate, which values usefulness of the action (expected summary reward, discounted reward, etc). The action is choosed from non-tabu actions only. |
|
ResetTabuList ( ) : void |
Reset tabu list. Clears tabu list making all actions allowed. |
|
SetTabuAction ( int action, int tabuTime ) : void |
Set tabu action.
|
|
TabuSearchExploration ( int actions, IExplorationPolicy basePolicy ) : System |
Initializes a new instance of the TabuSearchExploration class.
|
public ChooseAction ( double actionEstimates ) : int | ||
actionEstimates | double | Action estimates. |
리턴 | int |
public SetTabuAction ( int action, int tabuTime ) : void | ||
action | int | Action to set tabu for. |
tabuTime | int | Tabu time in iterations. |
리턴 | void |
public TabuSearchExploration ( int actions, IExplorationPolicy basePolicy ) : System | ||
actions | int | Total actions count. |
basePolicy | IExplorationPolicy | Base exploration policy. |
리턴 | System |