C# Класс Accord.MachineLearning.EpsilonGreedyExploration

Epsilon greedy exploration policy.

The class implements epsilon greedy exploration policy. According to the policy, the best action is chosen with probability 1-epsilon. Otherwise, with probability epsilon, any other action, except the best one, is chosen randomly.

According to the policy, the epsilon value is known also as exploration rate.

Наследование: IExplorationPolicy
Показать файл Открыть проект

Открытые методы

Метод Описание
ChooseAction ( double actionEstimates ) : int

Choose an action.

The method chooses an action depending on the provided estimates. The estimates can be any sort of estimate, which values usefulness of the action (expected summary reward, discounted reward, etc).

EpsilonGreedyExploration ( double epsilon ) : System

Initializes a new instance of the EpsilonGreedyExploration class.

Описание методов

ChooseAction() публичный Метод

Choose an action.
The method chooses an action depending on the provided estimates. The estimates can be any sort of estimate, which values usefulness of the action (expected summary reward, discounted reward, etc).
public ChooseAction ( double actionEstimates ) : int
actionEstimates double Action estimates.
Результат int

EpsilonGreedyExploration() публичный Метод

Initializes a new instance of the EpsilonGreedyExploration class.
public EpsilonGreedyExploration ( double epsilon ) : System
epsilon double Epsilon value (exploration rate).
Результат System