C# 클래스 AForge.MachineLearning.EpsilonGreedyExploration

Epsilon greedy exploration policy.

The class implements epsilon greedy exploration policy. Acording to the policy, the best action is chosen with probability 1-epsilon. Otherwise, with probability epsilon, any other action, except the best one, is chosen randomly.

According to the policy, the epsilon value is known also as exploration rate.

상속: IExplorationPolicy

파일 보기 프로젝트 열기: holisticware-admin/MonoVersal.AForgeNET

공개 메소드들

메소드	설명
ChooseAction ( double actionEstimates ) : int	Choose an action. The method chooses an action depending on the provided estimates. The estimates can be any sort of estimate, which values usefulness of the action (expected summary reward, discounted reward, etc).
EpsilonGreedyExploration ( double epsilon ) : System	Initializes a new instance of the EpsilonGreedyExploration class.

메소드 상세

ChooseAction() 공개 메소드

Choose an action.

The method chooses an action depending on the provided estimates. The estimates can be any sort of estimate, which values usefulness of the action (expected summary reward, discounted reward, etc).

public ChooseAction ( double actionEstimates ) : int
actionEstimates	double	Action estimates.
리턴	int

EpsilonGreedyExploration() 공개 메소드

Initializes a new instance of the EpsilonGreedyExploration class.

public EpsilonGreedyExploration ( double epsilon ) : System
epsilon	double	Epsilon value (exploration rate).
리턴	System