ray.rllib.utils.exploration.gaussian_noise.GaussianNoise
ray.rllib.utils.exploration.gaussian_noise.GaussianNoise#
- class ray.rllib.utils.exploration.gaussian_noise.GaussianNoise(action_space: <MagicMock name='mock.Space' id='140492834250560'>, *, framework: str, model: ray.rllib.models.modelv2.ModelV2, random_timesteps: int = 1000, stddev: float = 0.1, initial_scale: float = 1.0, final_scale: float = 0.02, scale_timesteps: int = 10000, scale_schedule: Optional[ray.rllib.utils.schedules.schedule.Schedule] = None, **kwargs)[source]#
Bases:
ray.rllib.utils.exploration.exploration.ExplorationAn exploration that adds white noise to continuous actions.
If explore=True, returns actions plus scale (annealed over time) x Gaussian noise. Also, some completely random period is possible at the beginning.
If explore=False, returns the deterministic action.
Methods
__init__(action_space, *, framework, model)Initializes a GaussianNoise instance.
before_compute_actions(*[, timestep, ...])Hook for preparations before policy.compute_actions() is called.
get_exploration_optimizer(optimizers)May add optimizer(s) to the Policy's own
optimizers.get_state([sess])Returns the current scale value.
on_episode_end(policy, *[, environment, ...])Handles necessary exploration logic at the end of an episode.
on_episode_start(policy, *[, environment, ...])Handles necessary exploration logic at the beginning of an episode.
postprocess_trajectory(policy, sample_batch)Handles post-processing of done episode trajectories.