ray.tune.syncer.Syncer
ray.tune.syncer.Syncer#
- class ray.tune.syncer.Syncer(sync_period: float = 300, sync_timeout: float = 1800)[source]#
Bases:
abc.ABCSyncer class for synchronizing data between Ray nodes and remote (cloud) storage.
This class handles data transfer for two cases:
Synchronizing data such as experiment checkpoints from the driver to cloud storage.
Synchronizing data such as trial checkpoints from remote trainables to cloud storage.
Synchronizing tasks are usually asynchronous and can be awaited using
wait(). The base class implements await_or_retry()API that will retry a failed sync command.The base class also exposes an API to only kick off syncs every
sync_periodseconds.- Parameters
sync_period – The minimum time in seconds between sync operations, as used by
sync_up/down_if_needed.sync_timeout – The maximum time to wait for a sync process to finish before issuing a new sync operation. Ex: should be used by
waitif launching asynchronous sync tasks.
DeveloperAPI: This API may change across minor Ray releases.
Methods
delete(remote_dir)Delete directory on remote storage.
retry()Retry the last sync up, sync down, or delete command.
sync_down(remote_dir, local_dir[, exclude])Synchronize remote directory to local directory.
sync_down_if_needed(remote_dir, local_dir[, ...])Syncs down if time since last sync down is greater than sync_period.
sync_up(local_dir, remote_dir[, exclude])Synchronize local directory to remote directory.
sync_up_if_needed(local_dir, remote_dir[, ...])Syncs up if time since last sync up is greater than sync_period.
validate_upload_dir(upload_dir)Checks if
upload_diris supported by the Syncer.wait()Wait for asynchronous sync command to finish.