ray.rllib.policy.sample_batch.SampleBatch.get_single_step_input_dict
ray.rllib.policy.sample_batch.SampleBatch.get_single_step_input_dict#
- SampleBatch.get_single_step_input_dict(view_requirements: Dict[str, ViewRequirement], index: Union[str, int] = 'last') SampleBatch[source]#
Creates single ts SampleBatch at given index from
self.For usage as input-dict for model (action or value function) calls.
- Parameters
view_requirements – A view requirements dict from the model for which to produce the input_dict.
index – An integer index value indicating the position in the trajectory for which to generate the compute_actions input dict. Set to “last” to generate the dict at the very end of the trajectory (e.g. for value estimation). Note that “last” is different from -1, as “last” will use the final NEXT_OBS as observation input.
- Returns
The (single-timestep) input dict for ModelV2 calls.