Experience
Experience tuple at time step \(t\). It contains the observation, action, next observation, reward, terminated. Each item shape is (num_envs, *item_shape)
. *item_shape
depends on the item.
Module: aine_drl.exp
@dataclass(frozen=True)
class Experience
Fields
obs
The observation tuple. See Observation docs.
obs: Observation
action
See Action docs.
action: Action
next_obs
The next observation tuple. See Observation docs.
next_obs: Observation
reward
The reward tensor. *item_shape
= 1
reward: Tensor
terminated
The terminated tensor. *item_shape
= 1
terminated: Tensor
Methods
transform()
Transform the experience tuple with the callable function.
def transform(self, func: Callable[[Tensor], Tensor]) -> Experience
Parameters:
Name | Description |
---|---|
func ((Tensor) -> Tensor ) | The callable function to transform each experience tensor. |
Returns:
Name | Description |
---|---|
exp (Experience ) | The transformed experience. |