Divide by zero 

Hi,
First and foremost, thanks for sharing the code. This is greatly appreciated. 

Currently testing ARS in other learning environments and found that for very difficult environments the users of the code might face a divide by zero error, particularly at early stages of the learning process (ie, zero reward in all the initial rollouts). 

```python
# normalize rewards by their standard deviation
rollout_rewards /= np.std(rollout_rewards)
```
Thanks,

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Divide by zero #1

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Divide by zero #1

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions