np.random.choice(self.action_space, p=prediction) #23

Open

opened

on Dec 21, 2022

Hi, I want to ask about this:
Why you use np.random.choice(self.action_space, p=prediction) but not np.argmax()??

Metadata

Assignees

No one assigned

Labels

No labels

No labels

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests