Distributed-RL

Distributed RL for Hide and Seek

Instruction

Algorithms

It current supports BC(Behavior Cloning), DQN(Deep Q Network), PPO(Proximal Policy Optimization). Set the hyperparameters in .yaml files in Config/Methods/...

Train

Training methods

It currently supports Self-play and Imitation Learning. DQN and PPO is available for Self-play, and BC is available for Imitation Learning.

The training config can be set in .yaml files in Config/Run/... and distributed learning hyperparameters can be set in .conf files in Config/Shell/...

Start training

Run . Pipeline/run.sh <Training methods.conf>

Name		Name	Last commit message	Last commit date
Latest commit History 246 Commits
.idea		.idea
Config		Config
Experience_Collector		Experience_Collector
Models		Models
Pipeline		Pipeline
unity_wrappers		unity_wrappers
Change_checkpoints.py		Change_checkpoints.py
Network_test.py		Network_test.py
README.md		README.md
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Distributed-RL

Instruction

Algorithms

Train

Training methods

Start training

About

Uh oh!

Releases

Packages

Uh oh!

Languages

DarrellDai/Distributed-RL

Folders and files

Latest commit

History

Repository files navigation

Distributed-RL

Instruction

Algorithms

Train

Training methods

Start training

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages