DeepQSnake

An implementation of many different methods for improving the performance of Q-learning tested on the game Snake.

Developed over a few years, achieving very good performance on 7x7 maps, where it averages around 45 out of 49 possible apples.

Note that in general, Q-learning is suboptimal for most RL scenarios and PPO is often the go to.

Name		Name	Last commit message	Last commit date
Latest commit History 43 Commits
checkpoints		checkpoints
models		models
.gitignore		.gitignore
README.md		README.md
ReplayMemory.py		ReplayMemory.py
agent.py		agent.py
checkpoint_visualizer.py		checkpoint_visualizer.py
game.py		game.py
game_map.py		game_map.py
graphics_module.py		graphics_module.py
hyperparams.py		hyperparams.py
main.py		main.py
model.py		model.py
network_trainer.py		network_trainer.py
prioritized_replay_memory.py		prioritized_replay_memory.py
to do.txt		to do.txt
visualize.py		visualize.py

Provide feedback