MAGAT

This is a variation of the MADDPG algorithm enhanced with Graph Attention Layers on the critic networks

Simple spread env

This algorithm has been used to solve the simple spread (Cooperative navigation) environment from OpenAI link. N agents, N landmarks. Agents are rewarded based on how far any agent is from each landmark. Agents are penalized if they collide with other agents. So, agents have to learn to cover all the landmarks while avoiding collisions. However, I modified part of the reward function to be able to increase the training performance (i.e. the agents receive +10 if they are near a landmark).

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
.idea		.idea
multiagent		multiagent
LICENSE		LICENSE
OUNoise.py		OUNoise.py
README.md		README.md
buffer.py		buffer.py
clean.sh		clean.sh
ddpg.py		ddpg.py
env_wrapper.py		env_wrapper.py
envs.py		envs.py
layers.py		layers.py
maddpg.py		maddpg.py
main.py		main.py
networkforall.py		networkforall.py
networkforgat.py		networkforgat.py
replay_buffer.py		replay_buffer.py
rewards.JPG		rewards.JPG
run_tensorboard.sh		run_tensorboard.sh
run_training.sh		run_training.sh
see_training_agents.py		see_training_agents.py
utilities.py		utilities.py
vec_env.py		vec_env.py
workspace_utils.py		workspace_utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

MAGAT

Simple spread env

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Folders and files

Latest commit

History

Repository files navigation

MAGAT

Simple spread env

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Packages