Single agent settingยถ

TL;DR

We use Flatland as a single agent environment to make it easier to train using simple methods. This doesnโ€™t scale well but can be used to test new methods and check the efficiency of new observations!

๐Ÿ’ก The ideaยถ

Multi-agent problems are by nature more complicated than standard reinforcement learning problem.

As a first step, we

๐Ÿ“ˆ Resultsยถ


We can see that the policy can easily handle a single agent. However, as the number of agent grows, the performance quickly diminishes.

The episode length is lower with 2 agents compared to 3 agents, but stays over 200, which is excessive for such small environments.

Only in the 1-agent does the policy manage to consistently bring most of the agents to their targets.

Full W&b report

๐ŸŒŸ Creditsยถ