Single agent settingยถ
TL;DR
We use Flatland as a single agent environment to make it easier to train using simple methods. This doesnโt scale well but can be used to test new methods and check the efficiency of new observations!
๐ก The ideaยถ
Multi-agent problems are by nature more complicated than standard reinforcement learning problem.
As a first step, we
๐ Resultsยถ
We can see that the policy can easily handle a single agent. However, as the number of agent grows, the performance quickly diminishes.
The episode length is lower with 2 agents compared to 3 agents, but stays over 200, which is excessive for such small environments.
Only in the 1-agent does the policy manage to consistently bring most of the agents to their targets.
๐ Creditsยถ
Sharada Mohanty - idea
Florian Laurent - implementation