Single agent setting¶

TL;DR

We use Flatland as a single agent environment to make it easier to train using simple methods. This doesn’t scale well but can be used to test new methods and check the efficiency of new observations!

💡 The idea¶

Multi-agent problems are by nature more complicated than standard reinforcement learning problem.

As a first step, we

📈 Results¶

We can see that the policy can easily handle a single agent. However, as the number of agent grows, the performance quickly diminishes.

The episode length is lower with 2 agents compared to 3 agents, but stays over 200, which is excessive for such small environments.

Only in the 1-agent does the policy manage to consistently bring most of the agents to their targets.

Full W&b report

🔗 Links¶

🌟 Credits¶

Sharada Mohanty - idea
Florian Laurent - implementation