With the growing development of robotics services, the problem of orchestrating a fleet of robots (or autonomous agents) under various constraints has recently become a major design bottleneck, especially when seeking to optimise service operations. In the Optimization with Learning team, we are int