上一条: Sequential dynamic resource allocation in multi-beam satellite systems: A learning-based optimization method
下一条: Revising the observation satellite scheduling problem based on deep reinforcement learning