Twin Nphard

Graph-based Deterministic Policy Gradient for Repetitive Combinatorial Optimization Problems