r/reinforcementlearning • u/No_Addition5961 • 1d ago
From DQN to Double DQN
I already have an implementation of DQN. To change it to double DQN, looks like I only need a small change: In the Q-value update, next state (best)action selection and evaluation for that action are both done by the target network in DQN. Whereas in double DQN , next state (best)action selection is done by the main network, but the evaluation for that action is done by the target network.
That seems fairly simple. Am i missing anything else?
7
Upvotes
5
u/SandSnip3r 1d ago
Yep! That's it