[[target_network]] - anagora.org

📚 node [[target_network]]

📓 garden/KGBicheno/Artificial Intelligence/Introduction to AI/Week 3 - Introduction/Definitions/Target_Network.md by @KGBicheno

target network

Go back to the [[AI Glossary]]

#rl

In Deep Q-learning, a neural network that is a stable approximation of the main neural network, where the main neural network implements either a Q-function or a policy. Then, you can train the main network on the Q-values predicted by the target network. Therefore, you prevent the feedback loop that occurs when the main network trains on Q-values predicted by itself. By avoiding this feedback, training stability increases.

📖 stoas

public document at doc.anagora.org/target_network
video call at meet.jit.si/target_network

⥱ context

← back
(none)

↑ pushing here
(none)

↓ pulling this
(none)

→ forward
ai glossary

🔎 full text search for 'target_network'