📚 node [[target_network]]
📓
garden/KGBicheno/Artificial Intelligence/Introduction to AI/Week 3 - Introduction/Definitions/Target_Network.md by @KGBicheno
target network
Go back to the [[AI Glossary]]
#rl
In Deep Q-learning, a neural network that is a stable approximation of the main neural network, where the main neural network implements either a Q-function or a policy. Then, you can train the main network on the Q-values predicted by the target network. Therefore, you prevent the feedback loop that occurs when the main network trains on Q-values predicted by itself. By avoiding this feedback, training stability increases.
📖 stoas
- public document at doc.anagora.org/target_network
- video call at meet.jit.si/target_network
🔎 full text search for 'target_network'