All Questions
1 question
2votes
0answers
53views
Single-value loss/training in a CNN with a tensor output
I am playing around with an idea of using using Q-learning with a DQN (Deep Q-Network), to determine the optimal position of a number of 'units' on a grid of allowed locations, according to some ...