All Questions
1 question
0votes
0answers
38views
PPO with multiple actions per action vector
I would like to have the following vector for example [0.2,0.6,0.3,0.4,0.8] end up looking like this after training [0,1,0,0,1]. In other words , rather than choosing one action, I'm choosing more ...