Skip to main content

All Questions

0votes
0answers
38views

PPO with multiple actions per action vector

I would like to have the following vector for example [0.2,0.6,0.3,0.4,0.8] end up looking like this after training [0,1,0,0,1]. In other words , rather than choosing one action, I'm choosing more ...
Tofara Moyo's user avatar

close