All Questions

1 question

0votes

0answers

38views

PPO with multiple actions per action vector

I would like to have the following vector for example [0.2,0.6,0.3,0.4,0.8] end up looking like this after training [0,1,0,0,1]. In other words , rather than choosing one action, I'm choosing more ...

Tofara Moyo

asked Aug 7, 2024 at 10:49

Featured on Meta
Evolving comments: An experiment to encourage engagement and follow-up questions
Updates to advertising guidelines
Upcoming initiatives on Stack Overflow and across the Stack Exchange network...

Hot Network Questions

Why do the infected not attack each other?
Locally free coherent module over open disc
Can multiple creatures use a Legendary Action at the end of a single turn?
How to differentiate the Chinese Translation for "Specialized High Schools" and "Special Education School"?
Which Western countries are looking to cancel procurement/collaboration programs for US weapon systems and how far has that proceeded?
Algebraic proof that the left Maurer-Cartan form is well defined
Idiomatic way of generating a unique filename?
Creating "flag" background for labels using QGIS
There are no employees at the store. Why not?
How To Handle Daughter's Bathroom (#2) Accident?
Why are US executive orders so controversial? Aren't they just the chief executive telling the executive branch what to do?
Why full-wave bridge circuit connect ground
What is this orange button on my antique Black & Decker drill?
Is there a sign-problem when the wavefunction itself has positive and negative values, or is it only when the Hamiltonian has such entries?
Reasoning about quest and story deadlocks etc
Is Sour dough starter Davar Hamaamid?
Revising part of a manuscript not covered by the referee report
What does \clist_map_inline:Nn return?
Doubt regarding center of mass of a cone
My CMOS inverter output does not go to zero
What is this 3-pole LED striplight mains connector?
Can the irrationals be partitioned into dense, disjoint subsets?
Why did Germany allow and help introduce the Deutsche Mark to Montenegro?
I'm owed money from a non-profit for services rendered, but they are unresponsive

All Questions

PPO with multiple actions per action vector

Related Tags

Hot Network Questions