V-MPO: On-Policy Maximum a Posteriori Policy Optimization for Discrete and Continuous Control
P1205961_A00906_writing_DraftP1205961_A03331_defaultP1205961_A103946_defaultP1205961_A126355_defaultP1205961_A132261_defaultP1205961_A14382_defaultP1205961_A29688_defaultP1205961_A34270_defaultP1205961_A50572_writing_DraftP1205961_A56332_defaultP1205961_A60293_defaultP1205961_A71149_defaultP1205961_A90839_resourceP1205961_A90839_supervisionP1205961_A90839_writing_review_editingP1205961_A92500_writing_review_editing
isContributionToPaper
V-MPO: On-Policy Maximum a Posteriori Policy Optimization for Discrete and Continuous Control
title
V-MPO: On-Policy Maximum a Pos ...... iscrete and Continuous Control
arXivID
1909.12238