Clinician-in-the-LoopDecisionMaking:ReinforcementLearningwithNear-OptimalSet-ValuedPoliciesShengpuTang1AdityaModi1MichaelW.Sjoding23JennaWiens1Abstractrewardsignalsviarewardshaping(Lizotteetal.,201...