PIDAcceleratedValueIterationAlgorithmAmir-massoudFarahmand12MohammadGhavamzadeh3Abstractapproximationofthevalueoraction-valuefunctions,i.e.,Vk+1←TπVkorQk+1←T∗Qk.FordiscountedMDPs,Theconvergence...