Information-TheoreticConsiderationsinBatchReinforcementLearningJinglinChen1NanJiang1AbstractwhentheyworkiscentraltoourunderstandingofRL.Ex-istingworksthatanalyzeerrorpropagationandfinitesam-Value-f...