GT « Analyse, Algorithmique, Apprentissage »
SU - 15-16-309 4 Place Jussieu, ParisSasha Rakhlin (MIT) Title: On the Foundations of Interactive Decision Making Abstract: We present a general framework for interactive decision making that subsumes multi-armed bandits, contextual bandits, structured bandits, and reinforcement […]