GT « Analyse, Algorithmique, Apprentissage »

SU - 15-16-309 4 Place Jussieu, Paris

Sasha Rakhlin (MIT) Title: On the Foundations of Interactive Decision Making Abstract: We present a general framework for interactive decision making that subsumes multi-armed bandits, contextual bandits, structured bandits, and reinforcement […]