Bandit Algorithms - Rilegato

Lattimore, Tor; Szepesvári, Csaba

 
9781108486828: Bandit Algorithms

Sinossi

A comprehensive and rigorous introduction for graduate students and researchers, with applications in sequential decision-making problems.

Le informazioni nella sezione "Riassunto" possono far riferimento a edizioni diverse di questo titolo.

Informazioni sugli autori

Tor Lattimore is a research scientist at DeepMind. His research is focused on decision making in the face of uncertainty, including bandit algorithms and reinforcement learning. Before joining DeepMind he was an assistant professor at Indiana University and a postdoctoral fellow at the University of Alberta.

Csaba Szepesvári is a Professor in the Department of Computing Science at the University of Alberta and a Principal Investigator of the Alberta Machine Intelligence Institute. He also leads the 'Foundations' team at DeepMind. He has co-authored a book on nonlinear approximate adaptive controllers and authored a book on reinforcement learning, in addition to publishing over 200 journal and conference papers. He is an action editor of the Journal of Machine Learning Research.

Le informazioni nella sezione "Su questo libro" possono far riferimento a edizioni diverse di questo titolo.