Student Probability Seminar

New Potential-Based Bounds for Prediction with Expert Advice

Speaker: Vlad Kobzar, CIMS

Location: Warren Weaver Hall 201

Date: Thursday, December 5, 2019, 12:15 p.m.

Synopsis:

We address the classic machine learning problem of online prediction 
with expert advice: At each round, the predictor (player) uses guidance 
from a collection of experts with the goal of minimizing the difference 
(regret) between the player's loss and that of the best performing 
expert in hindsight. The experts' losses and gains are determined by the 
environment (adversary), possibly in an adversarial manner.

State-of-the art player strategies are based on potential functions that 
bound the regret above. On the other hand, state-of-the art adversary 
strategies were historically determined by random walk methods.

Although the player and the adversary may use randomization in their 
strategies, the deterministic control paradigm fully describes the 
expert framework.  Accordingly, using verification arguments from 
optimal control theory, we view the task of finding better lower and 
upper bounds on the value of the game (regret) as the problem of finding 
better sub-and supersolutions of certain partial differential equations 
(PDEs). These sub-and supersolutions serve as the potentials for player 
and adversary strategies, which lead to the corresponding bounds.

To get explicit bounds, we use closed-form solutions of specific PDEs. 
Our bounds hold for any fixed number of experts and any time-horizon; in 
certain regimes they improve upon the previous state-of-the-art.

This is joint work with Robert Kohn and Zhilei Wang at Courant. A 
preprint is available at https://arxiv.org/abs/1911.01641