Skip to main content

MIS Regression

Mean interval score (MIS) regression directly minimizes MIS, a scoring function for predictions of confidence intervals. This is done by using MIS as the loss function for deep neural networks. The formula to calculate MIS is also known as Winkler loss and is expressed as:

MIS=1h∑j=1h((ut+j−lt+j)+2α(lt+j−yt+j)1(yt+j<lt+j)+2α(yt+j−ut+j)1(yt+j>ut+j))MIS = \frac{1}{h}\sum_{j=1}^{h}\left((u_{t+j} - l_{t+j}) + \frac{2}{\alpha}(l_{t+j} - y_{t+j})\mathbb{1}(y_{t+j} < l_{t+j}) + \frac{2}{\alpha}(y_{t+j} - u_{t+j})\mathbb{1}(y_{t+j} > u_{t+j})\right)

where uu and ll are the upper and lower bounds respectively, and α\alpha is a fixed confidence level. Here α\alpha is equivalent to 1−CI1-CI, where CICI is the desired confidence interval. Therefore, α=0.05\alpha=0.05 for a 95%95\% confidence interval. There are 3 parts to this loss function, which summed together equal the total mean interval score.

  1. Penalize distance between the upper and lower bounds.
  2. Penalize distance between the lower bound and actual value by a ratio of 2/α2/\alpha when the actual value is lower than the lower bound.
  3. Penalize distance between the actual value and upper bound by a ratio of 2/α2/\alpha when the actual value is higher than the upper bound.

Since the loss function for this model jointly includes the upper and lower bounds, the result outputs both, unlike with quantile regression.