Lina Palmborg: Premium control with reinforcement learning
On 2022-10-26 kl 15.15 - 16.00
Albano, Cramer room
Lina Palmborg (Stockholm University)
Abstract We consider a premium control problem in discrete time, inspired by earlier works of Anders Martin-Löf, formulated in terms of a Markov decision process. In a simplified setting, the optimal premium rule can be derived with dynamic programming methods. However, these classical methods are not feasible in a more realistic setting due to the dimension of the state space. Hence, to combat the curse of dimensionality we explore reinforcement learning techniques, using linear function approximation. We illustrate the appropriateness of the approximate optimal premium rule compared with the true optimal premium rule in a simplified setting, and further demonstrate that the approximate optimal premium rule outperforms benchmark rules in a more realistic setting where classical approaches fail.