A practitioner is nowadays faced with an entire zoo of different datadriven optimization methods all of which are touted to work well in practice. This begs the question which – if any – method should be preferred and under what statistical conditions. In my research I investigate which datadriven optimization methods are efficient and do more with the given data than other methods. Despite the folklore believe that an era of unlimited data is upon us, many realworld decisions in fact critically depend on relatively few relevant data points. A case in point would be personalized health care where medical records are hard to obtain and the fraction of data relevant to a specific medical issue is often very small. Data famously has been claimed as the oil of the 21st century and I believe that just like oil it ought to be treated as a scarce resource and consequently used efficiently.
Efficient Prescription
I was able to show that certain robust datadriven optimization formulations as well as certain regularization techniques are indeed statistically efficient under certain common statistical assumptions on the data. These results were all established for datadriven prediction and prescription problems in which all data is given up front.
Related publications

Van Parys, B.P.G. (2023). “Efficient DataDriven Optimization with Noisy Data “. In: Operations Research Letters. Major Revision. Link

Bennouna, M.A. and B.P.G. Van Parys (2022). “Learning and DecisionMaking with Data: Optimal Formulations and Phase Transitions”. In: Mathematical Programming. Submitted. Won the ORC Best Student Paper Award 2022. Link .

Sutter, T., B.P.G. Van Parys, and D. Kuhn (2021). “A general framework for optimal datadriven optimization”. In: Operations Research. Major Revision. Link

Van Parys, B.P.G., P.M. Esfahani, and D. Kuhn (2017). “From data to decisions: Distributionally robust optimization is optimal”. In: Management Science 67.6, pp. 3387–3402. Link
Efficient Bandits
Sometimes decisions have to be made repeatedly over time instead of only once. When trying to determine which drug to prescribe for treatment of a novel decease, for instance, treatment efficacy data is only available after treatments have been prescribed. In such dynamic settings, a decision made now, and its associated response, will serve as an additional data point later on. When making repeated decisions, exploration (learning) and exploitation (decisionmaking) can no longer be done sequentially but rather must be carefully balanced against each other. Some methods make this tradeoff better than others in particular when structural information is present. For instance, a drug developer could assume that the efficacy of drugs based on chemically related active components are likely similar. Classical reinforcement learning methods such as Thomson sampling or UCB do not treat the collected data efficiently and may end up wasting most collected data even in the presence of such simple structural information. I demonstrate that certain datadriven optimization formulations can, again surprisingly, also balance such exploration and exploitation optimally even in the presence of generic convex structural information.
Related publications

Van Parys, B.P.G. and N. Golreaei (2022). “Optimal learning for structured bandits”. In: Management Science. Accepted. Link

T. Lattimore and C. Szepesvari (2017). “The end of optimism? An asymptotic analysis of finitearmed linear bandits”. In: PMLR. Vol. 54. 2017, pp. 728–737.