Optimal budget control in real-time advertising

Loading...
Thumbnail Image

URL

Journal Title

Journal ISSN

Volume Title

School of Science | Master's thesis

Department

Major/Subject

Mcode

Language

en

Pages

73

Series

Abstract

This thesis focuses on bidding strategies in Real Time Bidding (RTB) for advertisers with constrained resources. The work shows how a limited budget increases the need to optimize it and, therefore, to leverage different bidding strategies to maximize profit while satisfying the constraint. In particular, this work evaluates the use of Reinforcement Learning (RL) to solve this problem, having the agent learn to optimize the budget within the RTB environment. The proposed approach employs a Double Deep Q-Network (DDQN) agent that learns to adjust bid multipliers dynamically, using app installs as the reward signal to align with real business objectives. RL is then compared with a simple baseline and a stronger one used in production, and highlights possibilities and limitations. This thesis demonstrates that RL has potential for solving online optimization problems with performance close to industry standard, although further improving it would be at the cost of exponentially growing complexity. In conclusion, this work expands on the practical considerations related to production grade algorithms, such as explainability and maintainability, and highlight the area of improvement that future work using RL for real life optimization problem should tackle.

Description

Supervisor

Uitto, Jara

Thesis advisor

Bragard, Quentin

Other note

Citation