Abstract: The dual control problem, first introduced by Feldbaum in the 1960s, is recognized as encapsulating the "exploration versus exploitation" dilemma, central to online learning and control.