Advanced Bandit Algorithms Research

Network DLN shows cost-adjusted utility gains at large scale
Computational models of cognitive stages show that network-stage architectures outperform linear stages through estimation efficiency and explicit exposure tracking, not just parameter reduction.
Thompson sampling improved exercise recommendations for learner skill gain
Contextual Thompson sampling approach for personalizing exercise sequences in digital learning environments, optimizing skill advancement at scale using bandit-based algorithms.
NFL money line odds show pricing gaps
Study reveals NFL sportsbooks misprice money line underdog odds relative to spread odds, enabling conditional returns up to 6.55% and challenging market efficiency assumptions.