Advanced Bandit Algorithms Research
External reference: https://openalex.org/T12101
-
Network DLN shows cost-adjusted utility gains at large scale Computational models of cognitive stages show that network-stage architectures outperform linear stages through estimation efficiency and explicit exposure tracking, not just parameter reduction.
-
Thompson sampling improved exercise recommendations for learner skill gain Contextual Thompson sampling approach for personalizing exercise sequences in digital learning environments, optimizing skill advancement at scale using bandit-based algorithms.
-
NFL money line odds show pricing gaps Study reveals NFL sportsbooks misprice money line underdog odds relative to spread odds, enabling conditional returns up to 6.55% and challenging market efficiency assumptions.

