Advanced Bandit Algorithms Research

External reference: https://openalex.org/T12101

  1. Network DLN shows cost-adjusted utility gains at large scale
    Computational models of cognitive stages show that network-stage architectures outperform linear stages through estimation efficiency and explicit exposure tracking, not just parameter reduction.
  2. Thompson sampling improved exercise recommendations for learner skill gain
    Contextual Thompson sampling approach for personalizing exercise sequences in digital learning environments, optimizing skill advancement at scale using bandit-based algorithms.
  3. NFL money line odds show pricing gaps
    Study reveals NFL sportsbooks misprice money line underdog odds relative to spread odds, enabling conditional returns up to 6.55% and challenging market efficiency assumptions.