2025_12_16_rltheory

I gave a talk on off-policy contextual bandits at the RL Theory Seminar: [video], [slides].