סמינר: Graduate Seminar
Exploring in the Dark: Pure Exploration for POMDPs
Date:
September,10,2025
Start Time:
11:30 - 12:30
Location:
506, Zisapel Building
Zoom:
Zoom link
Add to:
Lecturer:
Yonatan Ashlag
Research Areas:
Unsupervised pre-training has achieved remarkable success in NLP and computer vision by leveraging large-scale unlabeled data. In reinforcement learning, such pre-training—often termed pure exploration—aims to learn task-agnostic exploratory policies that can be efficiently fine-tuned for downstream tasks. However, existing pure exploration methods predominantly assume full observability, a limitation that precludes their application to most real-world scenarios. The core challenge lies in defining the exploration objective: while fully observable methods target state-space coverage, partial observability fundamentally obscures the true state space. This ambiguity can lead to degenerate solutions where agents maximize observation diversity through trivial behaviors rather than meaningful exploration. We propose a novel approach that addresses this challenge by learning latent state representations through dynamics modeling , enabling principled exploration in the latent space as a proxy for true state coverage. We demonstrate that our method enables efficient adaptation to sparse-reward POMDP tasks, significantly outperforming baselines that lack structured pre-training. |
MSc student under the supervision of Professor Kfir Yehuda Levy.
|