ECE Women Community

Fine-tunning of RL models via local policy iteration

Date: July,25,2024 Start Time: 13:00 - 14:00

Location: 1061, Meyer Building

Zoom: Zoom link

Add to:

Lecturer: Itai Lavie

Affiliations: The Andrew and Erna Viterbi Faculty of Electrical & Computer Engineering

Research Areas:

Machine learning and intelligent systems

Inspired by recent challenges in fine-tuning reinforcement learning (RL) models, we introduce a fine-tuning method employing a local adaptation of Policy Iteration (PI). Our proposed algorithm can be deployed over tasks with large (or even infinite) state spaces, while preserving the existing knowledge of the model. We propose two local variants of the PI algorithm: One utilizing a fixed lookahead (or even no lookahead), and another utilizing an adaptive lookahead. We provide explicit iteration complexity bounds for both local PI algorithms. We empirically validate the efficacy of our local algorithms, and discuss several implications and challenges of applying local fine-tuning methods to deep RL models.

M.Sc. student under the supervision of Prof. Nir Weinberger.

Seminar: Graduate Seminar

Seminars

Fine-tunning of RL models via local policy iteration

Seminars

Fine-tunning of RL models via local policy iteration

Upcoming Seminars

Metagratings on Low-Cost Substrates for Efficient Anomalous Reflection: Addressing Dielectric Loss

Non-Adaptive Multi-Stage Algorithm for Group Testing with Prior Statistics

Unsupervised Invariant Risk Minimization