Description of the Project

Title of the Project

Extending Generalization in Reinforcement Learning

Topic of the Project

I decided to fully focus on the concept of generalization using different Atari environments since I already wrote two different CNN policies for different Atari environments.

Generalization usually means to be able to perform well on similar unseen tasks. We realize that the standard reinforcement learning only uses one environment to train the agent, which can facilite overfitting. Therefore, by extending the concept of generalization, we decide to introduce different environments (tasks) to the agent in order to learn more efficiently a new Atari environment (motivation of meta reinforcement learning). Our goal is here to let the agent train on two different games and it needs to learn the third game by itself based on its trained policy.

Figure

Illustration of Meta-Learning Approach in RL Reference: https://lilianweng.github.io/posts/2019-06-23-meta-rl/ Topic of the project illustrated.

Language and Main Libraries

Programming Language: Python 3.7 with Anaconda virtual environment

GYM: Standard API for reinforcement learning and collection of Atari environments

RLKIT: Library for Reinforcement Learning Algorithms and Experiment for Atari environments

DOODAD: Library for launching Atari experiments on different machines.

IDEs and Software Used in This Project

Pycharm: For debugging issues related to preprocessing of Atari environments and local experiments.

VSCode with ssh extension: For debugging actual Atari experiments while connecting to the lab computers.

Docker: For running experiments inside a nvidia-docker image.

Setup for Experiments

Figure

Illustration of the setup for running an Experiment Topic of the project illustrated.

Which computers are you working on?
Local: Virtual Machine Ubuntu version 22.04
Lab Computer: Glen's lab computers (Blue and Green) accessed through ssh with a GPU (NVIDIA RTX A4000)

How do you execute an experiment?
In the remote computer, I created a nvidia-docker image for my Atari experiment and I run the python file by using that image.