Alcune informazioni sono riportate in lingua inglese.
Chi sono
Reinforcement Learning projects in python.
I have experience in the following reinforcement learning algorithms:
1. Value Iteration
2. Policy Iteration
3. Q-learning
4. DQN (Deep Q Network)
5. DDPG (Deep Deterministic Policy Gradient)
6. TRPO (Trust Region Policy Optimisation)
7. PPO (Proximal Policy Optimisation)
I am very comfortable with OpenAI gym and can create any kind of custom environment.
I can work with Tensorflow and Pytorch. ... Continua a leggere