1mgofficial

A Hands-On Guide to Policy Gradient Algorithms for Beginners - REINFORCE, A2C, PPO

Image from unsplash

In the previous article, we discussed two methods under model-free RL algorithms: policy-based and value-based. This article will focus on policy-based algorithms such as REINFORCE, Actor-Critic, and PPO methods of learning, and also implement the algorithms using simple Cart-Pole example.

Policy Gradient:

(link is external)

Reinforcement Learning 101: A Quick Start Guide

In this article, we will go through the basics of reinforcement learning focusing on the framework of RL, some of the key concepts like Markov Decision Process, and Bellman Equations which forms the mathematical foundation of RL. We will also see different RL algorithms and the challenges that we face in RL.

What is Reinforcement Learning?

(link is external)

Building a Comprehensive API Testing Framework using Pytest, Sanic, and TDD

Co-author’s:

  1. Prashant Mishra (Technical Architect @ Tata 1mg)
  2. Sankar Yadalam (Technical Architect @ Tata 1mg)
  3. Aman Garg (Associate Technical Architect @ Tata 1mg)
  4. Dollar Dhingra (Associate Engineering Manager @ Tata 1mg)

Introduction

(link is external)

Elevating Developer Productivity and Delight: A Paradigm Shift in Development and Deployment

Bringing about a paradigm shift in development & deployment

Co-author’s:

  1. Prashant Mishra (Technical Architect @ Tata 1mg)
  2. Vimal Sharma (Technical Architect @ Tata 1mg)
  3. Aman Garg (Associate Technical Architect @ Tata 1mg)

Introduction

(link is external)