1mgofficial

In the previous article, we discussed two methods under model-free RL algorithms: policy-based and value-based. This article will focus on policy-based algorithms such as REINFORCE, Actor-Critic, and PPO methods of learning, and also implement the algorithms using simple Cart-Pole example.

Policy Gradient:

Exploring the promise, pitfalls, and practical applications of using LLMs to automate AI evaluation - from synthetic QA to clinical reasoning tasks.

Read more about LLM-as-a-Judge: Can Language Models Be Trusted to Evaluate Other Models?
Log in to post comments

In this article, we will go through the basics of reinforcement learning focusing on the framework of RL, some of the key concepts like Markov Decision Process, and Bellman Equations which forms the mathematical foundation of RL. We will also see different RL algorithms and the challenges that we face in RL.

What is Reinforcement Learning?

Read more about Reinforcement Learning 101: A Quick Start Guide
Log in to post comments

One Reward to Rule Them All — Inside the Engine Powering Homepage Recommendation

Read more about One Reward to Rule Them All — Inside the Engine Powering Homepage Recommendation
Log in to post comments
2 views

Co-author’s:

Pankaj Pandey (Senior Technical Architect @ Tata 1mg)
Prashant Mishra (Technical Architect @ Tata 1mg)
Vimal Sharma (Technical Architect @ Tata 1mg)

Read more about Balancing Database Read and Write Queries with Replication Lag Handling
Log in to post comments
9 views

Scalable task scheduling — Server-Less

Co-Authors: Pankaj Pandey, Prashant Mishra, Aman Garg

Introduction

Read more about Scalable task scheduling — Server-Less
Log in to post comments
5 views

Dopamine — Tata 1mg’s design system

Read more about Dopamine — Tata 1mg’s design system
Log in to post comments
5 views

Co-author’s:

Prashant Mishra (Technical Architect @ Tata 1mg)
Sankar Yadalam (Technical Architect @ Tata 1mg)
Aman Garg (Associate Technical Architect @ Tata 1mg)
Dollar Dhingra (Associate Engineering Manager @ Tata 1mg)

Introduction

Read more about Building a Comprehensive API Testing Framework using Pytest, Sanic, and TDD
Log in to post comments
17 views

Co-authors:

Swati Grover ( Associate Technical Architect @ Tata 1mg)
Prashant Mishra (Technical Architect @ Tata 1mg)

*Introduction

Read more about Achieving Seamless PostgreSQL Upgrades from 10 to 12 on AWS: Lessons from Tata 1mg
Log in to post comments
2 views

Bringing about a paradigm shift in development & deployment

Co-author’s:

Prashant Mishra (Technical Architect @ Tata 1mg)
Vimal Sharma (Technical Architect @ Tata 1mg)
Aman Garg (Associate Technical Architect @ Tata 1mg)

Introduction

Read more about Elevating Developer Productivity and Delight: A Paradigm Shift in Development and Deployment
Log in to post comments
6 views

Subscribe to 1mgofficial