Deep Q-Network (DQN) — CartPole-v1

This repository contains a trained Deep Q-Network (DQN) agent for the Gymnasium environment CartPole-v1.

Environment

CartPole has a continuous state space, making tabular Q-learning infeasible.

DQN approximates Q-values using a neural network:

Q(s,a; θ)

Training target:

y = r + γ max_a' Q(s',a'; θ⁻)

Key components:

Because CartPole has continuous states:

s ∈ ℝ⁴

A Q-table cannot represent infinite possible states.
A neural network is used for function approximation.

Replay buffer improves stability and data efficiency.

The agent learns to consistently balance the pole for long durations.

Training reward improves steadily over episodes.

Below is the trained DQN agent:

This project demonstrates:

It represents a transition from tabular RL to deep RL.

Downloads last month: -; Downloads are not tracked for this model. How to track

Video Preview