Build software better, together

VachanVY / Reinforcement-Learning

PyTorch implementations of algorithms from "Reinforcement Learning: An Introduction by Sutton and Barto", along with various RL research papers.

reinforcement-learning deep-reinforcement-learning pytorch artificial-intelligence dqn policy-gradient deep-deterministic-policy-gradient ddpg-algorithm proximal-policy-optimization actor-critic-algorithm dqn-pytorch rl-book sutton-barto-book policy-gradient-with-baseline actor-critic-pytorch soft-actor-critic-continuous ppo-algorithm reinforcement-learning-an-introduction

Updated Aug 14, 2025
Python

amin-sharifi-github / quant-rl-trading-agent

Star

End-to-end RL trading framework with PPO agent, self-attention neural network, custom Gym environment, and advanced backtesting.

reinforcement-learning ai algotrading reinforcement-learning-algorithms trading-algorithms quantitative-finance attention-mechanism quantitative-trading backtesting trading-systems gym-environment reinforcement-learning-agent financial-machine-learning quantitative-research market-simulation stable-baselines3 ppo-algorithm

Updated Aug 6, 2025
Python

negarhonarvar / DeepReinforcementLearning

Star

A Complete Collection of Deep RL Famous Algorithms implemented in Gymnasium most Popular environments

dqn boltzmann-exploration sarsa lunar-lander cartpole-v1 d3qn swimmer softmax-exploration drl-algorithms ppo-algorithm gymnasium-environment

Updated Apr 13, 2025
Python

Ruchit-Gaurh / AI-Traffic-Management-System

Star

🚦 Next-generation AI Traffic Management System with real-time computer vision, reinforcement learning optimization, emergency vehicle detection, and immersive 3D visualization

Updated Oct 14, 2025
Python

RongzheZhao2R2-lab / Implementing-Core-LLM-Algorithms-from-Scratch

Star

This repository is dedicated to implementing algorithms "From Scratch". It goes beyond simple API calls, diving deep into the underlying logic of everything from basic training to cutting-edge techniques like DeepSeek-R1.

moe knowledge-distillation multimodal-learning alignment-algorithm rag mixture-of-experts rlhf ppo-algorithm grpo

Updated Nov 26, 2025
Python

zxy-tech / ppo-for-S-P-500-trading-strategy

Star

This is a project for PPO S&P 500 trading

time-series-forecasting stockprediction stocktrader ppo-algorithm

Updated Mar 10, 2025
Python

green-hat-001 / NASA-Space-Apps-Commercialising-LEO-by-OptimAI

Star

2D orbital rocket sim with PPO in PyTorch. Models thrust, drag, gravity, fuel; agent learns efficient ascent. Includes telemetry & visualization

ai python3 rocketry ppo-algorithm

Updated Dec 23, 2025
Python

unaizaahmedk / Balancing-Inverted-Pendulum-using-RL

Star

Reinforcement learning–based controller for balancing an inverted pendulum using Proximal Policy Optimization (PPO). Supports configurable mass, length, and gravity settings (Earth, lunar, microgravity) with automated training logs, reward visualization, and performance analysis.

reinforcement-learning openai-gym reinforcement-learning-algorithms inverted-pendulum ppo-algorithm

Updated Mar 3, 2026
Python

omerjakoby / MARIO-RL-PPO

Star

This repository implements a Proximal Policy Optimization (PPO) agent that learns to play Super Mario Bros using TensorFlow/Keras and OpenAI Gym. Features CNNs for vision, Actor-Critic architecture, and parallel environments. Train your own Mario master or run a pre-trained one!

machine-learning tensorflow keras openai-gym cnn actor-critic mario-game proximal-policy-optimization ppo reinforcement-learning-agent ppo-algorithm

Updated Dec 12, 2025
Python

MarGo-20 / isaaclab-anymal-locomotion

Star

🐾 Implement Proximal Policy Optimization (PPO) for quadruped locomotion, achieving 96% performance of RSL-RL with a custom solution for enhanced robot control.

ppo anymal isaacsim isaac-sim locomation legged-locomotion ppo-algorithm isaac-lab isaaclab

Updated Mar 18, 2026
Python

mturan33 / isaaclab-anymal-locomotion

Star

A legged locomotion project

ppo anymal isaacsim isaac-sim locomation legged-locomotion ppo-algorithm isaac-lab isaaclab anymal-c

Updated Nov 29, 2025
Python

Devanik21 / general-gamer-ai-lite

Star

A specialized Reinforcement Learning (RL) project focused on multi-task mastery across 10 distinct gaming environments. General-Gamer-AI-Lite implements a lightweight multi-task agent designed to learn shared representations and transfer knowledge between varied game mechanics, from classic arcade challenges to strategic grid worlds.

reinforcement-learning deep-reinforcement-learning game-theory multi-task-learning multi-agent-rl curiosity-driven-exploration ppo-algorithm hierarchical-rl self-play-rl game-environment-simulation

Updated Jan 26, 2026
Python

mafaldaaires / Reinforcement-Learning

Star

Stable Baselines3

gymnasium a2c-algorithm car-racing-environment ppo-algorithm

Updated Dec 26, 2023
Python

Anca-Mt / TrackmaniaRL-AI

Star

AI agents for Trackmania using the TMRL package. Implemented DDPG, PPO, and used two SAC algorithms (with one or two critics) to train cars to navigate custom-built tracks.

python ai reinforcement-learning-algorithms game-ai ddpg-algorithm ppo-algorithm sac-algorithm tmrl tmrl-package modern-game-ai