معرفی شرکت ها

تبلیغات ما

مشتریان به طور فزاینده ای آنلاین هستند. تبلیغات می تواند به آنها کمک کند تا کسب و کار شما را پیدا کنند.

مشاهده بیشتر

تبلیغات ما

مشتریان به طور فزاینده ای آنلاین هستند. تبلیغات می تواند به آنها کمک کند تا کسب و کار شما را پیدا کنند.

مشاهده بیشتر

تبلیغات ما

مشتریان به طور فزاینده ای آنلاین هستند. تبلیغات می تواند به آنها کمک کند تا کسب و کار شما را پیدا کنند.

مشاهده بیشتر

تبلیغات ما

مشتریان به طور فزاینده ای آنلاین هستند. تبلیغات می تواند به آنها کمک کند تا کسب و کار شما را پیدا کنند.

مشاهده بیشتر

تبلیغات ما

مشتریان به طور فزاینده ای آنلاین هستند. تبلیغات می تواند به آنها کمک کند تا کسب و کار شما را پیدا کنند.

مشاهده بیشتر

توضیحات

Modular Single-file Reinfocement Learning Algorithms Library

ویژگی	مقدار
سیستم عامل	-
نام فایل	abcdrl-0.2.0a4
نام	abcdrl
نسخه کتابخانه	0.2.0a4
نگهدارنده	['Adam Zhao']
ایمیل نگهدارنده	['pazyx728@gmail.com']
نویسنده	Adam Zhao
ایمیل نویسنده	pazyx728@gmail.com
آدرس صفحه اصلی	https://abcdrl.xyz/
آدرس اینترنتی	https://pypi.org/project/abcdrl/
مجوز	MIT

# **abcdRL** (Implement a RL algorithm in four simple steps) English | [简体中文](./README.cn.md) [![license](https://img.shields.io/pypi/l/abcdrl)](https://github.com/sdpkjc/abcdrl) [![pytest](https://github.com/sdpkjc/abcdrl/actions/workflows/test.yml/badge.svg)](https://github.com/sdpkjc/abcdrl/actions/workflows/test.yml) [![pre-commit](https://github.com/sdpkjc/abcdrl/actions/workflows/pre-commit.yml/badge.svg)](https://github.com/sdpkjc/abcdrl/actions/workflows/pre-commit.yml) [![pypi](https://img.shields.io/pypi/v/abcdrl)](https://pypi.org/project/abcdrl) [![docker autobuild](https://img.shields.io/docker/cloud/build/sdpkjc/abcdrl)](https://hub.docker.com/r/sdpkjc/abcdrl/) [![docs](https://img.shields.io/github/deployments/sdpkjc/abcdrl/Production?label=docs&logo=vercel)](https://docs.abcdrl.xyz/) [![Gitpod ready-to-code](https://img.shields.io/badge/Gitpod-ready--to--code-908a85?logo=gitpod)](https://gitpod.io/#https://github.com/sdpkjc/abcdrl) [![benchmark](https://img.shields.io/badge/Weights%20&%20Biases-benchmark-FFBE00?logo=weightsandbiases)](https://report.abcdrl.xyz/) [![mirror repo](https://img.shields.io/badge/Gitee-mirror%20repo-black?style=flat&labelColor=C71D23&logo=gitee)](https://gitee.com/sdpkjc/abcdrl/) [![Checked with mypy](https://img.shields.io/badge/mypy-checked-blue)](http://mypy-lang.org/) [![Code style: black](https://img.shields.io/badge/code%20style-black-000000.svg)](https://github.com/psf/black) [![Imports: isort](https://img.shields.io/badge/%20imports-isort-%231674b1?style=flat&labelColor=ef8336)](https://pycqa.github.io/isort/) [![python versions](https://img.shields.io/pypi/pyversions/abcdrl)](https://pypi.org/project/abcdrl) abcdRL is a **Modular Single-file Reinforcement Learning Algorithms Library** that provides modular design without strict and clean single-file implementation. <img src="https://abcdrl.xyz/logo/adam.svg" width="300"/> *When reading the code, understand the full implementation details of the algorithm in the single file quickly; When modifying the algorithm, benefiting from a lightweight modular design, only need to focus on a small number of modules.* > abcdRL mainly references the single-file design philosophy of [vwxyzjn/cleanrl](https://github.com/vwxyzjn/cleanrl/) and the module design of [PaddlePaddle/PARL](https://github.com/PaddlePaddle/PARL/). ***Documentation ➡️ [docs.abcdrl.xyz](https://abcdrl.xyz)*** ***Roadmap🗺️ [#57](https://github.com/sdpkjc/abcdrl/issues/57)*** ## 🚀 Quickstart Open the project in Gitpod🌐 and start coding immediately. [![Open in Gitpod](https://gitpod.io/button/open-in-gitpod.svg)](https://gitpod.io/#https://github.com/sdpkjc/abcdrl) Using Docker📦: ```bash # 0. Prerequisites: Docker & Nvidia Drive & NVIDIA Container Toolkit # 1. Run DQN algorithm docker run --rm --gpus all sdpkjc/abcdrl python abcdrl/dqn.py ``` ***[For detailed installation instructions 👀](https://docs.abcdrl.xyz/install/)*** ## 🐼 Features - 👨‍👩‍👧‍👦 Unified code structure - 📄 Single-file implementation - 🐷 Low code reuse - 📐 Minimizing code differences - 📈 Tensorboard & Wandb support - 🛤 PEP8(code style) & PEP526(type hint) compliant ## 🗽 Design Philosophy - "Copy📋", ~~not "Inheritance🧬"~~ - "Single-file📜", ~~not "Multi-file📚"~~ - "Features reuse🛠", ~~not "Algorithms reuse🖨"~~ - "Unified logic🤖", ~~not "Unified interface🔌"~~ ## ✅ Implemented Algorithms ***Weights & Biases Benchmark Report ➡️ [report.abcdrl.xyz](https://report.abcdrl.xyz)*** - [Deep Q Network (DQN)](https://doi.org/10.1038/nature14236) - [Deep Deterministic Policy Gradient (DDPG)](http://arxiv.org/abs/1509.02971) - [Twin Delayed Deep Deterministic Policy Gradient (TD3)](http://arxiv.org/abs/1802.09477) - [Soft Actor-Critic (SAC)](http://arxiv.org/abs/1801.01290) - [Proximal Policy Optimization (PPO)](http://arxiv.org/abs/1802.09477) --- - [Double Deep Q Network (DDQN)](http://arxiv.org/abs/1509.06461) - [Prioritized Deep Q Network (PDQN)](http://arxiv.org/abs/1511.05952) ## Citing abcdRL ```bibtex @misc{zhao_abcdrl_2022, author = {Yanxiao, Zhao}, month = {12}, title = {{abcdRL: Modular Single-file Reinforcement Learning Algorithms Library}}, url = {https://github.com/sdpkjc/abcdrl}, year = {2022} } ```

نیازمندی

مقدار	نام
>=2.10.0,<3.0.0	tensorboard
>=3.0	opencv-python
>=1.0.0	moviepy
==2.1.3.dev8	pygame
>=0.27.0,<0.28.0	gymnasium
>=0.13.7,<0.14.0	wandb
>=0.1.0,<0.2.0	combine_signatures
>=0.3.6,<0.4.0	dill
>=0.5.0,<0.6.0	fire
>=1.12.1,<2.0.0	torch
>=0.1.0,<1.0	shimmy[atari]
>=0.4.2,<0.5.0	autorom[accept-rom-license]
>=2.3.1.post1	mujoco
>=2.1,<2.2	mujoco-py
>=2.14.1	imageio
==2.1.6	free-mujoco-py

زبان مورد نیاز

مقدار	نام
>=3.8,<3.11	Python

نحوه نصب

نصب پکیج whl abcdrl-0.2.0a4:

pip install abcdrl-0.2.0a4.whl

نصب پکیج tar.gz abcdrl-0.2.0a4:

pip install abcdrl-0.2.0a4.tar.gz