معرفی شرکت ها

تبلیغات ما

مشتریان به طور فزاینده ای آنلاین هستند. تبلیغات می تواند به آنها کمک کند تا کسب و کار شما را پیدا کنند.

مشاهده بیشتر

تبلیغات ما

مشتریان به طور فزاینده ای آنلاین هستند. تبلیغات می تواند به آنها کمک کند تا کسب و کار شما را پیدا کنند.

مشاهده بیشتر

تبلیغات ما

مشتریان به طور فزاینده ای آنلاین هستند. تبلیغات می تواند به آنها کمک کند تا کسب و کار شما را پیدا کنند.

مشاهده بیشتر

تبلیغات ما

مشتریان به طور فزاینده ای آنلاین هستند. تبلیغات می تواند به آنها کمک کند تا کسب و کار شما را پیدا کنند.

مشاهده بیشتر

تبلیغات ما

مشتریان به طور فزاینده ای آنلاین هستند. تبلیغات می تواند به آنها کمک کند تا کسب و کار شما را پیدا کنند.

مشاهده بیشتر

توضیحات

Tools for single-embodiment, multiple-task, Reinforcement Learning

ویژگی	مقدار
سیستم عامل	-
نام فایل	dm-robotics-agentflow-0.5.0
نام	dm-robotics-agentflow
نسخه کتابخانه	0.5.0
نگهدارنده	[]
ایمیل نگهدارنده	[]
نویسنده	DeepMind
ایمیل نویسنده	-
آدرس صفحه اصلی	https://github.com/deepmind/dm_robotics/tree/main/py/agentflow
آدرس اینترنتی	https://pypi.org/project/dm-robotics-agentflow/
مجوز	Apache 2.0

# AgentFlow: A Modular Toolkit for Scalable RL Research  ## Overview `AgentFlow` is a library for composing Reinforcement-Learning agents. The core features that AgentFlow provides are: 1. tools for slicing, transforming, and composing *specs* 2. tools for encapsulating and composing RL-tasks. Unlike the standard RL setup, which assumes a single environment and an agent, `AgentFlow` is designed for the single-embodiment, multiple-task regime. This was motivated by the robotics use-case, which frequently requires training RL modules for various skills, and then composing them (possibly with non-learned controllers too). Instead of having to implement a separate RL environment for each skill and combine them ad hoc, with `AgentFlow` you can define one or more `SubTasks` which *modify* a timestep from a single top-level environment, e.g. adding observations and defining rewards, or isolating a particular sub-system of the environment, such as a robot arm. You then *compose* SubTasks with regular RL-agents to form modules, and use a set of graph-building operators to define the flow of these modules over time (hence the name `AgentFlow`). The graph-building step is entirely optional, and is intended only for use-cases that require something like a (possibly learnable, possibly stochastic) state-machine.  ### [Components](docs/components.md) ### [Control Flow](docs/control_flow.md) ### [Examples](docs/examples.md)

نیازمندی

مقدار	نام
-	dm-robotics-transformations
-	dm-robotics-geometry
>=1.16.0	numpy
==1.0.9	dm-control
==2.3.1.post1	mujoco
<=4.6.0.66,>=3.4.0	opencv-python
>=20.3.0	attrs
>=1.2.4	pydot
>=3.7.4	typing-extensions

زبان مورد نیاز

مقدار	نام
>=3.7, <3.11	Python

نحوه نصب

نصب پکیج whl dm-robotics-agentflow-0.5.0:

pip install dm-robotics-agentflow-0.5.0.whl

نصب پکیج tar.gz dm-robotics-agentflow-0.5.0:

pip install dm-robotics-agentflow-0.5.0.tar.gz