• windows系统下安装最新版gym的安装方法(此时最新版的gym为0.24.0,gym==0.24.0)


    当前gym的最新版本为0.24.0,本篇介绍对gym[atari]==0.24.0进行安装。

    使用pip安装:

    pip install gym[atari]

    可以看到此时安装的是ale_py而不是atari_py库。

    运行测试代码:

    import gym
    env=gym.make("Pong-v0")
    print(env)
    env.reset()

    结果报错:

    f'We\'re Unable to find the game "{self._game}". Note: Gym no longer distributes ROMs. '
    gym.error.Error: We're Unable to find the game "Pong". Note: Gym no longer distributes ROMs. If you own a license to use the necessary ROMs for research purposes you can download them via `pip install gym[accept-rom-license]`. Otherwise, you should try importing "Pong" via the command `ale-import-roms`. If you believe this is a mistake perhaps your copy of "Pong" is unsupported. To check if this is the case try providing the environment variable `PYTHONWARNINGS=default::ImportWarning:ale_py.roms`. For more information see: https://github.com/mgbellemare/Arcade-Learning-Environment#rom-management

    根据报错的信息我们知道是因为没有安装atari游戏对应的ROMs的bin文件,我们可以选择手动下载后添加到ale_py库中,但是根据报错信息的提醒我们知道可以使用命令:pip install gym[accept-rom-license] 来进行安装。

    执行:

    pip install gym[accept-rom-license]

    此时再运行,结果:

     证明问题已解决,gym[atari]已经成功安装。

    测试代码:

    import time
    import gym
    
    env = gym.make('BreakoutNoFrameskip-v4')
    print("Observation Space: ", env.observation_space)
    print("Action Space       ", env.action_space)
    
    obs = env.reset()
    for i in range(1000):
        env.render()
        action = env.action_space.sample()
        obs, reward, done, info = env.step(action)
        time.sleep(0.01)
    env.close()

    发现报错:

     

    查看报错的代码地址:lib\site-packages\gym\core.py

        def render(self, mode="human"):
            """Renders the environment.
    
            A set of supported modes varies per environment. (And some
            third-party environments may not support rendering at all.)
            By convention, if mode is:
    
            - human: render to the current display or terminal and
              return nothing. Usually for human consumption.
            - rgb_array: Return a numpy.ndarray with shape (x, y, 3),
              representing RGB values for an x-by-y pixel image, suitable
              for turning into a video.
            - ansi: Return a string (str) or StringIO.StringIO containing a
              terminal-style text representation. The text can include newlines
              and ANSI escape sequences (e.g. for colors).
    
            Note:
                Make sure that your class's metadata 'render_modes' key includes
                the list of supported modes. It's recommended to call super()
                in implementations to use the functionality of this method.
    
            Example:
                >>> import numpy as np
                >>> class MyEnv(Env):
                ...    metadata = {'render_modes': ['human', 'rgb_array']}
                ...
                ...    def render(self, mode='human'):
                ...        if mode == 'rgb_array':
                ...            return np.array(...) # return RGB frame suitable for video
                ...        elif mode == 'human':
                ...            ... # pop up a window and render
                ...        else:
                ...            super().render(mode=mode) # just raise an exception
    
            Args:
                mode: the mode to render with, valid modes are `env.metadata["render_modes"]`
            """
            raise NotImplementedError
    View Code

    可以看到由于gym的版本升级曾经多个版本的gym的测试代码已经不能继续使用了,于是我们进行修改:

     再次报错:

     

     报错信息的全部:

    "render(mode='human') is deprecated. Please supply `render_mode` when "
    gym.error.Error: render(mode='human') is deprecated. Please supply `render_mode` when constructing your environment, e.g., gym.make(ID, render_mode='human'). The new `render_mode` keyword argument supports DPI scaling, audio, and native framerates.

    从上面的报错信息可以知道gym设置了一个绘图显示模式`render_mode,如果想使用绘图显示的功能就需要在创建环境的时候进行设置,形式如gym.make(ID, render_mode='human')。

    修改后的代码:

    import time
    import gym
    
    env = gym.make('BreakoutNoFrameskip-v4', render_mode='human')
    print("Observation Space: ", env.observation_space)
    print("Action Space       ", env.action_space)
    
    obs = env.reset()
    for i in range(1000):
        action = env.action_space.sample()
        obs, reward, done, info = env.step(action)
        time.sleep(0.01)
    env.close()

     

    可以正常运行并绘制显示图像:

    通过对上面的代码测试,可以知道在设置绘图显示模式`render_mode时进行绘图的步骤为 env.reset() 和 env.step(action),在执行这两步操作时会自动进行绘图显示。

     

     

    查看位于lib\site-packages\gym\envs\atari\environment.py中的代码:

    def render(self, mode: str) -> Any:
    """
    Render is not supported by ALE. We use a paradigm similar to
    Gym3 which allows you to specify `render_mode` during construction.

    For example,
    gym.make("ale-py:Pong-v0", render_mode="human")
    will display the ALE and maintain the proper interval to match the
    FPS target set by the ROM.
    """
    if mode == "rgb_array":
    return self.ale.getScreenRGB()
    elif mode == "human":
    raise error.Error(
    (
    "render(mode='human') is deprecated. Please supply `render_mode` when "
    "constructing your environment, e.g., gym.make(ID, render_mode='human'). "
    "The new `render_mode` keyword argument supports DPI scaling, "
    "audio, and native framerates."
    )
    )
    else:
    raise error.Error(
    f"Invalid render mode `{mode}`. Supported modes: `rgb_array`."
    )

    可以知道在新版本的gym中一共存在两个环境模式,一个是'human'模式,一个是`rgb_array`模式。如果不在创建环境时通过gym.make(ID, render_mode='human')设置绘图显示模式的话默认就是`rgb_array`模式。

    虽然新版的gym的改动不大但是对于习惯使用旧版本的gym的人来说还是有些不方便的,不过说一个敲黑板的事情,那就是如果你要发布你的依赖gym环境的强化学习代码一定要把gym的版本号和ale_py版本号或atari_py版本号给标注出来,否则不一定gym未来版本一升级你以前的代码就会运行报错,挨个试验旧版本的gym是个很困难的工作,所以发布代码时标注好你所使用的gym版本号是很有必要的。

    =================================================

  • 相关阅读:
    2020.12.7
    IDEA修改代码后不用重新启动项目即可刷新
    期中测试人口普查登记题目
    Android去掉标题头
    Android限制输入框内容
    Android:setOnItemClickListener cannot be used with a spinner报错
    Android修改app图标
    将外部sqlite3数据库导入到Android项目中
    IDEA个人常用快捷键
    css选择器
  • 原文地址:https://www.cnblogs.com/devilmaycry812839668/p/16328596.html
Copyright © 2020-2023  润新知