ML-Agents + PyTorch遇到的问题

参考：https://www.cnblogs.com/gentlesunshine/p/12452360.html

虽然是之前的 ML-Agents v0.15.0，基于TensorFlow的，但是安装环境的道理都差不多

一、PyTorch、CUDA、cuDNN的版本问题

按着教程装了一遍，训练的时候出现这个：

意思是PyTorch要1.6.0以上的版本，但是CUDA10.0最高版本也只是支持到PyTorch1.1.0，所以要重新安装CUDA和cuDNN。

参考：https://blog.csdn.net/HaoZiHuang/article/details/107878351

到这里我眉头一皱，感觉问题大了呀，于是从来不看官方文档的我去瞄了一喵

官方文档：https://github.com/Unity-Technologies/ml-agents/blob/release_12_docs/docs/localized/zh-CN/docs/Readme.md

还是官方文档好

下了一手CUDA11.2

再来一手最新的cuDNN（没办法，虽然它是写的支持CUDA11.1，但我没得选）

使用PyCharm安装PyTorch（失败了）

出错了

去官网看看

这下总算好了吧

pip install C:\Users\liyuanhang\Desktop\torch-1.7.1+cu110-cp38-cp38-win_amd64.whl

验证一下Pytorch是否可以使用GPU和CUDA

意思是我电脑没得NVIDIA GPU?

吓得我赶紧看了看

真没有

参考：https://blog.csdn.net/weixin_41194129/article/details/107475509

那算了，换台电脑吧

家里有台旧电脑，配置很辣鸡，但是显卡是NVIDIA，所以装一个试试

老样子python，cuda，cudnn，Anaconda，pytorch

然后报错

参考：https://blog.csdn.net/weixin_42868552/article/details/107990522

参考：https://blog.csdn.net/hinson0710/article/details/107656971

但是vc的库我装好了也没用，把cafffe2_dectron_ops_gpu删了也没用，其他文件还是会报错

难道版本还是不对？

看了所有的地方都没办法

然后想一想，ProgramData这个文件夹好像是默认的“只读”和“隐藏”状态

改了之后还是没好

然后回到自己的电脑，装了没有cuda的pytorch

跟着官方文档走着

运行mlagents-learn出现这个

mlagents.trainers.exception.UnityTrainerException: Previous data from this run ID was found. Either specify a new run ID, use --resume to resume this run, or use the --force parameter to overwrite existing data.

引发UnityTraineException(Mlagents.trainers.exception.UnityTrainerException：找到此运行ID中的以前数据。指定新的运行ID，使用--Resume恢复此运行，或者使用--force参数覆盖现有数据。

运行mlagents-learn --resume

出现这个

mlagents_envs.exception.UnityTimeOutException: The Unity environment took too long to respond. Make sure that :
The environment does not need user interaction to launch
The Agents' Behavior Parameters > Behavior Type is set to "Default"
The environment and the Python interface have compatible versions.