code_1.py 192 字节
Newer Older
ToTensor's avatar
ToTensor 已提交
1 2 3 4 5 6
alpha = 0.6 # 学习速率
gamma = 0.75 # 奖励折扣
episodes = 500 # 游戏盘数
r_history = [] # 奖励值的历史信息
j_history = [] # 步数的历史信息
for i in range(episodes):