提交 a06be6de 编写于 作者: U u010280923

add ppo model

上级 765f307b
......@@ -263,6 +263,8 @@ if __name__ == "__main__":
trainer.strategy.config["zero_optimization"]["allgather_bucket_size"] = args.ds_bucket_mb * 1000 * 1000
trainer.strategy.config["zero_optimization"]["reduce_bucket_size"] = args.ds_bucket_mb * 1000 * 1000
model.deepspeed_offload()
seq = torch.randint(0, 50277, (1, 100))
model(seq)
......
Markdown is supported
0% .
You are about to add 0 people to the discussion. Proceed with caution.
先完成此消息的编辑!
想要评论请 注册