提交 7f22d0d0 编写于 作者: F Flowingsun007

refine readme of resnet50

上级 d5b7f9d7
......@@ -255,20 +255,20 @@ Oneflow保持了和Mxnet一致的初始学习率以及衰减方式。具体来
#### Optimizer
| oneflow | nvidia | tricks |
| -------- | -------- | ----------------------------- |
| momentum | momentum | Nesterov Accelerated Gradient |
| oneflow | nvidia |
| -------- | -------- |
| momentum | momentum |
#### Weight Initializer
| variable | oneflow | nvidia |
| ----------- | -------------------- | ---------------------------- |
| conv weight | variance_scaling[^1] | Xavier( 'gaussian', 'in', 2) |
| conv bias | NA | NA |
| fc weight | random_normal | Xavier( 'gaussian', 'in', 2) |
| fc bias | 0 | NA |
| bn gamma | 1 | 1 |
| bn beta | 0 | 0 |
| variable | oneflow | nvidia |
| ----------- | ------------- | ---------------------------- |
| conv weight | random_normal | Xavier( 'gaussian', 'in', 2) |
| conv bias | NA | NA |
| fc weight | random_normal | Xavier( 'gaussian', 'in', 2) |
| fc bias | 0 | NA |
| bn gamma | 1 | 1 |
| bn beta | 0 | 0 |
#### Weight Decay
......
Markdown is supported
0% .
You are about to add 0 people to the discussion. Proceed with caution.
先完成此消息的编辑!
想要评论请 注册