Created by: chajchaj
dropout_implementation参数的可选值是'downgrade_in_infer'或'upscale_in_train', 缺省值是'downgrade_in_infer',这种情况下api行为和pytorch不一致, dropout_implementation参数改成'upscale_in_train'才能跟pytorch一致。