Created by: zhiqiu
Related #21569 This PR changes the auto-generated pybind functions for dygraph operators, remove maps in function args to improve performance.
Train ptb for one epoch on v100 GPU place, before: 112.269 after: 107.502 time reduced: -4%
The generated pybind functions can be see here.