- 01 8月, 2018 1 次提交
-
-
由 dzhwinter 提交于
* "add gradient register" * "make some enhance" * "better format" * "fix typo" * "fix reuse" * "fix get expected kernel" * "change the mkldnn code" * "fix mkldnn" * "fix mkldnn failed test" * "add comment"
-
- 03 5月, 2018 1 次提交
-
-
由 chengduo 提交于
* fix __shfl_down_sync_ of cross_entropy * use reduceSum * "fix ci"
-
- 30 4月, 2018 1 次提交
-
-
由 dzhwinter 提交于
* "re-commit " * "picked up" * "fix ci" * "fix pdb hang up issue in cuda 9"
-
- 24 4月, 2018 1 次提交
-
-
由 chengduoZH 提交于
-
- 10 4月, 2018 1 次提交
-
-
由 chengduo 提交于
* add cuda_device_functions.h * move reduceSum to elementwise_op_function.h
-
- 06 3月, 2018 1 次提交
-
-
由 chengduoZH 提交于
-
- 28 2月, 2018 1 次提交
-
-
由 xuwei06 提交于
When the second argument contains batch dimension, the axis should be 0. Also makes elementwise ops more tolerant at handling tensors with trailing singular dimensions.
-
- 26 2月, 2018 1 次提交
-
-
由 chengduoZH 提交于
-
- 24 2月, 2018 2 次提交
-
-
由 chengduoZH 提交于
-
由 chengduoZH 提交于
-
- 23 2月, 2018 2 次提交
- 13 2月, 2018 1 次提交
-
-
由 xuwei06 提交于
And some minor fixes on comments.
-
- 12 2月, 2018 1 次提交
-
-
由 qingqing01 提交于
-
- 10 2月, 2018 2 次提交
- 03 2月, 2018 1 次提交
-
-
由 chengduoZH 提交于
-
- 02 2月, 2018 1 次提交
-
-
由 chengduoZH 提交于
-
- 22 1月, 2018 1 次提交
-
-
由 Yang Yu 提交于
-
- 19 1月, 2018 1 次提交
-
-
由 Yang Yu 提交于
-
- 17 1月, 2018 1 次提交
-
-
由 fengjiayi 提交于
-
- 16 1月, 2018 1 次提交
-
-
由 fengjiayi 提交于
-
- 15 1月, 2018 1 次提交
-
-
由 fengjiayi 提交于
-
- 26 12月, 2017 1 次提交
-
-
由 Luo Tao 提交于
-
- 25 12月, 2017 1 次提交
-
-
由 chengduoZH 提交于
-
- 19 12月, 2017 2 次提交
-
-
由 chengduoZH 提交于
-
由 chengduoZH 提交于
-
- 16 12月, 2017 1 次提交
-
-
由 chengduoZH 提交于
-
- 12 12月, 2017 1 次提交
-
-
由 QI JUN 提交于
There are mainly following fixes: - take `DeviceContext` as the template parameter of math functors and OpKernel instead of `Place` - remove `eigen_device` interface in base class `DeviceContext` - remove `GetEigenDevice` interface in `ExecutionContext` and base class `DeviceContext` - remove unused `platform::EigenDeviceConverter` - rename `REGISTER_OP_GPU_KERNEL` to `REGISTER_OP_CUDA_KERNEL` - rename `USE_GPU_ONLY_OP` to `USE_CUDA_ONLY_OP`
-
- 06 12月, 2017 1 次提交
-
-
由 chengduoZH 提交于
-
- 05 12月, 2017 4 次提交
-
-
由 chengduoZH 提交于
-
由 chengduoZH 提交于
-
由 chengduoZH 提交于
-
由 chengduoZH 提交于
-
- 04 12月, 2017 1 次提交
-
-
由 chengduoZH 提交于
-
- 01 12月, 2017 1 次提交
-
-
由 Yu Yang 提交于
* Fix Proformance problem of enforce * Fix missing `;` in code * Fix CI
-
- 20 10月, 2017 1 次提交
-
-
由 qijun 提交于
-
- 27 9月, 2017 3 次提交
-
-
由 qiaolongfei 提交于
-
由 qiaolongfei 提交于
-
由 Qiao Longfei 提交于
* init Infershape * add static InferShape interface * refactor add-op infershape * add AttrReader * add all maker's infershape * add all InferShape * add python infer api * add VarDesc interface * add python VarDesc and OpDesc interface * update python code * use infershape function to do shape inference * clean code * do not use pointer * refine code of op_proto_maker * add get_dims to VarDesc * refine the code * remove the dependency from operator to op registry * remove OpProtoAndCheckerMaker from operator * restore complete_add_op * add shape_infer_impl.h * code optimization * remove const return value * add fake BlockDesc class * optimize code * remove infer function in op_info * move InferShapeContextImpl to operator.h * optimize the interface of InferShapeContextBase * add temperary interface of new infershape * change add_op, clip_op, conv2d_op and activation_op * change all operators InferShape * fix SetDim * update cos_sim_op * update crop_op * update lookup_table_op * allocate tensor when call GetDim in InferShapeContext * update modified_huber_loss_op * update rowwise_add_op * update mean_op * update sequence_avg_pool_op * typo * remove old InferShape interface * can compile * fix or unit test * clean code * clean code * remove const before InferShapeContext * change InferenceContextBase to pointer * rename RunTime to Runtime, code clean
-