- 17 1月, 2022 1 次提交
-
-
由 Wilber 提交于
* add pten::Place data structure. * update ci problem * fix ci problem * update * using platform::Place=pten::Place * remove BOOST_GET_CONST for CPUPlace and GPUPlace * compile pass 25%. * compile pass 45% * compile pass 60% * remove boost_get for xpu npu mlu and ipu * compile pass on cpu and gpu. * fix compile problem * fix compile error. * update * fix ci problem * update * ci approve * fix ci problem * fix ci eager test problem * remove BOOST_GET_CONST * fix npu compile
-
- 03 12月, 2021 1 次提交
-
-
由 ronnywang 提交于
* refine structure for cuda and rocm * update * update * update * update
-
- 03 12月, 2020 1 次提交
-
-
由 tangwei12 提交于
* fix gpu emb out of range Change-Id: I5794ac73bd634d5ea069a6fbbd914274b6d6b7bf * fix doc Change-Id: I5a3350b2930a9ab2f52116c192b087307faf8fdf
-
- 28 9月, 2020 1 次提交
-
-
由 yaoxuefeng 提交于
* enhance error messages of lookup_tale, merge_ids, data_norm * fix * fix error msg in .cu
-
- 01 9月, 2020 1 次提交
-
-
由 tangwei12 提交于
* add embedding 2.0 * add embedding support input int32
-
- 22 7月, 2020 1 次提交
-
-
由 donproc 提交于
-
- 11 5月, 2020 1 次提交
-
-
由 Chen Weihang 提交于
* add new macro BOOST_GET_SAFELY & unittests, test=develop * add different macro type, test=develop * fix get macro type in executor, test=develop * four macro part change backup * using one macro for all case, test=develop * revert attribute change, test=develop * change to three func to solve gcc4.8 bug, test=develop * polish some details, test=develop
-
- 29 11月, 2019 1 次提交
-
-
由 hong 提交于
* add_dygraph_execution_context * add dygraph infershape context and execution context; test=develop * fix imperative bug; test=develop * remove inputs outputs interface from execution context, because it have same function with inputNames; test=develop * remove tracer_test ctest; test=develop * fix split op bug; test=develop * fix unitests bug; test=develop * fix distribute test bug; test=develop * fix ngraph compile bug; test=develop * fix grad maker bug; test=develop * fix load op bugs; test=develop * fix operator.cc construct bug; test=develop * remove useless name find in operator; test=develop * add tracer_test; test=develop * fix concat, split bug; test=develop * remove tracer_test unitest; test=develop * fix attribute check bug; test=develop * add test code to fix converage; test=develop * remove useless code, change check backward input in engin; test=develop * unlock var type infer shape;test=develop * add ShareAllLoD api; test=develop * add dygraph infershape context unitest; test=develop * remove increase and decrease lod in dygraph; test=develop * addd override; test=develop * fix increase descrease lod; test=develop * fix paddle_enforce; test=develop * disable lod op dygraph check; test=develop * fix paddle enforce error; test=develop * add comment for op_registry and OperatorBase; test=develop * optimize the comment of op_registry; test=develop * fix format of comment; test=develop * fix format of comment; test=develop * optimize the format of comment; test=develop * optimize the format of the comment; test=develop * optimize comment of op_registry; test=develop
-
- 12 10月, 2019 1 次提交
-
-
由 Aurelius84 提交于
* enhance embedding error message test=develop * enforce .h error test=develop * fix unittest code test=develop * Fix fp16 dtype in embedding test=develop * add import warnings test=develop
-
- 24 9月, 2019 1 次提交
-
-
由 Aurelius84 提交于
* Remove constraint that last dimension is forced to be 1 by add lookup_table_v2 test=develop * modify into PADDLE_ENFORCE_CUDA_SUCCESS test=develop * Revert "modify into PADDLE_ENFORCE_CUDA_SUCCESS test=develop" This reverts commit 8a960bfc61e51aa27c3c529df8fb90b93ebd19f9. * move api into fluid.embedding test=develop * fix example code test=develop * move one_hot into fluid.one_hot * modify api.spec test=develop * fix loss shape test=develop
-
- 05 9月, 2019 1 次提交
-
-
由 Tao Luo 提交于
* remove assert.h * change PADDLE_ASSERT_MSG to PADDLE_ENFORCE test=develop * fix tensorrt paddle_enforce test=develop
-
- 28 8月, 2019 1 次提交
-
-
由 tangwei12 提交于
* fix correctness of the communicator * fix a bug in send thread when sending var context is empty, test=develop * add lookup_table_prefetch_op and prefetch optimize, test=develop * remove remote prefetch GPU supported * word2vec force with CPU, test=develop * test dist remote lookup table force with CPU, test=develop
-
- 09 8月, 2019 1 次提交
-
-
由 Zhang Ting 提交于
* optimize error message, test=develop * optimize error message, test=develop
-
- 08 5月, 2019 1 次提交
-
-
由 chengduo 提交于
test=develop
-
- 28 3月, 2019 1 次提交
-
-
由 Qiao Longfei 提交于
-
- 30 1月, 2019 1 次提交
-
-
由 Yibing Liu 提交于
* Some improvements to support bert mixed precision training test=develop * Revert the cast in layer_norm test=develop
-
- 19 12月, 2018 1 次提交
-
-
由 JiabinYang 提交于
-
- 03 12月, 2018 1 次提交
-
-
由 Yibing Liu 提交于
-
- 29 11月, 2018 1 次提交
-
-
由 Qiao Longfei 提交于
test=develop
-
- 21 9月, 2018 1 次提交
-
-
由 Yu Yang 提交于
-
- 31 7月, 2018 2 次提交
- 27 7月, 2018 1 次提交
-
-
由 chengduo 提交于
* refine regularization for selected_rows * clean lookup_table * refine rpc_server_test * temporally disable rpc_server_test * fix rpc_server_test * add unit test
-
- 30 4月, 2018 1 次提交
-
-
由 dzhwinter 提交于
* "re-commit " * "picked up" * "fix ci" * "fix pdb hang up issue in cuda 9"
-
- 13 3月, 2018 2 次提交
-
-
由 chengduoZH 提交于
-
由 chengduoZH 提交于
-
- 12 3月, 2018 1 次提交
-
-
由 chengduoZH 提交于
-
- 09 3月, 2018 1 次提交
-
-
由 chengduoZH 提交于
-
- 12 2月, 2018 1 次提交
-
-
由 qingqing01 提交于
-
- 10 2月, 2018 2 次提交
- 08 2月, 2018 1 次提交
-
-
由 Yu Yang 提交于
-
- 31 1月, 2018 1 次提交
-
-
由 dzhwinter 提交于
* "Need to re-design LoD " * "add lod design" * "fix lod gpu ptr pointer" * "removed commented code" * "fix CI" * "remove set lod in pybind" * "fix style check" * "fix CI" * "fix long type template error" * "pybind reorder to use Place" * "fix ci" * "fix ci" * fix ci * "sperate as a new file" * "fix CI" * "fix ci" * small fix * "add test" * "fix adam op" * "fix lstmp op" * "fix adam op" * "follow comments" * "fix ci"
-
- 22 1月, 2018 2 次提交
- 26 12月, 2017 1 次提交
-
-
由 Luo Tao 提交于
-
- 25 12月, 2017 1 次提交
-
-
由 dzhwinter 提交于
-
- 12 12月, 2017 1 次提交
-
-
由 QI JUN 提交于
There are mainly following fixes: - take `DeviceContext` as the template parameter of math functors and OpKernel instead of `Place` - remove `eigen_device` interface in base class `DeviceContext` - remove `GetEigenDevice` interface in `ExecutionContext` and base class `DeviceContext` - remove unused `platform::EigenDeviceConverter` - rename `REGISTER_OP_GPU_KERNEL` to `REGISTER_OP_CUDA_KERNEL` - rename `USE_GPU_ONLY_OP` to `USE_CUDA_ONLY_OP`
-
- 07 11月, 2017 1 次提交
-
-
由 typhoonzero 提交于
-
- 06 11月, 2017 1 次提交
-
-
由 typhoonzero 提交于
-