- 25 2月, 2021 1 次提交
-
-
由 Leo Chen 提交于
refactor npu device manager (#31154)
-
- 23 2月, 2021 1 次提交
-
-
由 liym27 提交于
* [NPU] Support executor with NPU * Fix code according to reviews * Fix code * Add unittest for sub op npu
-
- 09 2月, 2021 1 次提交
-
-
由 Leo Chen 提交于
[feature] support npu allocator
-
- 08 2月, 2021 1 次提交
-
-
由 gongweibao 提交于
Destroy session first.
-
- 28 1月, 2021 1 次提交
-
-
由 Leo Chen 提交于
Dev/fix ascend string
-
- 27 1月, 2021 1 次提交
-
-
由 Leo Chen 提交于
fix compilation on ascend-20.1
-
- 21 1月, 2021 1 次提交
-
-
由 gongweibao 提交于
Add distribution supported
-
- 15 1月, 2021 2 次提交
-
-
由 gongweibao 提交于
Fix compilcation on CANN20.1 and older
-
由 hutuxian 提交于
-
- 14 1月, 2021 1 次提交
-
-
由 yaoxuefeng 提交于
-
- 13 1月, 2021 2 次提交
- 12 1月, 2021 2 次提交
-
-
由 tangwei12 提交于
* rename sendrecv.proto to namespace paddle.distributed * split ps with distributed
-
由 Chengmo 提交于
* add save tensor support Co-authored-by: NseiriosPlus <tangwei12@baidu.com>
-
- 11 1月, 2021 1 次提交
-
-
由 AshburnLee 提交于
-
- 09 1月, 2021 1 次提交
-
-
由 pangyoki 提交于
* add view strategy on squeeze,unsqueeze,reshape,flatten * add squeeze unittest * add unittests * use View strategy as name rather than Reuse Allacation * fix view api doc * fix format * use core.ops when input of reshape2 is Tensor * fix test_cross_entropy_loss error because of reshape2 * delete selected_rows * change op_function * little change * solve HandleViewBetweenInputAndOutput
-
- 08 1月, 2021 3 次提交
-
-
由 Leo Chen 提交于
* fix dtype of ungenerated grad var * update ut * refine code * set default dtype * fix could_use_cudnn bug * remove debug code * re-implement * fix bug
-
由 Leo Chen 提交于
* change to tensor copy sync * change to tensor copy sync * make copy_to safe when use TensorCopy * refine code * add ut * add cudapinned garbagecollector * add testcase: cpu place -> cuda pinned place
-
由 Chengmo 提交于
* add tensor table
-
- 07 1月, 2021 1 次提交
-
-
由 123malin 提交于
* test=develop, add model_average and lookahead
-
- 06 1月, 2021 2 次提交
-
-
由 Leo Chen 提交于
* add dispenable input 'shape' for core.ops.reshape2 * add dispenable inputs for core.ops.reshape2/expand/slice * add ut
-
由 liym27 提交于
1. when slice_item is a slice: 1) the start of __getitem__ should be std::max(start, 0) if slice 2) the start of __getitem__ should be std::min(end, dim) 2. when slice_item is an integer, it should be in [-dim_len, dim_len) 3. Fix error message to use accurate data
-
- 05 1月, 2021 1 次提交
-
-
由 Thunderbrook 提交于
* add topo aware * resource.h * topo aware * format
-
- 04 1月, 2021 1 次提交
-
-
由 cc 提交于
* zero_copy_tensor supports int8_t
-
- 27 12月, 2020 1 次提交
-
- 26 12月, 2020 1 次提交
-
-
由 liuyuhui 提交于
-
- 24 12月, 2020 1 次提交
-
-
由 tangwei12 提交于
* oneps (3/4) Co-authored-by: NMrChengmo <cmchengmo@163.com> Co-authored-by: Nmalin10 <malin10@baidu.com> Co-authored-by: Nchengmo <chengmo@baidu.com>
-
- 23 12月, 2020 1 次提交
-
-
由 Thunderbrook 提交于
* add heter box * add trainer, worker, wrapper... * format * for ci * format * remove boost get * boost & copyright * rename * rename * format * format * format Co-authored-by: Nyaoxuefeng6 <yaoxuefeng@baidu.com>
-
- 22 12月, 2020 1 次提交
-
-
由 ShenLiang 提交于
* fix fleet for multi-stream * fix memcpy for ncclid * use sync to solve move operation
-
- 16 12月, 2020 2 次提交
- 15 12月, 2020 2 次提交
-
-
由 AshburnLee 提交于
-
由 Wilber 提交于
-
- 09 12月, 2020 2 次提交
- 05 12月, 2020 1 次提交
-
-
由 myq406450149 提交于
* enhance array_to_lod_tensor_op lod_tensor_to_array_op errors information. test=develop * fix format. test=develop * format fix. test=develop * add lod_rank_table. test=develop * fix format. test=develop * fix doc info. test=develop * fix np error * add unbind dygraph api. test=develop * fix unbind doc.test=develop
-
- 04 12月, 2020 2 次提交
-
-
由 liym27 提交于
[inplace] Add ShareHolderWith for class Variable and SharePlaceholderWith in VarBase.detach() to share the same Tensor/SelectedRows (#29267)
-
由 Chen Weihang 提交于
* basic impl of type promote * add comment & another testcase * fix complex bugs & support python op promote type * fix failed unittests & polish code * add unittest for coverage * change to only promote complex type * polish code details * polish several comments
-
- 02 12月, 2020 1 次提交
-
-
由 Zhen Wang 提交于
* add the weight decay func for the momentum op * Add the multi_precision function in Momentum Optimizer. * Make sure that the initial value of master weights are same with the fp16 weights. * add static loss scaling. * add the rescale_grad function in the pure fp16 training. * use the original momentum updating method. * Polish some codes, such as variable names. * add docstring for apis. * update the var creation details of _create_master_weight. * not modify codes about imperative momentum updating. * Fix the error of test_dist_sparse_tensor_load_momentum UT. * add unit test for multi precision fp16 training. * add more unit tests for CI. * Use lower threshold values for allclose comparing in test_multi_precision_fp16_train UT. * For CI Coverage Checking.
-
- 01 12月, 2020 1 次提交
-
-
由 chentianyu03 提交于
* add complex64 and complex128 type; add +-*/@ and slice opreator for complex types * add test cases for complex elementwise, matmul and getitem unittest * add test cases for complex types * add test cases for complex matmul unittest
-