- 19 1月, 2021 3 次提交
-
-
由 pangyoki 提交于
Cherry pick PR #30520 . Fix error message of Inplace strategy.
-
由 Leo Chen 提交于
[cherry-pick] support layer_norm fp16 in dygraph amp (#30430)
-
由 Zhou Wei 提交于
cherry-pick #30553 fix bug of multicard grad ncclAllReduce, the gradient accumulater of parameters should be keep order, otherwsie, it will influence multicard ncclAllReduce of grad.
-
- 18 1月, 2021 1 次提交
-
-
由 pangyoki 提交于
Cherry-pick PR 30103. Add Inplace strategy (Output reuse Input Varbase) in dygraph (#30103) (#30496) * add view strategy on squeeze,unsqueeze,reshape,flatten * add squeeze unittest * add unittests * use View strategy as name rather than Reuse Allacation * fix view api doc * fix format * use core.ops when input of reshape2 is Tensor * fix test_cross_entropy_loss error because of reshape2 * fix test_cross_entropy_loss error because of reshape2 * add inplace strategy * add elementwise_add sub * let backward op not use inplace * grad op do not use inplace * fix memory increase error and add leaf error message * delete selected_rows * change op_function * little change * solve HandleViewBetweenInputAndOutput * add unittest and leaf error message * merge view error * optimize op_function_generator format and support sum inplace op * fix format of basic_engine * fix format for framework * little change of variable wrapper * add reshape, squeeze, unsqueeze, scatter api * add relu elu tanh softmax inplace api * fix test_squeeze_op unittest * fix test_relu_op unittest * fix comment problems * delete sample code of inplace api * add reference of grad_pending_nodes in basic_engine * fix unittest name * add inplace apis into wlist * fix error message * add PADDLE_ENFORCE for set grad op twice * fix head file error
-
- 13 1月, 2021 1 次提交
-
-
由 ShenLiang 提交于
-
- 12 1月, 2021 1 次提交
-
-
由 Leo Chen 提交于
* change to tensor copy sync * change to tensor copy sync * make copy_to safe when use TensorCopy * refine code * add ut * add cudapinned garbagecollector * add testcase: cpu place -> cuda pinned place
-
- 08 1月, 2021 1 次提交
-
-
由 Chen Weihang 提交于
* simplify prepared op impl to improve performance * fix kunlun compile error * continue fix kunlun compile error * only transform diff place when dtype diff * fix failed unittests * remove useless file * polish impl by review comment
-
- 06 1月, 2021 1 次提交
-
-
由 hong 提交于
* support dygraph in xpu place; test=develop * fix cpu/gpu compile error; test=develop * fix compile error; test=develop * fix xpu compile error; testd=develop
-
- 04 1月, 2021 1 次提交
-
-
由 Zhou Wei 提交于
* support deepcopy for Layer/Tensor/Paramerbase * fix some code
-
- 29 12月, 2020 1 次提交
-
-
由 Chen Weihang 提交于
* [Complex] Add support for complex grad accumulated (#29889) * add support for complex grad accumulated * add unittest for coverage * update test dtype * remove useless blank line * [Complex] Handle complex to real after type promotion (#29855) * try to add fwd op input dtypes * refactor base impl * return tmp_ins after dygraph prepare data * fix typo found in debug * polish comment & add complex net test * revert detail change * fix unittest failed * add complex kernel condition control * fix xpu test failed & polish comment * polish details by review comments * Complex op test (#29753) * delete no need to calculate inputs in dygraph op_test * delete no need to calculate inputs in dygraph op_test * change grad elementwise_mul for complex types (#29757) * add conj op for complex types * add conj for complex types * add more test case * add conj_op test * modify conj api and impl * add complex type for fill_constant_op xpu * add setConstant for complex type * remove complex conj test file * user define grad for test_conj_op * add test case for static mode of conj api * modify conj doc * change input args name to x * remove useless codes * conj support real types * add conj test case for real number * delete no need to calculate inputs in dygraph op_test * delete no need to calculate inputs in dygraph op_test * modify grad of mul for complex types * fix the grads of inputs args order not match bug * change the grad of div when complex types (#29804) * change the grad of div when complex types * fix the grads of inputs args order not match bug Co-authored-by: Nchentianyu03 <chentianyu03@baidu.com>
-
- 23 12月, 2020 1 次提交
-
-
由 ShenLiang 提交于
-
- 22 12月, 2020 1 次提交
-
-
由 ShenLiang 提交于
* fix fleet for multi-stream * fix memcpy for ncclid * use sync to solve move operation
-
- 17 12月, 2020 1 次提交
-
-
由 ShenLiang 提交于
* Fix the dowanload bug in the case of multiple machines (#29551) * fix the dowanload bug * add sort for ips * Fix bug of matmul_v2 for broadcast case (#29599) * fix bug of matmul_v2 for broadcast * Rebuild group automatically in dynamic graph distributed (#29255) * add tensor_indices in AssignGroupBySize * add rebuild group in reducer * fix error message of gather nd (#29521)
-
- 07 12月, 2020 1 次提交
-
-
由 Zhou Wei 提交于
-
- 04 12月, 2020 1 次提交
-
-
由 Leo Chen 提交于
* use has_grad instead of train_mode * add vlog for debug * fix ut * fix ut
-
- 03 12月, 2020 1 次提交
-
-
由 ShenLiang 提交于
* fix the warning of reducer (#29323) * fix warning of fleet (#29317) * Fix doc of fleet api (#29282)
-
- 01 12月, 2020 1 次提交
-
-
由 Zhou Wei 提交于
* The leaf tensor concept is exposed and the gradient accumulation of leaf tensor * The leaf tensor concept is exposed and the gradient accumulation of leaf tensor * fix coverage * fix api doc * fix CI unittest * fix CI unittest * fix unitest * empty tensor does’t need inner_var_ * fix some error message
-
- 30 11月, 2020 1 次提交
-
-
由 liym27 提交于
* Add a class TensorInplaceVersion to count the inplace version and put it in framework::Tensor instead of Allocation or Variable. * Add a new attribute `_inplace_version` for VarBase. * Raise exception if an inplace operation can result in incorrect gradient computation. * Add a new interface _bump_inplace_version() for VarBase to bump the version whenever the Tensor is modified through an inplace operation. * For api assign, call _bump_inplace_version() when it's an inplace operation inn dynamic mode. * Use original var_wrapper if the inplace_version is not changed. * Replace SnapshotVarWrapperList with SnapshotVarWrapper to optimize performane.
-
- 27 11月, 2020 1 次提交
-
-
由 ShenLiang 提交于
* add reducer * refine envent for memorycopy * add concat&split for allreduce * apply concat & split for fuse tensor * fix nccl dep * fix the untest, compile problem and ddp initialize problem * fix untest for mac & add some comments & solve the repeated param in sublayers * fix untest for windows & fix document
-
- 26 11月, 2020 1 次提交
-
-
由 Leo Chen 提交于
* split train_mode and has_grad * fix format * fix ci problems * fix sample code
-
- 18 11月, 2020 1 次提交
-
-
由 Chen Weihang 提交于
* add base hook classes and reduce hook impl * fix constructor typo * polish comment format * refactor baisc hook class design * polish design details
-
- 16 11月, 2020 1 次提交
-
-
由 danleifeng 提交于
-
- 06 11月, 2020 1 次提交
-
-
由 Chen Weihang 提交于
* remove slelected rows all reduce over height check * polish unittest
-
- 05 11月, 2020 1 次提交
-
-
由 Chen Weihang 提交于
* add retry for dygraph parallel socket bind * change to loop always * fix writing error
-
- 04 11月, 2020 1 次提交
-
-
由 Leo Chen 提交于
-
- 23 10月, 2020 1 次提交
-
-
由 lidanqing 提交于
-
- 21 10月, 2020 1 次提交
-
-
由 danleifeng 提交于
* nccl init support hostname and ip; test=develop
-
- 13 10月, 2020 1 次提交
-
-
由 Leo Chen 提交于
* add sumary feature * refine printting tensor * add sci_mode * add sample code * fix indent error * fix _format_item * polish code * support item indent * add ut * set place for ut * fix py2 issue * fix ut
-
- 28 9月, 2020 1 次提交
-
-
由 arlesniak 提交于
* Add support for mkldnn ops types selection with FLAGS in dygraph * use regex to match DNNL verbose * python3 encoding fix
-
- 25 9月, 2020 1 次提交
-
-
由 Leo Chen 提交于
* refine err msg * follow comments
-
- 24 9月, 2020 1 次提交
-
-
由 wanghuancoder 提交于
* use iwyu clean include, test=develop, test=win * compilation error, test=develop * fix compilation error2, test=develop * fix compilation error3, test=develop * fix compilation error4, test=develop * fix compilation error5, test=develop * fix compilation error6, test=develop * fix compilation error7, test=develop * fix compilation error8, test=develop * fix compilation error8, test=develop * fix compilation error10, test=develop * fix compilation error11, test=develop
-
- 31 8月, 2020 1 次提交
-
-
由 arlesniak 提交于
* get use of global 'use_mkldnn' in layer_helper * update for CI * update for CI, relu test * update for CI, relu test added, make FLAGS_use_mkldnn a public flag * added more strict tests, fixes after review * fixes after review * fixes after review, CI stuff
-
- 28 8月, 2020 2 次提交
- 21 8月, 2020 1 次提交
-
-
由 QingshuChen 提交于
* support Baidu AI Accelerator * test=kunlun * minor * test=kunlun * support xpu op in separate file * test=kunlun * update XPU error message and remove duplicated code * test=kunlun * minor * test=kunlun * minor * test=kunlun
-
- 18 8月, 2020 1 次提交
-
-
由 Sylwester Fraczek 提交于
* enable mkldnn layout conversion * review fix: remove tmp_place * fix test mkldnn swish * add UT for PrepareData CPU->MKLDNN * add #ifdef PADDLE_WITH_MKLDNN * Force-push commit Co-authored-by: Ngrygielski <adam.grygielski@gmail.com>
-
- 13 8月, 2020 2 次提交
-
-
由 Leo Chen 提交于
* add unchaged infershape function * add broadcast infershape function * fix bug * rename infershape functions * add UnaryOpUnchangedInferShapeCheckAxis * add error message * add test for common infer shape functions * dont update existed ops * dont update op_desc.h * add more test * add error check, refine error message
-
由 Leo Chen 提交于
* add auto_cast, test=develop * add loss scaler, test=develop * add comments, test=develop * refine code, test=develop * refine code, test=develop * do not set flags automatically, test=develop * fix custom op bug, test=develop * add more test, test=develop * refine enable logic, test=develop * enable amp test with GPU, test=develop * add unittest * add test for found_inf * follow comments * follow comments * remove global variable, use singleton * add some notes * update comments * update comments * update comments * add use_dynamic_loss_scaling argument * refine found_inf * refine found_inf
-
- 11 8月, 2020 1 次提交
-
-
由 Zhen Wang 提交于
-
- 05 8月, 2020 1 次提交
-
-
由 Zhen Wang 提交于
* Add some error messages for the op without double grads. * fix the test_imperative_double_grad UT.
-