- 16 8月, 2021 1 次提交
-
-
由 Guoxia Wang 提交于
* support margin loss (arcface, cosface, sphereface)
-
- 07 7月, 2021 1 次提交
-
-
由 jakpiase 提交于
* added prelu bf16/fp32 fwd/bwd kernel
-
- 30 6月, 2021 1 次提交
-
-
由 jakpiase 提交于
* added matmul_v2 bf16/fp32 FWD kernel added matmul_v2 bf16/fp32 FWD kernel * added formatting * removed some tests due to timeout in CI * refactored tests * merged tests classes into one file * minor change * removed test guard for CUDA * remove skipIf * changes after review * formated one file * minor change * added skipping UT in CUDA place
-
- 24 6月, 2021 1 次提交
-
-
由 CtfGo 提交于
As the title
-
- 23 6月, 2021 1 次提交
-
-
由 jakpiase 提交于
* base changes for split op * 90% of split functionality added * full fp32 functionality * added bf16 test * added submemory caching * added bf test to static mode whitelist * minor change * enabled split op for inference * minor fix * minor fix
-
- 17 6月, 2021 2 次提交
-
-
由 joanna.wozna.intel 提交于
* Add bf16 support for save and load ops * Add bf16 test condition * Add matmul and chagne fluid.io to paddle.static * Reduce the test duration
-
由 joanna.wozna.intel 提交于
* Add lookup_table_v2 BF16 * Reuse lookup table UT * Change op_type to op_version * Remove check_dygraph * Remove skip_check_grad_ci
-
- 07 6月, 2021 1 次提交
-
-
由 ShenLiang 提交于
* fix c_split bug * fix utest * add c_embedding for tensorparallel
-
- 26 5月, 2021 2 次提交
- 25 5月, 2021 1 次提交
-
-
由 jakpiase 提交于
-
- 29 4月, 2021 1 次提交
-
-
由 joanna.wozna.intel 提交于
* Add bf16 uniform random initializer * Remove duplicated section * Change UT to CPU place only * Put detail functions into anonymous namespace
-
- 21 4月, 2021 1 次提交
-
-
由 jakpiase 提交于
-
- 14 4月, 2021 2 次提交
-
-
由 jakpiase 提交于
-
由 Adam Osewski 提交于
* Initial draft for SGD BG16 kernel. * Unit tests for SGD with BF16 data type. * Add VLOG message to SGD BF16 op CPU kernel. * Enhance error messages and error types. * Refactor SGD op kernels to leverage some common code. * Make easier to add new kerne invoke code. * Fix SGD op kernel for sparse grad. * Unify quotes style. * Fix error for ROCM compilation. * Use specialized PADDLE_ENFORCE_xx functions.
-
- 30 3月, 2021 1 次提交
-
-
由 jakpiase 提交于
-
- 22 3月, 2021 1 次提交
-
-
由 arlesniak 提交于
-
- 19 3月, 2021 1 次提交
-
-
由 Adam Osewski 提交于
-
- 04 3月, 2021 1 次提交
-
-
由 jakpiase 提交于
-
- 02 3月, 2021 1 次提交
-
-
由 Gradie 提交于
* lamb_op_xpu;test=kunlun * modify lamb_op_xpu.cc;test=kunlun * delete atol lamb_op_xpu; test=kunlun * update xpu.cmake;test=kunlun * test_error 1e-5,lamb_op_xpu;test=kunlun * error1e-5,lamb_op_xpu,test=kunlun * delete atol lamb_xpu;test=kunlun * modify atol,lamb_op_xpy;test=kunlun * lamb_op_xpu;test=kunlun * lamb_op_xpu;test=kunlun * lamb_op_xpu, XPUOptest;test=kunlun * lamb_op_xpu;test=kunlun * lamb_op_xpu;test=kunlun * lamb_op_xpu;test=kunlun * lamb_op_xpu;test=kunlun * lamb_op_xpu;test=kunlun * lamb_op_xpu;test=kunlun * lamb_op_xpu;test=kunlun * lamb_op_xpu;test=kunlun * lamb_op_xpu;test=kunlun * lamb_op_xpu;test=kunlun * lamb_op_xpu;test=kunlun * lamb_op_xpu,modify xpu_cmake; test=kunlun * lamb_op_xpu;test=kunlun * lamb_op_xpu,modify xpucmake;test=kunlun
-
- 18 2月, 2021 1 次提交
-
-
由 joanna.wozna.intel 提交于
* Add conv transpose BF16 * Share function GetWeightsTz * Adjust to review and fix op compatibility * Add bias to unique handler name * Remove errors related to paddle enforce * Add conv2d_transpose to bf16 list and kernel refator
-
- 03 2月, 2021 1 次提交
-
-
由 Adam Osewski 提交于
-
- 27 1月, 2021 1 次提交
-
-
由 jakpiase 提交于
* added external reorder to profiler * resolved conflict * added enable_static * initial version of lstm, not working yet * added lstm to operators.cmake * added vanilla lstm mkldnn op * added peephole weights integration * minor changes * added formatting * added fusion_lstm_mkldnn to static_whitelist * added formatting * removed comment * moved use_peepholes attribute inside is_cached block * reverted wrong changes * minor formatting change * minor changes * changed stream handling * minor change * added datatype to GetExpectedKernelType() * added reading stream from TLS
-
- 26 1月, 2021 2 次提交
-
-
由 jakpiase 提交于
* added external reorder to profiler * resolved conflict * added enable_static * initial version of lstm, not working yet * added lstm to operators.cmake * added vanilla lstm mkldnn op * added peephole weights integration * minor changes * added formatting * added fusion_lstm_mkldnn to static_whitelist * added formatting * removed comment * moved use_peepholes attribute inside is_cached block * reverted wrong changes * minor formatting change * minor changes
- 31 12月, 2020 1 次提交
-
-
由 cc 提交于
* Add mkldnn nearest_interp and bilinear_interp op * don't run mkldnn interpolate in default * add interpolate_mkldnn_pass
-
- 21 12月, 2020 1 次提交
-
-
由 LielinJiang 提交于
* add double grad for conv_transpose
-
- 09 12月, 2020 1 次提交
-
-
由 Wei Shengyu 提交于
* remove addcmul * remove unittest and other related code of addcmul * fix bug * fix merge conflict
-
- 26 11月, 2020 1 次提交
-
-
由 joanna.wozna.intel 提交于
* Add bf16 pool2d and unify bf16 unit tests * Add change default ops test
-
- 25 11月, 2020 1 次提交
-
-
由 Wojciech Uss 提交于
* Add multi_gru_fuse_pass and tests * fix date * cleaned up headers
-
- 24 11月, 2020 1 次提交
-
-
由 Wojciech Uss 提交于
* Add multi_gru_seq_fuse_pass and tests * fix date * removed unused functions
-
- 20 11月, 2020 1 次提交
-
-
由 joanna.wozna.intel 提交于
* Add bf16 matmul, fc, elementwise add and mul * Correct unit test
-
- 19 11月, 2020 1 次提交
-
-
由 Wojciech Uss 提交于
* Add multi_gru op and tests * removed redundant disable_dygraph()
-
- 17 11月, 2020 1 次提交
-
-
由 Jacek Czaja 提交于
-
- 16 11月, 2020 1 次提交
-
-
由 lidanqing 提交于
* add mkldnn inplace op version test * update mkldnn_inplace fuse pass * update the inplace test
-
- 06 11月, 2020 1 次提交
-
-
由 joanna.wozna.intel 提交于
* Add bfloat16 softmax and gelu * Add pass attr bfloat16_enabled_op_types * Changes from review
-
- 05 11月, 2020 1 次提交
-
-
由 Jacek Czaja 提交于
* - Added sum bf16 oneDNN test=develop * - Fix to UT of sum bf16 test=develop
-
- 02 11月, 2020 1 次提交
-
-
由 wangguanzhong 提交于
* add generate_proposals_v2 op
-
- 22 10月, 2020 1 次提交
-
-
由 pangyoki 提交于
-
- 21 10月, 2020 1 次提交
-
-
由 pangyoki 提交于
* add static_mode_white_list * add Mac CI static list * add Win CI white_list * add Coverage and Py3 CI white_list, add test_unittest
-