- 24 12月, 2021 1 次提交
-
-
由 yaoxuefeng 提交于
add pull gpups sparse op
-
- 16 12月, 2021 1 次提交
-
-
由 yeliang2258 提交于
* add test for conv_elementwise_add2_act_fuse_pass and conv_elementwise_add_act_fuse_pass * Add conv_eltwiseadd_bn_fuse_pass test and fix test_conv_elementwise_addX_act_fuse_pass * add tests for conv_act_mkldnn_fuse_pass * add test for conv_bias_mkldnn_fuse_pass * update code * add conv_act_mkldnn_fuse_pass for relu, relu6, swish, leaky_relu * update test * update * update bug * update * update pattern_detector * fix test_conv_eltwiseadd_bn_fuse_pass * add diff display notest;test=windows_ci_inference * fix * remove test_conv_act_mkldnn_fuse_pass.py * ifix
-
- 14 12月, 2021 1 次提交
-
-
由 heliqi 提交于
* add layer_norm_fuse_pass test case * restore cmakelist code * Merge branch 'develop' into layer_norm_fuse_pass * Merge branch 'develop' into layer_norm_fuse_pass * add bad case test
-
- 23 11月, 2021 1 次提交
-
-
由 Aurelius84 提交于
* Add transfer_layout/dtype op * clean useless codes * fix unused var * add optest in white.txt * split into data_transfer.cc * fix cmake * modify according reviewer comment * replace cast_op with transfer_dtype_op
-
- 27 10月, 2021 1 次提交
-
-
由 piotrekobiIntel 提交于
* Add WIP version of elementwise_div_mkldnn without working dy grad * Add dy gradient calculation implementation, disable broadcast tests * Readd removed tests from static_mode_white_list * Add bfloat16 gradient tests, remove int8 and uint8 support * - Change the way dy grad is calculated to improve performance - Refactor BinaryMKLDNNHandler to use a default parameter * Change copyright year * Refactor as suggested * Attempt to bypass CI Approval not accepting max_relative_error * Fix formatting issue
-
- 24 9月, 2021 1 次提交
-
-
由 piotrekobiIntel 提交于
* Add elementwise_sub_mkldnn_op without grad * Add test to static_mode_white_list * Refactor code, change license years * Remove invalid grad implementation * Fix element_wise_sub_op test * Fix CI Approval error * Remove unnecessary EltwiseSubMKLDNNGradKernel class * Fix CI Approval 2 * Fix CI Approval 3 * Fix CI Approval Attempt #4 * Fix CI Approve Attempt #5 * Fix CI Approval Attempt #6 * Fix CI Approval Attemt #7 * Change test names containing add to sub * Fix old tests testing add instead of sub * Copy grad implementation from elementwise_add_mkldnn * CI test fix attempt * Revert "CI test fix attempt" This reverts commit c647cacf41e6a87c715385a185de5cbf65fc8900. * Fix CI attempt 2 * Fix elementwise_sub tests, temporary mkldnn broadcast test disable * Add working implementation of elementwise_sub grad * Fix build errors caused by pull * Fix format error * Fix format error 2 * Disable elementwise_sub_mkldnn test on GPU * Apply fix for paddle.fluid import * Revert changes of test_elementwise_sub and Fix mkldnn test * Revert "Apply fix for paddle.fluid import" This reverts commit fc3b122fec8e12f2bcb32928a2685ba4d20fd742. * fix bug of module 'paddle' has no attribute 'fluid' for python3.6 (#35862) * Add changes suggested by reviewers * Change @unittest.skipIf... to @OpTestTool.skip_if_not_cpu_bf16() to satisfy Approval CI * Remove check_dygraph=False to satisify CI Approval Co-authored-by: Nzhangbo9674 <82555433+zhangbo9674@users.noreply.github.com>
-
- 17 9月, 2021 1 次提交
-
-
由 Haohongxiang 提交于
* Support EMA in Paddle2.x and Fleet * update * update * update * modify ut of ema * modify docs * modify bugs * update * update * update * modify ut
-
- 14 9月, 2021 1 次提交
-
-
由 zhiboniu 提交于
add fill_ backward
-
- 10 9月, 2021 1 次提交
-
-
由 zhiboniu 提交于
-
- 27 8月, 2021 1 次提交
-
-
由 Guoxia Wang 提交于
* sparse_momentum_op is used to save w@GRAD memory for gather_op when gather from a large parameter
-
- 18 8月, 2021 1 次提交
-
-
由 Guoxia Wang 提交于
* support class center sample of PartialFC
-
- 16 8月, 2021 1 次提交
-
-
由 Guoxia Wang 提交于
* support margin loss (arcface, cosface, sphereface)
-
- 07 7月, 2021 1 次提交
-
-
由 jakpiase 提交于
* added prelu bf16/fp32 fwd/bwd kernel
-
- 30 6月, 2021 1 次提交
-
-
由 jakpiase 提交于
* added matmul_v2 bf16/fp32 FWD kernel added matmul_v2 bf16/fp32 FWD kernel * added formatting * removed some tests due to timeout in CI * refactored tests * merged tests classes into one file * minor change * removed test guard for CUDA * remove skipIf * changes after review * formated one file * minor change * added skipping UT in CUDA place
-
- 24 6月, 2021 1 次提交
-
-
由 CtfGo 提交于
As the title
-
- 23 6月, 2021 1 次提交
-
-
由 jakpiase 提交于
* base changes for split op * 90% of split functionality added * full fp32 functionality * added bf16 test * added submemory caching * added bf test to static mode whitelist * minor change * enabled split op for inference * minor fix * minor fix
-
- 17 6月, 2021 2 次提交
-
-
由 joanna.wozna.intel 提交于
* Add bf16 support for save and load ops * Add bf16 test condition * Add matmul and chagne fluid.io to paddle.static * Reduce the test duration
-
由 joanna.wozna.intel 提交于
* Add lookup_table_v2 BF16 * Reuse lookup table UT * Change op_type to op_version * Remove check_dygraph * Remove skip_check_grad_ci
-
- 07 6月, 2021 1 次提交
-
-
由 ShenLiang 提交于
* fix c_split bug * fix utest * add c_embedding for tensorparallel
-
- 26 5月, 2021 2 次提交
- 25 5月, 2021 1 次提交
-
-
由 jakpiase 提交于
-
- 29 4月, 2021 1 次提交
-
-
由 joanna.wozna.intel 提交于
* Add bf16 uniform random initializer * Remove duplicated section * Change UT to CPU place only * Put detail functions into anonymous namespace
-
- 21 4月, 2021 1 次提交
-
-
由 jakpiase 提交于
-
- 14 4月, 2021 2 次提交
-
-
由 jakpiase 提交于
-
由 Adam Osewski 提交于
* Initial draft for SGD BG16 kernel. * Unit tests for SGD with BF16 data type. * Add VLOG message to SGD BF16 op CPU kernel. * Enhance error messages and error types. * Refactor SGD op kernels to leverage some common code. * Make easier to add new kerne invoke code. * Fix SGD op kernel for sparse grad. * Unify quotes style. * Fix error for ROCM compilation. * Use specialized PADDLE_ENFORCE_xx functions.
-
- 30 3月, 2021 1 次提交
-
-
由 jakpiase 提交于
-
- 22 3月, 2021 1 次提交
-
-
由 arlesniak 提交于
-
- 19 3月, 2021 1 次提交
-
-
由 Adam Osewski 提交于
-
- 04 3月, 2021 1 次提交
-
-
由 jakpiase 提交于
-
- 02 3月, 2021 1 次提交
-
-
由 Gradie 提交于
* lamb_op_xpu;test=kunlun * modify lamb_op_xpu.cc;test=kunlun * delete atol lamb_op_xpu; test=kunlun * update xpu.cmake;test=kunlun * test_error 1e-5,lamb_op_xpu;test=kunlun * error1e-5,lamb_op_xpu,test=kunlun * delete atol lamb_xpu;test=kunlun * modify atol,lamb_op_xpy;test=kunlun * lamb_op_xpu;test=kunlun * lamb_op_xpu;test=kunlun * lamb_op_xpu, XPUOptest;test=kunlun * lamb_op_xpu;test=kunlun * lamb_op_xpu;test=kunlun * lamb_op_xpu;test=kunlun * lamb_op_xpu;test=kunlun * lamb_op_xpu;test=kunlun * lamb_op_xpu;test=kunlun * lamb_op_xpu;test=kunlun * lamb_op_xpu;test=kunlun * lamb_op_xpu;test=kunlun * lamb_op_xpu;test=kunlun * lamb_op_xpu;test=kunlun * lamb_op_xpu,modify xpu_cmake; test=kunlun * lamb_op_xpu;test=kunlun * lamb_op_xpu,modify xpucmake;test=kunlun
-
- 18 2月, 2021 1 次提交
-
-
由 joanna.wozna.intel 提交于
* Add conv transpose BF16 * Share function GetWeightsTz * Adjust to review and fix op compatibility * Add bias to unique handler name * Remove errors related to paddle enforce * Add conv2d_transpose to bf16 list and kernel refator
-
- 03 2月, 2021 1 次提交
-
-
由 Adam Osewski 提交于
-
- 27 1月, 2021 1 次提交
-
-
由 jakpiase 提交于
* added external reorder to profiler * resolved conflict * added enable_static * initial version of lstm, not working yet * added lstm to operators.cmake * added vanilla lstm mkldnn op * added peephole weights integration * minor changes * added formatting * added fusion_lstm_mkldnn to static_whitelist * added formatting * removed comment * moved use_peepholes attribute inside is_cached block * reverted wrong changes * minor formatting change * minor changes * changed stream handling * minor change * added datatype to GetExpectedKernelType() * added reading stream from TLS
-
- 26 1月, 2021 2 次提交
-
-
由 jakpiase 提交于
* added external reorder to profiler * resolved conflict * added enable_static * initial version of lstm, not working yet * added lstm to operators.cmake * added vanilla lstm mkldnn op * added peephole weights integration * minor changes * added formatting * added fusion_lstm_mkldnn to static_whitelist * added formatting * removed comment * moved use_peepholes attribute inside is_cached block * reverted wrong changes * minor formatting change * minor changes
- 31 12月, 2020 1 次提交
-
-
由 cc 提交于
* Add mkldnn nearest_interp and bilinear_interp op * don't run mkldnn interpolate in default * add interpolate_mkldnn_pass
-
- 21 12月, 2020 1 次提交
-
-
由 LielinJiang 提交于
* add double grad for conv_transpose
-
- 09 12月, 2020 1 次提交
-
-
由 Wei Shengyu 提交于
* remove addcmul * remove unittest and other related code of addcmul * fix bug * fix merge conflict
-
- 26 11月, 2020 1 次提交
-
-
由 joanna.wozna.intel 提交于
* Add bf16 pool2d and unify bf16 unit tests * Add change default ops test
-