- 06 1月, 2022 2 次提交
-
-
由 limingshu 提交于
* fix the wrong filename * first commit
- 05 1月, 2022 1 次提交
-
-
由 Lijunhui 提交于
* init commit: new elem_mul_grad * add template speciallization for complex in multiply * reply review comments * correct dx and dy computation when T is complex * reply review comments * update to new ReduceRunctor * mul-output broadcast * call functions * call functions with comments * remove comments
-
- 31 12月, 2021 1 次提交
-
-
由 YuanRisheng 提交于
* change 'math' to 'math_kernel' * fix compile bugs * merge develop * fix compile bugs
-
- 16 12月, 2021 1 次提交
-
-
由 YuanRisheng 提交于
* Reduce reshape kernel functions in pten * delete notes * fix bugs when compile * modify register name * fix compile bugs
-
- 24 11月, 2021 1 次提交
-
-
由 YuanRisheng 提交于
* elementwise_mul refactor * perfect code in test * delete redundant code * fix bugs when run test_multiply * adjust the location of macro * fix bugs when run ci
-
- 15 11月, 2021 1 次提交
-
-
由 Weilong Wu 提交于
* Add elementwise_mul triple grad kernel * Removed InplaceInferer and polished code
-
- 15 9月, 2021 1 次提交
-
-
由 Yiqun Liu 提交于
-
- 02 6月, 2021 1 次提交
-
-
由 limingshu 提交于
-
- 25 5月, 2021 1 次提交
-
-
由 chentianyu03 提交于
* modify complex template for elementwise ops * modify mul, div grad struct * add complex template for CudaShuffleDownSync CudaShuffleXorSync funcs and fix the bug when delete cuda<9000 * fix shuffle func args bug * fix shuffle func args bug * fix shuffle func args bug
-
- 03 3月, 2021 1 次提交
-
-
由 Qi Li 提交于
* [ROCM] update fluid elementwise op for rocm (part10), test=develop * update, test=develop * address review comments, test=develop
-
- 25 1月, 2021 1 次提交
-
-
由 arlesniak 提交于
* More precise mkldnn kernel choice in GetExpectedKernelType * Fixes after review * Refresh develop for CI * CI experiment * get back from CI exper
-
- 22 12月, 2020 1 次提交
-
-
由 chentianyu03 提交于
* add conj op for complex types * add conj for complex types * add more test case * add conj_op test * modify conj api and impl * add complex type for fill_constant_op xpu * add setConstant for complex type * remove complex conj test file * user define grad for test_conj_op * add test case for static mode of conj api * modify conj doc * change input args name to x * remove useless codes * conj support real types * add conj test case for real number * delete no need to calculate inputs in dygraph op_test * delete no need to calculate inputs in dygraph op_test * modify grad of mul for complex types * fix the grads of inputs args order not match bug
-
- 04 12月, 2020 1 次提交
-
-
由 Chen Weihang 提交于
* basic impl of type promote * add comment & another testcase * fix complex bugs & support python op promote type * fix failed unittests & polish code * add unittest for coverage * change to only promote complex type * polish code details * polish several comments
-
- 27 11月, 2020 1 次提交
-
-
由 arlesniak 提交于
-
- 10 9月, 2020 1 次提交
-
-
由 Jacek Czaja 提交于
* - introducing oneDNN 1.6 test=develop * - Removed redundant code test=develop
-
- 18 6月, 2020 1 次提交
-
-
由 Jacek Czaja 提交于
* Start implementing int8 eltwise add test=develop * - Fix to Michal PR * - Fix test=develop * - Lint fixes test=develop * - Added checking if elementwise_mul can be used test=develop * - Added attribs to skip_attrs_set test=develop * - Improved broadcasting test=develop - fixes to compilation - fix - fix - Lint fixes test=develop * - removed redundant condition test=develop Co-authored-by: NMichal Gallus <michal.gallus@intel.com>
-
- 13 4月, 2020 1 次提交
-
-
由 LutaoChu 提交于
Those ops add the kernel message enhancement, as follows paddle.fluid.layers.elementwise_add paddle.fluid.layers.elementwise_div paddle.fluid.layers.elementwise_floordiv paddle.fluid.layers.elementwise_max paddle.fluid.layers.elementwise_min paddle.fluid.layers.elementwise_mod paddle.fluid.layers.elementwise_mul paddle.fluid.layers.elementwise_pow paddle.fluid.layers.elementwise_sub
-
- 30 12月, 2019 1 次提交
-
-
由 danleifeng 提交于
-
- 10 12月, 2019 1 次提交
-
-
由 Adam 提交于
* MKLDNN v1.0 rebase to Paddle 1.6 test=develop * Add hacky paddle::string::to_string() implementation * vectorize<int64-t>() -> vectorize() cleanup test=develop * PADDLE_ENFORCE and void_cast fixes test=develop * Rebase changes test=develop * Cosmetics test=develop * Delete MKL from mkldnn.cmake test=develop * CMake debug commands test=develop * Delete MKLDNN_VERBOSE and rebase fixes test=develop * Rebase fixes test=develop * Temporarily disable int8 resnet101 vgg16 and vgg19 tests test=develop * Add libmkldnn.so.1 to python setup test=develop * Add libmkldnn.so.1 to inference_lib cmake after rebase test=develop * Post rebase fixes + FC int8 changes test=develop * Fix LRN NHWC test=develop * Fix NHWC conv3d test=develop * Windows build fix + next conv3d fix test=develop * Fix conv2d on AVX2 machines test=develop
-
- 29 11月, 2019 1 次提交
-
-
由 hong 提交于
* add_dygraph_execution_context * add dygraph infershape context and execution context; test=develop * fix imperative bug; test=develop * remove inputs outputs interface from execution context, because it have same function with inputNames; test=develop * remove tracer_test ctest; test=develop * fix split op bug; test=develop * fix unitests bug; test=develop * fix distribute test bug; test=develop * fix ngraph compile bug; test=develop * fix grad maker bug; test=develop * fix load op bugs; test=develop * fix operator.cc construct bug; test=develop * remove useless name find in operator; test=develop * add tracer_test; test=develop * fix concat, split bug; test=develop * remove tracer_test unitest; test=develop * fix attribute check bug; test=develop * add test code to fix converage; test=develop * remove useless code, change check backward input in engin; test=develop * unlock var type infer shape;test=develop * add ShareAllLoD api; test=develop * add dygraph infershape context unitest; test=develop * remove increase and decrease lod in dygraph; test=develop * addd override; test=develop * fix increase descrease lod; test=develop * fix paddle_enforce; test=develop * disable lod op dygraph check; test=develop * fix paddle enforce error; test=develop * add comment for op_registry and OperatorBase; test=develop * optimize the comment of op_registry; test=develop * fix format of comment; test=develop * fix format of comment; test=develop * optimize the format of comment; test=develop * optimize the format of the comment; test=develop * optimize comment of op_registry; test=develop
-
- 26 11月, 2019 1 次提交
-
-
由 Michał Gallus 提交于
* Refactor MKL-DNN ElementwiseMul remove manual fallback, remove format attrs test=develop * Refine PADDLE_ENFORCEs in eltwise_mul_op.h test=develop * Make ElementwiseMulOp inherit from ElementwiseOp * Change type of simd_width to int test=develop * Remove Constructor extensions in ElementwiseOp and ElementwiseMulOp test=develop * Restore attributes test=develop * Fix test coverage for mkldnn eltwise mul test=develop * Conform to new is_run_common_broadcast API test=develop * Add UT for AreDimsAndFormatCorrect test=develop
-
- 20 11月, 2019 1 次提交
-
-
由 danleifeng 提交于
-
- 19 11月, 2019 1 次提交
-
-
由 danleifeng 提交于
-
- 30 9月, 2019 1 次提交
-
-
由 danleifeng 提交于
Improve elementwise operators performance in same dimensions
-
- 24 9月, 2019 1 次提交
-
-
由 Leo Chen 提交于
* make OpTest check grad inplace even if forward has no inplace, test=develop * do not run PE when enable_inplace is False, test=develop * add conv3d cuda kernel for float16 type, test=develop * refactor OpTest for inplace, test=develop * add comments, test=develop
-
- 18 9月, 2019 1 次提交
-
-
由 Leo Chen 提交于
* update elementwise double grad to save gpu memory, test=develop * update elementwise_mul/div_grad_grad to save memory, test=develop * remove eval function in eigen statement to save memory, test=develop * add unittest for elementwise_div_grad_grad without dout, test=develop * add unittest for elementwise_add_grad_grad without ddx, test=develop * add float16 cuda kernel for elementwise double grad op, test=develop
-
- 13 5月, 2019 2 次提交
-
-
由 Yiqun Liu 提交于
* Optimize the elementwise op with CUDA kernels. test=develop * Support setting of attr in op config file. test=develop * Add the support the setting dtype and initializer in config. test=develop * Save workspace. * Add initializer "zeros". test=develop * Fix compiling error. * Support the use of existed file to initailize tensor in op_tester. * Use eigen to optimize the elementwise_add/mul for the case that x and y have the same dims. test=develop
-
由 Kaipeng Deng 提交于
* add double grad for elementwise_mul. test=develop * remove comment. test=develop * fix grad sum. test=develop * fix for axis expand. test=develop * add test for axis expand. test=develop
-
- 19 12月, 2018 1 次提交
-
-
由 sneaxiy 提交于
test=develop
-
- 05 12月, 2018 1 次提交
-
-
由 chengduo 提交于
* expose square test=develop * fix activation test=develop * Add square API test=develop * add necessary op * code refine * fix API.spec test=develop * fix unit test test=develop * add unit test sparse_grad_clip test=develop * fix API.spec test=develop * remove mac test for test_gradient_clip test=develop * remove selectedrows_mul_tensor test=develop
-
- 16 11月, 2018 1 次提交
-
-
由 Wu Yi 提交于
* wip simplify operator framework * wip * wip * done test=develop * clean test=develop * fix test=develop * fix deps test=develop * fix cpu build test=develop * fix tensorrt build test=develop * fix tests test=develop * fix test=develop * fix cpu build test=develop
-
- 08 11月, 2018 1 次提交
-
-
由 chengduo 提交于
* fix input<tensor> test=develop * fix split_ids test=develop * ElementwiseMul should not support SelectedRows * fix scale op test=develop * change GetTensorFromVar() method to GetTensorOrSelectedRowsFromVar() * fix operator * refine MultiOutput * fix MultiOutput test=develop * disable test_dist_save_load test=develop * fix elementwise_op test=develop * add get_sparse_as_op test=develop * add info for check test=develop * rename get_sparse_as_op with extract_rows_as_op. test=develop * elementwise doesn't support selected_rows * fix regularizer * remove extract_rows_as test=develop * fix ci test=develop * add test for sum_op * fix regularizer test=develop * test=develop * fix pserver weight decay multi inputs test=develop
-
- 18 9月, 2018 1 次提交
-
-
由 sneaxiy 提交于
-
- 17 9月, 2018 1 次提交
-
-
由 sneaxiy 提交于
-
- 22 8月, 2018 1 次提交
-
-
由 Yu Yang 提交于
-
- 16 8月, 2018 1 次提交
-
-
由 tensor-tang 提交于
-
- 07 3月, 2018 1 次提交
-
-
由 chengduoZH 提交于
-
- 06 3月, 2018 1 次提交
-
-
由 chengduoZH 提交于
-
- 12 2月, 2018 1 次提交
-
-
由 qingqing01 提交于
-