- 02 3月, 2022 1 次提交
-
-
由 Feiyu Chan 提交于
* move sequence2batch * move lstm and gru * Add phi/kernels directory into exclusion to stop using hipcc to compile non .cu files in it.
-
- 20 2月, 2022 1 次提交
-
-
由 Chen Weihang 提交于
* rename pten dir to phi * rename namespace to phi * rename infrt pten dir to phi * resolve conflict * rename pten to phi in cmake * revert all infrt change * change needed files * fix infrt failed * fix inference failed
-
- 18 2月, 2022 1 次提交
-
-
由 Feiyu Chan 提交于
* move blas related files * move lapack related files
-
- 12 6月, 2021 1 次提交
-
-
由 joanna.wozna.intel 提交于
* Small changes related to BF16 fusion_gru and fusion_lstm * Correct to pass arg by value * Add conditions to rnn op * Correct the spelling mistake * Improving the test with checking activation * Trigger CI
-
- 30 3月, 2021 1 次提交
-
-
由 jakpiase 提交于
-
- 04 3月, 2021 1 次提交
-
-
由 jakpiase 提交于
-
- 27 1月, 2021 1 次提交
-
-
由 jakpiase 提交于
* added external reorder to profiler * resolved conflict * added enable_static * initial version of lstm, not working yet * added lstm to operators.cmake * added vanilla lstm mkldnn op * added peephole weights integration * minor changes * added formatting * added fusion_lstm_mkldnn to static_whitelist * added formatting * removed comment * moved use_peepholes attribute inside is_cached block * reverted wrong changes * minor formatting change * minor changes * changed stream handling * minor change * added datatype to GetExpectedKernelType() * added reading stream from TLS
-
- 26 1月, 2021 2 次提交
-
-
由 jakpiase 提交于
* added external reorder to profiler * resolved conflict * added enable_static * initial version of lstm, not working yet * added lstm to operators.cmake * added vanilla lstm mkldnn op * added peephole weights integration * minor changes * added formatting * added fusion_lstm_mkldnn to static_whitelist * added formatting * removed comment * moved use_peepholes attribute inside is_cached block * reverted wrong changes * minor formatting change * minor changes
- 10 4月, 2020 1 次提交
-
-
由 zhaoyuchen2018 提交于
C++ OP enhancement.
-
- 28 11月, 2019 1 次提交
-
-
由 Zeng Jinle 提交于
* use system allocator in unittests, test=develop * fix op bugs, test=develop * fix tensor copy bug when src and dst are the same, test=develop
-
- 28 10月, 2019 1 次提交
-
-
由 Chen Weihang 提交于
* replace part of the old implementation, test=develop * restore concat op, test=develop * update all ops implemention & delete GetDataTypeOfVar func, test=develop
-
- 11 9月, 2019 1 次提交
-
-
由 Yiqun Liu 提交于
* Refine the codes related to fc op. * Add GPU implementation for fc functor. * Apply fc_fuse_pass in GPU inference. test=develop * Change the cmake for fc op. * Change PADDLE_ENFORCE to PADDLE_ENFORCE_EQ. * Add an attribute to set the activation type in fc_op. * Enhance the unittest of fc_op. test=develop * Remove the declaration of FCOpGrad back to the header file. test=develop * Set default value for newly added arguments in test_fc_op. test=develop
-
- 14 8月, 2019 1 次提交
-
-
由 Leo Chen 提交于
* remove unused DefaultGradOpDescMaker in REGISTER_OPERATOR(), test=develop * remove SplitIdsOpGradMaker since it is buggy and not tested, update spec file, test=develop
-
- 19 6月, 2019 1 次提交
-
-
由 翟飞跃 提交于
* fix spelling errors; test=develop * Update API.spec update md5 * Update API.spec * change the order of api;test=develop
-
- 08 3月, 2019 1 次提交
-
-
由 tensor-tang 提交于
test=develop
-
- 07 3月, 2019 1 次提交
-
-
由 tensor-tang 提交于
test=develop
-
- 20 12月, 2018 1 次提交
-
-
由 tensor-tang 提交于
test=develop
-
- 18 12月, 2018 1 次提交
-
-
由 tensor-tang 提交于
-
- 17 12月, 2018 1 次提交
-
-
由 tensor-tang 提交于
-
- 12 12月, 2018 1 次提交
-
-
由 Yu Yang 提交于
test=develop
-
- 21 11月, 2018 1 次提交
-
-
由 tensor-tang 提交于
test=develop
-
- 16 11月, 2018 1 次提交
-
-
由 Wu Yi 提交于
* wip simplify operator framework * wip * wip * done test=develop * clean test=develop * fix test=develop * fix deps test=develop * fix cpu build test=develop * fix tensorrt build test=develop * fix tests test=develop * fix test=develop * fix cpu build test=develop
-
- 17 10月, 2018 1 次提交
-
-
由 nhzlx 提交于
test=develop
-
- 12 10月, 2018 1 次提交
-
-
由 tensor-tang 提交于
-
- 11 10月, 2018 1 次提交
-
-
由 tensor-tang 提交于
-
- 09 10月, 2018 1 次提交
-
-
由 tensor-tang 提交于
test=develop
-
- 30 9月, 2018 1 次提交
-
-
由 tensor-tang 提交于
test=develop
-
- 26 9月, 2018 1 次提交
-
-
由 tensor-tang 提交于
test=develop
-
- 14 9月, 2018 3 次提交
-
-
由 tensor-tang 提交于
-
由 tensor-tang 提交于
-
由 tensor-tang 提交于
-
- 12 9月, 2018 1 次提交
-
-
由 tensor-tang 提交于
-
- 11 9月, 2018 2 次提交
-
-
由 tensor-tang 提交于
-
由 tensor-tang 提交于
-
- 05 9月, 2018 5 次提交
-
-
由 tensor-tang 提交于
-
由 tensor-tang 提交于
-
由 tensor-tang 提交于
-
由 tensor-tang 提交于
-
由 Brian Liu 提交于
* Refine fusion lstm op code for better readability * Enable peephole in fusion lstm op (seq_mode part) and add unit test * Enable peephole in fused lstop op (batch_mode part) Set batch_mode as default as well * Use pre-commit to clean format * Follow up review comments as well as adding more unit tests for seq mode
-