- 10 4月, 2023 1 次提交
-
-
由 HongyuJia 提交于
* [Opt Performance] Optimize custom operator performance, reconstruct python API auto-gen, add cache and use const inference * opt AutoGradMeta implementation * remove profiler codes * fix unit test * change year, 2021->2023 * fix int64_t parse bug
-
- 03 4月, 2023 1 次提交
-
-
由 HongyuJia 提交于
* [CustomOP Optional Inplace] Custom operator supports inplace optional vector Tensor input * uncomment unittest codes
-
- 31 3月, 2023 1 次提交
-
-
由 HongyuJia 提交于
* [CustomOP Inplace] Automap inplace dtype and shape, prepare for vector<Tensor> output * delete custom_inplace_setup.py * [CustomOP Optional Inplace] Custom operator supports inplace optional Tensor input * fix bug for vector<Tensor> inplace test
-
- 27 3月, 2023 1 次提交
-
-
由 HongyuJia 提交于
* [CustomOP Inplace] Automap inplace dtype and shape, prepare for vector<Tensor> output * delete dtype,shape func of multi_inplace op * [CustomOP Inplace] Automap inplace dtype and shape, support vector<Tensor> output
-
- 23 3月, 2023 1 次提交
-
-
由 HongyuJia 提交于
-
- 22 3月, 2023 1 次提交
-
-
由 HongyuJia 提交于
* [CustomOP Optional] CustomOP supports optional Tensor * fix test_custom_concat, add pytest to CMakeLists
-
- 21 3月, 2023 1 次提交
-
-
由 HongyuJia 提交于
-
- 20 3月, 2023 1 次提交
-
-
由 HongyuJia 提交于
-
- 16 3月, 2023 1 次提交
-
-
由 HongyuJia 提交于
* init unit test commit, contains register thinking * support inplace * get inplaced x.grad * Try support inplace and hook at the same time * Support inplace, need debug * Support inplace successfully * Inplace use Tensor&, consistent with Tensor* * fix MapPlainOutputs bug * fix double grad inplace error
-
- 07 3月, 2023 1 次提交
-
-
由 Chen Weihang 提交于
-
- 31 1月, 2023 1 次提交
-
-
由 HongyuJia 提交于
* decouple phi custom_op * decouple phi custom_op, remove codes * delete custom symbol of inference
-
- 09 1月, 2023 1 次提交
-
-
由 HongyuJia 提交于
-
- 30 12月, 2022 1 次提交
-
-
由 HongyuJia 提交于
* add custom_cpu testcase * update test_custom_device_setup * update path to custom_runtime * fix cmd wait * test Linux only * setup once * integrate to one run_cmd * add pip install * change timeout * add debug string * add debug string * add debug string * use os.system and change module name * add runtime * add more debug message * continue debug * timestamp * fix testcase import bug * remove error message * set TIMEOUT property
-
- 28 11月, 2022 1 次提交
-
-
由 Qi Li 提交于
* [NPU] apply npu_identity to conv bn and copy2cpu, test=develop * update npu identity to share data with x, test=develop * address review comments, test=develop
-
- 22 11月, 2022 1 次提交
-
-
由 HongyuJia 提交于
-
- 25 8月, 2022 1 次提交
-
-
由 hong 提交于
* add python final state * fix bug * fix bugs * fix bug * fix bug * revert impl, final state mul not support selected rows * fix softmax use cudnn error * add softlable false unitest * revert loss.py
-
- 04 8月, 2022 1 次提交
-
-
由 Aurelius84 提交于
-
- 04 7月, 2022 1 次提交
-
-
由 Ruibiao Chen 提交于
-
- 05 6月, 2022 1 次提交
-
-
由 Sing_chan 提交于
-
- 14 4月, 2022 2 次提交
-
-
由 zyfncg 提交于
* support some c++ api in paddle namespace * change c++ api namespace in custom op
-
由 Chen Weihang 提交于
* chnage dispatch to visit * resolve conflict
-
- 13 4月, 2022 1 次提交
-
-
由 Chen Weihang 提交于
* remove old custom op placetype * replace dist placetype using * add with gpu macro * fix mutable_data error * fix set value error * add comment
-
- 12 4月, 2022 2 次提交
-
-
由 Chen Weihang 提交于
* add context pool unittests * fix timeout * polish details * change option pos * add dll decl for wndows * fix pre-commit error * move dll_decl and export DeviceContext * replace lost dll_decl.h
-
由 Chen Weihang 提交于
* add new method for custom double grad * add tanh double grad unittest * change year * revert tensor init method
-
- 14 3月, 2022 1 次提交
-
-
由 Jiabin Yang 提交于
* eager, test=develop * fix bug, test=develop * eager, test=develop * merge legacy to fluid * eager, test=develop * eager, test=develop * Refactor TensorAdd func by template and remove gradient_accumulation in eager * Remove needless target name * eager, test=develop * eager, test=develop * Use overload instead of template * Remove legacy code * Remove legacy code * selectedrows, test=develop * Remove DataType test * eager, test=develop * eager, test=develop * support gan, test=develop * Using Tensor directly instead of using EagerTensor * support gradient_accumulation * make test_imperative_lod_tensor_to_selected_rows longer * make test_imperative_lod_tensor_to_selected_rows longer * refine code * ptb, test=develop * Rename all EagerTensor to Tensor * Rename some EagerTensor to Tensor * rename EagerTensor to EagerVariable * eager, test=develop * eager, test=develop * eager, test=develop * eager, test=develop * add more test * eager, test=develop * Support copiable selected rows and merge develop * save load, eager, test=develop * save load, eager, test=develop * refine, test=develop * remove useless _set_value method * refine, test=develop * refine, test=develop * revert static_runner, test=develop * EagerTensor to Tensor, test=develop * refine, test=develop * refine, test=develop * clear grad, test=develop * merge, develop * merge, develop * merge, test=develop * merge, test=develop * Support quant and part of slice * support legacy static save * extend slim tests time * remove imperative on inference * remove imperative on inference * merge develop * fix typo * fix typo * split slice related code into 2 part for imperative and eager * split slice from inference * split slice from inference * fix test_tensor_register_hook * support custom op in eager mode * fix inference deps error * split eager utils from custom operator * fix type match * fix typo Co-authored-by: NWang Huan <wanghuan29@baidu.com> Co-authored-by: NWeilong Wu <veyron_wu@163.com> Co-authored-by: Nwanghuancoder <wanghuancoder@163.com>
-
- 22 2月, 2022 3 次提交
-
-
由 Aganlengzi 提交于
* [custom kernel]Delete useless * change RegType enum names * mod notes * merge * update
-
由 xiongkun 提交于
* change Vector to std::vector and provide MixVector class as a helper wrapper class * solve the multi-gpu hang problem * remove the duplicate template instantialize * Copy vector to cpu * add CopyToCPU * xxx * final version: fix the problem of all reduce * remove mixvector dependence * fix * merge * fix code * fix by CI
-
由 Chen Weihang 提交于
* unify register macro * rename declare macro * fix infrt error
-
- 20 2月, 2022 1 次提交
-
-
由 Chen Weihang 提交于
* rename pten dir to phi * rename namespace to phi * rename infrt pten dir to phi * resolve conflict * rename pten to phi in cmake * revert all infrt change * change needed files * fix infrt failed * fix inference failed
-