- 16 1月, 2023 1 次提交
-
-
由 Charles-hit 提交于
* polish static grad op maker gen * fix some bugs * fix static code gen * solve conflict * modify composite grad maker name
-
- 10 1月, 2023 2 次提交
- 09 1月, 2023 1 次提交
-
-
由 Jiabin Yang 提交于
* proto type of composite grad in paddle * proto type of composite grad in paddle * refactor composite api with phi * fix compile error * support static graph code-gen for squeeze op * generate static graph code of unsqueeze * refine op name * fix compile error * add extra output in op_compat * remove debug log * fix clang compile error * support prim switch flag * support prim switch flag * fix dygraph error * merge develop * add code_gen * add necessary files without codegen * fix code_gen bug * add deps * modify igmnore * add ignore * delete std cout * add composite logic for backward.py * add tanh first order grad composite * support enable_prim flag for static graph * throw expection when both GrapOpMaker and GradCompOpMaker not been registered * reorganize the directory of prim api tests * fix windows error * add eager_utils * add eager_utils * modify code gen * add composite parse * add unittest for get_grad_op_desc * code optimize * fix static test on windows * support generate static graph code for imag and real op * fix windows compile error in test_static_prim * merge develop * disable test eager in inference * prim code gen * disable eager compile in inference * rm other file * rm gitignore file * code_style * add eager test * code_style * merge develop * remove useless files * modify static test * support bool flag from singlton * merge develop * recover git ignore * fix conflict * recover git ignore for generated op * fix test compile error * remove some tests * add python test * fix some name issue * add composite code gen * modify backward yaml * fix static composite grad maker code gen * remove addtional files * add some static funcs unit test * fix some bugs * fix composite grad maker register code gen * optimize some functions Co-authored-by: Nzyfncg <zhangyunfei07@baidu.com> Co-authored-by: Nwangruting <wangruting@baidu.com> Co-authored-by: Ncxxly <chenxx_id@163.com> Co-authored-by: Ncharles-hit <wanghao107@baidu.com> Co-authored-by: Nxiaoguoguo626807 <100397923+xiaoguoguo626807@users.noreply.github.com>
-
- 04 1月, 2023 2 次提交
-
-
由 Aurelius84 提交于
-
由 HongyuJia 提交于
* execute use kernel_key first * change OpKernelType->KernelKey * fix py3 compile error, remove redundant header files * fix build_strategy_test * fix DataType::RAW * fix custom_type test: operator_test.cc * fix transform place * fix backends_are_same_class * try fix place TransDataDevice * support all KernelKey * fix TransformData * fix place_are_same_class * fix merge * fix test_params_no_grad * fix specific place of GetExpectedKernelType * fix specific place of GetExpectedKernelType * fix GetKernelTypeForVar * fix dtype error * fix fetch_v2 * change GetKernelTypeForVar * fix interpreter * fix typo error * polish codes * polish codes * polish codes * fix conflict
-
- 23 12月, 2022 2 次提交
-
-
由 QingshuChen 提交于
-
由 houj04 提交于
-
- 14 12月, 2022 1 次提交
-
-
由 Yiqun Liu 提交于
-
- 12 12月, 2022 1 次提交
-
-
由 Yiqun Liu 提交于
* Enable to print device info. * Enhance the nan and inf checking for cpu. * Implement a common print function. * Unify the check of complex numbers. * Rewrite the omp method. * Count and print the number of nan and inf. * Change the print content. * Add unittest.
-
- 05 12月, 2022 1 次提交
-
-
由 huangjiyi 提交于
-
- 05 11月, 2022 1 次提交
-
-
由 Yiqun Liu 提交于
-
- 02 11月, 2022 1 次提交
-
-
由 Yiqun Liu 提交于
Improve the tool for checking nan and inf, and support to compute the max, min and mean of output tensor. (#47095) * Improve the tool for checking nan and inf, and support to compute the max, min and mean of output tensor. * Add a FLAGS to control whether abort when meets inf/nan and polish codes. * Fix unittest. * Change the computing of mean.
-
- 27 10月, 2022 1 次提交
-
-
由 Leo Chen 提交于
* make all cpp tests dynamic linked to libpaddle.so * add comments * keep old cc_test for some tests * fix some ut * make some ut use cc_test_old * fix typos and fit for win32 * fix lib path * fix some tests * skip lite test * fit for rocm * fit for cinn * fit for mac * fit for win32 * skip inference ut * skip windows * fix coverage
-
- 26 10月, 2022 1 次提交
-
-
由 Chen Weihang 提交于
* remove using lodtensor part2 * resolve code format error * resolve conflict * resolve conflict * replace added frameworrk tensor
-
- 19 10月, 2022 1 次提交
-
-
由 Wang Xin 提交于
-
- 17 10月, 2022 2 次提交
-
-
由 YuanRisheng 提交于
* namespace modify * update by comment
-
由 pangyoki 提交于
* skip ReplaceAllReduceOp in GraphtoBlock when nccl_ctxs_ is nullptr * update ut * test_dist_allreduce_op failed * fix test_dist_allreduce_op * add ut * fix nccl cpu compile * fix
-
- 13 10月, 2022 1 次提交
-
-
由 Leo Chen 提交于
* remove class ScopeBase * reopen test
-
- 12 10月, 2022 1 次提交
-
-
由 zhouweiwei2014 提交于
* [Zero-Dim] support input 0D Tensor for unary api * fix CI
-
- 11 10月, 2022 1 次提交
-
-
由 Chen Weihang 提交于
* remove using lodtensor part1 * polish history code format
-
- 28 9月, 2022 2 次提交
-
-
由 Chen Weihang 提交于
* remove needless using tensor * remove needless using tensor * resolve conflict * replace tensor using * fix format error * revert needless changing * fix rocm and npu compile error * fix cinn compile error * fix format error * fix mkldnn format error * fix mkldnn format error * fix cinn compile error * fix cinn compile error * fix cinn compile error * resolve conflict
-
由 Leo Chen 提交于
-
- 22 9月, 2022 1 次提交
-
-
由 Leo Chen 提交于
-
- 19 9月, 2022 1 次提交
-
-
由 zyfncg 提交于
* fix wrong eigen header include * fix complie bug * fix nan_inf_utils_detail * fix resource_manager * fix conv_miopen_helper
-
- 09 9月, 2022 1 次提交
-
-
由 Leo Chen 提交于
* add operator<< for BuildStrategy * add fake_coalesce * fit allreduce mode for new_exe * remove dubeg code * follow comments
-
- 06 9月, 2022 1 次提交
-
-
由 Hui Zhang 提交于
* using mkldnn * using with mkldnn macro * fix use mkldnn
-
- 01 9月, 2022 1 次提交
-
-
由 Leo Chen 提交于
* refine cmake of framework * add deps for dense tensor * fix deps * remove alloc(ctx) * add depends on mkldnn
-
- 31 8月, 2022 1 次提交
-
-
由 Hui Zhang 提交于
* add del dropout op pass to jit pe enigne * add delete dropout test
-
- 30 8月, 2022 1 次提交
-
-
由 zyfncg 提交于
* add runtime config in phi * add runtime attr for op desc and op * fix no proto error * adjust opdesc set_attr impl * try to remove conv_op extra attrs * add init runtime attr map * change extra header path * fix runtime_attr * fix trace_op * fix bug of pass * fix merge conflict * fix dygraph attrs * fix bug of pass * fix dygraph bug * fix unittest module * delete extra attr default * fix dropout kernel * polish code * fix extra output of instance_norm * fix merge confilct * fix op_desc bug * add extra attr in yaml for conv3d_transpose * don't remove extra input and output * fix save_inference_model * fix bug of batch_norm * revert some change * polish log * polish code * add code comment Co-authored-by: NChen Weihang <chenweihang@baidu.com>
-
- 25 8月, 2022 1 次提交
-
-
由 danleifeng 提交于
* update brpc version;test=develop
-
- 18 8月, 2022 1 次提交
-
-
由 zhangxiaoci 提交于
* change to async mode for xpu multi-card training in static graph mode * minor bugfix * irrelevant. move to another pr * move change to other pr * fix stream issue * fix 'stream not meet with current context' error * fix branch diverge, test=kunlun
-
- 08 8月, 2022 1 次提交
-
-
由 Leo Chen 提交于
* clean tensor.h * fix gather_nd
-
- 02 8月, 2022 1 次提交
-
-
由 Leo Chen 提交于
-
- 01 8月, 2022 1 次提交
-
-
由 Leo Chen 提交于
* remove cudaDeviceContext * remove more template * fix rocm compile * remove alias name CUDADeviceContext * fix compile * fix tests * revert changes
-
- 29 7月, 2022 1 次提交
-
-
由 Leo Chen 提交于
* remove cudaDeviceContext * remove more template * fix rocm compile
-
- 26 7月, 2022 1 次提交
-
-
由 Ruibiao Chen 提交于
* Set more attrs in ReplaceScaleLossGradOp * Fix typos * Fix CI errors * Add UT
-
- 19 7月, 2022 1 次提交
-
-
由 Ruibiao Chen 提交于
* Rename BOOST_GET macros * Fix conflicts
-
- 15 7月, 2022 1 次提交
-
-
由 Ruibiao Chen 提交于
-
- 02 7月, 2022 1 次提交
-
-
由 Leo Chen 提交于
* fix init() * delete test_device_context * replace CPUDeviceContext with CPUContext * fix test_scalar * remove dot_op.cc * fix compile
-