- 09 1月, 2023 11 次提交
-
-
由 ykkk2333 提交于
* migrate shaple sgd, split,sign xpu kernels to phi, test=kunlun * fix dlrm throughput problem, test=kunlun * add xpu einsum, fill_diagonal, and diagonal kernels, test=kunlun
-
由 gem5 提交于
-
由 张春乔 提交于
* Dy2St_RandomHorizontalFlip * Dy2St_RandomHorizontalFlip
-
由 Jiabin Yang 提交于
* proto type of composite grad in paddle * proto type of composite grad in paddle * refactor composite api with phi * fix compile error * support static graph code-gen for squeeze op * generate static graph code of unsqueeze * refine op name * fix compile error * add extra output in op_compat * remove debug log * fix clang compile error * support prim switch flag * support prim switch flag * fix dygraph error * merge develop * add code_gen * add necessary files without codegen * fix code_gen bug * add deps * modify igmnore * add ignore * delete std cout * add composite logic for backward.py * add tanh first order grad composite * support enable_prim flag for static graph * throw expection when both GrapOpMaker and GradCompOpMaker not been registered * reorganize the directory of prim api tests * fix windows error * add eager_utils * add eager_utils * modify code gen * add composite parse * add unittest for get_grad_op_desc * code optimize * fix static test on windows * support generate static graph code for imag and real op * fix windows compile error in test_static_prim * merge develop * disable test eager in inference * prim code gen * disable eager compile in inference * rm other file * rm gitignore file * code_style * add eager test * code_style * merge develop * remove useless files * modify static test * support bool flag from singlton * merge develop * recover git ignore * fix conflict * recover git ignore for generated op * fix test compile error * remove some tests * add python test * fix some name issue * add composite code gen * modify backward yaml * fix static composite grad maker code gen * remove addtional files * add some static funcs unit test * fix some bugs * fix composite grad maker register code gen * optimize some functions Co-authored-by: Nzyfncg <zhangyunfei07@baidu.com> Co-authored-by: Nwangruting <wangruting@baidu.com> Co-authored-by: Ncxxly <chenxx_id@163.com> Co-authored-by: Ncharles-hit <wanghao107@baidu.com> Co-authored-by: Nxiaoguoguo626807 <100397923+xiaoguoguo626807@users.noreply.github.com>
-
由 HongyuJia 提交于
-
由 Ryan 提交于
* add RandomResizedCrop * fix code style * fix first loop * add uni test
-
由 wangzhen38 提交于
-
由 骑马小猫 提交于
* remove dumplicate element_size * remove tensor_methon_* method
-
由 zhaoyingli 提交于
* [AutoParalle] balancing the calculation of global_norm in data parallel * fix unittest * update cond pure_data_parallel
-
由 xiongkun 提交于
* [Dy2static] Add ParameterRecorder * filter by shape(tensor)==0 * fix code by review * fix random failed in CI. (especially coverage) * fix bugs * remove API changes to avoid static CI approval
-
由 LiYuRio 提交于
* comm_context and static init * refactor: move to phi/core/distributed * refactor: avoid mutable_data usage * fix: windows sock * fix: device without nccl Co-authored-by: Wen Sun <syl1887415157@126.com>
-
- 07 1月, 2023 1 次提交
-
-
由 Ruibiao Chen 提交于
* Enable standalone executor for fleet training * Update code * Replace use_standalone_executor utils in auto parallel * Update code * Diable standalone executor for test_pass_sharding * Update code * Set sequential run for auto parallel * Fix dist_attr bug * Set sequential run for auto parallel
-
- 06 1月, 2023 20 次提交
-
-
由 Yuanle Liu 提交于
-
由 Guanghua Yu 提交于
-
由 RuohengMa 提交于
* add bitwise and, bitwise not, bitwise or and bitwise xor * correct typo
-
由 JYChen 提交于
* add 0-d support for paddle.kthvalue * add 0-d support for paddle.mode * fix coverage test for device * fix check-bug in windows * change axis check from LT to LE * add shape & value check for grad when input is 0d tensor
-
由 HongyuJia 提交于
-
由 wangzhen38 提交于
-
由 seemingwang 提交于
* zero-tensor * remove unused * zero_dim_xpu * relocate * add value test * fix syntax
-
由 Yuanle Liu 提交于
-
由 Aurelius84 提交于
* [D2SCinn]Add test_cinn unittest and param_grad into skip_gc_vars * remove print
-
由 HongyuJia 提交于
* acquire custom_deivce stream * regulate file name and unittest
-
由 jiangcheng 提交于
* flatten op support 0D-tensor * add test in zero dim py * fix shape should be list * short code for ci-coverage * add backward test * simple code for ci coverage * add axis check * add 0D-tensor test in test_flatten_contiguous_range_op.py * add axis error test for Coverage CI * add more test for CI-Coverage * add more test for CI-Coverage
-
由 Wen Sun 提交于
* fix: fix hidden virtual funcs * fix: add default impl
-
由 Weilong Wu 提交于
-
由 Weilong Wu 提交于
-
由 MarDino 提交于
-
由 Thomas Young 提交于
-
由 Nyakku Shigure 提交于
-
由 张春乔 提交于
-
由 niuliling123 提交于
-
由 Yulong Ao 提交于
* [Auto Parallel] Rename methods of ProcessMesh * [Auto Parallel] Impl the python process_mesh by the c++ one * [Auto Parallel] Add some minor modifications * [Auto Parallel] Rename some methods * [Auto Parallel] Remove unnecessary codes * [Auto Parallel] Add back some removed files * [Auto Parallel] Fix bugs * [Auto Parallel] Fix a bug * Update process_mesh.cc * [Auto Parallel] Merge dist attrs of Python into C++ * [Auto Parallel] Add back deleted importing * [Auto Parallel] Add back removed unittest * [Auto Parallel] Remove type qualifiers of return types * [Auto Parallel] Fix some bugs * [Auto Parallel] Fix a bug of the quant pass * [Auto Parallel] Fix the code style
-
- 05 1月, 2023 8 次提交
-
-
由 Feiyu Chan 提交于
sequence_mask fix: when the input length is an empty tensor, the kernel tries to dereference illegal sentinel iterator (#49525)
-
由 Siming Dai 提交于
* support 0D for paddle.sort/argsort * support 0D tensor for paddle.sort/argsort in xpu * fix bug * fix grad and add value assertion
-
由 xiaoxiaohehe001 提交于
-
由 Leo Guo 提交于
-
由 HappyHeavyRain 提交于
* generate the static graph code of ops * modify the isclose comment * modify the clip comment in nn.py * reset nn.py
-
由 sneaxiy 提交于
-
由 Zhang Jun 提交于
* update nearest_interp, expand_v2, cast for stable diffusion * update nearest_interp, expand_v2, cast for stable diffusion * correct shape rank * Update expand_v2_op.cc
-
由 Jianghai 提交于
* add pool flops * add annotations and tests
-