- 12 1月, 2023 17 次提交
-
-
由 Wen Sun 提交于
* refactor: migrate comm checks * refactor: add check in comm context * feat: add gloo static check * refactor: add place param in static check
-
由 zhangkaihuo 提交于
-
由 jameszhang 提交于
* Fix reduce func bug in process_group_bkcl Also catch up with a recent process_group PR that failed to add XPU branch. Note that reduce is still accomplished by allreduce for xpu. Fix this should xccl lib be updated. * fix compile issue for non-XPU
-
由 YuanRisheng 提交于
-
由 tianshuo78520a 提交于
* test=ninja;test=document_fix * test=ninja;test=document_fix * test=ninja;test=document_fix * add ninja * update dockerfile * update dockerfile * update dockerfile * update dockerfile * update dockerfile * test=cuda117 * update ce dockerfile * update ce dockerfile
-
由 gem5 提交于
-
由 xiaoxiaohehe001 提交于
-
由 wenbin 提交于
* compile fix * fix compile * compile fix * add more preln
-
由 FlyingQianMM 提交于
* [Zero-Dim] support input 0D Tensor for maximum,minimum,allclose,sigmoid_focal_loss * [Zero-Dim] add backward test for sigmoid_focal_loss with 0-D input Tensor * [Zero-Dim] support input 0D Tensor for fmax,fmin,complex
-
由 wangzhen38 提交于
-
由 Leo Guo 提交于
xpu2_op_list.cc. test=kunlun
-
由 Yuanle Liu 提交于
-
由 jiangcheng 提交于
-
由 YuanRisheng 提交于
* rename kernel * delete sig * modify code according comment * fix ci bugs
-
由 yuehuayingxueluo 提交于
-
由 zhaoyingli 提交于
* recovery annotation * bugfix
-
由 Nyakku Shigure 提交于
* remove Travis CI build status badge from README.md * empty commit, test=document_fix
-
- 11 1月, 2023 18 次提交
-
-
由 Wen Sun 提交于
-
由 whs 提交于
* Add tools for quantization-aware training 1. Expose an API named paddle.quantization.QAT 2. Define a wrapper class to insert quanters into model for QAT 3. Add some functions in QuantConfig for QAT 4. Add unittest for QAT * Add QuantedConv2D and QuantedLinear for QAT * Add paddle.nn.quant.qat to setup.py
-
由 Wen Sun 提交于
-
由 Yiqun Liu 提交于
* Implement a common PointerArray. * Polish codes. * Add including of header file. * Add the branch of kFix8. * Fix compiling error. * Add alignas hint to fix the performance drop. * Optimize the H2D copy in stack_grad. * Rename the macro. * Fix align hint for different compilers. * Polish the define of PADDLE_ALIGN. * Fix compiling error. * Remove the align hint on windows.
-
由 zhangxin81 提交于
* fix paddle_infer_contrib include
-
由 niuliling123 提交于
-
由 Aurelius84 提交于
* [D2SCinn]Fix self.infer_program always build cinn pass without cache * fix infer op size
-
由 kangguangli 提交于
-
由 yuehuayingxueluo 提交于
* add FusedLinear pass * add fused_op_list and renname PASSES to OP_FUSION * add fused_passes_list to constants.py * add test_passes.py * fix test_fused_passes.py * fix add if float(paddle.version.cuda()) >= 11.6: * renamed test_fused_passes.py * fix CMakeList.txt
-
由 duanyanhui 提交于
-
由 wangzhen38 提交于
-
由 wenbin 提交于
* compile fix * fix compile * compile fix
-
由 Wangzheee 提交于
-
由 Ryan 提交于
* add enable_to_static and drop some methods of ProgramTranslator * fix code style * fix cant import enable_to_static and update unitest * change unitest and rollback code of PT * fix can't import as of utils * roll back PT * fix roll back * add some unitest * add unitest and fix codestyle bug in api.py * finish all unitest * remove ProgramTranslator * fix code style * restore test_program_translator * api.py remove get_func * TestDygraphToStaticCode * fix check_type and import err * roll back PT without getcode * roll back pt with get_code * convert_to_static * fix import __all__
-
由 Linjie Chen 提交于
-
由 Linjie Chen 提交于
* fix nll_loss * fix nll_loss * update * update * update * fix
-
由 houj04 提交于
-
由 姜永久 提交于
* rm retain_grad_flag for tests * modify transpose op * retain grads for xpu tests * lint * modify xpu test
-
- 10 1月, 2023 5 次提交
-
-
由 limingshu 提交于
* add stack grad kernel optimization * add basic optimization kernel for stack_grad_kernel * optimization of stack_grad_kernel for last dim stack and change code format with pre-commit
-
由 Wen Sun 提交于
* refactor: gloo comm context migration * fix: headers & avoid mutable_data usage * fix: cmake gloo dep * style: rename funcs * refactor: move to new files * fix: gloo deps * refactor: simplify create device
-
由 FlyingQianMM 提交于
* [Zero-Dim] support input 0D Tensor for maximum,minimum,allclose,sigmoid_focal_loss * [Zero-Dim] add backward test for sigmoid_focal_loss with 0-D input Tensor
-
由 risemeup1 提交于
-
由 Xiaoxu Chen 提交于
-