- 14 4月, 2023 8 次提交
-
-
由 Feiyu Chan 提交于
1. modify set_value op, use Scalars to represent attr `values`, instead of a bunch of attributs of various types; (#52408) 2. add program converter and set_value op as an example, which provides the functionality to convert `paddle::framework::ProgramDesc` between old and new formats(the differences are mainly some operators with incompatible updates in the definition); 3. program version and operator version map now are always saved when serializing `paddle::framework::ProgramDesc` to identify the version; 3. provide an option `legacy_format=false` in serialization of `paddle::framework::ProgramDesc`, it decided whether to convert ProgramDesc back to a legacy format, which is compatible for paddle 2.4.2 or earlier versions to load and execute; 4. deserialization of `paddle::framework::ProgramDesc` is now automatically detecting whether the bytes it receives is in legacy format(contains any of the operators that has been incompatibly updated and have any attribute of type `Scalar`) and convert it to new format. But if you want a faithful deserialization without the automatic conversion, you can use protobuf's deserialization instead. Though it is not recommended, it can be used for the purpose of testing.
-
由 Jiabin Yang 提交于
* add more infer var type * fix split error * fix ut * fix top_k infer vartype * fix top_k infer vartype
-
由 zhupengyang 提交于
-
由 duanyanhui 提交于
-
由 骑马小猫 提交于
* support uint16 python op in d2s * convert uint16 -> bfloat16 in docstring
-
由 Kim Yann 提交于
-
由 Yiqun Liu 提交于
* Unify the static amp codes of fp16 and bf16. * Polish apis and add unittest. * Add operator stats collecting tools for program. * Add the check of number of bloat16 operators in unittest. * Add warning for operator not supported for amp. * Add testing of BF16 O1 and O2.
-
由 ronnywang 提交于
-
- 13 4月, 2023 20 次提交
-
-
由 Wangzheee 提交于
* Paddle-Trt: Replace fc mul matmul matmul_v2 with matrix_multiply
-
由 superwinner1 提交于
* No55 add channel_shuffle FP16/BF16 support and tests
-
由 Difer 提交于
* add_fp_bf_for_dot & bf_for_cross * fix error * fix some error * fix some error * change something * fix magic number
-
由 Zhang Zheng 提交于
* [AMP OP&Test] Support fp16&bf16 in reduce_max
-
由 zhoutianzi666 提交于
* fix bilinear_interp_v2 && some other bugs in trt 7011 * add version check in test_trt_convert_bilinear_interp_v2.py
-
由 niuliling123 提交于
-
由 chenxujun 提交于
-
由 Zman 提交于
* Add GaussianNLLLoss API. * Change `rotl` `atol`.Check `var` in dynamic graph * remove assertTrue * update unittest * update unittest for ci-covarage.add broadcast with same dim. * Supply static err print. * Repair note and example. * Split unitest. * empty commit. * for standard commit. * for standard commit. * Add int dynamic graph test. * Repair parameters name. * Repair unitest parameters name. * Repair unitest parameters name * Repair unitest parameters name * Repair unitest parameters name * add square in code-block * fit few notes. * fit few notes. * fit few notes. * fit few notes. * add few interpretations. * add few interpretations. * add few interpretations. * fix import. * fix space. * empty commit for ci.
-
由 Sanbu 提交于
-
由 chenxujun 提交于
-
由 TaoTao Li 提交于
* add auto parallel tuner options in launch * add ut for launch in auto_parallel tuner fix code format * fix ci-converage
-
由 Vvsmile 提交于
* adjust test_elementwise_sub's tolerance, max_relative_error of grad and atol/rtol of output to 1e-3 * fix the dtype in setUp * fix the elementwise_sub optest * modify elementwise_sub optest * fix and add bf16/fp16 to elementwise_sub * fix elementwise_sub bugs * fix bugs * fix elementwise_sub op * fix the data type * fix elementwise_sub * fix elementwise * fix elementwise_sub * fix bugs * fix elementwise sub * fix elementwise_sub * remove scalar and vector
-
由 zhupengyang 提交于
-
由 WangZhen 提交于
* Fix _param_grad_names when like 'param@GRAD@GRAD'
-
由 Guoxia Wang 提交于
-
由 jiangcheng 提交于
* [CINN] optest add cinn check test * replace set self.check_cinn to pass check_cinn by function parameter * fix ci bug * add cinn atol/rtol
-
由 zhangyuqin1998 提交于
-
由 lzydev 提交于
* fix bug in cross_entropy in static mode * fix ci-coverage
-
由 Kim Yann 提交于
* rem cncl in new test * rem cncl in build sh * rem cncl in old test
-
由 gaoziyuan 提交于
--------- Co-authored-by: NZhang Jun <ewalker@live.cn>
-
- 12 4月, 2023 12 次提交
-
-
由 ShenLiang 提交于
-
由 Yulong Ao 提交于
* [Auto Parallel] Speedup the completion process * [Auto Parallel] Skip the property of dist_context when deepcopying * [Auto Parallel] Remove the unnecessary print * [Auto Parallel] Move some changes from 2.4 branch to develop * Update engine.py * [Auto Parallel] Fix a bug
-
由 liuruyan 提交于
-
由 张春乔 提交于
* remove c_comm_init_hccl_op.cc and c_gen_hccl_id_op.cc * remove gen_hccl_id_op.cc
-
由 CHANGer 提交于
-
由 Aurelius84 提交于
-
由 qizhaoaoe 提交于
* fix dtype cast in amp. * add test case and update docs. * remove set_prim.
-
由 gaoziyuan 提交于
-
由 Wei Shengyu 提交于
* add bf16 support and bf16/fp16 unittest for pool2d * add include files * dbg * reformat * reformat * modify code according to review comment * remove duplicate code * remove dup code * remove useless include * dbg
-
由 RedContritio 提交于
* move python/paddle/fluid/tests/unittests/xpu to test/xpu * update CMakeLists.txt * remove xpu in fluid/tests/unittests/ * add path to op_test_xpu * fix incorrect path * update test script * fix test_adadelta_op_xpu error
-
由 Guoxia Wang 提交于
* [AMP OP&Test] support bf16 for batchnorm * codestyle * Update batch_norm_grad_kernel.cu * Update batch_norm_kernel.cu * fix codestyle * fix * fix * fix * fix * fix * Update batch_norm_kernel.cc
-
由 Huihuang Zheng 提交于
* [Do NOT merge] Expr PR on Composite * Expr PR on Composite * Revert some compsite experiment * Remove unnecessary composite code * Add rsqrt as sub primitives
-