- 22 3月, 2023 28 次提交
-
-
由 joanna.wozna.intel 提交于
-
由 ShenLiang 提交于
-
由 Ghost Screaming 提交于
* Add fused_feed_forward pass for semi-automatic static graph training. * Add fused_feedforward property in parallel_executor.cc * Polish code. * Polish fused feed_forward pass code. Support use_dropout1 and use_dropout2 option. * Support model parallel in fused_feedforward pass.
-
由 Sławomir Siwek 提交于
* extract common methods to reuse * add header for transpose ops * fused_transpose * Split big function * transpose2 tests * fused_transpose * Apply extra attributes * add pbtxt file * update pbtxt * Merge develop * add more strict op compats * code style * remove mkldnn_data_type * unify SetOutMemDescWithReshape2FuseSupport * adjust quantize-dequantize for transpose * remove appendact * transpose2 quantization * fix int8 tests * adjust transpose_op to current develop * delete fusion code from transpose_kernel * add fused transpose to NHWC unittest * change order
-
由 Bo Zhang 提交于
* test_logit_op * add cudaKernel to replace eigen impl * bf16 unit test CI
-
由 houj04 提交于
-
由 Zhang Zheng 提交于
* [AMP OP&Test] Fix fp16 check_grad when user_defined_grads are not None * fix cond
-
由 HongyuJia 提交于
* [CustomOP Optional] CustomOP supports optional Tensor * fix test_custom_concat, add pytest to CMakeLists
-
由 LoneRanger 提交于
* remove net_drawer.py * remove memory_analysis.py * remove test_memory_analysis.py
-
由 niuliling123 提交于
-
由 kangguangli 提交于
* fix raw_program_optimizer not apply when using amp * fix CI
-
由 Zhang Ting 提交于
-
由 wangxiaoning 提交于
* max comp * fix * add test * fix * fix * fix * fix * fix test * fix api
-
由 sneaxiy 提交于
* add fused_linear_param_grad_add_kernel * fix compile error * remove flag * fix ci compile error * fix ci compile error * revert pylayer revision * fix ci ut * improve performance
-
由 Yuanle Liu 提交于
-
由 iSerendipity 提交于
-
由 Jiabin Yang 提交于
* allow return none when stop_gradient=True * remove useless code * refine code * refine code * fix test cast * change more test * add more tests
-
由 Bo Zhang 提交于
* test_accuracy_op * add create_test_fp/bf16_class * cast after calculation * change convert_uint16_to_float_ifneed * delete TestAccuracyOpFp32 according to PR comment * fix the rtol setting rules in bfloat16 forward
-
由 Zheng-Bicheng 提交于
-
由 Zhang Zheng 提交于
-
由 Zhang Zheng 提交于
-
由 Haohongxiang 提交于
-
由 zhaoyingli 提交于
* [AutoParallel] support bloom * fix import * align amp and bf16 * update func name * clipbyglobalnorm and add_n support bf16 * upgrade amp strategy api * update bf16 unittest * fix static clip --------- Co-authored-by: Nliangjianzhong <liangjianzhong@baidu.com> Co-authored-by: NAurelius84 <zhangliujie@baidu.com>
-
由 Difer 提交于
-
由 kangguangli 提交于
* remove DataFeeder.feed_parallel and decorate_reader * fix CI * fix CI
-
由 Ainavo 提交于
* replace assert false with AssertionError * 修改配置文件多余的部分
-
由 cyber-pioneer 提交于
* fix batch_norm custom_vjp dtype * add nhwc test example * fix typo
-
由 Yuang Liu 提交于
-
- 21 3月, 2023 12 次提交
-
-
由 Ainavo 提交于
-
由 HongyuJia 提交于
* [Fix typo] Fix typo error in grad_node_info.h * fix varbase_patch_methods.py typo error * fix more typo errors
-
由 ShenLiang 提交于
-
由 GGBond8488 提交于
* remove fluid.data * fix typro * fix somme unitest error * fix conflicts * fix sample code error * fxi sample coder error * fxi sample code error * fxi sample code error * fix xpu test error * fix xpu test error * Delete ps_pb2.py * fix test error * fix typro * fix sample code error * fix comments * fix test norm op data * fix sample code error * fix conflicts
-
由 xiaoguoguo626807 提交于
* init * close cinn * close cinn * add public_python_api
-
由 cyber-pioneer 提交于
* simplify batch_norm composite rule * polish code
-
由 Charles-hit 提交于
* add public_python_api in prim test * fix test_squeeze2_op
-
由 iSerendipity 提交于
* move DataType from paddle::experimental to phi * convert namespace * convert namespace * convert namespace * clarify namespace * convert more datatype * Revert "convert more datatype" This reverts commit 083b462959e6a22d4d8767707b628b95b396642e. * convert more in auto_code_generator * fix conflicts for XPU * fix namespace conflicts * fix errors * Revert "fix errors" This reverts commit f9d9958b54ee32141112274c8a5c3c381ab0f876. * fix errors * fix formatting
-
由 ShenLiang 提交于
* fix flash_attention * Update mp_layers.py
-
由 Ainavo 提交于
-
由 Ainavo 提交于
-
由 ShenLiang 提交于
* set order for hybridparallel * fix bug * fix -> * fix -> * fix -> * add topology * fix utest
-