- 24 6月, 2022 1 次提交
-
-
由 Yulong Ao 提交于
* [Auto Parallel] Use a fast completion for data parallelism * remove unuse cuSparse function * [Auto Parallel] Fix some bugs of the fast dp completion * [Auto Parallel] Add the cmake statements * [Auto Parallel] Make the unittest adapt to the new interface * [Auto Parallel] Modify the timeout of the unittest * [Auto Parallel] Remove unnecessary comments Co-authored-by: Nzhouwei25 <zhouwei25@baidu.com>
-
- 23 6月, 2022 14 次提交
-
-
由 niuliling123 提交于
-
由 Matsumoto Ruko 提交于
-
由 taixiurong 提交于
-
由 Leo Chen 提交于
-
由 zyfncg 提交于
* move trace into api.yaml * add trace unittest * fix trace test * fix generate op
-
由 Nyakku Shigure 提交于
* rewrite all code examples, test=document_fix * refine arguments, test=document_fix * fix desc format error, test=document_fix * capitalize the first letter, test=document_fix * refine api desc, test=document_fix * fix wrong COPY-FROM label in Model docs, test=document_fix * refine returns, test=document_fix * refine returns, test=document_fix * add a blank line in code block, test=document_fix
-
由 Nyakku Shigure 提交于
* add outputs for code examples, test=document_fix * missing code example update, test=document_fix * missing returns, test=document_fix * fix the case of proper nouns, test=document_fix * fix SqueezeNet code example (add a blank line), test=document_fix * fix ShuffleNetV2 args style, test=document_fix * missing paper links, test=document_fix * missing `optional`, test=document_fix * refine parameter `pretrained`, test=document_fix * add parameter `kwargs`, test=document_fix * refine parameter `num_classes`, test=document_fix * capitalize the first letter, test=document_fix * missing output of code example, test=document_fix * fix wrong api label, test=document_fix * refine returns, test=document_fix * add missing `.` after paper link, test=document_fix * remove extra `:`, test=document_fix * :math:\`0.0\`, test=document_fix
-
由 zhangbo9674 提交于
-
由 Aurelius84 提交于
* [Dy2Stat]Support nonlocal mechanism in IF ast transformer * support prune return vars in cond * fix unittest * fix unittest * fix static check
-
由 ccrrong 提交于
* add cast trt converter
-
由 Shijie 提交于
-
由 Shijie 提交于
* Fix test_fuse_resnet_unit failure * Fix test_imperative_auto_mixed_precision failure * Fix sparse_attention_op error * Fix sparse_attention_op error
-
由 zlsh80826 提交于
* Reduce gather op unit tests size and increase the timeout * Add NVIDIA_TF32_OVERRIDE for multi-processes environment * Remove record test for device event ut
-
由 Sylwester Fraczek 提交于
* sylwek prototype params to int8 pass * trying to make warmup work * wip * wip * change test to cpp test * review fixes, refactoring * more refactoring * add erasevars * change test to fixture * rename pass and reorder erasevars and graphsaferemovenodes * fix * more refactoring and fixed bug * formatting * remove scale count * enfroce message too short * remove erasevars erasevars couldbe cauuse of memory issues some other fixes * add count of successfull fuses to name of new nodes * FindVar -> GetVar and use ConvResidual pattern * use tensor->clear() instead of new variable * Update paddle/fluid/framework/ir/mkldnn/params_quantization_mkldnn_pass_tester.cc Co-authored-by: NTomasz Socha <tomasz.socha@intel.com> * Update paddle/fluid/framework/ir/mkldnn/params_quantization_mkldnn_pass_tester.cc Co-authored-by: NTomasz Socha <tomasz.socha@intel.com> * Update paddle/fluid/inference/tests/api/analyzer_lexical_analysis_gru_tester.cc Co-authored-by: NTomasz Socha <tomasz.socha@intel.com> * add log (review fix)c * review fix (2 functions to one) * code review: Conv->QuantizeConv * revert * fix formatting * remove unused functions * add paddle enforce Co-authored-by: NTomasz Socha <tomasz.socha@intel.com>
-
- 22 6月, 2022 10 次提交
-
-
由 sneaxiy 提交于
-
由 ccrrong 提交于
* fix arg_max converter
-
由 Jackwaterveg 提交于
* improve MSRAInitializer * improve the doc
-
由 WJJ1995 提交于
* fixed multihead matmul fuse pass * Add unittests * rm scale op * fixed code style * fixed code style * resolve testcase falied * add note
-
由 Feiyu Chan 提交于
* add codegen for get_expected_kernel, add argument mapping for selected_rows kernels, fix other bugs in codegen for operators. * move bernoulli, erf, mv, poisson, trunc, erf to api.yaml and corresponding backward api to backward.yaml * generate EmptyGradOpMaker for ops without grad op * add code to generate all possible kernel signatures for infrt
-
由 heliqi 提交于
-
由 zhoutianzi666 提交于
* add fc, multihead_mul, shape tensor infer, slice
-
由 zhangkaihuo 提交于
-
由 xiongkun 提交于
因此将N次前向共用一个 vec scope 换为了N次前向分别使用自己的一个vecscope
-
由 tianshuo78520a 提交于
* test=gpups
-
- 21 6月, 2022 15 次提交
-
-
由 Guanghua Yu 提交于
-
由 gongweibao 提交于
-
由 Yiqun Liu 提交于
-
由 Zhen Wang 提交于
* Fix some bugs about the usage of cinn lib. * Fix some unit test errors.
-
由 Yao Zihang 提交于
-
由 cifar10 提交于
-
由 Chenxiao Niu 提交于
-
由 zhangbo9674 提交于
* refine layer to * refine code * add ut
-
由 cifar10 提交于
-
由 cifar10 提交于
-
由 zhaoying9105 提交于
-
由 Sławomir Siwek 提交于
* consolidate conv act passes * generalize conv_activation * integrate conv+act tests * code style format * whitespaces * remove timeout from old tests * implement comments from review * restore ut * whitespace * code style * transpose * fixes after review * method for gettin act * Change Paddle_enforce error type * code format * add missing opcompats
-
由 fwenguang 提交于
-
由 cambriconhsq 提交于
* [MLU] add mlu kernel for elementwise_max_grad * [MLU] modify mlu kernel elementwise_min_grad impl
-
由 Allen Guo 提交于
* add argmin and argsort ops (#800) * add argmin and arsort ops * Add dot bmm ops (#803) * add bmm * add dot op * clean CreateConst * clean CreateCast * add activation ops (#808) * add activation ops * fix 1function-redefined error
-