- 19 8月, 2021 8 次提交
-
-
由 lilong12 提交于
-
由 Peihan 提交于
* add slim resnet50 quant model in pr-ci-inference * enable resnet50_quant multi_thread4_trt_int8_bz1 * remove LOG(FATAL)
-
由 Yiqun Liu 提交于
Add dimension check for inverse to avoid dividing by 0 error when input's shape is [0, 0, 0]. (#34996)
-
由 ceci3 提交于
* fix batch_norm and instance norm when input is []
-
由 王明冬 提交于
-
由 tianshuo78520a 提交于
* notest;test=gpu-inference * notest;test=gpu-inference * notest;test=gpu-inference * notest;test=gpu-inference * fix error * notest;test=gpu-inference * notest;test=gpu-inference * notest;test=gpu-inference * test=gpu-inference
-
由 tianshuo78520a 提交于
-
由 Aurelius84 提交于
* add device_context * add gtest for device_event_gpu * Remvoe duplicate DeviceType * push for test * add unittest * fix macros * fix MSVC using usage
-
- 18 8月, 2021 19 次提交
-
-
由 lzzyzlbb 提交于
* [npu]add rmsprop op
-
由 xiongkun 提交于
* Add NPU kernel for norm Op: float16 and float32 * fix code for code review * fix for code review * add type for paddle_throw * remove unnecessary head file.\nAdd more testcase * remove a broadcast
-
由 littletomatodonkey 提交于
* fix pad outliers err * fix pad api input type and doc * fix example of pad * add unittest for pad3d * fix unittest * fix error format * fix pad doc
-
由 wanghuancoder 提交于
* code refactoring, test=develop * refine, test=develop * refine, test=develop * refine, test=develop
-
由 Peihan 提交于
-
由 Jackwaterveg 提交于
* test=develop * test=develop
-
由 Jackwaterveg 提交于
* test=develop * test=develop
-
由 WangXi 提交于
[Hybrid Performance] Move the cast op of AMP which cast fp32 param to fp16 param to the optimizer (#34965)
-
由 ronnywang 提交于
-
由 Fan Zhang 提交于
[CPU-PSLIB] Add consistency insepection of use_var_list and data_generator data, test=develop (#34463)
-
由 XGZhang 提交于
-
由 Chen Weihang 提交于
* fix ext_tensor.cast failed bug * remove useless deps * fix windows cmake failed * try to fix windows make failed * fix make error on windwos
-
由 Zhanlue Yang 提交于
* Add function to disable paddle signal handler Paddle used google::InstallFaultSignalHandler to handle selected system signals, mainly for debugging and bug report purposes. However, this can be conflicted with other python packages whoever captures similar signals. Such python package involves tvm and more To resolve this issue, we support a function to disable signal handler * Remove signal test from WIN32 platform * Remove redundant return from disable_signal_handler() function * Add detailed messages to en_doc
-
由 lilong12 提交于
-
由 WangXi 提交于
-
由 wawltor 提交于
-
由 Leo Chen 提交于
* add retry for HcclGetRootInfo * refine code * reduce retry interval
-
由 Guoxia Wang 提交于
* support class center sample of PartialFC
-
由 Wangzheee 提交于
* unitest_quant_dequant * fix * fix * deleted: test_trt_quant_conv2d_dequant_fuse_pass.py * fix
-
- 17 8月, 2021 13 次提交
-
-
由 Roc 提交于
-
由 Aganlengzi 提交于
-
由 tianshuo78520a 提交于
* fix op-benchmark * test=document_fix
-
由 chentianyu03 提交于
* copy boost optional.hpp to paddle * copy boost optional.hpp to paddle * move directions * del fluid/utils * modify .hpp to .h * move directions * modify to paddle::optional * add modification description * format code stype for the files in paddle/utils * format code stype
-
由 Jacek Czaja 提交于
* - disabled caching of layer norm - fix in compilation - compilation fix - transpose caching disabled - compilation fix - more compilation fixes - sum caching disabled - compilation fix * - LRN with disabled cache * lint fixes
-
由 chentianyu03 提交于
* add exclude rules of pre-commit to paddle/utils and third_party * remove exclude direction distributed/third_party * remove exclude of paddle/utils for format cpplint check
-
由 WeiXin 提交于
* polish unittest. * polish code * polish code
-
由 shangliang Xu 提交于
* [bug fix] fix unfold negative_size_param
-
由 Peihan 提交于
* add mkl multi-thread test cases * fix codestyle * fix codestyle & enable ernie mkl test
-
由 Hui Zhang 提交于
* dygraph support more ctc grad scale * scale for 1.x * fix unitest * fix unitest * format code * fix unittest * fix log info * unittest cov * fix format;notest,test=cpu,coverage * skip ctc_loss egs;test=cpu * warpctc grad cov;test=coverage * add dygraph test;test=coverage * format;test=cpu,coverage * format;test=cpu * add api compat;test=cpu * add cpu test * rename * rename * fix * fix test * format * eigen cpu * eigen gpu grad pass * cuda gpu pass * format * fix ci
-
由 Zeng Jinle 提交于
* add inplace passes and tests * update * fix use_cuda undefined fix compile error of op compat * add more ut * fix CPU CI error * check adam unique * fix mac/windows ci, improve coverage * fix ci error * follow weihang's comment * fix BlockDesc::MoveFrom * follow qiuliang's comment * update * follow huihuang's comments
-
由 zhiboniu 提交于
-
由 Kaipeng Deng 提交于
* fix drop_last not work in IterableDataset. test=develop
-