- 09 4月, 2020 1 次提交
-
-
由 mozga-intel 提交于
* Remove the NGraph engine from PDPD repository 1. Each operator was removed from the operator's directory 2. Each test was removed from the unittest directory 3. The parallel executor support was removed from the PDPD 4. The CMake file was removed from the PDPD 5. The NG flags were removed from the repository test=develop * Remove ngraph from: 1. Cmake file 2. Python file test=develop
-
- 08 4月, 2020 1 次提交
-
-
由 石晓伟 提交于
* change the cmake and apis of lite engine, test=develop * change the cmake of lite engine, test=develop
-
- 03 4月, 2020 1 次提交
-
-
由 channings 提交于
* update linspace, equal operators to API 2.0, test=develop * equal support higher performance CUDA kernel, test=develop * update comment of equal&linspace operator, test=develop * update comment of equal&linspace operator, test=develop
-
- 01 4月, 2020 1 次提交
-
-
由 Adam 提交于
-
- 26 3月, 2020 2 次提交
-
-
由 Zhaolong Xing 提交于
* add dynamic plugin support. test=develop * change emb eltwise layernorm to math function test=develop * add emb eltwise layernorm test=develop * can run dynamic shape ernie test=develop * fix ci test=develop * add ut for trt ernie dynamic test=develop * refine dynamic shape c++ interface. test=develop * fix comments test=develop * fix comments test=develop
-
由 xujiaqi01 提交于
* get pslib version * test=develop
-
- 11 3月, 2020 1 次提交
-
-
由 Zhaolong Xing 提交于
* 1. add embedding eltwise layernorm fuse 2. add embedding eltwise layernorm op 3. refine inplace_add_relu 4. refine fc_eltwise_layernorm test=develop * 1. refine fc test=develop * fix comments test=develop * fix comments test=develop
-
- 03 3月, 2020 1 次提交
-
-
由 石晓伟 提交于
* change the function in op_teller, test=develop * correct the commit-id, test=develop
-
- 02 3月, 2020 1 次提交
-
-
由 zhou wei 提交于
fix bug that sourcecode of third_party can't be cached correctly,and add cache for xbyak and openblas (#22772)
-
- 28 2月, 2020 1 次提交
-
-
由 tianshuo78520a 提交于
-
- 25 2月, 2020 1 次提交
-
-
由 hutuxian 提交于
* Add two types of Metric Calculator: MultiTaskCalculator & CmatchRankCalculator. * Add a config for DynamicAdjustChannelNum function to denote whether we will discard the remaining instances when they are not be distributed evenly. * Remove CPU code in Pull/PushSparse and we will add it back when testing it fully. * Fix some known issues: such as copying persistable vars after one epoch running.
-
- 18 2月, 2020 1 次提交
-
-
由 zhouwei25 提交于
-
- 17 2月, 2020 1 次提交
-
-
由 Adam 提交于
-
- 14 2月, 2020 1 次提交
-
-
由 flame 提交于
* support golang inference
-
- 13 2月, 2020 1 次提交
-
-
由 石晓伟 提交于
-
- 12 2月, 2020 1 次提交
-
-
由 Pei Yang 提交于
-
- 11 2月, 2020 1 次提交
-
-
由 yaoxuefeng 提交于
* update * update test=develop * update compile set test=develop * update compile set test=develop * update test=develop * update test=develop * update test=develop * update compile setting test=develop * update compile setting test=develop * update run demo test=develop * update test=develop * update test=develop * fix test=develop * update test=develop * update test=develop * update test=develop * update test=develop * update test=develop * update test=develop * update test=develop * update test=develop * update test=develop * update format test=develop * update format test=develop * update style test=develop * update style test=develop * change style test=develop * change style test=develop * change style test=develop * add dataset unittest test=develop * update test=develop * update for record test=develop * udpate style for record test=develop * update for record test=develop * update for record test=develop * update for record test=develop * fix format test=develop * update test=develop * update test=develop * update test=develop * update test=develop * update test=develop
-
- 04 2月, 2020 1 次提交
-
-
由 石晓伟 提交于
-
- 20 1月, 2020 1 次提交
-
-
由 Wilber 提交于
* modify lite commit id to support var_conv_2d cascade. test=develop * modify lite commit id. test=develop
-
- 19 1月, 2020 1 次提交
-
-
由 石晓伟 提交于
* update external lite, test=develop * switch WITH_TESTING to OFF, test=develop
-
- 15 1月, 2020 1 次提交
-
-
由 Wilber 提交于
修改了依赖lite的commit id:lite支持了var_conv_2d的级联使用
-
- 14 1月, 2020 4 次提交
-
-
由 Leo Chen 提交于
* fix compile issue, test=develop * force link libiomp5 when mklml is enabled, test=develop
-
由 silingtong123 提交于
* remove the useless third_party library from C++ inference library * revert removing the install directory
-
由 zhouwei25 提交于
faster build by reduce by-product, reduce linking library and fix compile warning of std=c++11 (#22164)
-
由 xujiaqi01 提交于
* add collective communication library in fleet to replace mpi * test=develop
-
- 11 1月, 2020 1 次提交
-
-
由 Wilber 提交于
- 添加了fluid-lite子图方式运行resnet的单测 - 修改了依赖Lite的git commit id
-
- 10 1月, 2020 2 次提交
-
-
由 Zhen Wang 提交于
* add bn and relu fuse pass * add op attr assert and dtype assert * fix some inputs&&outputs bugs for the fused op and pattern. * add the unittest for fuse_bn_act_pass. test=develop * use normative enforce statements. test=develop * add the cpu test. test=develop * add the support of batch_size=1 for the bn with relu op. test=develop * add the error type for paddle throws. test=develop * add fused_batch_norm_act and fused_batch_norm_act_grad to op_has_unsed_vars_white_list. test=develop
-
由 baojun 提交于
-
- 09 1月, 2020 2 次提交
- 06 1月, 2020 1 次提交
-
-
由 Adam 提交于
-
- 04 1月, 2020 1 次提交
-
-
由 Adam 提交于
-
- 03 1月, 2020 1 次提交
-
-
由 Yiqun Liu 提交于
* Add the dynamic load of nvrtc, and support runtime compiling of CUDA kernel using nvrtc. test=develop * Call CUDA driver api to launch the kernel compiled by nvrtc. test=develop * Disable for mac and windows. test=develop * Refine the codes to support manually specified num_threads and workload_per_thread. test=develop * Refine the CUDA kernel to support large dims. test=develop * Add DeviceCodePool to manage all device codes. * Add the first implementation fusion_group op. * Add unit-test for fusion_group op. * Add the check of result. * Add the check of nvrtc in unit-test. test=develop * Add comment to explain the inputs, outputs and features of fusion_group op. test=develop * Disable fusion_group op for mac and windows. test=develop * Make the compiling of device code return status instead of hanging up. test=develop * Add the check of whether there is CUDA driver library, and do not core dump when failing to call the CUDA driver API. * Unify fusion_group_op's input and output names. test=develop * Add the check of CUDA driver library in unittest. test=develop * Refine the calling of PADDLE_ENFORCE. test=develop
-
- 26 12月, 2019 3 次提交
- 25 12月, 2019 1 次提交
-
-
由 zhouwei25 提交于
-
- 24 12月, 2019 1 次提交
-
-
由 zhouwei25 提交于
-
- 16 12月, 2019 2 次提交