- 09 7月, 2019 1 次提交
-
-
由 LielinJiang 提交于
* fix transform matrix bug, test=develop * modify API.spec
-
- 08 7月, 2019 8 次提交
-
-
由 Zhaolong Xing 提交于
* Fix Mask rcnn predictor 1. refine memory optim algorithm to support the model with the block op. 2. output diff : modify the affine channel fuse 3. add condition_block_infer op add interface for setting trt calib table dir test=develop * add the missing files. test=develop
-
由 石晓伟 提交于
* update anakin-engine interfaces for content-dnn test=develop * support only-gpu mode of Anakin modify eltwise parse test=develop * modification for thread-safe test=develop * Integrated template instance test=develop * increase template parameters test=develop * support MLU predictor test=develop * update anakin cmake files test=develop * update TargetWrapper::set_device * update the initialization of anakin subgraph test=develop * use the default constructor of base class test=develop * load model from buffer with length test=develop * modify the access level of class test=develop * support anakin for bitmain arch test=develop * remove files * checkout cmakelists test=develop
-
由 tianshuo78520a 提交于
-
由 Leo Zhao 提交于
-
由 gongweibao 提交于
-
由 guru4elephant 提交于
remove benchmark folder, since there is a benchmark repo already, distributed benchmark will be maintained in fleet repo (#18537) test=develop
-
由 guru4elephant 提交于
* add random port
-
由 Tao Luo 提交于
* add mkldnn shapeblob cache clear strategy test=develop * refine with comments test=develop * make cache clear strategy more safey test=develop * add lock for GetShapeBlobSize test=develop
-
- 07 7月, 2019 1 次提交
-
-
由 chengduo 提交于
test=develop
-
- 05 7月, 2019 5 次提交
-
-
由 zhaoyuchen2018 提交于
* Fix topk cannot handle 1D vector bug Add path to handle 1D vector test=develop Signed-off-by: Nzhaoyuchen <zhaoyuchen01@baidu.com> * refine code test=develop Signed-off-by: Nzhaoyuchen <zhaoyuchen01@baidu.com>
-
由 石晓伟 提交于
* remove the obsolete cmake options, test=develop * remove unittests, test=develop * delete options in paddle/scripts/paddle_build.sh
-
由 Jiabin Yang 提交于
* test=develop, fix docker with paddle nccl problem * test=develop, hide no_support api and add ut for it
-
由 LielinJiang 提交于
* add_distributions_of_normal_and_uniform * paddle/fluid/API.spec * modify API.spec * modified paddle/fluid/API.spec, test=develop * modify paddle/fluid/API.spec, test=develop * modify paddle/fluid/API.spec, test=develop * fix some comment, test=develop * modify API.spec, test=develop * add comment for init function, modify hard code, test=develop * modify API.spec, test=develop * modify API.spec, test=develop * make unit test function shorter, test=develop * modify paddle/fluid/API.spec
-
由 bingyanghuang 提交于
-
- 04 7月, 2019 5 次提交
-
-
由 tensor-tang 提交于
remove pycpuinfo deps in core
-
由 qingqing01 提交于
* Refine Infershape in activation_op for double_grad.
-
由 qingqing01 提交于
* make it support float/int learning as input.
-
由 chengduo 提交于
-
由 chengduo 提交于
* enhance execution error info test=develop
-
- 03 7月, 2019 13 次提交
-
-
由 石晓伟 提交于
* remove the obsolete cmake options, test=develop * remove unittests, test=develop
-
由 pkpk 提交于
test=develop
-
由 Jiabin Yang 提交于
-
由 Tao Luo 提交于
test=develop
-
由 Tao Luo 提交于
test=develop
-
由 zhoukunsheng 提交于
-
由 adaxi123 提交于
* Adding communication part in readme, test=develop * modify_readme_contents,test=develop * Update README_cn.md
-
由 zhoukunsheng 提交于
* test=develop support Tensor input for chunk_eval op * test=develop fix testcase for chunk_eval op * test=develop fix typos in nn.py
-
由 zhoukunsheng 提交于
-
由 zhoukunsheng 提交于
-
由 zhoukunsheng 提交于
-
由 zhoukunsheng 提交于
-
由 hutuxian 提交于
move the thread-check code from train_from_dataset to a single function add UT for the thread check function
-
- 02 7月, 2019 7 次提交
-
-
由 Leo Zhao 提交于
* rename mkldnn set/get_cur_thread_id() to set/get_cur_mkldnn_session_id() test=develop * update session id definition and adjust logic for default behavior test=develop * reset logic in mkldnn reuse as most of cases work in default. test=develop
-
由 Zeng Jinle 提交于
-
由 Kaipeng Deng 提交于
-
由 Tao Luo 提交于
test=develop
-
由 Yi Liu 提交于
1. Since allreduce op has 4 reduce types, We split these four reduce types into four ops 2. We also refined the collective op code, e.g. we separated the collective op kernel into CPUKernel and CUDAKernel, and remove the device specified DeviceContext parameter in template as we already knew the target DeviceContext 3. We remove the newly added Collective op role to reduce the complexity of program and graph analysis
-
由 tianshuo78520a 提交于
* fix the api.spec file does not get the class comment problem * cat new.spec * check api.spec * test=develop
-
由 guru4elephant 提交于
make fleet support mpi job submit directly.
-