- 04 3月, 2020 2 次提交
-
-
由 Zeng Jinle 提交于
* add recorded cuda memory apis, fix typo, test=develop * add more ut, test=develop * follow comments, test=develop * fix py35 incompatible issues, test=develop
-
由 石晓伟 提交于
* encapsulate the PaddleTensorToLoDTensor, test=develop * serialize the pd_tensor, test=develop * serialize tensors to file, test=develop
-
- 03 3月, 2020 3 次提交
-
-
由 Zhang Ting 提交于
-
由 Zhang Ting 提交于
* add fluid.device_guard to specify the device type for Op
-
由 石晓伟 提交于
* change the function in op_teller, test=develop * correct the commit-id, test=develop
-
- 02 3月, 2020 7 次提交
-
-
由 Zhen Wang 提交于
* update ScopeBufferedSSAGraphExecutor&AsyncSSAGraphExecutor&ThreadedSSAGraphExecutor&FastThreadedSSAGraphExecutor&ParallelSSAGraphExecutor&ParallelExecutor for fetching unmerged results. * add the unit test for fetch_unmerged. * update ut for multi-card and multi-cpu. * add the error message and the user suggestion in FetchOpHandle. test=develop
-
由 wangchaochaohu 提交于
-
由 zhongpu 提交于
-
由 Chen Weihang 提交于
* add lodtensor share memory & serialization, test=develop * fix windows compile error, test=develop * deal vartype pickle & fix unittest matching error message, test=develop * update timeout variable name, test=develop * refactor memory map implement, test=develop * clear mmap file discripter when exit unexpectedly, test=develop * remove the child process fd in advance, test=develop * remove mmap fds after Queue.put in child process, test=develop * add hard unittests for register exit func, test=develop * fix python2 compatibility problem in unittest, test=develop * fix exception unittest error, test=develop * polish code based review comment, test=develop
-
由 liu zhengxi 提交于
* fix inference c api lod, test=develop * fix capi lod problem and enrich tests, test=develop * delete useless header files and alter const_cast, test=develop
-
由 wangchaochaohu 提交于
* add profiler_help.h to refine the code test=develop
-
由 hutuxian 提交于
* user can call dataset.set_download_cmd to set its customized download cmd * add UT to cover this scenario
-
- 01 3月, 2020 1 次提交
-
-
由 wangchaochaohu 提交于
* Add the codegen and auto fusion for sum Op in fusion group
-
- 28 2月, 2020 2 次提交
-
-
由 tianshuo78520a 提交于
-
由 Kaipeng Deng 提交于
-
- 27 2月, 2020 3 次提交
-
-
由 zhaoyuchen2018 提交于
* Refine adam op, test=develop * Fuse kernels together to reduce cpu time. * Refine paddle enforce, test=develop * Remove some comments, test=develop * Refine code,test=develop * Refine cuda kernel, test=develop * Refine code according to comments, test=develop
-
由 wangguanzhong 提交于
-
由 FlyingQianMM 提交于
* Correct CPU gradients of the argsort op, form a network to test its forward and backward process, test=develop * fix dynamic threshold error in test_argsort_op, test=develop
-
- 26 2月, 2020 4 次提交
-
-
由 Adam 提交于
-
由 guofei 提交于
As the title
-
由 liu zhengxi 提交于
* refine pointer and c-api prototype, test=develop * fix new c api profile bug, test=develop * add unit tests, test=develop
-
由 Leo Chen 提交于
* support cond in clone, test=develop * refine code, test=develop * refine code, test=develop * follow comments, test=develop * refine code, test=develop
-
- 25 2月, 2020 7 次提交
-
-
由 Zhang Ting 提交于
* add framework overhead ratio, test=develop * print GpuMemcpy overhead, test=develop
-
由 zhouwei25 提交于
-
由 chengjuntao 提交于
* register fp16 for assign op, test=develop * add op test for fp16, test=develop
-
由 zhangchunle 提交于
-
由 dyning 提交于
-
由 GaoWei8 提交于
-
由 hutuxian 提交于
* Add two types of Metric Calculator: MultiTaskCalculator & CmatchRankCalculator. * Add a config for DynamicAdjustChannelNum function to denote whether we will discard the remaining instances when they are not be distributed evenly. * Remove CPU code in Pull/PushSparse and we will add it back when testing it fully. * Fix some known issues: such as copying persistable vars after one epoch running.
-
- 24 2月, 2020 4 次提交
-
-
由 ShenLiang 提交于
* add partial_sum_op, test=develop * modify the Paddle Error Message, test=develop * modify the Paddle Error Message, test=develop * modify the bug for python3, test=develop * modify the ut for ci, test=develop * mv to contrib, test=develop * use check_variable_and_dtype, test=develop * fix ci, test=develop * fix conflict, test=dvelop * add partial concat, test=develop * fix the conflict, test=develop * fix the error, test=develop * rm SSE4, test=develop
-
由 wangchaochaohu 提交于
* add support for the driver api callback and fix the profiler name show bug
-
由 ShenLiang 提交于
* add partial_concat, test=develop * fix the grids and blocks, test=develop * fix the Paddle_Enforce, test=develop * fix the doc of op, test=develop * fix the doc, test=develop * fix the doc of the op, test=develop * replace -1 with None, test=develop
-
由 GaoWei8 提交于
* Add an interface of disabling FC padding * fix bert regression * polish fc padding interface * recover pass function * fix argument error * fix mkldnn error
-
- 23 2月, 2020 1 次提交
-
-
由 tianshuo78520a 提交于
-
- 22 2月, 2020 2 次提交
-
-
由 Yibing Liu 提交于
test=develop
-
由 tangwei12 提交于
* add sync communicator and implement
-
- 21 2月, 2020 3 次提交
-
-
由 Yiqun Liu 提交于
-
由 flame 提交于
-
由 Huihuang Zheng 提交于
1. Add PADDLE_ENFORCE to Check Sequence Length of RecurrentOp. 2. Also enrich PADDLE_ENFORCE error messages.
-
- 20 2月, 2020 1 次提交
-
-
由 flame 提交于
-