- 05 9月, 2019 2 次提交
-
-
由 Yiqun Liu 提交于
* Add the dynamic load of nvrtc, and support runtime compiling of CUDA kernel using nvrtc. test=develop * Call CUDA driver api to launch the kernel compiled by nvrtc. test=develop * Disable for mac and windows. test=develop * Refine the codes to support manually specified num_threads and workload_per_thread. test=develop * Refine the CUDA kernel to support large dims. test=develop
-
由 Tao Luo 提交于
* remove assert.h * change PADDLE_ASSERT_MSG to PADDLE_ENFORCE test=develop * fix tensorrt paddle_enforce test=develop
-
- 03 9月, 2019 3 次提交
- 02 9月, 2019 1 次提交
-
-
由 zhouwei25 提交于
-
- 01 9月, 2019 2 次提交
-
-
由 Jacek Czaja 提交于
* - First set of modifications - Compilation fixes - compilation fix - Another compilation fix - Moved AcquireSoftmaxPrimitiveDescriptor call into handler - MKL-DNN Softmax PD refactor test=develop - Compilation fix test=develop - another compilation fix - cosmetcis test=develop - Compilation fix - Fix to crash when softmax backward is created * - Fixes after review of softmax refactoring test=develop
-
由 Zeng Jinle 提交于
* add retry_allocator for gpu, test=develop * follow chengduoZH's comments, test=develop * follow huihuang's comments,test=develop * change f,l in enforce.h to be file,line, test=develop * increase code coverage by adding unittests, test=develop * fix CMakeLists.txt, test=develop
-
- 30 8月, 2019 3 次提交
-
-
由 Jacek Czaja 提交于
- Refactor step 1 - Compilation fix - Yet another compilation fix - Even more compilation fix - Lint fixes test=develop - Removed deprectaed PADDLE_ENFORCE occurance test=develop - Candidate fix to BN forward - Lint fixes test=develop - Refactoring in data_layout_transform - compilation fix - Another comppilation fix - Step further into darkness - Yet another compilation fix - Yet another compilation fix - missing header - compilation fix - Added MKLDNN -> Paddle conversion in fetch op test=develop - Compilation fix test=develop - Lint test=develop - Mul fix - Fix to MKLDNN MUL op and Elementwise MUL UT test=develop - Workaround for diffrent weights with groups representation Paddle vs MKL-DNN. test=develop - Candidate fix for 5D convolution with groups - Refactor of fix for conv3d and conv2d in fetch op test=develop - Compilation fix - Still same compilation fix - Compilation fix - Compilation fix - Reverted refactoring of fixes - Adapted test_conv2d_int8_mkldnn so it exects data in NCHW format not NHWC test=develop - minor fix in UT test=develop - Lint fixes test=develop
-
由 liuwei1031 提交于
-
由 Zeng Jinle 提交于
-
- 28 8月, 2019 1 次提交
-
-
由 Zeng Jinle 提交于
* add signal message to stderr, test=develop * add unittests for ugly SignalHandle, test=develop
-
- 27 8月, 2019 2 次提交
- 20 8月, 2019 2 次提交
-
-
由 Tao Luo 提交于
* replace part of PADDLE_ASSERT to PADDLE_ENFORCE test=develop * remove unused fallback_alloc_size_ * add unit-test of CUDAPinnedAllocator test=develop
-
由 Yihua Xu 提交于
* Implement the operator with sprase matrix multiply * Update the URL of mklml library. test=develop * Disable MKLML implematation when using no-linux. test=develop * Ignore the deprecated status for windows test=develop
-
- 19 8月, 2019 1 次提交
-
-
由 Zeng Jinle 提交于
* make PADDLE_ENFORCE_EQ support cannot to string types, test=develop * follow huihuang's comments, test=develop
-
- 16 8月, 2019 2 次提交
-
-
由 Zeng Jinle 提交于
-
由 Zeng Jinle 提交于
-
- 15 8月, 2019 1 次提交
-
-
由 Adam 提交于
test=develop
-
- 12 8月, 2019 2 次提交
-
-
由 gongweibao 提交于
Polish fleet API to support cuda collective mode and nccl2 mode
-
由 wopeizl 提交于
* add tensorrt support for windows
-
- 09 8月, 2019 1 次提交
-
-
由 Zhang Ting 提交于
* optimize error message, test=develop * optimize error message, test=develop
-
- 05 8月, 2019 1 次提交
-
-
由 liuwei1031 提交于
* fix warpctc.dll not found issue, test=develop * revert the linux platform change, test=develop * delete warpctc_lib_path.h.in, test=develop * add SetPySitePackagePath function * fix warpctc.dylib not found issue on Mac, test=develop * improve the paddle lib path setting logic, test=develop * fix mac ci issue caused by test_warpctc_op unittest, test=develop * tweak code, test=develop
-
- 01 8月, 2019 2 次提交
-
-
由 Zeng Jinle 提交于
* fix gpu_info, test=develop * fix reserving gpu memory calculation bug, add fraction=1 unittest, test=develop * fix bug again for reserving size, test=develop
-
由 Jacek Czaja 提交于
test=develop - Extracted key generation from FWD and GRAD into separate function test=develop - Compilation fix test=develop - another compilation test=develop
-
- 31 7月, 2019 1 次提交
-
-
由 Huihuang Zheng 提交于
GPU allocation uses fraction of available memory, also fix the GetUsed without lock
-
- 30 7月, 2019 1 次提交
-
-
由 Jacek Czaja 提交于
test=develop - optimization of TID to string test=develop
-
- 29 7月, 2019 1 次提交
-
-
由 Huihuang Zheng 提交于
-
- 27 7月, 2019 1 次提交
-
-
由 Huihuang Zheng 提交于
Also fix a dependency error which may cause compile error
-
- 25 7月, 2019 1 次提交
-
-
由 lidanqing 提交于
* change INT8 to template so that checking dst_dt with if-else could be removed. CI will be enabled after fixing reviews * reverse user_residual_memory_p and user_bias_memory_p declaration scope test=develop
-
- 23 7月, 2019 2 次提交
-
-
由 Jacek Czaja 提交于
test=develop - compileation fix - Yet another compilation fix - Even yet another compilation fix - Surprise! Again compilation fix - lint fixes test=develop - Fix to workspace acquire of LRN test=develop - Fix to hash of BWD LRN test=develop - fix to lrn BWD PD acquire test=develop - Fixing LRN PD creation test=develop - cosmetic fix in comment test=develop - Fixes after review test=develop
-
由 chengduo 提交于
* support sparse gradients test=develop
-
- 19 7月, 2019 1 次提交
-
-
由 Jacek Czaja 提交于
test=develop
-
- 18 7月, 2019 2 次提交
-
-
由 zhouwei25 提交于
Optimize the content of error reporting information, print error code and official document web sites (#18671) optimize the error reporting information of cuda related API index on develop: 130ac177 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into develop
-
由 Zeng Jinle 提交于
* feature/auto_growth_allocator, test=develop * add unittest of AlignedAllocator, test=develop * try to turn on auto_growth to test on CI, test=develop * fix segmentation fault in mixed_vector.h, test=develop * add unittests, test=develop
-
- 16 7月, 2019 2 次提交
-
-
由 liuwei1031 提交于
-
由 Jacek Czaja 提交于
* - Added partial draft of pooling acquire - Workspace support - compilation fix - Added draft of pooling backward reimplementation - Segfault fix - reverted 'any' for diff_dst crewation in pooling - Lint fixes test=develop - lint fixes test=develop - Further lint fixes test=develop * - Fixes after review test=develop * - Lint fixes test=develop * - Even more lint fixes test=develop
-
- 11 7月, 2019 2 次提交
-
-
由 Tao Luo 提交于
* add config.SetMkldnnCacheCapacity api for mkldnn cache clear strategy test=develop * enhance MkldnnPostReset test=develop * add comments for mkldnn_cache_capacity field test=develop
-
由 gongweibao 提交于
-