- 19 11月, 2019 2 次提交
-
-
由 yaoxuefeng 提交于
* fix auc drop first commit test=develop * update datanorm op * update datanorm with enforce test=develop * update test=develop * update format test=develop * update format * update format test=develop * add unit test test=develop * update unit test test=develop * update format test=develop * update format test=develop * update API description test=develop * update API description test=develop * update format test=develop * fix codes as comments test=develop * fix description as comments test=develop * fix description as comments test=develop * update codes.. test=develop
-
由 Zeng Jinle 提交于
* polish jit trace codes, test=develop * polish codes again by removing var_id, test=develop
-
- 18 11月, 2019 8 次提交
-
-
由 Zeng Jinle 提交于
* fix warnings oof gcc 8 compilation, test=develop * fix boost::bad_get, test=develop * refine PADDLE_ENFORCE, test=develop
-
由 Jeng Bai-Cheng 提交于
* Fix TensorRT detection bug 1. Add new search path for TensorRT at tensorrt.cmake 2. Add better debug message 3. Fix the bug of detection of TensorRT version In NVIDIA official docker image, TensorRT headers are located at `/usr/include/x86_64-linux-gnu` and TensorRT libraries are located at `/usr/lib/x86_64-linux-gnu`, so using `-DTENSORRT_ROOT` will fail to detect TensorRT. There is no debug/warning message to tell developer that TensorRT is failed to be detected. In later version of TensorRT (e.g. v6), `NV_TENSORRT_MAJOR` is defined at `NvInferVersion.h` instead of `NvInfer.h`, so add compatibility fix. * Fix TensorRT variables in CMake 1. Replace `${TENSORRT_ROOT}/include` with `${TENSORRT_INCLUDE_DIR}` 2. Replace `${TENSORRT_ROOT}/lib` with `${TENSORRT_LIBRARY}` Manually type path may locate incorrect path of TensorRT. Use the paths detected by system instead. * Fix TensorRT library path 1. Add new variable - `${TENSORRT_LIBRARY_DIR}` 2. Fix TensorRT library path inference_lib.cmake and setup.py.in need the path of TensorRT library instead of the file of TensorRT library, so add new variable to fix it. * Add more general search rule for TensoRT Let system detect architecture instead of manually assign it, so replace `x86_64-linux-gnu` with `${CMAKE_LIBRARY_ARCHITECTURE}`. * Add more general search rule for TensorRT Remove duplicate search rules for TensorRT libraries. Use `${TENSORRT_LIBRARY_DIR}` to get full path of libnvinfer.so test=develop
-
由 danleifeng 提交于
-
由 Zhang Ting 提交于
* modified error message for conv and conv_transpose, test=develop * modified doc of conv and conv_transpose op, test=develop * modified the expression for error message, test=develop * modified error message for group_norm op, test=develop * modified detail of Attr(data_format) or Attr(data_layout) * add ValueError in API doc for maxout op, test=develop
-
由 liym27 提交于
* add API switch_case. test=develop add Nest * modify code according to reviews: 1.Attr(branch_index) support 'uint8' and 'int64' besides 'int32'. 2.remove useless code. test=develop * replace fluid.layers.data with fluid.data and polish API document. test=develop
-
由 guofei 提交于
-
由 WangXi 提交于
-
由 Zeng Jinle 提交于
-
- 16 11月, 2019 1 次提交
-
-
由 juncaipeng 提交于
* Support more ops in post training quantization, and save the output scale in quantized op. * Update docs in post training quantization and qat
-
- 15 11月, 2019 3 次提交
-
-
由 xujiaqi01 提交于
* fix cache table bug * add save_paddle_inference_model * fix hdfs util bug * test=develop
-
由 xujiaqi01 提交于
* copy some feasigns and corresponding embeddings from one sparse table to another * copy all feasigns and corresponding embeddings from one sparse table to another * copy all dense params from one table to another * copy some local vars to other local vars
-
由 ruri 提交于
-
- 14 11月, 2019 6 次提交
-
-
由 Kaipeng Deng 提交于
-
由 hong 提交于
-
由 Zeng Jinle 提交于
* friendly trace interface, test=develop * refine TracedLayer, test=develop * add some docs, test=develop
-
由 whs 提交于
-
由 Tao Luo 提交于
* add input type and dtype check template, and update some APIs check * refine check template, and update some APIs check in nn.py * update some APIs check in loss.py test=develop
-
由 joanna.wozna.intel 提交于
test=develop
-
- 13 11月, 2019 1 次提交
-
-
由 gongweibao 提交于
use 2 cards test=develop
-
- 12 11月, 2019 7 次提交
-
-
由 Tao Luo 提交于
* Split some APIs from nn.py to loss.py test=develop * fix test_detection unit-test test=develop
-
由 zhaoyuchen2018 提交于
* Add Asypadding for conv fusion. test=develop reference: pr/20042 * Fix eigen build link error * Change back file mode * Use math function & add more checks.
-
由 WangXi 提交于
-
由 lilong12 提交于
modify the implementation of save_persistables and save_inference_model for fleet collective mode (#20802) * modify the implementation of save_persistables and save_inference_model functions for fleet collective, test=develop * add ut, test=develop
-
由 Bai Yifan 提交于
-
由 ceci3 提交于
* fix instance norm * update unitest,test=develop
-
由 lilong12 提交于
* set the default value of alpha for prelu to 0.25, test=develop * add the call to __syncthreads(), test=develop * fix the implementation of cpu prelu, test=develop * repair the implementation of element mode prelu, test=develop * modify test_prelu_op.py, test=develop
-
- 11 11月, 2019 3 次提交
-
-
由 Huihuang Zheng 提交于
-
由 Huihuang Zheng 提交于
-
由 Yiqun Liu 提交于
* Add the check of lod_level between compile-time and runtime. test=develop * Fix bug in check_compile_vs_runtime. test=develop * Fix the check of output when it is dispensiable or intermediate. test=develop * Share lod of x to out in match_matrix_tensor op in compile-time. * Implement GetLoDLevel in InferShapeContext. * Set the default value of check_compile_vs_runtime to False and enable it in test_sequence_pad_op. test=develop * Enable check_compile_vs_runtime in test_match_matrix_tensor. * Add the implementation of SetLoDLevel in InferShapeContext. * Remove the implementation of IncreaseLoDLevel and call Get/SetLoDLevel instead. * Remove the implementation of DecreaseLoDLevel and call Set/GetLoDLevel instead. * Refine some ops and unittests. test=develop * Fix a typo. test=develop * Remove the check of var type, and change int to int32_t. test=develop * Add unittest for Get/SetLoDLevel. test=develop
-
- 08 11月, 2019 5 次提交
-
-
由 Tao Luo 提交于
* split some APIs from nn.py to rnn.py * split some APIs from nn.py to sequence_lod.py test=develop * fix unit-test bug test=develop * fix test_layers unit-test bug test=develop
-
由 joanna.wozna.intel 提交于
* Add transpose2 INT8 for mkl-dnn test=develop * Fix test_transpose_int8_mkldnn test=develop * Revert "Merge branch 'develop' into transpose_int8_mkldnn_2" This reverts commit 34011bdb, reversing changes made to 2ce6473f. * Revert "Revert "Merge branch 'develop' into transpose_int8_mkldnn_2"" This reverts commit 23754dd7. * Add template to TransposeMKLDNNHandler test=develop * Resolve conflict test=develop * Restore get_size and refactor test=develop
-
由 juncaipeng 提交于
-
由 LielinJiang 提交于
-
由 liym27 提交于
fix bug in pool/conv/conv_transpose: UpdatePaddingAndDilation, _get_padding_with_SAME and conv2dtranspose_forward_naive. (#20997) * fix bug in pool/conv/conv_transpose: 1. It should be stride[i] not stride[0] in UpdatePaddingAndDilation; 2. fix bug of func _get_padding_with_SAME in test_conv/conv_transpose_op.py; 3. fix bug of the computation process in function conv2dtranspose_forward_naive. test=develop * change test to make the data of different dimensions different. test=develop
-
- 07 11月, 2019 2 次提交
-
-
由 Adam 提交于
* Add asymetric padding support for mkldnn pooling test=develop * Add asymetric padding support for mkldnn conv test=develop * Add asymetric padding support for mkldnn conv_transpose test=develop
-
由 Huihuang Zheng 提交于
These ops are useful in control flow.
-
- 06 11月, 2019 2 次提交
-
-
由 hong 提交于
* fix uniform random; test=develop * add uniform random test; test=develop
-
由 Wojciech Uss 提交于
test=develop
-