- 20 11月, 2019 6 次提交
-
-
由 Pei Yang 提交于
added splitter "__" between weight name and suffix number to avoid conflicts.
-
由 juncaipeng 提交于
* support set model_filename and params_filename in post_training_quantization, test=develop
-
由 Dong Daxiang 提交于
test=develop
-
由 Zhang Ting 提交于
* optimize assign op to avoid copy data from GPU to GPU, test=develop * modified GetkernelTypeForVar and just avoid device transform, test=develop
-
由 Liufang Sang 提交于
-
由 Zeng Jinle 提交于
* make Docker to gcc 8.2, test=develop * add -std=c11 to grpc.cmake, test=develop
-
- 19 11月, 2019 8 次提交
-
-
由 zhouwei25 提交于
-
由 Chen Weihang 提交于
* fix PADDLE_ENFORCE ci check bug, test=develop, test=document_fix * fix PADDLE_ENFORCE match error, test=develop, test=document_fix
-
由 Kaipeng Deng 提交于
-
由 danleifeng 提交于
-
由 Adam 提交于
test=develop
-
由 Zeng Jinle 提交于
-
由 yaoxuefeng 提交于
* fix auc drop first commit test=develop * update datanorm op * update datanorm with enforce test=develop * update test=develop * update format test=develop * update format * update format test=develop * add unit test test=develop * update unit test test=develop * update format test=develop * update format test=develop * update API description test=develop * update API description test=develop * update format test=develop * fix codes as comments test=develop * fix description as comments test=develop * fix description as comments test=develop * update codes.. test=develop
-
由 Zeng Jinle 提交于
* polish jit trace codes, test=develop * polish codes again by removing var_id, test=develop
-
- 18 11月, 2019 12 次提交
-
-
由 Zeng Jinle 提交于
* fix warnings oof gcc 8 compilation, test=develop * fix boost::bad_get, test=develop * refine PADDLE_ENFORCE, test=develop
-
由 zhouwei25 提交于
fix bug when build openblas with a computer that has installed openblas before,test=develop (#21160)
-
由 Jeng Bai-Cheng 提交于
* Fix TensorRT detection bug 1. Add new search path for TensorRT at tensorrt.cmake 2. Add better debug message 3. Fix the bug of detection of TensorRT version In NVIDIA official docker image, TensorRT headers are located at `/usr/include/x86_64-linux-gnu` and TensorRT libraries are located at `/usr/lib/x86_64-linux-gnu`, so using `-DTENSORRT_ROOT` will fail to detect TensorRT. There is no debug/warning message to tell developer that TensorRT is failed to be detected. In later version of TensorRT (e.g. v6), `NV_TENSORRT_MAJOR` is defined at `NvInferVersion.h` instead of `NvInfer.h`, so add compatibility fix. * Fix TensorRT variables in CMake 1. Replace `${TENSORRT_ROOT}/include` with `${TENSORRT_INCLUDE_DIR}` 2. Replace `${TENSORRT_ROOT}/lib` with `${TENSORRT_LIBRARY}` Manually type path may locate incorrect path of TensorRT. Use the paths detected by system instead. * Fix TensorRT library path 1. Add new variable - `${TENSORRT_LIBRARY_DIR}` 2. Fix TensorRT library path inference_lib.cmake and setup.py.in need the path of TensorRT library instead of the file of TensorRT library, so add new variable to fix it. * Add more general search rule for TensoRT Let system detect architecture instead of manually assign it, so replace `x86_64-linux-gnu` with `${CMAKE_LIBRARY_ARCHITECTURE}`. * Add more general search rule for TensorRT Remove duplicate search rules for TensorRT libraries. Use `${TENSORRT_LIBRARY_DIR}` to get full path of libnvinfer.so test=develop
-
由 liuwei1031 提交于
cudaStreamSynchronize randomly hang when used in multi-thread environment, replace it with cudaStreamQuery API on windows
-
由 danleifeng 提交于
-
由 Zhang Ting 提交于
* modified error message for conv and conv_transpose, test=develop * modified doc of conv and conv_transpose op, test=develop * modified the expression for error message, test=develop * modified error message for group_norm op, test=develop * modified detail of Attr(data_format) or Attr(data_layout) * add ValueError in API doc for maxout op, test=develop
-
由 liym27 提交于
* add API switch_case. test=develop add Nest * modify code according to reviews: 1.Attr(branch_index) support 'uint8' and 'int64' besides 'int32'. 2.remove useless code. test=develop * replace fluid.layers.data with fluid.data and polish API document. test=develop
-
由 Zhaolong Xing 提交于
* refine trt int8 for dynamic range set test=develop * refine trt int8 test=develop
-
由 guofei 提交于
-
由 WangXi 提交于
-
由 Zeng Jinle 提交于
-
由 Chen Weihang 提交于
* add ci check for error message specification, test=develop, test=document_fix * replace spec url & refine failed message, test=develop, test=document_fix
-
- 16 11月, 2019 1 次提交
-
-
由 juncaipeng 提交于
* Support more ops in post training quantization, and save the output scale in quantized op. * Update docs in post training quantization and qat
-
- 15 11月, 2019 5 次提交
-
-
由 xujiaqi01 提交于
* fix cache table bug * add save_paddle_inference_model * fix hdfs util bug * test=develop
-
由 Yihua Xu 提交于
-
由 GaoWei8 提交于
* solve cmake fails on inference_download_and_uncompress test=develop * solve cmake fails on inference_download_and_uncompress test=develop
-
由 xujiaqi01 提交于
* copy some feasigns and corresponding embeddings from one sparse table to another * copy all feasigns and corresponding embeddings from one sparse table to another * copy all dense params from one table to another * copy some local vars to other local vars
-
由 ruri 提交于
-
- 14 11月, 2019 8 次提交
-
-
由 Kaipeng Deng 提交于
-
由 hong 提交于
-
由 Zeng Jinle 提交于
* friendly trace interface, test=develop * refine TracedLayer, test=develop * add some docs, test=develop
-
由 Chen Weihang 提交于
-
由 Chen Weihang 提交于
-
由 whs 提交于
-
由 Chen Weihang 提交于
Add examples for error message writing specification - NotFound, OutOfRange, AlreadyExists, PermissionDenied (#21134) * add examples for error msg spec, test=develop * change ENFORCE to ENFORCE_**, test=develop * add more already exists examples, test=develop
-
由 zhaoyuchen2018 提交于
* Improve topk performance. give 200000 data to compute topk, before opt: cost 1s after opt: cost 0.0028s. * Refine return value. * Add cuda util funtions. * Fix ComputeBlockSize bug & refine comments. Signed-off-by: Nzhaoyuchen <zhaoyuchen01@baidu.com>
-