- 03 4月, 2020 4 次提交
-
-
由 zhaoyuchen2018 提交于
elementwise function used before definition then failed in cuda 8, move it ahead.
-
由 channings 提交于
* update linspace, equal operators to API 2.0, test=develop * equal support higher performance CUDA kernel, test=develop * update comment of equal&linspace operator, test=develop * update comment of equal&linspace operator, test=develop
-
由 zhaoyuchen2018 提交于
* improve elementwise performance. * Add contiguous check, test=develop
-
由 Chen Weihang 提交于
* add op inout check macro, test=develop * fix enforce_test, test=develop
-
- 02 4月, 2020 5 次提交
-
-
由 liym27 提交于
* Add unittest for transformer prediction in dygraph_to_static. * fix bug in fill_constant api. * Make transpose support size 0. test=develop
-
由 zhongpu 提交于
* use global conv cache; test=develop * use singleton cache; test=develop * fix format error; test=develop * add cudnn helper header; test=develop * fix header error; test=develop * fix mac unitest; test=develop * fix mac unitest; test=develop * fix file format; test=develop * fix include file error, test=develop * remove kernel_configs_ in class ExecutionContext and kernel_configs_map_ in class OperatorWithKernel, test=develop * fix test_elementwise_mul_op_dim, test=develop Co-authored-by: Nphlrain <phliuhongyu@126.com>
-
由 Adam 提交于
* Delete is_test from activation operators test=develop * Revent unneeded changes test=develop
-
由 Kaipeng Deng 提交于
* add inplace_abn_op. test=develop
- 01 4月, 2020 2 次提交
-
-
由 wangchaochaohu 提交于
* refine the error message of tensor_array_read_write Op
-
由 xujiaqi01 提交于
* add fleet pslib pull and push sparse op and push dense op * test=develop
-
- 31 3月, 2020 1 次提交
-
-
由 GaoWei8 提交于
-
- 30 3月, 2020 1 次提交
-
-
由 Jacek Czaja 提交于
-
- 29 3月, 2020 1 次提交
-
-
由 zhaoyuchen2018 提交于
* Improve elementwise performance. Elementwise performace is poor as walk into CommonGradBroadcastCUDA, add some new kernels for different data pattern. * Add some cuda kernel to speedup common broadcast cases. test=develop * Add more test cases and fix cuda kernel bug. test=develop * Remove tests as cpu percision fails.test=develop * Refine SplitDims, test=develop * Change file mode, test=develop
-
- 26 3月, 2020 2 次提交
-
-
由 Zhaolong Xing 提交于
* add dynamic plugin support. test=develop * change emb eltwise layernorm to math function test=develop * add emb eltwise layernorm test=develop * can run dynamic shape ernie test=develop * fix ci test=develop * add ut for trt ernie dynamic test=develop * refine dynamic shape c++ interface. test=develop * fix comments test=develop * fix comments test=develop
-
由 liym27 提交于
-
- 25 3月, 2020 6 次提交
-
-
由 Zeng Jinle 提交于
-
由 Zeng Jinle 提交于
-
由 Zeng Jinle 提交于
-
由 Liufang Sang 提交于
* fix compile error in win gpu test=develop * fix compile error in win gpu test=develop * fix compile error in win gpu test=develop
-
由 Zeng Jinle 提交于
-
由 Zeng Jinle 提交于
-
- 24 3月, 2020 1 次提交
-
-
由 wangchaochaohu 提交于
* refine the transopose conv using v7 to choose algorithm
-
- 23 3月, 2020 1 次提交
-
-
由 Adam 提交于
-
- 22 3月, 2020 1 次提交
-
-
由 Liufang Sang 提交于
* add dequantize_log_op and make pyramid hash support int8 weight test=develop * add unittest and update pyramid hash op test=develop * remove paddle_enforce test=develop * fix error message test=develop * remove incorrent commit test=develop * fix error message in log_dequantize test=develop * change 2019 to 2020 test=develop * remove useless check_grad test=develop
-
- 21 3月, 2020 1 次提交
-
-
由 Zeng Jinle 提交于
-
- 20 3月, 2020 3 次提交
-
-
由 Zeng Jinle 提交于
* sequential reader stage 1, test=develop * fix ut, test=develop * fix iterable=False reset bug, add some logs and polish code, test=develop * inference feed partial data, test=develop * Turn on keep_order=True for test, test=develop * enhance ut to test more cases, test=develop * test commit for reverting * Revert "test commit for reverting", test=develop This reverts commit 80aef42e. * add ut of merged and unmerged results, test=develop * add more uts for coverages and add en doc of api, test=develop * follow comments, test=develop * change note style, test=develop
-
由 Wilber 提交于
update embedding_eltwise_layernorm fuse pass and fused kernel, to support multi input
-
由 Zeng Jinle 提交于
* add double grad implementation for dygraph, test=develop * polish code, add uts, test=develop * fix place bug, test=develop * polish codes, add more uts for coverages, test=develop * add no_grad_set, test=develop * add star gan ut, test=develop * follow comments, test=develop
-
- 19 3月, 2020 4 次提交
-
-
由 songyouwei 提交于
* move __getitem__ to cpp * bug fix * add type check and gil release * support negative step with omitted ends test=develop * code refine test=develop * bug fix test=develop * slice always return different pyobj test=develop
-
由 Sylwester Fraczek 提交于
-
由 Zhaolong Xing 提交于
test=develop
-
由 Liufang Sang 提交于
* fix div zero test=develop * fix div zero test=develop * add hostdevice function test=develop * add eps when is zero test=develop
-
- 17 3月, 2020 4 次提交
-
-
由 Feiyu Chan 提交于
add approximation for gelu, default value is False (only kernel with eigen is added, remove code for computing gelu with MKLDNN temporarily)
-
由 Adam 提交于
-
由 GaoWei8 提交于
-
由 Zhang Ting 提交于
-
- 16 3月, 2020 1 次提交
-
-
由 wangchaochaohu 提交于
-
- 13 3月, 2020 2 次提交
-
-
由 wangchaochaohu 提交于
* add fusion group test for backward and refine code
-
由 chengjuntao 提交于
* fix bug which input shape, test=develop * add error type,test=develop
-