- 15 3月, 2019 3 次提交
-
-
由 qingqing01 提交于
* Support Sync Batch Norm. * Note, do not enable it in one device. Usage: build_strategy = fluid.BuildStrategy() build_strategy.sync_batch_norm = True binary = fluid.compiler.CompiledProgram(tp).with_data_parallel( loss_name=loss_mean.name, build_strategy=build_strategy)
-
由 Yibing Liu 提交于
* Impl fp16 compute kernel for slice_op test=develop * Use data() to replace mutable_data()
-
由 Zeng Jinle 提交于
Fix cross_entropy2_op numeric error
-
- 14 3月, 2019 12 次提交
-
-
由 sneaxiy 提交于
test=develop
-
由 sneaxiy 提交于
test=develop
-
由 sneaxiy 提交于
-
由 Zeng Jinle 提交于
Revert "PaddingRNN model memory optimize"
-
由 sneaxiy 提交于
test=develop
-
由 sneaxiy 提交于
test=develop
-
由 Zeng Jinle 提交于
test=develop
-
由 liuwei1031 提交于
2. remove lod check in reshape op test=develop
-
由 Wojciech Uss 提交于
* Add cpu_quantize_squash_pass for C-API quantization test=develop * add cpu_quantize_squash_pass teste * fix lint: add include memory unorderd_map and unordered_set test=develop * lint fix 2 * fixes test=develop * refactored test=develop * fix windows ci test=develop
-
由 Zeng Jinle 提交于
PaddingRNN model memory optimize
-
由 chengduo 提交于
* make timeline.py compatible test=develop * Follow comment test=develop
-
由 Tao Luo 提交于
add all_kernels_must_compute_runtime_shape example for speedup infershape
-
- 13 3月, 2019 13 次提交
-
-
由 tianshuo78520a 提交于
-
由 lujun 提交于
Fix api.spec
-
由 sneaxiy 提交于
test=develop
-
由 guomingz 提交于
* resolve #15618 Backgroud: the PR #15398 raised the box_coder op performance regression, we optimized the code via the more efficency leveraging opemmp.
-
由 sneaxiy 提交于
test=develop
-
由 tianshuo78520a 提交于
-
-
由 Kaipeng Deng 提交于
Add the inverse trigonometric function
-
由 Yan Xu 提交于
* fix broadcast with mp mode * polish code test=develop * fix bcast strategy test=develop * fic cpplint test=develop * fix py3 failed test=develop * fix comment test=develop * update comment test=develop
-
由 baojun 提交于
* remove concast_cast and refactor code test=develop * reduce flag use test=develop
-
由 chengduo 提交于
test=develop
-
由 tink2123 提交于
test=develop
-
由 Yan Xu 提交于
-
- 12 3月, 2019 12 次提交
-
-
由 Zhen Wang 提交于
Fix NaN bugs for static quantization strategy (mutil-cards train).
-
由 tianshuo78520a 提交于
-
由 luotao1 提交于
test=develop
-
由 Qiyang Min 提交于
Imperative transfer gru unit
-
由 Zhen Wang 提交于
Add clone function for IrGraph.
-
由 wopeizl 提交于
test=develop
-
由 Zhen Wang 提交于
Add channel wise quant op and channel wise dequant op.
-
由 Zhen Wang 提交于
-
由 tink2123 提交于
test=develop
-
由 luotao1 提交于
test=develop
-
由 luotao1 提交于
-
由 Zhen Wang 提交于
-