- 18 2月, 2020 2 次提交
-
-
由 123malin 提交于
-
由 yaoxuefeng 提交于
* update * update test=develop * update compile set test=develop * update compile set test=develop * update test=develop * update test=develop * update test=develop * update compile setting test=develop * update compile setting test=develop * update run demo test=develop * update test=develop * update test=develop * fix test=develop * update test=develop * update test=develop * update test=develop * update test=develop * update test=develop * update test=develop * update test=develop * update test=develop * update test=develop * update format test=develop * update format test=develop * update style test=develop * update style test=develop * change style test=develop * change style test=develop * change style test=develop * add dataset unittest test=develop * update test=develop * update for record test=develop * udpate style for record test=develop * update for record test=develop * update for record test=develop * update for record test=develop * fix format test=develop * update test=develop * update test=develop * update test=develop * update test=develop * update test=develop
-
- 17 2月, 2020 1 次提交
-
-
由 Wilber 提交于
[cherry-pick] #22551 当一个模型中有多个fc_lstm子图的时候,且其中fc共用了同一个persistable的bias,此时不应该将bias节点删除,只将非persistable的节点去除即可。
-
- 13 2月, 2020 1 次提交
-
-
由 tangwei12 提交于
* add thread barrier for the compiled program
-
- 12 2月, 2020 1 次提交
-
-
由 guofei 提交于
This PR makes assign op support LoDTensorArray and enable the loop_vars in while_loop to support tuple or list.
-
- 11 2月, 2020 1 次提交
-
-
由 Wilber 提交于
[cherry-pick] #22509 支持不依赖nccl进行编译。 多卡下,如果没有打开WITH_NCCL开关编译,多卡不能通信,则只能选择一张卡使用
-
- 07 2月, 2020 1 次提交
-
-
由 joanna.wozna.intel 提交于
* Add dequant scale squash * Correct dequant-scale squash test test=release/1.7
-
- 05 2月, 2020 3 次提交
-
-
由 Wilber 提交于
[cherry-pick] #22384 and #22371 22384增加了WITH_NCCL开关 22371修改了fluid依赖lite的commit id
-
由 xujiaqi01 提交于
* add GeneralRoleMaker which is for general usage * test=develop
-
由 xujiaqi01 提交于
* fix copy table bug (#22432) * fix copy table bug of lost some feasign * test=develop * add hdfs ls retry time and sleep time, fix save inference (#22433) * add hdfs ls retry time and sleep time, fix save inference * test=develop
-
- 04 2月, 2020 3 次提交
-
-
由 xujiaqi01 提交于
* add collective communication library in fleet to replace mpi * test=develop
-
由 石晓伟 提交于
-
由 Michał Gallus 提交于
test=release/1.7 * Enable quantize to reorder to nchw as well * Correct FC MKL-DNN input dim requirements to accept 3D * Improve DNNL FC format, error and 3D input handling * Improve error checking in FC * Improve PADDLE_ENFORCE messages in fc-related files * Remove data layout attribute from obligatory pass args * Fix message in fc_mkldnn_pass to be logically correct
-
- 21 1月, 2020 1 次提交
-
-
由 lidanqing 提交于
-
- 20 1月, 2020 1 次提交
-
-
由 tangwei12 提交于
* add half_async in the communicator * fix DistributedStrategy
-
- 19 1月, 2020 1 次提交
-
-
由 Adam 提交于
-
- 15 1月, 2020 3 次提交
- 14 1月, 2020 1 次提交
-
-
由 Zhen Wang 提交于
-
- 09 1月, 2020 4 次提交
-
-
由 zhongpu 提交于
* test Optimizer in dygraph, test=develop * add optest for Optimizer in dygraph, test=develop * fix adagrad optimizer, test=develop * fix dpsgd optimizer, test=develop * fix test_optimizer.py, test=develop * fix dpsgd optimizer, this op only support cpu, test=develop * add optest for optimizer, test=develop * add description for dpsgd, test=develop * add rmsprop to white_list in unused_var_check.cc, test=develop * polish code style, test=develop * polish code style, test=develop * delete seed attribute for DpsgdOptimizer, test=develop * change testing to debugging, test=develop
-
由 joanna.wozna.intel 提交于
-
由 Yiqun Liu 提交于
* Polish the PADDLE_ENFORCE in fusion_group pass related codes. test=develop * Correct the unittest because of the change relu_grad's formula. test=develop
-
由 wangchaochaohu 提交于
* add support for nested profiling event and printing in different level
-
- 07 1月, 2020 3 次提交
-
-
由 liu zhengxi 提交于
-
由 Yiqun Liu 提交于
test=develop
-
由 bingyanghuang 提交于
-
- 06 1月, 2020 3 次提交
-
-
由 silingtong123 提交于
-
由 Jacek Czaja 提交于
-
由 Huihuang Zheng 提交于
-
- 05 1月, 2020 1 次提交
-
-
由 Jacek Czaja 提交于
-
- 03 1月, 2020 2 次提交
-
-
由 Yiqun Liu 提交于
* Add the dynamic load of nvrtc, and support runtime compiling of CUDA kernel using nvrtc. test=develop * Call CUDA driver api to launch the kernel compiled by nvrtc. test=develop * Disable for mac and windows. test=develop * Refine the codes to support manually specified num_threads and workload_per_thread. test=develop * Refine the CUDA kernel to support large dims. test=develop * Add DeviceCodePool to manage all device codes. * Add the first implementation fusion_group op. * Add unit-test for fusion_group op. * Add the check of result. * Add the check of nvrtc in unit-test. test=develop * Add comment to explain the inputs, outputs and features of fusion_group op. test=develop * Disable fusion_group op for mac and windows. test=develop * Make the compiling of device code return status instead of hanging up. test=develop * Add the check of whether there is CUDA driver library, and do not core dump when failing to call the CUDA driver API. * Unify fusion_group_op's input and output names. test=develop * Add the check of CUDA driver library in unittest. test=develop * Refine the calling of PADDLE_ENFORCE. test=develop
-
由 Michał Gallus 提交于
-
- 29 12月, 2019 1 次提交
-
-
由 liu zhengxi 提交于
* fix seqconv_eltadd_relu pass during multi-threads predictor, test=develop * fix attention_lstm_fuse_pass during multi-threads inference, test=develop * fix embedding_fc_lstm_fuse_pass during multi-threads inference, test=develop * fix fc_lstm_fuse_pass during multi-threads inference, test=develop * fix seq_concat_fc_fuse_pass during multi-threads inference, test=develop
-
- 27 12月, 2019 1 次提交
-
-
由 石晓伟 提交于
* fix multi-thread error of fc_gru_fuse_pass.cc, test=develop * export FLAGS and GLOG symbols, test=develop
-
- 25 12月, 2019 2 次提交
-
-
由 Pei Yang 提交于
-
由 qingqing01 提交于
* Pack imperative/layer into paddle_framework.so
-
- 24 12月, 2019 1 次提交
-
-
由 Aurelius84 提交于
* optimize adam speed by removing _finish_update test=develop * fix SparseAdamFunctor param list test=develop * Remove scale_op in expect_list of adam_op test=develop * fix test optimizer loss assert error test=develop * fix test optimizer loss assert error test=develop * modify PADDLE_ENFORCE usage test=develop * fix op_type in lamb_op.cc test=develop * fix errors ostream format bug test=develop * add betaPowOut in ngraph op test=develop * fix ngraph::op api for gcc8 test=develop * clean code test=develop * modify struct into class test=develop * remove code of beta1Tensor in lamb_op test=develop
-
- 20 12月, 2019 1 次提交
-
-
由 Thunderbrook 提交于
* general table * add sparse table test=develop * no cvm test=develop * add no_cvm test=develop * add note test=develop * code style test=develop * code style test=develop * code style test=develop * code style test=develop * code style test=develop * add key of optimizer test=develop * solve pslib stop core test=develop * barrier test=develop * add notes test=develop * add table id in cache shuffle test=develop * table id test=develop * code style test=develop
-
- 19 12月, 2019 1 次提交
-
-
由 WangXi 提交于
-