- 08 10月, 2018 3 次提交
-
-
由 chengduoZH 提交于
test=develop
-
由 Xin Pan 提交于
test=develop
-
由 dzhwinter 提交于
* "fix operators cmake" * "rerun ci" test=develop
-
- 01 10月, 2018 1 次提交
-
-
由 Michal Gallus 提交于
test=develop
-
- 30 9月, 2018 5 次提交
-
-
由 sneaxiy 提交于
-
由 tensor-tang 提交于
test=develop
-
由 Tao Luo 提交于
test=develop
-
由 dzhwinter 提交于
* "fix compile error" * "fix ci" * rerun ci test=develop * test=develop rerun ci
-
- 29 9月, 2018 11 次提交
-
-
由 luotao1 提交于
test=develop
-
由 wangguibao 提交于
test=develop
-
由 chengduo 提交于
test=develop
-
由 luotao1 提交于
test=develop
-
由 luotao1 提交于
-
由 Xin Pan 提交于
test=develop
-
由 Xin Pan 提交于
test=develop
-
由 Xin Pan 提交于
test=develop
-
由 Dun 提交于
* refine reduce by cub * optimize KernelDepthwiseConvFilterGrad * optimize depthwise conv and reduce mean and reduce sum * fix bug: dilation * cuda arch and cuda 8 compatible
-
由 Xin Pan 提交于
test=develop
-
由 Xin Pan 提交于
test=develop
-
- 28 9月, 2018 12 次提交
-
-
由 Xin Pan 提交于
Add API.spec test=develop
-
由 Tao Luo 提交于
test=develop
-
由 Jacek Czaja 提交于
test=develop
-
由 JiabinYang 提交于
-
由 Tao Luo 提交于
test=develop
-
由 dzhwinter 提交于
* flags * "follow comment"
-
由 Jacek Czaja 提交于
test=develop
-
由 Tao Luo 提交于
-
由 Xin Pan 提交于
scope's API modifies its internal state. And scope's API can be called from multiple threads during traing. Hence, we need locks to protect the scope's internal states. We can optimize it in the future. But the current solution is buggy. test=develop
-
由 Wu Yi 提交于
* show detail error log on ci * test * fix memopt and dist * update apispec * will fix different batch issue test=develop
-
由 Yan Chunwei 提交于
- add naive executor - fix concurrency performance issue
-
由 Dang Qingqing 提交于
test=develop
-
- 27 9月, 2018 8 次提交
-
-
由 chengduo 提交于
* add GraphNum test=develop * add graph number check in parallelExecutor test=develop * fix transformer_model bug test=develop * fix graph num
-
由 Jacek Czaja 提交于
extended test_text_classification ot use new op
-
由 minqiyang 提交于
test=develop
-
由 chengduo 提交于
test=develop
-
由 Jacek Czaja 提交于
-
由 Jacek Czaja 提交于
- Added draft of new operator - Added fused embedding fc lstm files - First time embedding_fc_lstm_fuse_pass was invoked in test_text_classification - Added Embedding pattern - Not crashing - Enabled draft of embedding_fc_lstm pass (does it job) - First working (Seqcompute only) version - Removed diagnostic comment - First enabling of BatchCompute - Disabling pass for embedding with is_sparse and is_distributed - Cosmetics - Style - Style
-
由 chengduo 提交于
-
由 qingqing01 提交于
* Add CUDA implementation for generate_proposals_op. * Clean code. * Update code.
-