- 09 8月, 2019 3 次提交
-
-
由 Kaipeng Deng 提交于
* add trilinear interp. test=develop * fix unittest. test=develop * add python api and test_layers. test=develop * refine API.spec. test=develop * fix format. test=develop * add python API test. test=develop * format code. test=develop * refine code strcuture. test=develop * fix format * fix doc. test=develop * fix converage. test=develop * fix format. test=develop
-
由 chengduo 提交于
* Enhance fuse optimization op pass test=develop
-
由 chengduo 提交于
* Add call stack info during runtime and compile time test=develop * Rename operator_call_stack test=develop * Add unit test test=develop * follow comment test=develop
-
- 08 8月, 2019 1 次提交
-
-
由 Leo Chen 提交于
* fix memory overlapping of fetch var (return of executor.run), test=develop * fix wrong usage of ParallelExecutor in op_test, test=develop * remove useless parameter and simplify code * avoid tensor destruct untimely, test=develop * add testcase independent of OpTest, test=develop
-
- 06 8月, 2019 2 次提交
-
-
由 Kevin 提交于
* fix overflow by int32 mul test=develop * fix reference nullptr * fix codestyle test=develop * modify to point in ContextProjectFunctor test=develop * modify to point in ContextProjectFunctor test=develop * modify . to -> test=develop * add var_conv_2d op test=develop * edit api.spec test=develop * ignore unittest if with_mkl=off test=develop * fix python3 division test=develop * fix ignore unittest bug test=develop * remove useless code test=develop * modify api.spec test=develop * modify default_grad.spec test=develop
-
由 Zeng Jinle 提交于
-
- 05 8月, 2019 1 次提交
-
-
由 Liufang Sang 提交于
* test=develop support Tensor input for ctc_align_op * test=develop add some comment
-
- 04 8月, 2019 1 次提交
-
-
由 Dong Daxiang 提交于
make listen and server as exclusive run
-
- 02 8月, 2019 2 次提交
-
-
由 Zeng Jinle 提交于
* open gc by default, test=develop * fix test_train_recognize_digits and disable gc when ngraph is enabled, test=develop * fix conditional_block op eager deletion bug, test=develop * add some comments to reviewers, test=develop
-
由 石晓伟 提交于
* add fusion_seqpool_cvm_concat test=develop * simplify pass, test=develop * fix code style, test=develop
-
- 01 8月, 2019 2 次提交
-
-
由 wawltor 提交于
* test=develop Add the op of unique_with_counts, the op is calc the unqiue input of data, and output the corresponding indices and count of data. * test=develop Check the input and dtype in the op of unique_with_counts * test=develop test=document_preview update the API.spec for `unique_with_counts`, at the same time, optimize the python api in the op of `unique_with_count` * test=develop test=document_preview Fix some python api problem in the op of `unique_with_counts`, and change the error messsage in this op. * Fix some API problem in the op of `unique_with_counts` test=develop test=document_preview * test=develop test=document_preview Fix the api sample of op `unique_with_counts`, and update api.spec
-
由 LielinJiang 提交于
* fix depthwise conv gpu kernel bug, test=develop * add more depthwise conv test, test=develop
-
- 31 7月, 2019 2 次提交
-
-
由 HaoRen 提交于
* support center loss * change tensor copy api to high level api tensorcopy * test=develop rewrite the center_loss cuda_kernel to make it faster and add document of the center loss api,also update test function * test=document_preview test=develop update document of center loss * test=document_preview test=develop modify API.spec modify test code remove nouse const_cast
-
由 Dong Daxiang 提交于
make dist unit test exclusive run
-
- 30 7月, 2019 2 次提交
-
-
由 danleifeng 提交于
-
由 chengduo 提交于
test=develop
-
- 29 7月, 2019 1 次提交
-
-
由 Zeng Jinle 提交于
* remove legacy memory optimization codes, test=develop * follow huihuang's comments,test=develop * follow luotao's comments, test=develop
-
- 28 7月, 2019 1 次提交
-
-
由 Zeng Jinle 提交于
-
- 27 7月, 2019 1 次提交
-
-
由 chengduo 提交于
* open fuse optimization ops test=develop
-
- 26 7月, 2019 2 次提交
-
-
由 Adam 提交于
-
由 Zeng Jinle 提交于
* first version memory optimize pass, test=develop * remove move_tensor_sharing_pass, test=develop * refine code comments, add unittests, test=develop * turn off memory_optimize by default, test=develop * follow huihuang's comments, test=develop * follow chengduoZH's comments, test=develop * fix grammar error, add const qualifier, fix pass_test exception message, test=develop * follow chengduoZH's comments 2nd, test=develop
-
- 25 7月, 2019 1 次提交
-
-
由 guru4elephant 提交于
* split test_dist_se_resnext.py into 4 testcases
-
- 24 7月, 2019 3 次提交
-
-
由 Bob Zhu 提交于
* extend matmul op to support multiple head multiplication With the support of multiple head, the multiplication of two big matrixes is split into multiplication of several (head_number) small matrixes. e.g. if Mat A is [3, 24] and Mat B is [24, 4], when multiple A and B with head_number as 4, Mat A will be split as 4 matrix of [3, 6] and Mat B will be 4 matrix of [6, 4]. The result of final matrix will be 4 matrix of [3, 4], i.e. [3, 16].
-
由 whs 提交于
* Make lod reset op support for append lod level. * Fix API.spec test=develop * Fix unitest. test=develop * Add python api for lod append. test=develop * Fix API.spec test=develop * Fix format of doc. test=develop * Fix unitest. test=develop * Fix doc. test=develop
-
由 chengduo 提交于
* prun backward ops test=develop
-
- 23 7月, 2019 2 次提交
- 22 7月, 2019 3 次提交
-
-
由 Huihuang Zheng 提交于
The change includes 3 things: 1. Set CPU_NUM to 1 in the tests because the ParallelExecutor will print warning that CPU_NUM is not set and use default 1. 2. Old tests compare two RNNs, hand written simple RNN and same RNN built by Paddle, but initialized RNN weights in numpy random and Paddle random separately. Fixed it by setting weights and bias values. 3. Also set numpy random seed in the tests. Now the two RNNs diff can be smaller (rtol from 0.1, 0.2 to. 0.01) in the tests. test=develop
-
由 Tao Luo 提交于
test=develop
-
由 guru4elephant 提交于
* split different comm method for mnist distributed training
-
- 19 7月, 2019 3 次提交
-
-
由 Huihuang Zheng 提交于
Test PaddingRNN on V100 GPU device. Test configuration: large model, padding mode (which is the mode using recurrentOp), one GPU. GPU memory (MiB): 6414 (this PR) vs 6837 (without this PR) Speed (steps/s): 10.28 (this PR) vs 9.89 (without this PR)
-
由 Adam 提交于
test=develop
-
由 tangwei12 提交于
* add check of executor, test=develop
-
- 18 7月, 2019 2 次提交
-
-
由 Zeng Jinle 提交于
* feature/auto_growth_allocator, test=develop * add unittest of AlignedAllocator, test=develop * try to turn on auto_growth to test on CI, test=develop * fix segmentation fault in mixed_vector.h, test=develop * add unittests, test=develop
-
由 hutuxian 提交于
* hash_op support int64 hash_size * add corresponding UT
-
- 15 7月, 2019 2 次提交
-
-
由 guru4elephant 提交于
* make auc op compatible with 1 dim
-
由 guru4elephant 提交于
test=develop
-
- 12 7月, 2019 2 次提交
- 11 7月, 2019 1 次提交
-
-
由 gongweibao 提交于
-