- 14 8月, 2019 1 次提交
-
-
由 chengduo 提交于
Use CUDAPinnedPlace in buffered_reader
-
- 13 8月, 2019 1 次提交
-
-
由 Jiawei Wang 提交于
* instag lod tensor impl * First PR for instag * First PR for instag * Before adding Selection Rows. * Change name from instag to filter_instag, add upgrade the impl of filter_instag * Change name from instag to filter_instag, add upgrade the impl of filter_instag * Fix yapf error in gradient_checker.py to pass Travis-CI * Fix Filter Instag Grad test=develop * Fix Filter Instag Grad test=develop * 1) Fix API.spec, add filter_instag Op. 2) Add Vector Support for CUDA. test=develop * Impl Loss_weight and empty output handler * change Loss Weight datatype to Float32, and add Loss Weight as 2nd output * 1) Support Tensor Input(without LOD) 2) Add Unit test * Filter By Instag Final test=develop * Update API.spec for filter_by_instag test=develop * Update API.spec for filter_by_instag 2 test=develop * Add Filter By Instag Coverage * code format of test_layers.py * code format test_layers.py test=develop * Make API args more readable test=develop * Make API args more readable and pass code format test=develop * Filter By Instag Op, Rename Map to Index Map test=develop * Filter By Instag Op, code format err in filter_by_instag_op.cc test=develop * Filter by instag op: code format of cpp files test=develop * Filter by instag Op: Api spec modification test=develop * Filter by instag Op: Api spec doc id modification test=develop * Filter by instag Op: Api spec and doc preview test=develop test=document_preview * Filter By Instag Op, fix doc erro test=document_preview test=develop * Filter By Instag Op, fix doc err and Api spec test=document_preview test=develop * Filter By Instag Op, fix Api spec test=document_preview test=develop * Filter By Instag Op, fix Paddle Encoforce deprecated warning test=document_preview test=develop * Filter By Instag Op, fix Paddle Encoforce deprecated and code format warning test=document_preview test=develop
-
- 12 8月, 2019 5 次提交
-
-
由 huangjun12 提交于
* add hard_swish activation op (new op) test=develop * remove redundancy files * modify document content of HardSwish OP * add API test in test_layers.py * add dynamic_graph for test_hard_swish
-
由 joanna.wozna.intel 提交于
test=develop
-
由 wangguanzhong 提交于
-
由 gongweibao 提交于
Polish fleet API to support cuda collective mode and nccl2 mode
-
由 Kevin 提交于
Fix seq_pool failed when input dims is too large. Resolve issue #3023
-
- 09 8月, 2019 4 次提交
-
-
由 Zeng Jinle 提交于
-
由 ShenLiang 提交于
* add eye op,test=document_preview test=develop * fix the API.spec, test=develop * fix the document, test=document_preview test=develop * add unitest for CI coverage, test=develop
-
由 Kaipeng Deng 提交于
* add trilinear interp. test=develop * fix unittest. test=develop * add python api and test_layers. test=develop * refine API.spec. test=develop * fix format. test=develop * add python API test. test=develop * format code. test=develop * refine code strcuture. test=develop * fix format * fix doc. test=develop * fix converage. test=develop * fix format. test=develop
-
由 Zhang Ting 提交于
* optimize error message, test=develop * optimize error message, test=develop
-
- 06 8月, 2019 2 次提交
-
-
由 Yiqun Liu 提交于
* Add the check of lod in sequence_softmax kernel. test=develop * Refine the comments. test=develop
-
由 Kevin 提交于
* fix overflow by int32 mul test=develop * fix reference nullptr * fix codestyle test=develop * modify to point in ContextProjectFunctor test=develop * modify to point in ContextProjectFunctor test=develop * modify . to -> test=develop * add var_conv_2d op test=develop * edit api.spec test=develop * ignore unittest if with_mkl=off test=develop * fix python3 division test=develop * fix ignore unittest bug test=develop * remove useless code test=develop * modify api.spec test=develop * modify default_grad.spec test=develop
-
- 05 8月, 2019 2 次提交
-
-
由 pawelpiotrowicz 提交于
test=develop
-
由 Liufang Sang 提交于
* test=develop support Tensor input for ctc_align_op * test=develop add some comment
-
- 02 8月, 2019 3 次提交
-
-
由 hutuxian 提交于
-
由 Zeng Jinle 提交于
* open gc by default, test=develop * fix test_train_recognize_digits and disable gc when ngraph is enabled, test=develop * fix conditional_block op eager deletion bug, test=develop * add some comments to reviewers, test=develop
-
由 石晓伟 提交于
* add fusion_seqpool_cvm_concat test=develop * simplify pass, test=develop * fix code style, test=develop
-
- 01 8月, 2019 3 次提交
-
-
由 wawltor 提交于
* test=develop Add the op of unique_with_counts, the op is calc the unqiue input of data, and output the corresponding indices and count of data. * test=develop Check the input and dtype in the op of unique_with_counts * test=develop test=document_preview update the API.spec for `unique_with_counts`, at the same time, optimize the python api in the op of `unique_with_count` * test=develop test=document_preview Fix some python api problem in the op of `unique_with_counts`, and change the error messsage in this op. * Fix some API problem in the op of `unique_with_counts` test=develop test=document_preview * test=develop test=document_preview Fix the api sample of op `unique_with_counts`, and update api.spec
-
由 Jacek Czaja 提交于
test=develop - Extracted key generation from FWD and GRAD into separate function test=develop - Compilation fix test=develop - another compilation test=develop
-
由 LielinJiang 提交于
* fix depthwise conv gpu kernel bug, test=develop * add more depthwise conv test, test=develop
-
- 31 7月, 2019 7 次提交
-
-
由 liuwei1031 提交于
* fix security issue, test=develop * bug fix, test=develop * throw an exception when null pointer data with non-zero length PaddleBuf is passed, test=develop
-
由 Zhaolong Xing 提交于
* Fix Mask rcnn predictor 1. refine memory optim algorithm to support the model with the block op. 2. output diff : modify the affine channel fuse 3. add condition_block_infer op add interface for setting trt calib table dir test=develop * add the missing files. test=develop * 1 add trt fp16 support test=develop
-
由 chengduo 提交于
* update parallel.py test=develop
-
由 HaoRen 提交于
* support center loss * change tensor copy api to high level api tensorcopy * test=develop rewrite the center_loss cuda_kernel to make it faster and add document of the center loss api,also update test function * test=document_preview test=develop update document of center loss * test=document_preview test=develop modify API.spec modify test code remove nouse const_cast
-
由 Leo Zhao 提交于
test=develop
-
由 wopeizl 提交于
Optimize the error report information when loadcombine fail to open model files test=develop (#18888)
-
由 baojun 提交于
* upgrade ngraph to v0.24 test=develop * simplify io test=develop
-
- 30 7月, 2019 2 次提交
-
-
由 Jacek Czaja 提交于
test=develop - optimization of TID to string test=develop
-
由 danleifeng 提交于
-
- 28 7月, 2019 1 次提交
-
-
由 Zeng Jinle 提交于
-
- 26 7月, 2019 4 次提交
- 25 7月, 2019 4 次提交
-
-
由 FDInSky 提交于
* test=develop fix cpu roi_align_op backward bug
-
由 Bai Yifan 提交于
-
由 lidanqing 提交于
* change INT8 to template so that checking dst_dt with if-else could be removed. CI will be enabled after fixing reviews * reverse user_residual_memory_p and user_bias_memory_p declaration scope test=develop
-
由 JesseyXujin 提交于
-
- 24 7月, 2019 1 次提交
-
-
由 Bob Zhu 提交于
* extend matmul op to support multiple head multiplication With the support of multiple head, the multiplication of two big matrixes is split into multiplication of several (head_number) small matrixes. e.g. if Mat A is [3, 24] and Mat B is [24, 4], when multiple A and B with head_number as 4, Mat A will be split as 4 matrix of [3, 6] and Mat B will be 4 matrix of [6, 4]. The result of final matrix will be 4 matrix of [3, 4], i.e. [3, 16].
-