1. 22 8月, 2019 6 次提交
    • C
      Update CompiledProgram (#18919) · 4278518f
      chengduo 提交于
      * use PE for compiler
      test=develop
      4278518f
    • Use sparse matrix to implement FusedEmbeddingSeqPoolGradKernel (#19153) · 2e3ee579
      翟飞跃 提交于
      * Implement the operator with sprase matrix multiply
      
      * Update the URL of mklml library.
      
      test=develop
      
      * Disable MKLML implematation when using no-linux.
      
      test=develop
      
      * optimize bp with mkl sparse matrix
      test=develop
      2e3ee579
    • L
      Enhance OpTest to check the consistency of operators when using and not using inplace (#19101) · a9d5fc51
      Leo Chen 提交于
      * add pybind interface to get all inplace ops, test=develop
      
      * enhance OpTest to check whether the consistency of operator when using and not using inplace, test=develop
      
      * handle corner cases in op_test, test=develop
      
      * support outputs without tensor holder_, like XShape in reshape_op, test=develop
      
      * fix bug, some op has GradOpMaker, but actually no grad_op in OpInfoMap, test=develop
      
      * use reshape_grad instead of reshape in FlattenGradOp, test=develop
      
      * fix error debug dims info for variables like XShape, test=develop
      
      * change computational order in sum_op to relieve computation difference using inplace, test=develop
      
      * add inplace_atol to check group_norm, and skip inplace_grad for mkldnn, test=develop
      
      * follow sneaxiy's comments, test=develop
      
      * remove unused DefaultGradOpDescMaker in mkldnn op, test=develop
      a9d5fc51
    • A
      Supports diagonal initialization in uniform_random op (#19299) · 0d29cf18
      Aurelius84 提交于
      * add diag init in Uniform_random op test=develop
      
      * modify api.spec test=develop
      
      * fix unform_batch_size_like maker test=develop
      
      * add diag_num and diag_step assert check test=develop
      0d29cf18
    • C
      [Speedup] Make dygraph data parallel faster (#19280) · 5a579df9
      chengduo 提交于
      * update parallel.py
      test=develop
      5a579df9
    • C
      Split test_parallel_executor_seresnext to three unit test (#19239) · 6a163231
      chengduo 提交于
      * increase test_parallel_executor_seresnext time limit
      test=develop
      
      * split test_parallel_executor_seresnext
      test=develop
      
      * temporally disable reduce_and_allreduce test because of the random failure.
      test=develop
      
      * split gpu and cpu
      test=develop
      6a163231
  2. 21 8月, 2019 2 次提交
  3. 20 8月, 2019 2 次提交
  4. 19 8月, 2019 7 次提交
  5. 18 8月, 2019 1 次提交
  6. 16 8月, 2019 3 次提交
  7. 15 8月, 2019 9 次提交
  8. 14 8月, 2019 4 次提交
  9. 13 8月, 2019 3 次提交
    • L
      UT coverage for guassian_mkldnn_op and batch_norm_mkldnn_op (#19011) · c548e370
      lidanqing 提交于
      * integrations problem
      test=develop
      
      * add batch_norm_mkldnn_op backward-reuse test and guassian seed=0 test
      test=develop
      c548e370
    • J
      Instag Implemention (#18394) · 6ac32d09
      Jiawei Wang 提交于
      * instag lod tensor impl
      
      * First PR for instag
      
      * First PR for instag
      
      * Before adding Selection Rows.
      
      * Change name from instag to filter_instag, add upgrade the impl of filter_instag
      
      * Change name from instag to filter_instag, add upgrade the impl of filter_instag
      
      * Fix yapf error in gradient_checker.py to pass Travis-CI
      
      * Fix Filter Instag Grad test=develop
      
      * Fix Filter Instag Grad test=develop
      
      * 1) Fix API.spec, add filter_instag Op. 2) Add Vector Support for CUDA. test=develop
      
      * Impl Loss_weight and empty output handler
      
      * change Loss Weight datatype to Float32, and add Loss Weight as 2nd output
      
      * 1) Support Tensor Input(without LOD) 2) Add Unit test
      
      * Filter By Instag Final test=develop
      
      * Update API.spec for filter_by_instag test=develop
      
      * Update API.spec for filter_by_instag 2 test=develop
      
      * Add Filter By Instag Coverage
      
      * code format of test_layers.py
      
      * code format test_layers.py test=develop
      
      * Make API args more readable test=develop
      
      * Make API args more readable and pass code format test=develop
      
      * Filter By Instag Op, Rename Map to Index Map test=develop
      
      * Filter By Instag Op, code format err in filter_by_instag_op.cc  test=develop
      
      * Filter by instag op: code format of cpp files test=develop
      
      * Filter by instag Op: Api spec modification test=develop
      
      * Filter by instag Op: Api spec doc id modification test=develop
      
      * Filter by instag Op: Api spec and doc preview  test=develop test=document_preview
      
      * Filter By Instag Op, fix doc erro test=document_preview test=develop
      
      * Filter By Instag Op, fix doc err and Api spec test=document_preview test=develop
      
      * Filter By Instag Op, fix Api spec test=document_preview test=develop
      
      * Filter By Instag Op, fix Paddle Encoforce deprecated warning test=document_preview test=develop
      
      * Filter By Instag Op, fix Paddle Encoforce deprecated and code format warning test=document_preview test=develop
      6ac32d09
    • W
      Fix the error of op `ones_like` document,change the output variable · 0019eb37
      wawltor 提交于
      test=document_preview
      test=develop
      Fix the error of op `ones_like` document, change the output variable from x to out.
      0019eb37
  10. 12 8月, 2019 3 次提交
    • H
      Add hard swish op (new op) (#19001) · 20f18930
      huangjun12 提交于
      * add hard_swish activation op (new op)
      test=develop
      
      * remove redundancy files
      
      * modify document content of HardSwish OP
      
      * add API test in test_layers.py
      
      * add dynamic_graph for test_hard_swish
      20f18930
    • G
      Polish fleet API to support cuda collective mode and nccl2 mode. (#18966) · 29d87812
      gongweibao 提交于
      Polish fleet API to support cuda collective mode and nccl2 mode
      29d87812
    • K
      Refine embedding Api doc (#18820) · 744279fe
      Kevin 提交于
      * fix overflow by int32 mul test=develop
      
      * fix reference nullptr
      
      * fix codestyle test=develop
      
      * modify to point in ContextProjectFunctor test=develop
      
      * modify to point in ContextProjectFunctor test=develop
      
      * modify . to -> test=develop
      
      * refine embedding padding_idx doc test=develop
      
      * fix math:padding_idx preview bug test=develop
      
      * modify API.spec test=develop
      
      * fix spell error test=develop
      
      * refine dtype parm desc test=develop
      744279fe