1. 20 8月, 2019 1 次提交
    • W
      optimize the realization of cuda dropout (#19136) · 6e326ca2
      wangchaochaohu 提交于
      * cuda optimie for dropout
      
      * remove tmp swp file
      
      * fix compile error test=develop
      
      * test=develop optimize the cuda realization of dropout op
      
      * remove unsed code test=develop
      
      * remove tmp file test=develop
      6e326ca2
  2. 19 8月, 2019 6 次提交
  3. 18 8月, 2019 1 次提交
  4. 16 8月, 2019 2 次提交
  5. 15 8月, 2019 2 次提交
  6. 14 8月, 2019 3 次提交
  7. 13 8月, 2019 1 次提交
    • J
      Instag Implemention (#18394) · 6ac32d09
      Jiawei Wang 提交于
      * instag lod tensor impl
      
      * First PR for instag
      
      * First PR for instag
      
      * Before adding Selection Rows.
      
      * Change name from instag to filter_instag, add upgrade the impl of filter_instag
      
      * Change name from instag to filter_instag, add upgrade the impl of filter_instag
      
      * Fix yapf error in gradient_checker.py to pass Travis-CI
      
      * Fix Filter Instag Grad test=develop
      
      * Fix Filter Instag Grad test=develop
      
      * 1) Fix API.spec, add filter_instag Op. 2) Add Vector Support for CUDA. test=develop
      
      * Impl Loss_weight and empty output handler
      
      * change Loss Weight datatype to Float32, and add Loss Weight as 2nd output
      
      * 1) Support Tensor Input(without LOD) 2) Add Unit test
      
      * Filter By Instag Final test=develop
      
      * Update API.spec for filter_by_instag test=develop
      
      * Update API.spec for filter_by_instag 2 test=develop
      
      * Add Filter By Instag Coverage
      
      * code format of test_layers.py
      
      * code format test_layers.py test=develop
      
      * Make API args more readable test=develop
      
      * Make API args more readable and pass code format test=develop
      
      * Filter By Instag Op, Rename Map to Index Map test=develop
      
      * Filter By Instag Op, code format err in filter_by_instag_op.cc  test=develop
      
      * Filter by instag op: code format of cpp files test=develop
      
      * Filter by instag Op: Api spec modification test=develop
      
      * Filter by instag Op: Api spec doc id modification test=develop
      
      * Filter by instag Op: Api spec and doc preview  test=develop test=document_preview
      
      * Filter By Instag Op, fix doc erro test=document_preview test=develop
      
      * Filter By Instag Op, fix doc err and Api spec test=document_preview test=develop
      
      * Filter By Instag Op, fix Api spec test=document_preview test=develop
      
      * Filter By Instag Op, fix Paddle Encoforce deprecated warning test=document_preview test=develop
      
      * Filter By Instag Op, fix Paddle Encoforce deprecated and code format warning test=document_preview test=develop
      6ac32d09
  8. 12 8月, 2019 5 次提交
  9. 09 8月, 2019 4 次提交
  10. 06 8月, 2019 2 次提交
    • Y
      Add the check of lod in sequence_softmax kernel. (#18996) · a445c335
      Yiqun Liu 提交于
      * Add the check of lod in sequence_softmax kernel.
      test=develop
      
      * Refine the comments.
      test=develop
      a445c335
    • K
      Add var_conv_2d op (#18518) · e681d655
      Kevin 提交于
      * fix overflow by int32 mul test=develop
      
      * fix reference nullptr
      
      * fix codestyle test=develop
      
      * modify to point in ContextProjectFunctor test=develop
      
      * modify to point in ContextProjectFunctor test=develop
      
      * modify . to -> test=develop
      
      * add var_conv_2d op test=develop
      
      * edit api.spec test=develop
      
      * ignore unittest if with_mkl=off test=develop
      
      * fix python3 division test=develop
      
      * fix ignore unittest bug test=develop
      
      * remove useless code test=develop
      
      * modify api.spec test=develop
      
      * modify default_grad.spec test=develop
      e681d655
  11. 05 8月, 2019 2 次提交
  12. 02 8月, 2019 3 次提交
  13. 01 8月, 2019 3 次提交
    • W
      Add the op of unique_with_counts, expand count function of the op unique (#18720) · 3ab1866c
      wawltor 提交于
      * test=develop
      Add the op of unique_with_counts, the op is calc the unqiue input of data, and output the corresponding indices and count of data.
      
      * test=develop
      Check the input and dtype in the op of unique_with_counts
      
      * test=develop
      test=document_preview
      update the API.spec for `unique_with_counts`, at the same time, optimize the python api in the op of `unique_with_count`
      
      * test=develop
      test=document_preview
      Fix some python api problem in the op of `unique_with_counts`, and change the error messsage in this op.
      
      * Fix some API problem in the op of `unique_with_counts`
      test=develop
      test=document_preview
      
      * test=develop
      test=document_preview
      Fix the api sample of op `unique_with_counts`, and update api.spec
      3ab1866c
    • J
      - Removed passing X from FWD to GRAD via device context (#18911) · 5cf2d385
      Jacek Czaja 提交于
      test=develop
      
      - Extracted key generation from FWD and GRAD into separate function
      
      test=develop
      
      - Compilation fix
      
      test=develop
      
      - another compilation
      
      test=develop
      5cf2d385
    • L
      Fix depthwise conv gpu kernel bug (#18582) · 22fa4c2d
      LielinJiang 提交于
      * fix depthwise conv gpu kernel bug, test=develop
      * add more depthwise conv test, test=develop
      22fa4c2d
  14. 31 7月, 2019 5 次提交
    • L
      fix several security bugs reported by security team (#18831) · 0d996908
      liuwei1031 提交于
      * fix security issue, test=develop
      
      * bug fix, test=develop
      
      * throw an exception when null pointer data with non-zero length PaddleBuf is passed, test=develop
      0d996908
    • Z
      Trt fp16 support (#18860) · 61238d31
      Zhaolong Xing 提交于
      * Fix Mask rcnn predictor
          1. refine memory optim algorithm to support the model with the block op.
          2. output diff : modify the affine channel fuse
          3. add condition_block_infer op
      add interface for setting trt calib table dir
      test=develop
      
      * add the missing files.
      test=develop
      
      * 1 add trt fp16 support
      test=develop
      61238d31
    • C
      [DyGraph] Make multi-card program faster (#18892) · 20859c08
      chengduo 提交于
      * update parallel.py
      test=develop
      20859c08
    • H
      Add center Loss Op Support (#18681) · 24f85431
      HaoRen 提交于
      * support center loss
      * change tensor copy  api to high level api tensorcopy
      
      * test=develop rewrite the center_loss cuda_kernel to make it faster
      and add document of the center loss api,also update test function
      
      * test=document_preview test=develop
      update document of center loss
      
      * test=document_preview test=develop
      modify API.spec modify test code remove nouse const_cast
      24f85431
    • L
      use mkl to accelerate gelu_grad (#18099) · 86e494eb
      Leo Zhao 提交于
      test=develop
      86e494eb