1. 06 9月, 2019 1 次提交
  2. 05 9月, 2019 12 次提交
    • 1
      fix the diff between async mode and async_half mode (#19535) · 2f037c31
      123malin 提交于
      * test=develop,  communicator merge add => merge average
      2f037c31
    • J
      Refactor dygraph (#19107) · e9233d1c
      Jiabin Yang 提交于
      * refactor dygraph,test=develop
      
      * fix failed unittest,test=develop
      
      * polish code,test=develop
      
      * check windows ci error,test=develop
      try to fix windows ci error by np.allclose,test=develop
      
      * polish vlog and profiler, test=develop
      
      * try to fix preceding ops order,test=develop
      
      * test transformer in windows ci, test=develop
      
      * use python c-api to speed up tracer.trace,test=develop
      
      * test=develop, fix docker with paddle nccl problem
      
      * test=develop, add ut for debug string and gradient_accumulator
      
      * test=develop, add tests for layer/gradient_accumulator/prepared_op
      
      * test=develop, fix complie error for test_prepared_op
      
      * test=develop, add more ut for dygraph
      
      * test=develop, create API.spec for dygraph api change
      
      * test=develop, refoctor name to make it easier to understand
      
      * test=develop, refoctor name to make it easier to understand
      
      * test=develop, fix multi-gpu failed problem , add Tracer tests, change PADDLEENFORCE to PADDLEENFORCE_EQ
      
      * test=develop, fix ut failed on parallel se-resnext
      
      * test=develop, change one more PADDLE_ENFORCE
      e9233d1c
    • M
      add feed_var_names to Prune interface (#19589) · dca9b6c5
      mapingshuo 提交于
      * Fix bug: add feed_vars to the prune function
      dca9b6c5
    • T
      fix bug of communicator flag, test=develop (#19635) · f45cb1c2
      tangwei12 提交于
      f45cb1c2
    • Y
      Integrate NVRTC to support compiling CUDA kernel at runtime (#19422) · 42b5bec6
      Yiqun Liu 提交于
      * Add the dynamic load of nvrtc, and support runtime compiling of CUDA kernel using nvrtc.
      test=develop
      
      * Call CUDA driver api to launch the kernel compiled by nvrtc.
      test=develop
      
      * Disable for mac and windows.
      test=develop
      
      * Refine the codes to support manually specified num_threads and workload_per_thread.
      test=develop
      
      * Refine the CUDA kernel to support large dims.
      test=develop
      42b5bec6
    • T
      unify PADDLE_ASSERT_MSG into PADDLE_ENFORCE(error_message) (#19631) · 3ae939e4
      Tao Luo 提交于
      * remove assert.h
      
      * change PADDLE_ASSERT_MSG to PADDLE_ENFORCE
      
      test=develop
      
      * fix tensorrt paddle_enforce
      
      test=develop
      3ae939e4
    • L
    • T
      fix scope lock bug on infer (#19624) · e3e98ed6
      tensor-tang 提交于
      e3e98ed6
    • A
      Add distributions of Categorical and MultivariateNormal (#18263) · 6364ebc4
      Aurelius84 提交于
      * add_distributions_of_normal_and_uniform
      
      * paddle/fluid/API.spec
      
      * modify API.spec
      
      * modified paddle/fluid/API.spec, test=develop
      
      * modify paddle/fluid/API.spec, test=develop
      
      * modify paddle/fluid/API.spec, test=develop
      
      * fix some comment, test=develop
      
      * modify API.spec, test=develop
      
      * Add distributions of Categorical and MultivariateNormal test=develop
      
      * fix pylint codestyle test=develop
      
      * fix conflict file test=develop
      
      * edit API.spec test=develop
      
      * improve sample code test=develop
      
      * modify api.spec test=develop
      6364ebc4
    • Z
      Enable inplace support for some ops (#19612) · 710767d8
      Zeng Jinle 提交于
      * enable inplace for affine_channel op, dropout op, test=develop
      
      * remove dropout inplace for ngraph fails, test=develop
      710767d8
    • F
      add a argument for softshrink python api (#19396) · a18cf5e1
      FDInSky 提交于
      * test=develop add a argument for softshrink python api
      
      * test=develop fix doc format 
      
      test=develop fix doc format
      
      * test=develop fix API.spec
      
      test=develop fix API.spec
      a18cf5e1
    • T
      paddle::framework::vectorize() templatization (#19627) · d6c85c96
      Tao Luo 提交于
      test=develop
      d6c85c96
  3. 04 9月, 2019 8 次提交
  4. 03 9月, 2019 10 次提交
  5. 02 9月, 2019 7 次提交
  6. 01 9月, 2019 2 次提交
    • J
      [MKL-DNN] Refactoring Softmax (#19312) · cef95ee3
      Jacek Czaja 提交于
      * - First set of modifications
      
      - Compilation fixes
      
      - compilation fix
      
      - Another compilation fix
      
      - Moved AcquireSoftmaxPrimitiveDescriptor call into handler
      
      - MKL-DNN Softmax PD refactor
      
      test=develop
      
      - Compilation fix
      
      test=develop
      
      - another compilation fix
      
      - cosmetcis
      
      test=develop
      
      - Compilation fix
      
      - Fix to crash when softmax backward is created
      
      * - Fixes after review of softmax refactoring
      
      test=develop
      cef95ee3
    • Z
      Add retry_allocator for gpu (#19409) · 0a73f720
      Zeng Jinle 提交于
      * add retry_allocator for gpu, test=develop
      
      * follow chengduoZH's comments, test=develop
      
      * follow huihuang's comments,test=develop
      
      * change f,l in enforce.h to be file,line, test=develop
      
      * increase code coverage by adding unittests, test=develop
      
      * fix CMakeLists.txt, test=develop
      0a73f720