1. 12 9月, 2019 1 次提交
  2. 11 9月, 2019 10 次提交
  3. 10 9月, 2019 7 次提交
  4. 09 9月, 2019 6 次提交
  5. 08 9月, 2019 1 次提交
  6. 07 9月, 2019 1 次提交
  7. 06 9月, 2019 6 次提交
  8. 05 9月, 2019 8 次提交
    • 1
      fix the diff between async mode and async_half mode (#19535) · 2f037c31
      123malin 提交于
      * test=develop,  communicator merge add => merge average
      2f037c31
    • J
      Refactor dygraph (#19107) · e9233d1c
      Jiabin Yang 提交于
      * refactor dygraph,test=develop
      
      * fix failed unittest,test=develop
      
      * polish code,test=develop
      
      * check windows ci error,test=develop
      try to fix windows ci error by np.allclose,test=develop
      
      * polish vlog and profiler, test=develop
      
      * try to fix preceding ops order,test=develop
      
      * test transformer in windows ci, test=develop
      
      * use python c-api to speed up tracer.trace,test=develop
      
      * test=develop, fix docker with paddle nccl problem
      
      * test=develop, add ut for debug string and gradient_accumulator
      
      * test=develop, add tests for layer/gradient_accumulator/prepared_op
      
      * test=develop, fix complie error for test_prepared_op
      
      * test=develop, add more ut for dygraph
      
      * test=develop, create API.spec for dygraph api change
      
      * test=develop, refoctor name to make it easier to understand
      
      * test=develop, refoctor name to make it easier to understand
      
      * test=develop, fix multi-gpu failed problem , add Tracer tests, change PADDLEENFORCE to PADDLEENFORCE_EQ
      
      * test=develop, fix ut failed on parallel se-resnext
      
      * test=develop, change one more PADDLE_ENFORCE
      e9233d1c
    • M
      add feed_var_names to Prune interface (#19589) · dca9b6c5
      mapingshuo 提交于
      * Fix bug: add feed_vars to the prune function
      dca9b6c5
    • T
      fix bug of communicator flag, test=develop (#19635) · f45cb1c2
      tangwei12 提交于
      f45cb1c2
    • Y
      Integrate NVRTC to support compiling CUDA kernel at runtime (#19422) · 42b5bec6
      Yiqun Liu 提交于
      * Add the dynamic load of nvrtc, and support runtime compiling of CUDA kernel using nvrtc.
      test=develop
      
      * Call CUDA driver api to launch the kernel compiled by nvrtc.
      test=develop
      
      * Disable for mac and windows.
      test=develop
      
      * Refine the codes to support manually specified num_threads and workload_per_thread.
      test=develop
      
      * Refine the CUDA kernel to support large dims.
      test=develop
      42b5bec6
    • T
      unify PADDLE_ASSERT_MSG into PADDLE_ENFORCE(error_message) (#19631) · 3ae939e4
      Tao Luo 提交于
      * remove assert.h
      
      * change PADDLE_ASSERT_MSG to PADDLE_ENFORCE
      
      test=develop
      
      * fix tensorrt paddle_enforce
      
      test=develop
      3ae939e4
    • L
    • T
      fix scope lock bug on infer (#19624) · e3e98ed6
      tensor-tang 提交于
      e3e98ed6