1. 20 3月, 2018 1 次提交
    • S
      CMake refine for HIP support. · e50205e7
      sabreshao 提交于
      1. Add option WITH_AMD_GPU.
      2. Add cmake/hip.cmake for HIP toolchain.
      3. Some external module such as eigen may need HIP port.
      4. Add macro hip_library/hip_binary/hip_test to cmake/generic.cmake.
      5. Add one HIP source concat.hip.cu as an example. Each .cu may have its corresponding .hip.cu.
      e50205e7
  2. 16 3月, 2018 6 次提交
  3. 15 3月, 2018 7 次提交
    • Q
      Fix bug in LRN operator. (#9124) · 1cd700d8
      qingqing01 提交于
      1cd700d8
    • R
      change the dir of docs · 64775126
      ranqiu 提交于
      64775126
    • Q
      Fix a critical bug in softmax_with_cross_entropy_op backward. (#9120) · b5a16dca
      qingqing01 提交于
      * Fix a critical bug in softmax_with_cross_entropy_op, which will lead to the wrong gradients.
      
      * Enhance unit testing.
      b5a16dca
    • T
      Implement Select OP (#9088) · 1e4c504e
      Thuan Nguyen 提交于
      * Fix old documentation for channel_recv
      
      * Initial design of CSP select
      
      * Redesign channel implementation for Select Op
      
      * Remove unecessary header
      
      * Initial checkin of select op, currently will read all the conditional_op in the cases block and also pull out all channels involved in the select.
      
      * Init python select op API
      
      * Python select bug fix when checking op creates block
      
      * Add case_to_execute as (a) input to select, (b) into the passed inputs into the select op
      
      * Add in addition code for select op
      
      * Init fibonacci test from python
      
      * implement fibonnaci sequence test
      
      * update fib unit test
      
      * Improve select test cases
      
      * Shorten non-pep-8-ed lines
      
      * Add methods on channel needed by select op
      
      * Fix compile issues, finish implementation, still need to debug code
      
      * Fix issue with fibonncci test, it works now!
      
      * Change QueueMessage callback to take in an ChannelAction enum, fix select unit test
      
      * Fix case attributes
      
      * Fix issue with select control flow
      
      * Make cases - previously on each selectcase conditional_block - attributes to select
      
      * Use class constants for type of channel
      
      * Change select op to take in "cases" attribute
      
      * return boolean from select callback function to tell Channel if this RECV or SEND should be executed
      
      * Improve attributes and inputs comments on select op
      
      * Fix issues with python unit test
      
      * Assert fibonacci final output
      
      * Fix issue when channel name / channel var is null for "default" case in select op
      
      * Assert base select test output
      
      * Make QueueMessage use shared pointer and modify the order of the callback
      
      * Fixing the order in which the callback is called
      
      * Move channel utility methods to paddle/fluid/operators/concurrency/channel_util
      
      * Create channel_util and move channel util methods
      
      * Fix crash when calling select_op
      
      * Fix deadlock
      
      * Fix issue of channel destructor deadlock
      
      * Fix precommit issues
      
      * Accidentally checked in changes to beam_search_op, reverting change.
      
      * Fix dependency issue in concurrency cmake
      
      * add device_context dependency for concurrency target
      1e4c504e
    • D
      [Speed]implement cudnn sequence softmax cudnn (#8978) · 128adf53
      dzhwinter 提交于
      * "add softmax cudnn functor support"
      
      * "add testing"
      
      * "refine cmakelist"
      
      * "sequence softmax forward speed up"
      
      * "add softmax grad"
      
      * "fix sequence softmax test"
      
      * "add double precision'
      
      * "fix softmax test"
      
      * "add softmax cudnn support"
      
      * "fix softmax cudnn test"
      
      * "add softmax to nn.py"
      
      * "fix compile bug"
      
      * "refine cmakelist"
      
      * "fix ci"
      
      * "fix based on comment"
      
      * "fix based on comments"
      
      * "fix ci"
      128adf53
    • K
      Add fp16 mul op support and bind paddle fp16 to numpy fp16 (#9017) · e26f1123
      Kexin Zhao 提交于
      * add fp16 mul op support
      
      * small fix
      
      * fix bug
      
      * small fix
      
      * fix PADDLE_WITH_CUDA compiling issue
      
      * reorg code
      
      * test for pybind
      
      * treate as float16 as uint16_t in pybind
      
      * bind np.float16 to paddle float16
      
      * small fix
      
      * clean code
      
      * remove redundancy
      
      * fix mul_op test
      
      * address comments
      
      * small fix
      
      * add is_float16_supported func
      e26f1123
    • D
      "exported scatter to python" (#9038) · 71400711
      dzhwinter 提交于
      * "exported scatter to python"
      
      * Revert ""exported scatter to python""
      
      This reverts commit 38745a62.
      
      * "polish scatter and export to python"
      71400711
  4. 14 3月, 2018 4 次提交
  5. 13 3月, 2018 5 次提交
  6. 12 3月, 2018 10 次提交
  7. 10 3月, 2018 5 次提交
  8. 09 3月, 2018 2 次提交