1. 28 8月, 2019 1 次提交
    • T
      Fix the correctness of async mode at distributed training (#18863) · 65c73684
      tangwei12 提交于
      * fix correctness of the communicator
      
      * fix a bug in send thread when sending var context is empty, test=develop
      
      * add lookup_table_prefetch_op and prefetch optimize, test=develop
      
      * remove remote prefetch GPU supported
      
      * word2vec force with CPU, test=develop
      
      * test dist remote lookup table force with CPU, test=develop
      65c73684
  2. 30 3月, 2019 1 次提交
  3. 24 1月, 2019 2 次提交
  4. 11 12月, 2018 1 次提交
  5. 10 12月, 2018 2 次提交
  6. 08 12月, 2018 1 次提交
  7. 07 12月, 2018 2 次提交
  8. 06 12月, 2018 3 次提交
  9. 27 11月, 2018 1 次提交
    • T
      Make NCE_OP more efficient and support SelectedRows (#14469) · 56a4912b
      tangwei12 提交于
      * Fix truncated normal.
      
      * Fix.
      
      * Make nce support more distribution.
      
      * Fix API.spec.
      
      * Fix python API.
      
      * Fix.
      test=develop
      
      * Fix API.spec
      test=develop
      
      * Fix sampler.
      
      * Fix order of arguments in python API.
      test=develop
      
      * NCE add selectedrows support
      
      * NCE update weighted sampling
      
      * fix bugs in nce_op, and assign_value_op optimized
      
      * fix bugs in nce_op, revert assign_value_op
      
      * nce_op optimize
      
      * nce_op optimize
      
      * nce_op optimize
      
      * add selectedRows test later
      
      test=develop
      
      * add selectedRows supported
      
      * add selectedRows supported
      
      test=develop
      
      * add selectedRows supported
      
      * add nce selectedRows supported, test=develop
      
      * add nce selectedRows supported
      
      * add nce selectedRows supported, test=develop
      
      * fix height in nce, test=develop
      
      * add ut
      
      * add ut, test=develop
      
      * make AutoGrownIndex inline
      test=develop
      
      * fix tinny error, test=develop
      56a4912b
  10. 16 11月, 2018 1 次提交
    • W
      Make nce support more distribution. (#13549) · 17226782
      whs 提交于
      * Fix truncated normal.
      
      * Fix.
      
      * Make nce support more distribution.
      
      * Fix API.spec.
      
      * Fix python API.
      
      * Fix.
      test=develop
      
      * Fix API.spec
      test=develop
      
      * Fix sampler.
      
      * Fix order of arguments in python API.
      test=develop
      17226782
  11. 19 4月, 2018 1 次提交
  12. 12 2月, 2018 1 次提交
  13. 10 2月, 2018 2 次提交
  14. 22 1月, 2018 1 次提交
    • Y
      Fix bug of nce_op · af8cb820
      Yang Yu 提交于
      * also div num_samples when return cost of nce_op
      af8cb820
  15. 26 12月, 2017 1 次提交
  16. 12 12月, 2017 1 次提交
    • Q
      Refine device context (#6433) · 61ec0b95
      QI JUN 提交于
      There are mainly following fixes:
      
      - take `DeviceContext` as the template parameter of math functors and OpKernel instead of `Place`
      - remove `eigen_device` interface in base class  `DeviceContext`
      - remove `GetEigenDevice` interface in `ExecutionContext` and base class `DeviceContext`
      - remove unused `platform::EigenDeviceConverter`
      - rename `REGISTER_OP_GPU_KERNEL` to `REGISTER_OP_CUDA_KERNEL`
      - rename `USE_GPU_ONLY_OP` to `USE_CUDA_ONLY_OP`
      61ec0b95
  17. 06 12月, 2017 1 次提交
  18. 29 11月, 2017 1 次提交
  19. 28 11月, 2017 3 次提交
  20. 16 11月, 2017 1 次提交
  21. 08 11月, 2017 1 次提交
    • W
      Add nce op · 3d276277
      wanghaoshuang 提交于
      1. Add nce forward and backward kernel for CPU
      3d276277