1. 12 8月, 2019 2 次提交
    • G
      Polish fleet API to support cuda collective mode and nccl2 mode. (#18966) · 29d87812
      gongweibao 提交于
      Polish fleet API to support cuda collective mode and nccl2 mode
      29d87812
    • K
      Refine embedding Api doc (#18820) · 744279fe
      Kevin 提交于
      * fix overflow by int32 mul test=develop
      
      * fix reference nullptr
      
      * fix codestyle test=develop
      
      * modify to point in ContextProjectFunctor test=develop
      
      * modify to point in ContextProjectFunctor test=develop
      
      * modify . to -> test=develop
      
      * refine embedding padding_idx doc test=develop
      
      * fix math:padding_idx preview bug test=develop
      
      * modify API.spec test=develop
      
      * fix spell error test=develop
      
      * refine dtype parm desc test=develop
      744279fe
  2. 11 8月, 2019 2 次提交
    • Y
      add save cache model api in fleet& add slots shuffle in dataset module & add... · 9150cf50
      yaoxuefeng 提交于
      add save cache model api in fleet& add slots shuffle in dataset module & add metric op to calculate ctr related metrics (#18871)
      
      * add ctr related metric layer test=develop
      
      * add save cache and slots shuffle test=develop
      
      * add save cache and slots shuffle test=develop
      
      * fix error
      
      * fix error
      
      * fix style for ci
      
      * fix for comments
      
      * change SlotsShuffle input to std::strinf for generality
      
      * fix style
      
      * fix style
      
      * fix style
      
      * fix style
      
      * fix style
      
      * fix style
      
      * fix stylr
      
      * fix style
      
      * fix style
      
      * fix style
      
      * fix style
      
      * fix style
      
      * fix style
      
      * fix style
      
      * fix style
      
      * fix style
      
      * fix style
      
      * fix style
      
      * fix style
      
      * fix style
      
      * change non-const reference to pointer
      
      * fix style
      
      * fix style
      
      * fix style test=develop
      
      * fix style  test=develop
      
      * add return ins num in ctr metric op
      
      * change dtype to float in metric_op.py
      
      * fix error test=develop
      
      * fix style test=develop
      
      * fix API spec
      
      * fix API spec
      
      * fix API spec test=develop
      
      * add UT test=develop
      9150cf50
    • Z
      c51eb6bb
  3. 10 8月, 2019 2 次提交
  4. 09 8月, 2019 6 次提交
  5. 08 8月, 2019 4 次提交
    • J
      add fleet util, add some interface in hdfs util (#18752) · a99bc64c
      jiaqi 提交于
      * add fleet util (fleet/utils/fleet_util.py): functions for users' convenience
      * add some interface in hdfs util : hdfs is_file、hdfs cat
      a99bc64c
    • M
      [WIP] Add Imdb train demo (#18895) · 4ad7c9d5
      mapingshuo 提交于
      * add train demo for imdb text classification task
      
      * make inference library release data_feed dataset dataset_factory data_feed_factory
      
      * add String Data Generator
      
      * new feature of train demo: save model params
      
      * New feature of train demo: set training config using gflags
      
      * change code style for CI
      
      * add readme and dataset for imdb demo trainer
      4ad7c9d5
    • W
      update roi doc in roi_pool and roi_align (#19036) · e50f527f
      wangguanzhong 提交于
      * update roi doc in roi_pool and roi_align, test=develop
      e50f527f
    • L
      Fix memory overwriting of tensors returned by executor (#19030) · 8f537354
      Leo Chen 提交于
      * fix memory overlapping of fetch var (return of executor.run), test=develop
      
      * fix wrong usage of ParallelExecutor in op_test, test=develop
      
      * remove useless parameter and simplify code
      
      * avoid tensor destruct untimely, test=develop
      
      * add testcase independent of OpTest, test=develop
      8f537354
  6. 07 8月, 2019 1 次提交
  7. 06 8月, 2019 4 次提交
  8. 05 8月, 2019 6 次提交
  9. 04 8月, 2019 1 次提交
  10. 02 8月, 2019 5 次提交
  11. 01 8月, 2019 4 次提交
    • J
      adjust ins weight according to nid slot (#18784) · 768059b3
      jiaqi 提交于
      adjust ins weight according to nid slot , user can specify adjust_ins_weight in strategy
      768059b3
    • W
      Add the op of unique_with_counts, expand count function of the op unique (#18720) · 3ab1866c
      wawltor 提交于
      * test=develop
      Add the op of unique_with_counts, the op is calc the unqiue input of data, and output the corresponding indices and count of data.
      
      * test=develop
      Check the input and dtype in the op of unique_with_counts
      
      * test=develop
      test=document_preview
      update the API.spec for `unique_with_counts`, at the same time, optimize the python api in the op of `unique_with_count`
      
      * test=develop
      test=document_preview
      Fix some python api problem in the op of `unique_with_counts`, and change the error messsage in this op.
      
      * Fix some API problem in the op of `unique_with_counts`
      test=develop
      test=document_preview
      
      * test=develop
      test=document_preview
      Fix the api sample of op `unique_with_counts`, and update api.spec
      3ab1866c
    • L
      Fix depthwise conv gpu kernel bug (#18582) · 22fa4c2d
      LielinJiang 提交于
      * fix depthwise conv gpu kernel bug, test=develop
      * add more depthwise conv test, test=develop
      22fa4c2d
    • W
      Fix unitest of light nas. (#18931) · c92b78b0
      whs 提交于
      test=develop
      c92b78b0
  12. 31 7月, 2019 3 次提交
    • J
      set fleet_send_batch_num a default value according to trainer num · 233746d8
      jiaqi 提交于
      (1) set fleet_send_batch_num a default value according to trainer num, the previous 80000 is fixed,if trainer num is much less or larger than 100,global shuffle may have timeout error.
      
      (2) fix load one table bug, add barrier
      233746d8
    • C
      [DyGraph] Make multi-card program faster (#18892) · 20859c08
      chengduo 提交于
      * update parallel.py
      test=develop
      20859c08
    • H
      Add center Loss Op Support (#18681) · 24f85431
      HaoRen 提交于
      * support center loss
      * change tensor copy  api to high level api tensorcopy
      
      * test=develop rewrite the center_loss cuda_kernel to make it faster
      and add document of the center loss api,also update test function
      
      * test=document_preview test=develop
      update document of center loss
      
      * test=document_preview test=develop
      modify API.spec modify test code remove nouse const_cast
      24f85431