1. 04 8月, 2019 1 次提交
  2. 02 8月, 2019 5 次提交
  3. 01 8月, 2019 4 次提交
    • J
      adjust ins weight according to nid slot (#18784) · 768059b3
      jiaqi 提交于
      adjust ins weight according to nid slot , user can specify adjust_ins_weight in strategy
      768059b3
    • W
      Add the op of unique_with_counts, expand count function of the op unique (#18720) · 3ab1866c
      wawltor 提交于
      * test=develop
      Add the op of unique_with_counts, the op is calc the unqiue input of data, and output the corresponding indices and count of data.
      
      * test=develop
      Check the input and dtype in the op of unique_with_counts
      
      * test=develop
      test=document_preview
      update the API.spec for `unique_with_counts`, at the same time, optimize the python api in the op of `unique_with_count`
      
      * test=develop
      test=document_preview
      Fix some python api problem in the op of `unique_with_counts`, and change the error messsage in this op.
      
      * Fix some API problem in the op of `unique_with_counts`
      test=develop
      test=document_preview
      
      * test=develop
      test=document_preview
      Fix the api sample of op `unique_with_counts`, and update api.spec
      3ab1866c
    • L
      Fix depthwise conv gpu kernel bug (#18582) · 22fa4c2d
      LielinJiang 提交于
      * fix depthwise conv gpu kernel bug, test=develop
      * add more depthwise conv test, test=develop
      22fa4c2d
    • W
      Fix unitest of light nas. (#18931) · c92b78b0
      whs 提交于
      test=develop
      c92b78b0
  4. 31 7月, 2019 5 次提交
  5. 30 7月, 2019 3 次提交
  6. 29 7月, 2019 2 次提交
    • Z
      Remove legacy C++ memory optimization codes (#18834) · 8008ab4e
      Zeng Jinle 提交于
      * remove legacy memory optimization codes, test=develop
      
      * follow huihuang's comments,test=develop
      
      * follow luotao's comments, test=develop
      8008ab4e
    • T
      add clear_model interface in fleetwrapper (#18815) · 52c1431e
      Thunderbrook 提交于
      * dump slot
      
      * test
      
      * proto
      
      * dump slot
      
      * test
      
      * proto
      
      * code style
      
      * code style
      
      * code style
      
      * style
      
      * add delete after unseen days
      
      * add unseen days
      
      * code style
      
      * conflict solve
      test=develop
      
      * add clear model
      
      * code style
      test=develop
      
      * code style
      test=develop
      52c1431e
  7. 28 7月, 2019 2 次提交
  8. 27 7月, 2019 2 次提交
  9. 26 7月, 2019 2 次提交
    • A
      Add LeakyReLU MKLDNN support (#18762) · ee022279
      Adam 提交于
      ee022279
    • Z
      Feature/mem opt pass refactor (#18735) · a802da65
      Zeng Jinle 提交于
      * first version memory optimize pass, test=develop
      
      * remove move_tensor_sharing_pass, test=develop
      
      * refine code comments, add unittests, test=develop
      
      * turn off memory_optimize by default, test=develop
      
      * follow huihuang's comments, test=develop
      
      * follow chengduoZH's comments, test=develop
      
      * fix grammar error, add const qualifier, fix pass_test exception message, test=develop
      
      * follow chengduoZH's comments 2nd, test=develop
      a802da65
  10. 25 7月, 2019 4 次提交
  11. 24 7月, 2019 5 次提交
    • B
      Extend Matmul to support matrix multiplication with multiple heads (#18570) · 220eef60
      Bob Zhu 提交于
      * extend matmul op to support multiple head multiplication
      
      With the support of multiple head, the multiplication of two big matrixes is
      split into multiplication of several (head_number) small matrixes. e.g. if
      Mat A is [3, 24] and Mat B is [24, 4], when multiple A and B with head_number
      as 4, Mat A will be split as 4 matrix of [3, 6] and Mat B will be 4 matrix of
      [6, 4]. The result of final matrix will be 4 matrix of [3, 4], i.e. [3, 16].
      220eef60
    • W
      Add python API for appending LoD level (#18702) · 075e1cf7
      whs 提交于
      * Make lod reset op support for append lod level.
      
      * Fix API.spec
      test=develop
      
      * Fix unitest.
      test=develop
      
      * Add python api for lod append.
      test=develop
      
      * Fix API.spec
      test=develop
      
      * Fix format of doc.
      test=develop
      
      * Fix unitest.
      test=develop
      
      * Fix doc.
      test=develop
      075e1cf7
    • C
      Enhance backward process (#18700) · 8259f141
      chengduo 提交于
      * prun backward ops
      test=develop
      8259f141
    • J
      Modify auc doc. Add output variable description, previously was the scalar... · 25c9b57b
      JesseyXujin 提交于
      Modify auc doc. Add output variable description, previously was the scalar type, now changed to the tuple type.test=develop (#18771)
      
      25c9b57b
    • T
      add slot to sparse table (#18686) · d8396281
      Thunderbrook 提交于
      The change includes 2 things:
      
      1. save delta model and shrink table are control by the same parameter before, now add delete_after_unseen_days to control shrink table.
      2. value in sparse table has no slot before, now add slot in sparse table, and add DownpureCtrAccessor to support the new meta.
      test=develop
      d8396281
  12. 23 7月, 2019 3 次提交
    • J
      support patch data, add load_one_table, fix bug (#18509) · d18aabb4
      jiaqi 提交于
      (1)support patch data (merge slots of instances of same line id, modify dense layer which
      changes its size)
      (2)add fleet load_one_table interface, support load from paddle model and load from pslib model
      (3)fix push sparse bug which cause push sparse cost more time(about 10% in my testcase)
      (4)when some slots are not in one of your network (join/update, etc.),data feed、collect label info、push/pull sparse will skip these slots, instead of throw error.
      (5)add more debug info in TrainFilesWithProfiler
      d18aabb4
    • C
      Make fuse_optimizer_op_pass also work when the model contains sparse gradients. (#18664) · fd3aad6c
      chengduo 提交于
      * support sparse gradients
      test=develop
      fd3aad6c
    • Y
      supports distributed classification (#18690) · 157211c4
      Yi Liu 提交于
      * supports distributed classification training
      * update API.spec
      * fix evenly division in python3
      * change "index_range" to "index_num" in shard_index operator
      test=document_preview
      test=develop
      157211c4
  13. 22 7月, 2019 2 次提交
    • Z
      d07ad4c6
    • H
      Fix random test_recurrent_op failure (#18718) · a3028bb7
      Huihuang Zheng 提交于
      The change includes 3 things:
      
      1. Set CPU_NUM to 1 in the tests because the ParallelExecutor will print warning that CPU_NUM is not set and use default 1.
      
      2. Old tests compare two RNNs, hand written simple RNN and same RNN built by Paddle, but initialized RNN weights in numpy random and Paddle random separately. Fixed it by setting weights and bias values.
      
      3. Also set numpy random seed in the tests. Now the two RNNs diff can be smaller (rtol from 0.1, 0.2 to. 0.01) in the tests.
      
      test=develop
      a3028bb7