1. 03 9月, 2019 3 次提交
  2. 02 9月, 2019 2 次提交
  3. 31 8月, 2019 1 次提交
    • H
      Paddlebox Framework (#18982) · c756b5d2
      hutuxian 提交于
      * Support looking up embeddings from BoxPS.
      * Add a _pull_box_sparse op, for now this op is not exposed to users.
      * Add a BoxHelper class, providing 'BeginPass', 'EndPass', 'FeedPass' functions and so on.
      * Add 'BoxPSDataset' in python code.
      * Add a compile options WITH_BOX_PS and a MACRO PADDLE_WITH_BOX_PS.
      * Add UT.
      * More concrete information pls refer to: https://github.com/PaddlePaddle/Paddle/pull/18982
      c756b5d2
  4. 30 8月, 2019 5 次提交
    • J
      [MKL-DNN] Fix to face model on AVX512 platforms (#19282) · ecd9f330
      Jacek Czaja 提交于
      - Refactor step 1
      
      - Compilation fix
      
      - Yet another compilation fix
      
      - Even more compilation fix
      
      - Lint fixes
      
      test=develop
      
      - Removed deprectaed PADDLE_ENFORCE occurance
      
      test=develop
      
      - Candidate fix to BN forward
      
      - Lint fixes
      
      test=develop
      
      - Refactoring in data_layout_transform
      
      - compilation fix
      
      - Another comppilation fix
      
      - Step further into darkness
      
      - Yet another compilation fix
      
      - Yet another compilation fix
      
      - missing header
      
      - compilation fix
      
      - Added MKLDNN -> Paddle conversion in fetch op
      
      test=develop
      
      - Compilation fix
      
      test=develop
      
      - Lint
      
      test=develop
      
      - Mul fix
      
      - Fix to MKLDNN MUL op and Elementwise MUL UT
      
      test=develop
      
      - Workaround for diffrent weights with groups representation Paddle vs
        MKL-DNN.
      
      test=develop
      
      - Candidate fix for 5D convolution with groups
      
      - Refactor of fix for conv3d and conv2d in fetch op
      
      test=develop
      
      - Compilation fix
      
      - Still same compilation fix
      
      - Compilation fix
      
      - Compilation fix
      
      - Reverted refactoring of fixes
      
      - Adapted test_conv2d_int8_mkldnn so it exects data in NCHW format
        not NHWC
      
      test=develop
      
      - minor fix in UT
      
      test=develop
      
      - Lint fixes
      
      test=develop
      ecd9f330
    • Y
      add thread scope stat accurate metrics test=develop (#19480) · 10ca3f96
      yaoxuefeng 提交于
      * add thread scope stat accurate metrics test=develop
      
      * fix style
      
      * fix style
      
      * fix style
      
      * fix style test=develop
      
      * fix style test=develop
      
      * fix style test=develop
      
      * fix style test=develop
      
      * fix style test=develop
      
      * fix style test=develop
      
      * fix style test=develop
      
      * fix conflict
      
      * fix style
      
      * fix style test=develop
      
      * fix error test=develop
      
      * fix error test=develop
      10ca3f96
    • T
      remove unused assert.h (#19529) · 02270b3e
      Tao Luo 提交于
      test=develop
      02270b3e
    • C
      Support feed single persistable variable to PE (#19417) · e340df01
      chengduo 提交于
      * update executor feed
      e340df01
    • Y
      Add a pass to replace dropout_op with scale_op when is_test is true (#19297) · fcec365d
      Yiqun Liu 提交于
      * Add simplify_with_basic_ops_pass to replace dropout_op with scale_op when is_test is true.
      test=develop
      
      * Delete dropout_op directly when upscale_in_train is true.
      test=develop
      
      * Improve the debug string, adding the print of op_desc information.
      
      * Fix the case when dropout's input x is reused as the next op's output.
      
      * Add the pass to inference.
      test=develop
      
      * Change the log level.
      test=develop
      
      * Add unittest for inplace case.
      
      * Add comment to explain the pass.
      
      * Apply the pass for CPU inference.
      test=develop
      
      * Fix the typo.
      test=develop
      
      * Add the check of AttrType.
      test=develop
      fcec365d
  5. 29 8月, 2019 3 次提交
    • T
      support debug each output of each ins (#19004) · 1fe468d3
      Thunderbrook 提交于
      * dump slot
      
      * test
      
      * proto
      
      * dump slot
      
      * test
      
      * proto
      
      * code style
      
      * code style
      
      * code style
      
      * style
      
      * add delete after unseen days
      
      * add unseen days
      
      * code style
      
      * conflict solve
      test=develop
      
      * add clear model
      
      * code style
      test=develop
      
      * code style
      test=develop
      
      * support debug tensor of each ins
      test=develop
      
      * support debug tensor of each ins
      test=develop
      
      * learning rate
      
      * code style
      
      * code style
      
      * code style
      
      * code style
      
      * code style
      
      * code style
      
      * code style
      
      * code style
      
      * code style
      
      * code style
      
      * code style
      
      * code style
      
      * code style
      test=develop
      
      * code style
      test=develop
      
      * unitest
      
      * style
      
      * style
      
      * multi phase
      
      * add channel
      
      * code style
      
      * style
      
      * style
      
      * unitest
      
      * style
      
      * define
      
      * define
      test=develop
      
      * style
      test=develop
      
      * rm define
      test=develop
      
      * linux
      
      * linux
      test=develop
      
      * style
      test=develop
      
      * output format
      test=develop
      
      * windows ci
      test=develop
      1fe468d3
    • Z
      5c8f210c
    • C
      Increase num_iteration_per_drop_scope (#19075) · b6d1d890
      chengduo 提交于
      * increase num_iteration_per_drop_scope
      test=develop
      
      * Fix bug of while_op
      test=develop
      
      * fix bug of whileOp
      test=develop
      b6d1d890
  6. 28 8月, 2019 1 次提交
    • T
      Fix the correctness of async mode at distributed training (#18863) · 65c73684
      tangwei12 提交于
      * fix correctness of the communicator
      
      * fix a bug in send thread when sending var context is empty, test=develop
      
      * add lookup_table_prefetch_op and prefetch optimize, test=develop
      
      * remove remote prefetch GPU supported
      
      * word2vec force with CPU, test=develop
      
      * test dist remote lookup table force with CPU, test=develop
      65c73684
  7. 27 8月, 2019 1 次提交
  8. 23 8月, 2019 1 次提交
  9. 22 8月, 2019 2 次提交
    • L
      Enhance OpTest to check the consistency of operators when using and not using inplace (#19101) · a9d5fc51
      Leo Chen 提交于
      * add pybind interface to get all inplace ops, test=develop
      
      * enhance OpTest to check whether the consistency of operator when using and not using inplace, test=develop
      
      * handle corner cases in op_test, test=develop
      
      * support outputs without tensor holder_, like XShape in reshape_op, test=develop
      
      * fix bug, some op has GradOpMaker, but actually no grad_op in OpInfoMap, test=develop
      
      * use reshape_grad instead of reshape in FlattenGradOp, test=develop
      
      * fix error debug dims info for variables like XShape, test=develop
      
      * change computational order in sum_op to relieve computation difference using inplace, test=develop
      
      * add inplace_atol to check group_norm, and skip inplace_grad for mkldnn, test=develop
      
      * follow sneaxiy's comments, test=develop
      
      * remove unused DefaultGradOpDescMaker in mkldnn op, test=develop
      a9d5fc51
    • T
      stronger the error message of tensor's mutable_data (#19303) · e3c68bde
      Tao Luo 提交于
      * stronger the error message of tensor's mutable_data
      
      test=develop
      
      * update error message
      
      test=develop
      e3c68bde
  10. 21 8月, 2019 1 次提交
  11. 19 8月, 2019 5 次提交
  12. 16 8月, 2019 1 次提交
  13. 15 8月, 2019 2 次提交
  14. 14 8月, 2019 2 次提交
  15. 13 8月, 2019 1 次提交
  16. 12 8月, 2019 3 次提交
  17. 11 8月, 2019 1 次提交
    • Y
      add save cache model api in fleet& add slots shuffle in dataset module & add... · 9150cf50
      yaoxuefeng 提交于
      add save cache model api in fleet& add slots shuffle in dataset module & add metric op to calculate ctr related metrics (#18871)
      
      * add ctr related metric layer test=develop
      
      * add save cache and slots shuffle test=develop
      
      * add save cache and slots shuffle test=develop
      
      * fix error
      
      * fix error
      
      * fix style for ci
      
      * fix for comments
      
      * change SlotsShuffle input to std::strinf for generality
      
      * fix style
      
      * fix style
      
      * fix style
      
      * fix style
      
      * fix style
      
      * fix style
      
      * fix stylr
      
      * fix style
      
      * fix style
      
      * fix style
      
      * fix style
      
      * fix style
      
      * fix style
      
      * fix style
      
      * fix style
      
      * fix style
      
      * fix style
      
      * fix style
      
      * fix style
      
      * fix style
      
      * change non-const reference to pointer
      
      * fix style
      
      * fix style
      
      * fix style test=develop
      
      * fix style  test=develop
      
      * add return ins num in ctr metric op
      
      * change dtype to float in metric_op.py
      
      * fix error test=develop
      
      * fix style test=develop
      
      * fix API spec
      
      * fix API spec
      
      * fix API spec test=develop
      
      * add UT test=develop
      9150cf50
  18. 10 8月, 2019 1 次提交
  19. 09 8月, 2019 2 次提交
  20. 08 8月, 2019 2 次提交
    • J
      fix QueueDataset queue size (#19016) · fc038da7
      jiaqi 提交于
      * fix QueueDataset queue size,set queue size = batch size * 100, to avoid too many instances in channel when training is much slower than reading data.
      fc038da7
    • L
      Fix memory overwriting of tensors returned by executor (#19030) · 8f537354
      Leo Chen 提交于
      * fix memory overlapping of fetch var (return of executor.run), test=develop
      
      * fix wrong usage of ParallelExecutor in op_test, test=develop
      
      * remove useless parameter and simplify code
      
      * avoid tensor destruct untimely, test=develop
      
      * add testcase independent of OpTest, test=develop
      8f537354