1. 29 11月, 2019 1 次提交
  2. 28 11月, 2019 8 次提交
  3. 27 11月, 2019 3 次提交
    • H
      Support data_norm gpu kernel (#21325) · 47a82e38
      hutuxian 提交于
      * support data_norm_op run in CUDA
      * add two parameters sync_stats & summary_decay_rate
      * add UT
      47a82e38
    • Y
      Support numpy bridge (enabled by default in dygraph mode) (#20983) · d5ff79e5
      Youwei Song 提交于
      * add numpy bridge
      
      * fix template compile
      
      * add unittest, add default
      test=develop
      
      * fix unittest
      test=develop
      
      * fix unittest
      test=develop
      
      * zero_copy=True for to_variable,
      test=develop
      
      * bug fix
      test=develop
      
      * disable deprecated NumPy API
      test=develop
      
      * use better design of NumpyAllocator
      test=develop
      
      * fix Py_None check
      test=develop
      
      * reset c++ tracer when jump out dygraph guard
      test=develop
      
      * refine PADDLE_ENFORCE_xx format
      test=develop
      
      * bug fix of tracer switch
      test=develop
      
      * update decref
      test=develop
      d5ff79e5
    • M
      INT8 Fully-connected (#17641) · 5d7d5482
      Michał Gallus 提交于
      * Implement Int8 FC
      
      * Integrate FC into INT8v2
      
      test=develop
      
      * int8 FC: transpose weights before computing scales
      
      test=develop
      
      * Add support for activation_type string in FC
      
      test=develop
      
      * Disable MKL-DNN's FC in VGG16 and 19
      
      test=develop
      
      * Disable FC quantization when mkldnn FC is disabled
      
      test=develop
      
      * Solve PADDLE_ENFORCES in FC int8
      
      * Fix Paddle enforces and remove const cast
      
      test=develop
      
      * Fix style changes
      
      test=develop
      
      * Fix quantizer_tester test and add fc quantization
      
      test=develop
      
      * Fix FC test fail on CUDA
      
      * Remove unnecessary log from quantize placement pass
      
      test=develop
      
      * Add Thread ID to FC hash key
      
      test=develop
      
      * Add comments to MKL-DNN FC Kernel
      
      test=develop
      
      * Refactor quantizer
      
      test=develop
      
      * Fix linter issues
      
      test=develop
      
      * Fix crash in slim googlenet
      
      test=develop
      
      * Fix PADDLE_ENFORCE messages
      
      test=develop
      5d7d5482
  4. 26 11月, 2019 9 次提交
  5. 25 11月, 2019 7 次提交
  6. 24 11月, 2019 3 次提交
  7. 22 11月, 2019 4 次提交
    • L
      add dequantize_abs_max op and modify lookup_table op (#20899) · f0b15184
      Liufang Sang 提交于
      * add int8 kernel to lookup_table op and add dequantize op test=develop
      
      * change paddle_enforce to paddle_enforce_eq test=develop
      
      * change copyright and change some not suitable code test=develop
      
      * remove debug log test=develop
      
      * replace GetInputType with IndicateVarDataType test=develop
      
      * fix EmptyGradMaker test=develop
      
      * fix diff between cpu and gpu test=develop
      
      * use memcopy when int8_t test=develop
      f0b15184
    • H
      support cvm_op run in gpu (#21300) · a6ce2306
      hutuxian 提交于
      Previously, CVM OP was only able to run in CPU. This PR implements its GPU kernel.
      What's more, we improve the UTs about CVM OP.
      a6ce2306
    • C
      Polish some PE code details (#21274) · 95250852
      Chen Weihang 提交于
      * polish code details, test=develop
      
      * futher polish hint msg, test=develop
      95250852
    • Y
      fix bug of issue #21259 (#21287) · 0fd1281e
      Yi Liu 提交于
      pass the argument `allow_out_of_range` of one_hot op to c++ back end.
      0fd1281e
  8. 21 11月, 2019 5 次提交
    • X
      fix fs_client_param bug (#21212) · 319d2ba9
      xujiaqi01 提交于
      * fix fs_client_param bug, user can set this config through fleet_desc_file or fleet config
      * test=develop
      319d2ba9
    • T
      solve pslib core in stop worker (#21263) · 0d17c1b8
      Thunderbrook 提交于
      * general table
      
      * add sparse table
      test=develop
      
      * no cvm
      test=develop
      
      * add no_cvm
      test=develop
      
      * add note
      test=develop
      
      * code style
      test=develop
      
      * code style
      test=develop
      
      * code style
      test=develop
      
      * code style
      test=develop
      
      * code style
      test=develop
      
      * add key of optimizer
      test=develop
      
      * solve pslib stop core
      test=develop
      
      * barrier
      test=develop
      
      * add notes
      test=develop
      0d17c1b8
    • Z
    • Z
      open dygraph op test, test=develop (#19787) · c4ede95c
      zhongpu 提交于
      * open dygraph op test, test=develop
      
      * modify to_variable, test=develop
      
      * modify input and output for dygraph, test=develop
      
      * modify input and output for dygraph(fix bug), test=develop
      
      * fix input processing of dygraph op test, test=develop
      
      * fix bug, test=develop
      
      * fix op test, test=develop
      
      * fix forward bug for dygraph, test=develop
      
      * fix mkldnn op test for forward, test=develop
      
      * update nn.py for dygraph, test=develop
      
      * fix crop_tensor_op, test=develop
      
      * fix elementwise_mul_op, test=develop
      
      * fix fill_op, test=develop
      
      * fix some mkldnn op, test=develop
      
      * open backward op test for dygraph, test=develop
      
      * delete log, test=develop
      
      * close backward op test for dygraph, test=develop
      
      * fix bug for edit_distance_op and test_lstm_cudnn_op, test=develop
      
      * fix optest backward bug for dygraph, test=develop
      
      * fix optest backward bug for dygraph, test=develop
      
      * close backward op test for dygraph, test=develop
      
      * close backward op test for dygraph, test=develop
      
      * open dygraph op test, test=develop
      
      * fix op test for dygraph, fix GradOpDescMaker, test=develop
      
      * fix bug for linear_chain_crf_op.h, test=develop
      
      * remove log, test=develop
      
      * remove log, test=develop
      
      * remove log for op_test.py, test=develop
      
      * remove log for op_test.py, test=develop
      
      * fix bug for var_conv_2d_op, change PADDLE_ENFORCE, test=develop
      
      * fix PADDLE_ENFORCE_EQ for hierarchical_sigmoid_op.cc, test=develop
      
      * fix bug for test_increment_ngraph_op.py, test=develop
      
      * fix lod for op test in dygraph, test=develop
      
      * refactor op_test.py to reduce redundant code, test=develop
      
      * fix lod optest, modify InputVar/OutputVar to HasInput/HasOutput, test=develop
      
      * remove debug log, test=develop
      
      * remove redundant code in base.py, test=develop
      
      * fix some error in optest, test=develop
      
      * fix ClearNoNeedBufferInputs function's bug for LoDTensor, test=develop
      
      * refactor op_test.py, test=develop
      
      * remove redundant writing, test=develop
      
      * fix error(get tensor of the grad variable), test=develop
      
      * fix test_concat_mkldnn test_conv2d_mkldnn, test=develop
      
      * fix optest.py for get tensor of LoDTensor, test=develop
      
      * fix optest.py for get tensor of LoDTensor, test=develop
      
      * fix optest.py for get tensor of LoDTensor, test=develop
      
      * fix some redundant code, test=develop
      
      * reslove conflict and rewrite paddle error message, test=develop
      c4ede95c
    • X
      fix fleet util bug (#21254) · eca66f31
      xujiaqi01 提交于
      * fix fleet util bug in save paddle inference model
      * test=develop
      eca66f31