1. 09 10月, 2019 2 次提交
  2. 29 9月, 2019 1 次提交
  3. 28 9月, 2019 2 次提交
    • Q
      Enable users to create custom cpp op outside framework. (#19256) · 1a3eef02
      qingqing01 提交于
      * How to write custom op needs to follow framework OP spec.
      * Package fluid_framework.so and headers into whl.
      * Add paddle.sysconfig.get_include() and paddle.sysconfig.get_lib() to get include dir and lib dir.
      * Export some C-APIs to merge OpInfo between core.so and custom_op.so.
      * Add unit testing.
      * Update API.spec.
      1a3eef02
    • Y
      Dygraph Layer kwargs & param getter setter (#19901) · 54e07994
      Youwei Song 提交于
      * opt FC
      
      * opt rest of dygraph.nn
      
      * new param shape check and unittest
      
      * add kwargs for Layer
      
      * add new set_value api
      
      * use property decorator
      
      * update API.spec, test=develop
      
      * use UserList, separate gettersetters, test=develop
      
      * update test_custom_layer_with_kwargs, test=develop
      
      * fix UserList compatibility, test=develop
      
      * fix UserList compatibility, test=develop
      
      * keep FC._w, test=develop
      
      * add unittests, Conv3D bug fix, test=develop
      
      * clean code, test=develop
      
      * fix dygraph guard in unittest, test=develop
      
      * add property setters, remove unused param in tracer, test=develop
      
      * tracer none check, test=develop
      
      * merge, test=develop
      
      * refine, test=develop
      
      * bug fix in  prelu and conv3d_transpose, test=develop
      
      * rm __set__, test=develop
      
      * set tensor value instead of assign op
      
      * fix property setter call, test=develop
      
      * fix api.spec, test=develop
      
      * fix doc sample, test=develop
      54e07994
  4. 27 9月, 2019 1 次提交
    • J
      Optimze/optimize dygraph api (#19999) · 39ff0f9c
      Jiabin Yang 提交于
      * test=develop, fix docker with paddle nccl problem
      
      * test=develop, Add Variable api and refine dygraph related API
      
      * test=develop, Add Variable api and refine dygraph related API
      
      * test=develop, refine test for new api and error info
      
      * test=develop, refine error info and test_layers
      
      * test=develop, add API.spec
      
      * test=devleop, fix to_string python2 and python3 compat error and refien doc
      
      * test=devleop, add API spec
      
      * test=devleop, update API spec
      
      * test=devleop, update API spec
      
      * test=develop, invoke ci
      
      * test=develop, fix example code
      
      * test=develop, update API spec
      
      * test=develop, add compat test and fix inplace campat dict error
      39ff0f9c
  5. 26 9月, 2019 1 次提交
    • H
      Add new data layer (#19916) · 88af4ab6
      Huihuang Zheng 提交于
      The new "fluid.data" changes old "fluid.layers.data":
      
      1. Add shape and dtype check.
      2. Remove "append_batch_size" parameter. We won't offer this in the new data layer because other deep learning platforms don't have this kind of data layer pre-processing. It may confuse users.
      3. Remove "stop gradient" parameter because the data layer doesn't do back-propagation
      
      TODO:
      Now data layer feeded by executor is checked, will we want to check the feed data of readers in the future?
      88af4ab6
  6. 25 9月, 2019 2 次提交
    • S
      Avoid treating broadcast as initialization operation (#19857) · 5920d69d
      ShenLiang 提交于
      * treat broadcast as non-initial, test=develop
      
      * rename the class name
      
      * rename the class name, test=develop
      5920d69d
    • W
      Add support for new QAT models (#18970) · 4286a627
      Wojciech Uss 提交于
      * Add support for new QAT models
      
      test=develop
      Co-Authored-By: NMichał Gallus <michal.gallus@intel.com>
      Co-Authored-By: NWojciech Uss <wojciech.uss@intel.com>
      
      * fixed fps results
      
      test=develop
      
      * fix top5 accuracy drop problem
      
      * updated for new QAT models
      
      * skip quantizing average pooling - dirty but working
      
      * add missing pass
      
      * added missing conv+brelu fuse pass
      
      * removed a call to non-existent pass
      
      test=develop
      
      * renamed pass
      
      test=develop
      
      * Adjust finding pooling scale to newest QAT models
      
      * Remove unnecessary code from quantization_mkldnn_pass
      
      * Copy Pooling input scale to output scale in QAT
      
      * Refactor & remove unused code in QAT
      
      * Incorporate fp32 FC into QAT
      
      test=develop
      
      * Enable graph drawing with debug flag
      
      test=develop
      
      * Add tests for QATv2
      
      * Fix paths for QATv2 models
      
      test=develop
      
      * Add option to save transformed int8 qat model
      
      test=develop
      
      * Remove redundant lines from qat mkldnn pass
      
      test=develop
      
      * Delegate disablement of avg pooling to qat
      
      test=develop
      
      * fix CI bug, test=develop
      
      * Follow Wangzhen's Review, test=develop
      
      * Update API.spec
      
      test=develop
      
      * Name False in (is_unsigned, TensorScale) tuple
      
      test=develop
      4286a627
  7. 21 9月, 2019 1 次提交
    • J
      Feature/auto prune in dygraph (#19757) · 45425411
      Jiabin Yang 提交于
      * refactor dygraph,test=develop
      
      * fix failed unittest,test=develop
      
      * polish code,test=develop
      
      * check windows ci error,test=develop
      try to fix windows ci error by np.allclose,test=develop
      
      * polish vlog and profiler, test=develop
      
      * try to fix preceding ops order,test=develop
      
      * test transformer in windows ci, test=develop
      
      * use python c-api to speed up tracer.trace,test=develop
      
      * test=develop, fix docker with paddle nccl problem
      
      * test=develop, add ut for debug string and gradient_accumulator
      
      * test=develop, add tests for layer/gradient_accumulator/prepared_op
      
      * test=develop, fix complie error for test_prepared_op
      
      * test=develop, add more ut for dygraph
      
      * test=develop, create API.spec for dygraph api change
      
      * test=develop, refoctor name to make it easier to understand
      
      * test=develop, refoctor name to make it easier to understand
      
      * test=develop, fix multi-gpu failed problem , add Tracer tests, change PADDLEENFORCE to PADDLEENFORCE_EQ
      
      * test=develop, fix ut failed on parallel se-resnext
      
      * test=develop, change one more PADDLE_ENFORCE
      
      * support auto prune in dygraph mode
      
      * test=develop, support auto prune
      
      * test=develop, merge develop conflict
      
      * test=develop, fix test_layer and test_tracer ut
      
      * test=develop, fix bug which may cause stop_gradient disabled with a list of backward inputs
      45425411
  8. 17 9月, 2019 2 次提交
    • L
      add tensor(tensor and tensor in list) support for argument starts and ends in slice op; (#19208) · 88628016
      liym27 提交于
      add support parameter inference when arguments starts or ends is a list containing integer and tensor variable;
      test=develop,test=document_preview
      
      improve slice op according to review(from hongyu). test=develop
      
      fix slice op according to review: infer_flags, test=develop
      
      fix slice op: improve overload operator __getitem__ to support attrs(starts and ends) are Variable.
      test=develop,test=document_preview
      
      fix test_slice_op: add TestSliceOp_decs_dim_6 to resolve conflict with test_slice_ngraph_op. test=develop
      
      add stop_gradient=True when attr(starts) or attr(ends) is tensor Variable.
      test=develop,test=document_preview
      88628016
    • C
      Fix example error of Variable and Operator (#19821) · 2729c174
      chengduo 提交于
      * fix example error
      test=develop
      
      * Remove set_desc
      test=develop
      2729c174
  9. 16 9月, 2019 1 次提交
  10. 09 9月, 2019 1 次提交
    • C
      Fix train error when test_program.clone is executed after optimizer.minimize (#19397) · c78a4781
      Chen Weihang 提交于
      * add prune when test_program.clone is executed after optimizer.minimize
      
      * add unittest, test=develop
      
      * add resnet and transformer test case, test=develop
      
      * add regularization for optimizer & program compare function, test=develop
      
      * add lstm unittest, test=develop
      
      * polish code based on review comment, test=develop
      
      * adapt to interface change in framework._prune, test=develop
      
      * update API.spec, test=develop
      c78a4781
  11. 05 9月, 2019 2 次提交
    • J
      Refactor dygraph (#19107) · e9233d1c
      Jiabin Yang 提交于
      * refactor dygraph,test=develop
      
      * fix failed unittest,test=develop
      
      * polish code,test=develop
      
      * check windows ci error,test=develop
      try to fix windows ci error by np.allclose,test=develop
      
      * polish vlog and profiler, test=develop
      
      * try to fix preceding ops order,test=develop
      
      * test transformer in windows ci, test=develop
      
      * use python c-api to speed up tracer.trace,test=develop
      
      * test=develop, fix docker with paddle nccl problem
      
      * test=develop, add ut for debug string and gradient_accumulator
      
      * test=develop, add tests for layer/gradient_accumulator/prepared_op
      
      * test=develop, fix complie error for test_prepared_op
      
      * test=develop, add more ut for dygraph
      
      * test=develop, create API.spec for dygraph api change
      
      * test=develop, refoctor name to make it easier to understand
      
      * test=develop, refoctor name to make it easier to understand
      
      * test=develop, fix multi-gpu failed problem , add Tracer tests, change PADDLEENFORCE to PADDLEENFORCE_EQ
      
      * test=develop, fix ut failed on parallel se-resnext
      
      * test=develop, change one more PADDLE_ENFORCE
      e9233d1c
    • M
      add feed_var_names to Prune interface (#19589) · dca9b6c5
      mapingshuo 提交于
      * Fix bug: add feed_vars to the prune function
      dca9b6c5
  12. 04 9月, 2019 1 次提交
    • Z
      add detach API for Variable in dygraph mode, test=develop (#19477) · 4d26274d
      zhongpu 提交于
      * add to and detach for Variable in dygraph, test=develop
      
      * add detach for Variable in dygraph, test=develop
      
      * add detach for Variable in dygraph, test=develop
      
      * add detach for Variable in dygraph, test=develop
      
      * add detach for Variable in dygraph, test=develop
      
      * add detach for Variable in dygraph, test=develop
      
      * add exception check, test=develop
      4d26274d
  13. 23 8月, 2019 1 次提交
  14. 21 8月, 2019 1 次提交
  15. 19 8月, 2019 1 次提交
  16. 12 8月, 2019 1 次提交
  17. 05 8月, 2019 1 次提交
  18. 27 7月, 2019 1 次提交
  19. 23 7月, 2019 1 次提交
    • Y
      supports distributed classification (#18690) · 157211c4
      Yi Liu 提交于
      * supports distributed classification training
      * update API.spec
      * fix evenly division in python3
      * change "index_range" to "index_num" in shard_index operator
      test=document_preview
      test=develop
      157211c4
  20. 11 7月, 2019 1 次提交
  21. 01 7月, 2019 1 次提交
  22. 28 6月, 2019 2 次提交
    • J
      Fix/program doc (#17908) · 43f64a17
      Jiabin Yang 提交于
      * test=develop, add some comments for Program.clone
      
      * test=develop, add API.spec
      
      * test=develop, refine comments
      
      * refine Program doc and clone doc
      
      * test=develop, refine doc
      43f64a17
    • C
      Add is_compiled_with_cuda (#18356) · 871cc15e
      chengduo 提交于
      *  add cuda_is_available
      test=develop
      
      * Fix api.spec
      test=develop
      
      * fix api doc
      test=develop
      871cc15e
  23. 27 6月, 2019 1 次提交
    • H
      supports collective communicated training (#18175) · b7128bac
      HaoRen 提交于
      * fix prepare context redundant code problem, optimize executor by caching create_varaiables
      test=develop
      
      * supports collective training in executor
      
      * make fetch_list runable with variables, add more unittest for use_program_cache
      test=develop
      
      * fix comment
      test=develop
      
      * use unique name for nccl_id
      
      * supports output to stream in program_to_code
      
      * insert sync_comm_stream before regularization; add skip_op_callstack capability in program_to_code
      
      * set op role in collective training
      
      * add collective op role
      
      * remove orig file
      
      * add build optimizer by strategy
      
      * add collective strategy
      
      * refine collective strategy
      
      * add multi-process role maker
      
      * refine strategy building factory so that we can easily plugin more strategy
      
      * scale loss grad in collective sgd transpiler
      
      * add support for distributed fc
      
      * code format
      
      * revert some features for dist fc
      
      * add support for distributed fc training
      
      * fix prepare context redundant code problem, optimize executor by caching create_varaiables
      test=develop
      
      * supports collective training in executor
      
      * make fetch_list runable with variables, add more unittest for use_program_cache
      test=develop
      
      * use unique name for nccl_id
      
      * supports output to stream in program_to_code
      
      * insert sync_comm_stream before regularization; add skip_op_callstack capability in program_to_code
      
      * set op role in collective training
      
      * add collective op role
      
      * fix comment
      test=develop
      
      * remove orig file
      
      * add build optimizer by strategy
      
      * add collective strategy
      
      * refine collective strategy
      
      * add multi-process role maker
      
      * refine strategy building factory so that we can easily plugin more strategy
      
      * scale loss grad in collective sgd transpiler
      
      * add support for distributed fc
      
      * code format
      
      * revert some features for dist fc
      
      * add support for distributed fc training
      
      * test=develop
      add collective op unittest standard
      
      * test=develop
      remove the test_collective directory
      
      * test=develop
      remove the test_collective directory
      
      * remove slicegather test
      
      * code format for reducescatter
      
      * update attr of shard_index_op
      
      * Modify macro nccl_helper
      
      * remove test without distribute
      
      * macro collective_helper
      
      * marcro update
      
      * test=develop
      update support python3.5
      
      * test=develop change gpu memory use to 0.1 when test
      
      * test=develop
      update ut equal func
      
      * test=develop
      set flags to 1.5
      
      * test=develop fix pickle dumple  py35
      
      * test=develop
      fix divide in slice and add sync_comm_stream
      update atol and rtol to 1e-05
      rm shard_index op and test
      modify read input from file to read from memory
      remove origin_program in framework and add i/o in c_sync_calc_stream
      
      * test=develop update unittest sync operator I/O
      b7128bac
  24. 21 6月, 2019 1 次提交
  25. 20 6月, 2019 1 次提交
  26. 16 6月, 2019 1 次提交
  27. 15 6月, 2019 1 次提交
  28. 14 6月, 2019 1 次提交
  29. 13 6月, 2019 1 次提交
  30. 12 6月, 2019 1 次提交
    • T
      combine noavx and avx package (#17889) · 5c06bff2
      tensor-tang 提交于
      * support avx and noavx core
      
      * add catch and give some log
      
      test=develop
      
      * fix build
      
      test=develop
      
      * add missing package
      
      test=develop
      
      * fix pybind name
      
      test=develop
      
      * fix import error
      
      test=develop
      
      * conbime noavx core
      
      test=develop
      
      * add requirements
      
      test=develop
      
      * fix unkown message
      
      test=develop
      
      * fix api spec
      
      test=develop
      
      * refine and clean
      
      test=develop
      
      * update
      
      * pass dist ut
      
      * follow comments
      
      test=develop
      
      * refine scripts
      
      test=develop
      5c06bff2
  31. 11 6月, 2019 1 次提交
    • H
      Pipeline Concurrency (#17402) · 969e6378
      hutuxian 提交于
      Add Pipeline Concurrency Train Mode:
      - Cpp: pipeline_trainer & section_worker
      - Python: PipelineOptimizer
      - Add a new data_feed type: PrivateInstantDataFeed
      - Add a test demo of pipeline trainer and the test model is gnn
      - Do not support win32 now
      969e6378
  32. 10 6月, 2019 1 次提交
  33. 06 6月, 2019 1 次提交
  34. 04 6月, 2019 1 次提交
    • H
      ont hot support tensor depth (#16972) · 82358bfd
      Hongyu Liu 提交于
      * support some input tensor remain on cpu; test=develop
      
      * fix input = none; test=develop
      
      * fix unfound bug; test=develop
      
      * fix proto None case; test=develop
      
      * fix bug; test=develop
      
      * fix proto null bug; test=develop
      
      * remove conv check; test=develop
      
      * fix test bug; test=develop
      
      * move fill constant; test=develop
      
      * no change in proto; test=develop
      
      * fix bug; test=develop
      
      * change attr detph name; test=develop
      
      * remove remain cpu; test=develop
      
      * fix bug; test=develop
      
      * merge develop; test=develop
      
      * fix one_hot bug; test=develop
      
      * fix bug; test=develop
      
      * fix bug; test=develop
      
      * fix bug; test=develop
      
      * fix python api bug; test=develop
      82358bfd