1. 11 6月, 2021 1 次提交
    • R
      [cherry-pick] use the required instruction to determine if the environment... · f57ae4d7
      Ren Wei (任卫) 提交于
       [cherry-pick] use the required instruction to determine if the environment fits the sample code's required.  (#32766) (#33451)
      
      1 put a instruction # required: gpu(for example) in the sample code
      2 this piece of code will only run in the GPU-equipped CI pipelines, and be omitted in other pipelines.
      3 the CI pipelines can specify its capacity by shell environment variable SAMPLE_CODE_TEST_CAPACITY
      
      2.1 文档改版方案
      
      see #32766 for more infomation
      f57ae4d7
  2. 18 5月, 2021 1 次提交
  3. 07 5月, 2021 1 次提交
  4. 30 4月, 2021 1 次提交
    • P
      Add 12 inplace APIs including auto generated (#32573) (#32699) · 097d5f52
      pangyoki 提交于
      * add relu6_ hardsigmoid_ leaky_relu_ Inplace APIs
      
      * add softmax_with_cross_entropy_ Inplace API
      
      * add clip_ scale_ add_ subtract_ Inplace APIs
      
      * add wlist
      
      * fix parameter of scale api
      
      * add add_n_ Inplace API and remove log_ Inplace API
      
      * fix elementwise_add_ and elementwise_sub_ broadcast problem
      
      * elementwise inplace api give error message before run the op
      
      * use broadcast_shape in elementwise inplace op
      
      * add 8 inplace apis that is auto generated
      
      * add unittest for all inplace apis
      
      * add decorator for inplace apis in static mode
      
      * fix windows blas fail of exp inplace api, change array_equal to allclose
      
      * add flatten inplace api
      
      * add flatten unittest
      
      * fix flatten unittest
      
      * add decorator
      
      * fix grad.numpy in test_pylayer_op
      
      * unsupport softmax_with_cross_entropy_
      
      * add test_inplace_softmax_with_cross_entropy to static_mode_white_list
      
      * delete __all__ in inplace_utils
      
      * delete activation inplace function and add Tensor.inplace_func
      
      * change paddle.inplace_ to Tensor.inplace_
      
      * fix little problem
      
      * add paddle in inplace_utils
      097d5f52
  5. 29 4月, 2021 2 次提交
  6. 27 4月, 2021 1 次提交
  7. 26 4月, 2021 1 次提交
  8. 25 4月, 2021 3 次提交
    • R
      Cleanup the unused codes for samplecode testing (#32525) · 2328921f
      Ren Wei (任卫) 提交于
      * update testcases
      
      * remove unused codes
      
      * update the docstring for samcd_processor
      
      * no need to import the six module
      
      * 我也不知道为何有一个前导空格,但现在有单元测试,取消这个空格没啥问题
      
      * add unittests for print_signatures; add the first case for 'required' mechanism when executing sample code testing
      
      * there is no paddle installed in PR-CI-APPROVAL
      
      test=document_fix
      2328921f
    • P
      support python39 and delete python35 in Dockerfile (#32385) · 78fc74b0
      pangyoki 提交于
      * support python39 and delete python35
      
      * support python39 in Dockerfile.centos
      
      * fix ubuntu18 bug
      
      * update Dockerfile.ubuntu setuptools
      
      * fix centos py39 errors
      
      * fix centos py39 error2
      78fc74b0
    • W
      paddle.save/load support nested structure and layer (#32446) · 727b28d7
      WeiXin 提交于
      * support save/load binary format tensor
      
      * Fix error when create cudaplace
      
      * Fix error when create cudaplace
      
      * Fix error when create cudaplace
      
      * get devive context from pool.
      
      * move define of 'SerializeToStream' and 'DeserializeFromStream' to 'lod_tensor.cc' and 'selected_rows.cc'.
      
      * support complex object
      
      * improve coverage.
      
      * improve coverage
      
      * improve coverage.
      
      * fix a bug.
      
      * polish API
      
      * save/load program
      
      * paddle.save/load: layer
      
      * deal with conflict
      
      * if PY2, block test_paddle_save_load.TestSaveLoadLayer
      
      * polish code.
      
      * polish code
      
      * edit unnittest
      
      * The condition for object to be identified as state_dict becomes strict
      
      * use 'core._cuda_synchronize'
      727b28d7
  9. 24 4月, 2021 1 次提交
    • R
      print the real name for Functions instead of the ArgSpec (#32379) · ef8671e5
      Ren Wei (任卫) 提交于
      * print the real name for Functions instead of the ArgSpec
      
      class function method
      
      * some API's name is not __module__ + __name__
      
      so, we disard them temporarily.
      
      * update the logging format for console
      
      * ommit the top level of paddle package.
      
      * these APIs have been removed.
      
      test=document_fix
      
      * Another Error occerd
      
      * print_signatures.py 's stdout is redirect to spec file, so should not print any other info.
      
      so sad.
      
      * print the error msg to stderr
      
      * disable the __init__ magic method
      
      * update unittest for sampcd_processor.py
      
      update unittest for sampcd_processor.py
      
      * PR-CI-APPROVAL 's python interpreter name is not 'python3'.
      
      it's a python3.9;
      it does not have paddle installed yet.
      
      此句在CI流水线竟然不可以用。报python3找不到
      此句在CI流水线竟然不可以用。因为环境没有安装paddle
      
      * testing only extract api from __all__
      
      paddle module(the top module) does not have __add__
      test=document_fix
      
      * should import paddle here
      
      * update the mechanism of extractiong and executing for the sample-codes test.
      
      更新抽取代码和执行代码的逻辑
      优化输出打印
      
      * good code style
      ef8671e5
  10. 23 4月, 2021 2 次提交
  11. 21 4月, 2021 2 次提交
  12. 19 4月, 2021 2 次提交
    • L
      [NPU] cherry-pick gc/dataloader/save&load/optimization from ascendrc to develop (#32294) · cbe5c9f8
      Leo Chen 提交于
      * [NPU] support GarbageCollector for npu (#31874)
      
      * support GarbageCollector for npu
      
      * fix typo
      
      * fix gather_grad
      
      * disable NPUDefaultStreamGarbageCollector on NPU
      
      * [NPU] support npu for memcpy op (#31808)
      
      * support npu for memcpy op
      
      * add ut
      
      * fix ut
      
      * fix typo
      
      * 【NPU】fix bug of using temp vector (#31963)
      
      * fix bug when beta1_pow on cpu (#31995)
      
      * [NPU] support npu profiler (#31684)
      
      * support npu profiler
      
      * add python api
      
      * fix bugs
      
      * add wrapper for incomplete type
      
      * update profile proto
      
      * record npu wait
      
      * add xpu placeholder
      
      * fix adam (#32016)
      
      * [NPU] enable async copy and  add wait before sync operation (#31956)
      
      * enable async copy and  add wait before sync operation
      
      * remove unneccessary wait
      
      * add FillNpuTensorWithConstant
      
      * refine
      
      * fix fill_constant
      
      * make TensorFromVector/TensorToVector sync
      
      * [NPU] Support dataloader on npu place. (#31867)
      
      * [NPU] Wait on NPUPlace (#32086)
      
      * [NPU] fix cast op (#32121)
      
      * fix npu kernel of cast op to handle casting to same dtype
      
      * add comments
      
      * [NPU] support cann 20.3 (#32044)
      
      * fix compile problem on cann 20.3
      
      * fix ut
      
      * fix test_mul
      
      * fix check_finite_and_scale
      
      * fix lookup_table_v2_grad
      
      * fix cmake
      
      * support print op
      
      * [NPU] Support npu save load (#31893)
      
      * support save load for NPU
      
      * add save load npu unittest
      
      * support np.array transform in NPU
      
      * fix errors
      
      * delete dygraph in unittest
      
      * add Wait
      
      * fix unittest
      
      * fix review comment
      
      * fix unittest problem
      
      * fix little problem
      
      * change aclrtSynchronizeDevice to aclrtSynchronizeStream for better performance (#32196)
      
      * change aclrtSynchronizeDevice to aclrtSynchronizeStream for better performace
      
      * refine code
      
      * fix NPUDeviceContext in all c++ unittest (#32198)
      
      * fix NPUDeviceContext in all c++ unittest
      
      * refine log
      Co-authored-by: Npangyoki <pangyoki@126.com>
      
      * [NPU] Remove TensorFromVector and avoid sync copy in npu op kernel for better performance (#31994)
      
      * enable async copy and  add wait before sync operation
      
      * remove unneccessary wait
      
      * add FillNpuTensorWithConstant
      
      * refine
      
      * fix fill_constant
      
      * change TensorFromVector to FillNpuTensorWithConstant
      
      * fix ignored api
      
      * delete extra unittest
      
      * fix little error
      
      * fix update_loss_scaling_op_npu and check_finite_and_unscale_op_npu
      
      * change TensorCopySync to TensorCopy
      
      * delete useless Wait and add StreamWait
      
      * fix npu_stream error
      
      * fix check_finite_and_unscale_op_npu TensorCopy
      
      * only save stream wait
      
      * fix NPUDeviceContext in all c++ unittest
      
      * delete wait
      Co-authored-by: Nzhiqiu <chenqiuliang@baidu.com>
      
      * delete useless unittest file (#32206)
      
      * Fix op test (#32231)
      
      * fix conditional block (#32243)
      
      * fix adam bug again (#32246)
      
      * fix compile
      
      * fix ut
      
      * fix ut
      Co-authored-by: Nliym27 <33742067+liym27@users.noreply.github.com>
      Co-authored-by: Npangyoki <pangyoki@126.com>
      cbe5c9f8
    • R
      update `get_api_md5`, using the real api name as the map's key (#32224) · 21dc044a
      Ren Wei (任卫) 提交于
      * get_api_md5 should prefer use the real name rather than the alias names
      
      * case for ArgSpec style. update the unittests
      
      test=document_fix
      21dc044a
  13. 14 4月, 2021 3 次提交
  14. 13 4月, 2021 3 次提交
  15. 12 4月, 2021 2 次提交
  16. 10 4月, 2021 1 次提交
  17. 09 4月, 2021 1 次提交
  18. 07 4月, 2021 2 次提交
    • Y
      Check added ut on windows (#31826) · e09f4db9
      YUNSHEN XIE 提交于
      * added ut check on windows,notest,test=windows_ci
      
      * debug,notest,test=windows_ci
      
      * debug,notest,test=windows_ci
      
      * fix bug,notest,test=windows_ci
      
      * added ut check
      
      * test for new ut add on windows
      
      * test,notest,test=windows_ci
      
      * fix bug,notest,test=windows_ci
      
      * test
      
      * test
      
      * test
      
      * test,notest,test=windows_ci
      
      * test,notest,test=windows_ci
      
      * check added ut on windows
      
      * only fetch upstream develop
      
      * modified according comment
      
      * Update run_unittests.sh
      
      * Update run_unittests.sh
      e09f4db9
    • P
      update name of develop whl package and upgrade gcc 4.8.2 to gcc 5.4 (#31240) · f5186c3c
      pangyoki 提交于
      * update develop whl package name
      
      * distingush cpu and gpu name
      
      * fix ref_gcc
      
      * change whl name
      
      * upgrade gcc 4.8 to 5.4 in ubuntu_dev
      
      * update gcc4.8 to 5.4 in centos
      
      * Upgrade pip from 18.0 to 20.0.1
      
      * change 2.1.0_dev0 to 2.1.0.dev0 in gpu version
      f5186c3c
  19. 03 4月, 2021 1 次提交
  20. 02 4月, 2021 3 次提交
    • Y
      use busybox run test on windows openblas (#31728) · 290be88d
      YUNSHEN XIE 提交于
      * use busybox run test on windows openblas
      
      * fix error
      
      * fix disable_quick and nightly lable issue
      
      * add retry on windows openblas
      
      * fix bug
      
      * use one file to run cpu and gpu tests
      
      * fix with grep warning
      
      * fix syntax error
      
      * change run_unittest to run_unittest_gpu
      
      * Update run_unittests.sh
      
      fix error
      290be88d
    • W
      support save/load single tensor (#31756) · 43367e4b
      WeiXin 提交于
      * support save/load single tensor
      
      * compatibility modification according to unnittest
      
      * Some python2.7 don't have 'copyreg' modules
      
      * Handle a syntax error.
      
      * Dealing with compatibility problems on Mac.
      
      * Dealing with compatibility problems on Mac.
      
      * edit unittest to improve coverage.
      
      * Modify the code according to the review comments
      
      * Reduce redundant code.
      
      * support for static graph loading dygraph state_dict
      
      * edit code according to CI
      
      * edit unittest
      
      * edit unnittest
      
      * delete redundant file
      
      * edit code according to Comments
      
      * edit english doc
      
      * edit english doc
      
      * edit English DOC.
      
      * get/set_tensor->get/set_value; return_numpy=False
      
      * get/set_tensor->get/set_value; return_numpy=False
      
      * edit unnittest
      
      * edit unnittest
      
      * polish code.
      43367e4b
    • S
      graph engine (#31226) · 94736d60
      seemingwang 提交于
      * graph engine demo
      
      * upload unsaved changes
      
      * fix dependency error
      
      * fix shard_num problem
      
      * py client
      
      * remove lock and graph-type
      
      * add load direct graph
      
      * add load direct graph
      
      * add load direct graph
      
      * batch random_sample
      
      * batch_sample_k
      
      * fix num_nodes size
      
      * batch brpc
      
      * batch brpc
      
      * add test
      
      * add test
      
      * add load_nodes; change add_node function
      
      * change sample return type to pair
      
      * resolve conflict
      
      * resolved conflict
      
      * resolved conflict
      
      * separate server and client
      
      * merge pair type
      
      * fix
      
      * resolved conflict
      
      * fixed segment fault; high-level VLOG for load edges and load nodes
      
      * random_sample return 0
      
      * rm useless loop
      
      * test:load edge
      
      * fix ret -1
      
      * test: rm sample
      
      * rm sample
      
      * random_sample return future
      
      * random_sample return int
      
      * test fake node
      
      * fixed here
      
      * memory leak
      
      * remove test code
      
      * fix return problem
      
      * add common_graph_table
      
      * random sample node &test & change data-structure from linkedList to vector
      
      * add common_graph_table
      
      * sample with srand
      
      * add node_types
      
      * optimize nodes sample
      
      * recover test
      
      * random sample
      
      * destruct weighted sampler
      
      * GraphEdgeBlob
      
      * WeightedGraphEdgeBlob to GraphEdgeBlob
      
      * WeightedGraphEdgeBlob to GraphEdgeBlob
      
      * pybind sample nodes api
      
      * pull nodes with step
      
      * fixed pull_graph_list bug; add test for pull_graph_list by step
      
      * add graph table;name
      
      * add graph table;name
      
      * add pybind
      
      * add pybind
      
      * add FeatureNode
      
      * add FeatureNode
      
      * add FeatureNode Serialize
      
      * add FeatureNode Serialize
      
      * get_feat_node
      
      * avoid local rpc
      
      * fix get_node_feat
      
      * fix get_node_feat
      
      * remove log
      
      * get_node_feat return  py:bytes
      
      * merge develop with graph_engine
      
      * fix threadpool.h head
      
      * fix
      
      * fix typo
      
      * resolve conflict
      
      * fix conflict
      
      * recover lost content
      
      * fix pybind of FeatureNode
      
      * recover cmake
      
      * recover tools
      
      * resolve conflict
      
      * resolve linking problem
      
      * code style
      
      * change test_server port
      
      * fix code problems
      
      * remove shard_num config
      
      * remove redundent threads
      
      * optimize start server
      
      * remove logs
      
      * fix code problems by reviewers' suggestions
      Co-authored-by: NHuang Zhengjie <270018958@qq.com>
      Co-authored-by: NWeiyue Su <weiyue.su@gmail.com>
      Co-authored-by: Nsuweiyue <suweiyue@baidu.com>
      Co-authored-by: Nluobin06 <luobin06@baidu.com>
      Co-authored-by: Nliweibin02 <liweibin02@baidu.com>
      94736d60
  21. 31 3月, 2021 1 次提交
    • Z
      [Parallel UT]Improve Parallel UT level on Windows/Linux (#31377) · b05f6142
      Zhou Wei 提交于
      * [Parallel UT]improve Parallel UT level on Windows/Linux
      
      * [Parallel UT]improve Parallel UT level on Windows/Linux
      
      * [Parallel UT]Improve Parallel UT level on Windows/Linux
      
      * [Parallel UT]Improve Parallel UT level on Windows/Linux
      
      * fix CI
      b05f6142
  22. 30 3月, 2021 3 次提交
  23. 23 3月, 2021 1 次提交
  24. 22 3月, 2021 1 次提交