1. 23 4月, 2021 2 次提交
  2. 21 4月, 2021 2 次提交
  3. 19 4月, 2021 2 次提交
    • L
      [NPU] cherry-pick gc/dataloader/save&load/optimization from ascendrc to develop (#32294) · cbe5c9f8
      Leo Chen 提交于
      * [NPU] support GarbageCollector for npu (#31874)
      
      * support GarbageCollector for npu
      
      * fix typo
      
      * fix gather_grad
      
      * disable NPUDefaultStreamGarbageCollector on NPU
      
      * [NPU] support npu for memcpy op (#31808)
      
      * support npu for memcpy op
      
      * add ut
      
      * fix ut
      
      * fix typo
      
      * 【NPU】fix bug of using temp vector (#31963)
      
      * fix bug when beta1_pow on cpu (#31995)
      
      * [NPU] support npu profiler (#31684)
      
      * support npu profiler
      
      * add python api
      
      * fix bugs
      
      * add wrapper for incomplete type
      
      * update profile proto
      
      * record npu wait
      
      * add xpu placeholder
      
      * fix adam (#32016)
      
      * [NPU] enable async copy and  add wait before sync operation (#31956)
      
      * enable async copy and  add wait before sync operation
      
      * remove unneccessary wait
      
      * add FillNpuTensorWithConstant
      
      * refine
      
      * fix fill_constant
      
      * make TensorFromVector/TensorToVector sync
      
      * [NPU] Support dataloader on npu place. (#31867)
      
      * [NPU] Wait on NPUPlace (#32086)
      
      * [NPU] fix cast op (#32121)
      
      * fix npu kernel of cast op to handle casting to same dtype
      
      * add comments
      
      * [NPU] support cann 20.3 (#32044)
      
      * fix compile problem on cann 20.3
      
      * fix ut
      
      * fix test_mul
      
      * fix check_finite_and_scale
      
      * fix lookup_table_v2_grad
      
      * fix cmake
      
      * support print op
      
      * [NPU] Support npu save load (#31893)
      
      * support save load for NPU
      
      * add save load npu unittest
      
      * support np.array transform in NPU
      
      * fix errors
      
      * delete dygraph in unittest
      
      * add Wait
      
      * fix unittest
      
      * fix review comment
      
      * fix unittest problem
      
      * fix little problem
      
      * change aclrtSynchronizeDevice to aclrtSynchronizeStream for better performance (#32196)
      
      * change aclrtSynchronizeDevice to aclrtSynchronizeStream for better performace
      
      * refine code
      
      * fix NPUDeviceContext in all c++ unittest (#32198)
      
      * fix NPUDeviceContext in all c++ unittest
      
      * refine log
      Co-authored-by: Npangyoki <pangyoki@126.com>
      
      * [NPU] Remove TensorFromVector and avoid sync copy in npu op kernel for better performance (#31994)
      
      * enable async copy and  add wait before sync operation
      
      * remove unneccessary wait
      
      * add FillNpuTensorWithConstant
      
      * refine
      
      * fix fill_constant
      
      * change TensorFromVector to FillNpuTensorWithConstant
      
      * fix ignored api
      
      * delete extra unittest
      
      * fix little error
      
      * fix update_loss_scaling_op_npu and check_finite_and_unscale_op_npu
      
      * change TensorCopySync to TensorCopy
      
      * delete useless Wait and add StreamWait
      
      * fix npu_stream error
      
      * fix check_finite_and_unscale_op_npu TensorCopy
      
      * only save stream wait
      
      * fix NPUDeviceContext in all c++ unittest
      
      * delete wait
      Co-authored-by: Nzhiqiu <chenqiuliang@baidu.com>
      
      * delete useless unittest file (#32206)
      
      * Fix op test (#32231)
      
      * fix conditional block (#32243)
      
      * fix adam bug again (#32246)
      
      * fix compile
      
      * fix ut
      
      * fix ut
      Co-authored-by: Nliym27 <33742067+liym27@users.noreply.github.com>
      Co-authored-by: Npangyoki <pangyoki@126.com>
      cbe5c9f8
    • R
      update `get_api_md5`, using the real api name as the map's key (#32224) · 21dc044a
      Ren Wei (任卫) 提交于
      * get_api_md5 should prefer use the real name rather than the alias names
      
      * case for ArgSpec style. update the unittests
      
      test=document_fix
      21dc044a
  4. 14 4月, 2021 3 次提交
  5. 13 4月, 2021 3 次提交
  6. 12 4月, 2021 2 次提交
  7. 10 4月, 2021 1 次提交
  8. 09 4月, 2021 1 次提交
  9. 07 4月, 2021 2 次提交
    • Y
      Check added ut on windows (#31826) · e09f4db9
      YUNSHEN XIE 提交于
      * added ut check on windows,notest,test=windows_ci
      
      * debug,notest,test=windows_ci
      
      * debug,notest,test=windows_ci
      
      * fix bug,notest,test=windows_ci
      
      * added ut check
      
      * test for new ut add on windows
      
      * test,notest,test=windows_ci
      
      * fix bug,notest,test=windows_ci
      
      * test
      
      * test
      
      * test
      
      * test,notest,test=windows_ci
      
      * test,notest,test=windows_ci
      
      * check added ut on windows
      
      * only fetch upstream develop
      
      * modified according comment
      
      * Update run_unittests.sh
      
      * Update run_unittests.sh
      e09f4db9
    • P
      update name of develop whl package and upgrade gcc 4.8.2 to gcc 5.4 (#31240) · f5186c3c
      pangyoki 提交于
      * update develop whl package name
      
      * distingush cpu and gpu name
      
      * fix ref_gcc
      
      * change whl name
      
      * upgrade gcc 4.8 to 5.4 in ubuntu_dev
      
      * update gcc4.8 to 5.4 in centos
      
      * Upgrade pip from 18.0 to 20.0.1
      
      * change 2.1.0_dev0 to 2.1.0.dev0 in gpu version
      f5186c3c
  10. 03 4月, 2021 1 次提交
  11. 02 4月, 2021 3 次提交
    • Y
      use busybox run test on windows openblas (#31728) · 290be88d
      YUNSHEN XIE 提交于
      * use busybox run test on windows openblas
      
      * fix error
      
      * fix disable_quick and nightly lable issue
      
      * add retry on windows openblas
      
      * fix bug
      
      * use one file to run cpu and gpu tests
      
      * fix with grep warning
      
      * fix syntax error
      
      * change run_unittest to run_unittest_gpu
      
      * Update run_unittests.sh
      
      fix error
      290be88d
    • W
      support save/load single tensor (#31756) · 43367e4b
      WeiXin 提交于
      * support save/load single tensor
      
      * compatibility modification according to unnittest
      
      * Some python2.7 don't have 'copyreg' modules
      
      * Handle a syntax error.
      
      * Dealing with compatibility problems on Mac.
      
      * Dealing with compatibility problems on Mac.
      
      * edit unittest to improve coverage.
      
      * Modify the code according to the review comments
      
      * Reduce redundant code.
      
      * support for static graph loading dygraph state_dict
      
      * edit code according to CI
      
      * edit unittest
      
      * edit unnittest
      
      * delete redundant file
      
      * edit code according to Comments
      
      * edit english doc
      
      * edit english doc
      
      * edit English DOC.
      
      * get/set_tensor->get/set_value; return_numpy=False
      
      * get/set_tensor->get/set_value; return_numpy=False
      
      * edit unnittest
      
      * edit unnittest
      
      * polish code.
      43367e4b
    • S
      graph engine (#31226) · 94736d60
      seemingwang 提交于
      * graph engine demo
      
      * upload unsaved changes
      
      * fix dependency error
      
      * fix shard_num problem
      
      * py client
      
      * remove lock and graph-type
      
      * add load direct graph
      
      * add load direct graph
      
      * add load direct graph
      
      * batch random_sample
      
      * batch_sample_k
      
      * fix num_nodes size
      
      * batch brpc
      
      * batch brpc
      
      * add test
      
      * add test
      
      * add load_nodes; change add_node function
      
      * change sample return type to pair
      
      * resolve conflict
      
      * resolved conflict
      
      * resolved conflict
      
      * separate server and client
      
      * merge pair type
      
      * fix
      
      * resolved conflict
      
      * fixed segment fault; high-level VLOG for load edges and load nodes
      
      * random_sample return 0
      
      * rm useless loop
      
      * test:load edge
      
      * fix ret -1
      
      * test: rm sample
      
      * rm sample
      
      * random_sample return future
      
      * random_sample return int
      
      * test fake node
      
      * fixed here
      
      * memory leak
      
      * remove test code
      
      * fix return problem
      
      * add common_graph_table
      
      * random sample node &test & change data-structure from linkedList to vector
      
      * add common_graph_table
      
      * sample with srand
      
      * add node_types
      
      * optimize nodes sample
      
      * recover test
      
      * random sample
      
      * destruct weighted sampler
      
      * GraphEdgeBlob
      
      * WeightedGraphEdgeBlob to GraphEdgeBlob
      
      * WeightedGraphEdgeBlob to GraphEdgeBlob
      
      * pybind sample nodes api
      
      * pull nodes with step
      
      * fixed pull_graph_list bug; add test for pull_graph_list by step
      
      * add graph table;name
      
      * add graph table;name
      
      * add pybind
      
      * add pybind
      
      * add FeatureNode
      
      * add FeatureNode
      
      * add FeatureNode Serialize
      
      * add FeatureNode Serialize
      
      * get_feat_node
      
      * avoid local rpc
      
      * fix get_node_feat
      
      * fix get_node_feat
      
      * remove log
      
      * get_node_feat return  py:bytes
      
      * merge develop with graph_engine
      
      * fix threadpool.h head
      
      * fix
      
      * fix typo
      
      * resolve conflict
      
      * fix conflict
      
      * recover lost content
      
      * fix pybind of FeatureNode
      
      * recover cmake
      
      * recover tools
      
      * resolve conflict
      
      * resolve linking problem
      
      * code style
      
      * change test_server port
      
      * fix code problems
      
      * remove shard_num config
      
      * remove redundent threads
      
      * optimize start server
      
      * remove logs
      
      * fix code problems by reviewers' suggestions
      Co-authored-by: NHuang Zhengjie <270018958@qq.com>
      Co-authored-by: NWeiyue Su <weiyue.su@gmail.com>
      Co-authored-by: Nsuweiyue <suweiyue@baidu.com>
      Co-authored-by: Nluobin06 <luobin06@baidu.com>
      Co-authored-by: Nliweibin02 <liweibin02@baidu.com>
      94736d60
  12. 31 3月, 2021 1 次提交
    • Z
      [Parallel UT]Improve Parallel UT level on Windows/Linux (#31377) · b05f6142
      Zhou Wei 提交于
      * [Parallel UT]improve Parallel UT level on Windows/Linux
      
      * [Parallel UT]improve Parallel UT level on Windows/Linux
      
      * [Parallel UT]Improve Parallel UT level on Windows/Linux
      
      * [Parallel UT]Improve Parallel UT level on Windows/Linux
      
      * fix CI
      b05f6142
  13. 30 3月, 2021 3 次提交
  14. 23 3月, 2021 1 次提交
  15. 22 3月, 2021 1 次提交
  16. 19 3月, 2021 1 次提交
  17. 16 3月, 2021 4 次提交
  18. 12 3月, 2021 1 次提交
  19. 09 3月, 2021 2 次提交
    • Y
      Prec on mac (#31382) · 0b3c2296
      YUNSHEN XIE 提交于
      * add precision on mac
      
      * added judge
      
      * match file_ut.json on mac
      
      * fix code format error
      
      * fix code format error
      
      * fix error caused by length of ut_lists exceeds the limit
      
      * fix format error,notest,test=cpu
      
      * fix code format error
      
      * add windows judge on get_pr_ut
      0b3c2296
    • Y
      Prec on windows exclude check_added_ut (#31372) · 390cebee
      YUNSHEN XIE 提交于
      * add precision test for windows ci exclude check_added_ut
      
      * fix error
      
      * added PRECISION_TEST parameters
      
      * fix format error
      390cebee
  20. 08 3月, 2021 1 次提交
  21. 04 3月, 2021 3 次提交