- 24 5月, 2021 7 次提交
-
-
由 ShenLiang 提交于
* fix pipeline * fix mp pp dp * fix utest of hybrid parallel * add utest for tuple
-
由 tianshuo78520a 提交于
This reverts commit 0e5d832c.
-
由 wangguanzhong 提交于
-
由 Jacek Czaja 提交于
* - bump up oneDNN to 2.2.2 (should reduce perf drops of mobilenet) * - more recnet onednn 2.2.2 (some more bugfixes)
-
由 gongweibao 提交于
-
由 seemingwang 提交于
* graph engine demo * upload unsaved changes * fix dependency error * fix shard_num problem * py client * remove lock and graph-type * add load direct graph * add load direct graph * add load direct graph * batch random_sample * batch_sample_k * fix num_nodes size * batch brpc * batch brpc * add test * add test * add load_nodes; change add_node function * change sample return type to pair * resolve conflict * resolved conflict * resolved conflict * separate server and client * merge pair type * fix * resolved conflict * fixed segment fault; high-level VLOG for load edges and load nodes * random_sample return 0 * rm useless loop * test:load edge * fix ret -1 * test: rm sample * rm sample * random_sample return future * random_sample return int * test fake node * fixed here * memory leak * remove test code * fix return problem * add common_graph_table * random sample node &test & change data-structure from linkedList to vector * add common_graph_table * sample with srand * add node_types * optimize nodes sample * recover test * random sample * destruct weighted sampler * GraphEdgeBlob * WeightedGraphEdgeBlob to GraphEdgeBlob * WeightedGraphEdgeBlob to GraphEdgeBlob * pybind sample nodes api * pull nodes with step * fixed pull_graph_list bug; add test for pull_graph_list by step * add graph table;name * add graph table;name * add pybind * add pybind * add FeatureNode * add FeatureNode * add FeatureNode Serialize * add FeatureNode Serialize * get_feat_node * avoid local rpc * fix get_node_feat * fix get_node_feat * remove log * get_node_feat return py:bytes * merge develop with graph_engine * fix threadpool.h head * fix * fix typo * resolve conflict * fix conflict * recover lost content * fix pybind of FeatureNode * recover cmake * recover tools * resolve conflict * resolve linking problem * code style * change test_server port * fix code problems * remove shard_num config * remove redundent threads * optimize start server * remove logs * fix code problems by reviewers' suggestions * move graph files into a folder * code style change * remove graph operations from base table * optimize get_feat function of graph engine * fix long long count problem * remove redandunt graph files * remove unused shell * recover dropout_op_pass.h * fix potential stack overflow when request number is too large & node add & node clear & node remove Co-authored-by: NHuang Zhengjie <270018958@qq.com> Co-authored-by: NWeiyue Su <weiyue.su@gmail.com> Co-authored-by: Nsuweiyue <suweiyue@baidu.com> Co-authored-by: Nluobin06 <luobin06@baidu.com> Co-authored-by: Nliweibin02 <liweibin02@baidu.com> Co-authored-by: Ntangwei12 <tangwei12@baidu.com>
-
由 limingshu 提交于
-
- 22 5月, 2021 2 次提交
-
-
由 wangguanzhong 提交于
-
由 jakpiase 提交于
* added support for most matmul cases * added more functionality * full functionality of matmul op, fp32 only * added bf16 tests and functionality * added formatting * changes after review * minor change * added reviewers suggestions
-
- 21 5月, 2021 7 次提交
-
-
由 chentianyu03 提交于
* replace complex in set tensor from and to numpy * replace complex template in cast op
-
由 王明冬 提交于
-
由 Feng Xing 提交于
* optimize softmax with cross entropy hard label * label ignore_index cleaning
-
由 tianshuo78520a 提交于
* fix model_bhecnmark ci * fix model_bhecnmark ci
-
由 Leo Chen 提交于
-
由 YUNSHEN XIE 提交于
* update conda build script for cuda11 * update conda build script * modified wheel name * update conda_build * fix error * add cudnn8.1 for cuda11.2 * fix format error
-
由 pangyoki 提交于
* cast indices and label if their type is not consistent * fix bug * add unittest
-
- 20 5月, 2021 8 次提交
-
-
由 TTerror 提交于
* fix gather op and add logsumexp op on kunlun * update xpu depence * update tests and fix elementwise_add
-
由 Baibaifan 提交于
-
由 Aurelius84 提交于
* Support convert sublayers in Sequential Container * remove paddle.jit.set_code_level
-
由 liym27 提交于
-
由 chentianyu03 提交于
* add complex template file * add numtraits for complex template * add complex template type register * modify specify template of complex * modify specify template of complex * modify specify template of complex * modify specify template of complex * make TensorCheckerVisitor support complex type * fix operator= error * add complex template * add complex template type * add complex template type to pyarray transform * add complex template type to pyarray transform * remove complex type for dlpack register * set dlpack supprot complex type * set dlpack supprot complex type * set dlpack supprot complex type * remove explict for complex constructor * add complex unit test file
-
由 seemingwang 提交于
-
由 zhangchunle 提交于
-
由 limingshu 提交于
-
- 19 5月, 2021 10 次提交
-
-
由 WeiXin 提交于
* fix test_paddle_save_load and test_paddle_save_load_binary * fix unittest:test_paddle_save_load and test_paddle_save_load_binary * delete *.pyc * add comment for unittest
-
由 wuhuanzhou 提交于
* CI skip inference test if only python files modified, test=develop * fix compilation error on ROCM, test=develop * fix cmake error on PR-CI-ROCM-Compile, test=develop
-
由 石晓伟 提交于
-
由 zhulei 提交于
* [Rocm] fix test of random_crop_op * [Rocm] fix test of random_crop_op * [Rocm] fix test of random_crop_op & simple_rnn_op * [Rocm] fix test of random_crop_op & simple_rnn_op & logsumexp * [Rocm] fix test of random_crop_op & simple_rnn_op & logsumexp * [Rocm] fix test of random_crop_op & simple_rnn_op & logsumexp * [Rocm] fix test of random_crop_op & logsumexp
-
由 GT-Zhang 提交于
* Fix slow data reading, In the old version, one epoch read time of this data set was about 5371 seconds(MacBook Pro Retina, 13-inch, Early 2015 2.7 GHz), and a batch took 211 seconds, It's too painful to use. Now decompress the data in advance (about 10 seconds). Each epoch of reading takes about 3 seconds(MacBook Pro Retina, 13-inch, Early 2015 2.7 GHz), and a batch takes 0.017 seconds more. * Run CI, test=allcase * fix qq group number. test=document_fix fix qq group number. test=document_fix * fix qq group number. test=document_fix fix qq group number. test=document_fix
-
由 YUNSHEN XIE 提交于
* remove ut from parallel_ut list * remove some timeout ut
-
由 Aurelius84 提交于
* BugFix StaticAanlysis with gast.Subscript * remove codes
-
由 Jacek Czaja 提交于
-
由 Chen Weihang 提交于
-
由 Zhou Wei 提交于
-
- 18 5月, 2021 6 次提交
-
-
由 pangyoki 提交于
* Output indices of top_k npu op change to int64 * fix accuracy npu bug * fix errors * change cast method to FillNpuTensorWithConstant * change cast method to FillNpuTensorWithConstant
-
由 joanna.wozna.intel 提交于
* Update paths to Quant models * Update description
-
由 liuyuhui 提交于
-
由 wuhuanzhou 提交于
-
由 Aurelius84 提交于
* Enhance grep syntax to avoid random failed * Enhance grep syntax to avoid random failed
-
由 Aurelius84 提交于
* Add param_guard in ParameterList to support @to_static * Refactor param_guard of @to_static * fix unittest failed * add more unittest
-