- 24 9月, 2021 6 次提交
-
-
由 Weilong Wu 提交于
* Add linalg.solve op, test=develop * Fix a bug caused by accidental deletion * updated description and fix a bug: missing a comma * Add linalg.solve op, test=develop * updated solve op backward logic * updated solve op backward logic again * Add linalg.solve Op, test=develop * Updated and modified to fit CI requirements * Fix a bug * 1)Add more test cases; 2)Fix a wrong usage in reduces operation; 3)Remove redundant code * Remove redundant comments * 1)Removed redundant code; 2)Updated to enhance code robustness * Removed redundant code * Updated API documents
-
由 seemingwang 提交于
* graph engine demo * upload unsaved changes * fix dependency error * fix shard_num problem * py client * remove lock and graph-type * add load direct graph * add load direct graph * add load direct graph * batch random_sample * batch_sample_k * fix num_nodes size * batch brpc * batch brpc * add test * add test * add load_nodes; change add_node function * change sample return type to pair * resolve conflict * resolved conflict * resolved conflict * separate server and client * merge pair type * fix * resolved conflict * fixed segment fault; high-level VLOG for load edges and load nodes * random_sample return 0 * rm useless loop * test:load edge * fix ret -1 * test: rm sample * rm sample * random_sample return future * random_sample return int * test fake node * fixed here * memory leak * remove test code * fix return problem * add common_graph_table * random sample node &test & change data-structure from linkedList to vector * add common_graph_table * sample with srand * add node_types * optimize nodes sample * recover test * random sample * destruct weighted sampler * GraphEdgeBlob * WeightedGraphEdgeBlob to GraphEdgeBlob * WeightedGraphEdgeBlob to GraphEdgeBlob * pybind sample nodes api * pull nodes with step * fixed pull_graph_list bug; add test for pull_graph_list by step * add graph table;name * add graph table;name * add pybind * add pybind * add FeatureNode * add FeatureNode * add FeatureNode Serialize * add FeatureNode Serialize * get_feat_node * avoid local rpc * fix get_node_feat * fix get_node_feat * remove log * get_node_feat return py:bytes * merge develop with graph_engine * fix threadpool.h head * fix * fix typo * resolve conflict * fix conflict * recover lost content * fix pybind of FeatureNode * recover cmake * recover tools * resolve conflict * resolve linking problem * code style * change test_server port * fix code problems * remove shard_num config * remove redundent threads * optimize start server * remove logs * fix code problems by reviewers' suggestions * move graph files into a folder * code style change * remove graph operations from base table * optimize get_feat function of graph engine * fix long long count problem * remove redandunt graph files * remove unused shell * recover dropout_op_pass.h * fix potential stack overflow when request number is too large & node add & node clear & node remove * when sample k is larger than neigbor num, return directly * using random seed generator of paddle to speed up * fix bug of random sample k * fix code style * fix code style * add remove graph to fleet_py.cc * fix blocking_queue problem * fix style * fix * recover capacity check * add remove graph node; add set_feature * add remove graph node; add set_feature * add remove graph node; add set_feature * add remove graph node; add set_feature * fix distributed op combining problems * optimize * remove logs Co-authored-by: NHuang Zhengjie <270018958@qq.com> Co-authored-by: NWeiyue Su <weiyue.su@gmail.com> Co-authored-by: Nsuweiyue <suweiyue@baidu.com> Co-authored-by: Nluobin06 <luobin06@baidu.com> Co-authored-by: Nliweibin02 <liweibin02@baidu.com> Co-authored-by: Ntangwei12 <tangwei12@baidu.com>
-
由 Liu-xiandong 提交于
-
由 baoachun 提交于
-
由 baoachun 提交于
* add multihead_matmul trt converter test case * move attribute check to op_teller
-
由 wawltor 提交于
* add the shape check for the matmul * remove the test case for the linear
-
- 23 9月, 2021 7 次提交
-
-
由 liutiexing 提交于
* add align for WorkQueue * WorkQueue update * Revert "WorkQueue update" This reverts commit 14ce793dbb204f8ddec63c34b3b72a73c7cdb93a. * optimize WorkQueue
-
由 Peihan 提交于
-
由 Wilber 提交于
-
由 From00 提交于
-
由 Li Min 提交于
-
由 TTerror 提交于
* add argmax and iou_similarity for kunlun * add argmax and iou_similarity for kunlun * add argmax and iou_similarity for kunlun
-
由 wuhuanzhou 提交于
add pass_desc_py_proto depends
-
- 22 9月, 2021 24 次提交
-
-
由 Tomasz Socha 提交于
* Fix copy elision warning * Remove redundand code
-
由 Zhang Zheng 提交于
-
由 Sing_chan 提交于
move variable UPLOAD_TP_FILE to the beginning or it cant be initialized when running build-whl task (#35895)
-
由 zhaoyingli 提交于
-
由 zhangbo9674 提交于
* split minimize() to step() + update() * add unscale and step for grad_scaler * add unittest * refine code in minimize * delete step in loss_scaler * fix example bug * refine comment * refine unittest * add unittest
-
由 ronnywang 提交于
* add randperm_op_npu * fix test_set_value_op_npu
-
由 TeslaZhao 提交于
* Pass compat of conv_transpose_bias_mkldnn_fuse_pass * Fix a bug of strided_slice op, about the axes parameter access memory out of bounds * Fix a bug of transpose op, about accessing memory out of bounds of the perm param * op:transpose_op supports bool type
-
由 huangxu96 提交于
Add new API : paddle.linalg.det & paddle.linalg.slogdet API Alias:paddle.det& paddle.slogdet
-
由 yeliang2258 提交于
-
由 Peihan 提交于
* support ernie-int8 test and prune op attribute test * remove using and use namespace * remove macro and use shell instead * Revert "remove macro and use shell instead" This reverts commit 615964b149d7de7825b341936b42be22a4bc0091. * fix grammar error * fix shell error
-
由 wanghuancoder 提交于
-
由 Zeng Jinle 提交于
-
由 JingZhuangzhuang 提交于
-
由 Wangzheee 提交于
-
由 JingZhuangzhuang 提交于
* support nnadapter and ascend310 * modify code * add anchor_generator convert test * add gelu convert test * add conv2d convert test * modify anchor_operator convert test * modify conv2d test * modify con2d convert test * modify conv2d convert test * modify conv2d convert test * modify conv2d test * fix WITH_PYTHON compile error * modify test file * modify test file * modify test file * modify test file * modify test file * modify test file * modify test file * modify test file Co-authored-by: Nxiaoxiaohehe001 <hiteezsf@163.com> Co-authored-by: Njiweibo <jiweibo@baidu.com>
-
由 joanna.wozna.intel 提交于
-
由 wanghuancoder 提交于
* fix feed, test=develop * delete one test case, test=develop
-
由 wanghuancoder 提交于
-
由 wanghuancoder 提交于
* refine gc for new_executor, test=develop * refine, test=develop * refine, test=develop * merge, test=develop
-
由 Aurelius84 提交于
* Modify H2D and D2H as kQueue::Sync * fix interface error
-
由 zhouweiwei2014 提交于
* support extern third_party lapack on Linux/Windows/Mac * fix ci
-
由 Feiyu Chan 提交于
-
由 zhangbo9674 提交于
-
由 wangguanzhong 提交于
-
- 21 9月, 2021 2 次提交
-
-
由 Guoxia Wang 提交于
-
由 Adam Osewski 提交于
* Create stateful OneDNNAXPYHandler object. This makes it possible to call it multiple times without recreating the oneDNN primitives every time. * Prepare SGDOpKernel to reuse its implementation from OneDNN kernel. * OneDNN SGD kernel. * Update call to use new OneDNNAXPYHandler object api. * Setup seed in proper place. * Enable OneDNN kernel only for single case. * For dense param and sparse grad. * Small refactor. * Enable oneDNN by op attr or by cmd line flag. * Use int64_t type for number of elements. * Support dense param and grad from OneDNN kernel. * Enable SGD OneDNN kernel when use MP BF16 optimizer. * Force non-copyable/movable OneDNNAXPYHandler. * Reuse OneDNNAXPYHandler for spare tensors in SUM op. * Fix SFINAE rules. * Remove recording event inside AXPY. * Get rid of internal primitive caching. * Stop use PP cache mechanims to store mem and primitive obj. * Handler obj store and reuse needed desc & prim * Do not derive from MKLDNNHandlerT
-
- 19 9月, 2021 1 次提交
-
-
由 limingshu 提交于
* Optimization of pool2d grad, first commit. * remove useless print codes * refine codes * refine codes * seal more operation into template specialization * fix template struct error in MaxPool2dGrad. * Fix header including error * refine code with comment * Seal the param-preparation codes into function for common use. * Seal the param-preparation codes into function for common use. * Seal the param-preparation into funciton and make it common for other kernels * polish code and erase useless template speicalization * Rerun triger * rerun trigger
-