- 02 8月, 2021 1 次提交
-
-
由 Leo Chen 提交于
-
- 31 7月, 2021 1 次提交
-
-
由 Peihan 提交于
* add resnet50 trt test in pr-ci-inference test
-
- 30 7月, 2021 10 次提交
-
-
由 Huihuang Zheng 提交于
-
由 jakpiase 提交于
* test version of matmul_v2 * added matmul_v2 grad kernel * minor changes * minor changes * minor change for CI approval * CI fix * CI fix * trigger CI * changes after review, not working yet * moved ops to anonymous namespaces * changes after review
-
由 Leo Chen 提交于
-
由 jakpiase 提交于
* test version of matmul_v2 * added matmul_v2 grad kernel * minor changes * minor changes * minor change for CI approval * CI fix * CI fix * added squeeze and squeeze2 kernels * CI fix * CI fix * CI fix * disabled tests when compiled with cuda * added setting format_tag by strides * added sigmoid BF16 FWD/BWD and gelu BF16 BWD * changes after review * Revert "added sigmoid BF16 FWD/BWD and gelu BF16 BWD" This reverts commit 6e3f76720b545abfcff9f6052b46b73a1e745cae. * Revert "Merge branch 'matmul_v2_grad' into squeeze2_op" This reverts commit 06fcf67843a4a7884eccdf67a02a03575e1d4cb8, reversing changes made to 6e3f76720b545abfcff9f6052b46b73a1e745cae. * minor change * added reshape1/2 kernels * moved some functions into private block * CI fix * CI fix * CI fix
-
由 wangguanqun 提交于
* add trainer desc config to distributed strategy * code style modified
-
由 jakpiase 提交于
* added expand_v2 bf16/fp32 kernel * minor change * CI fix * added missing test file * added formatting * reduced binary size * CI fix
-
由 Qi Li 提交于
* [NPU] add clip and clip_grad on NPU, test=develop * address review comments, test=develop * update, test=develop
-
由 Haohongxiang 提交于
-
由 Leo Chen 提交于
-
由 zyfncg 提交于
* Support setitem by None index * remove unreachable code * Add Checkpoint for set_value_op because add a new attribute
-
- 29 7月, 2021 8 次提交
-
-
由 Zeng Jinle 提交于
* add fix op run order pass * add ut for fix_op_run_order * fix ci error * improve coverage * improve coverge again and fix cpu test case * follow some comments
-
由 seemingwang 提交于
* graph engine demo * upload unsaved changes * fix dependency error * fix shard_num problem * py client * remove lock and graph-type * add load direct graph * add load direct graph * add load direct graph * batch random_sample * batch_sample_k * fix num_nodes size * batch brpc * batch brpc * add test * add test * add load_nodes; change add_node function * change sample return type to pair * resolve conflict * resolved conflict * resolved conflict * separate server and client * merge pair type * fix * resolved conflict * fixed segment fault; high-level VLOG for load edges and load nodes * random_sample return 0 * rm useless loop * test:load edge * fix ret -1 * test: rm sample * rm sample * random_sample return future * random_sample return int * test fake node * fixed here * memory leak * remove test code * fix return problem * add common_graph_table * random sample node &test & change data-structure from linkedList to vector * add common_graph_table * sample with srand * add node_types * optimize nodes sample * recover test * random sample * destruct weighted sampler * GraphEdgeBlob * WeightedGraphEdgeBlob to GraphEdgeBlob * WeightedGraphEdgeBlob to GraphEdgeBlob * pybind sample nodes api * pull nodes with step * fixed pull_graph_list bug; add test for pull_graph_list by step * add graph table;name * add graph table;name * add pybind * add pybind * add FeatureNode * add FeatureNode * add FeatureNode Serialize * add FeatureNode Serialize * get_feat_node * avoid local rpc * fix get_node_feat * fix get_node_feat * remove log * get_node_feat return py:bytes * merge develop with graph_engine * fix threadpool.h head * fix * fix typo * resolve conflict * fix conflict * recover lost content * fix pybind of FeatureNode * recover cmake * recover tools * resolve conflict * resolve linking problem * code style * change test_server port * fix code problems * remove shard_num config * remove redundent threads * optimize start server * remove logs * fix code problems by reviewers' suggestions * move graph files into a folder * code style change * remove graph operations from base table * optimize get_feat function of graph engine * fix long long count problem * remove redandunt graph files * remove unused shell * recover dropout_op_pass.h * fix potential stack overflow when request number is too large & node add & node clear & node remove * when sample k is larger than neigbor num, return directly * using random seed generator of paddle to speed up * fix bug of random sample k * fix code style * fix code style * fix blocking_queue problem * fix style * fix * recover capacity check Co-authored-by: NHuang Zhengjie <270018958@qq.com> Co-authored-by: NWeiyue Su <weiyue.su@gmail.com> Co-authored-by: Nsuweiyue <suweiyue@baidu.com> Co-authored-by: Nluobin06 <luobin06@baidu.com> Co-authored-by: Nliweibin02 <liweibin02@baidu.com> Co-authored-by: Ntangwei12 <tangwei12@baidu.com>
-
由 gongweibao 提交于
-
由 Yuang Liu 提交于
-
由 Huihuang Zheng 提交于
As the title
-
由 Leo Chen 提交于
-
由 wenbin 提交于
* tile op * more uts * disable tile if trt6.0 * typo * fix timeout issue * opteller * opteller remove duplicate code * comments. test=document_fix * modify PADDLE_ENFORCE. * fix reduce_mean issue
-
由 seemingwang 提交于
-
- 28 7月, 2021 7 次提交
-
-
由 Leo Chen 提交于
* support ScaleTensor for scale npu kernel * add more tests for adam npu * fix compile * fix unittest * refine adam optimizer
-
由 zlsh80826 提交于
* trt buildEngineWithConfig is deprecated * add trt version control
-
由 jiangcheng 提交于
See https://github.com/PaddlePaddle/Paddle/pull/33949 for details
-
由 jiangcheng 提交于
This PR added optional boolean is_parameter and stop_gradient in the VarDesc proto, and remove them during save_inference_model
-
由 Wangzheee 提交于
-
由 Aganlengzi 提交于
* [NPU] add NPU ops&uts of compare, test=develop * testing * try style-format * [NPU] update compare_op_npu uts * [NPU] fix code sytle of test_compare_op_npu.py
-
由 jiangcheng 提交于
When Graph has sub-graph, apply pass to it and all sub-graph. And add single test script .
-
- 27 7月, 2021 3 次提交
-
-
由 Feng Xing 提交于
This PR add some primitives api related files. Those files are almost empty for the moment, but will be filled later.
-
由 Qi Li 提交于
* fix build error of gtest dependency, test=develop * update when WITH_TESTING is OFF, test=develop
-
由 Aurelius84 提交于
Revert "Revert "[Dy2Stat] Refactor ExecutorCache logic and pre-support BuildStrategy for pass (#34181)" (#34348)" (#34384) This reverts commit 577fdde5.
-
- 26 7月, 2021 7 次提交
-
-
由 furnace 提交于
* [NPU] add tril_triu * [NPU] delete debug codes * [NPU] add more test cases, and api test * [NPU] optimize codes style
-
由 danleifeng 提交于
* psgpu:edit cuda remote_streams; test=develop
-
由 ronnywang 提交于
-
由 zyfncg 提交于
* Support getitem by ellipsis index in dynamic mode * change some code style * Support getitem by none index in dynamic mode * modify a comments style and remove useless code
-
由 Qi Li 提交于
-
由 furnace 提交于
* [NPU] add cumsum * [NPU] delete debug codes * [NPU] add attr flatten and unittests, and api tests * [NPU] delete comment codes * [NPU] add attr flatten and axis exclusive check * [NPU] delete skipIf
-
由 furnace 提交于
* [NPU] add hard_sigmoid * [NPU] delete check_dygraph=False and max_relative_error * [NPU] delete debug codes * [NPU] add more test cases * [NPU] add api test TestHardsigmoidAPI * [NPU] temp delete hard_sigmoid for resovle conficts * [NPU] resolve conflicts
-
- 23 7月, 2021 3 次提交
-
-
由 Aurelius84 提交于
Revert "[Dy2Stat] Refactor ExecutorCache logic and pre-support BuildStrategy for pass (#34181)" (#34348) This reverts commit 609f8225.
-
由 will-jl944 提交于
* logical ops support int8, int16, int32, int64, float, double * update docs of logical ops * fix npu and xpu logical ops * fix npu and xpu logical ops * fix bug in xpu logical op code * update test_logical_op_npu and test_logical_op_xpu * correct error type
-
由 ronnywang 提交于
* add index_sample_op_npu and tests * update
-