1. 22 11月, 2021 19 次提交
  2. 20 11月, 2021 1 次提交
  3. 19 11月, 2021 20 次提交
    • J
      Add corner case in scale calculation (#37352) · 4d891c00
      joanna.wozna.intel 提交于
      4d891c00
    • W
      Add dygraph triple grad test, broadcast case (#37377) · bb2733fa
      Weilong Wu 提交于
      bb2733fa
    • L
      bug fix shard_index (#37042) · b505ff96
      lilong12 提交于
      b505ff96
    • zhouweiwei2014's avatar
      add new API paddle.nn.initializer.Orthogonal and calculate_gain (#37163) · 62ad3594
      zhouweiwei2014 提交于
      * add new API paddle.nn.initializer.Orthogonal and calculate_gain
      
      * fix comment
      
      * fix comment
      62ad3594
    • L
      Fix runtime graph on gpt, add debug message (#37361) · af83e79a
      LiYuRio 提交于
      af83e79a
    • J
      Optimize cinn_cache_key by replace GraphToProgram to Dot string (#37317) · edc3496f
      jiangcheng 提交于
      * optimize cache-key by replace GraphToProgram to Dot string
      
      * fix compile failure bug
      edc3496f
    • Z
      Fix CI bug caused by type of TensorMeta (#37373) · d29cc7b4
      zyfncg 提交于
      * rename TensorBase interface data_type() to dtype()
      
      * rename type to dtype of TensorMeta
      
      * merge the code
      
      * merge the code
      
      * fix the problem when merge conflict
      
      * fix bug of ci caused by type of tensor_meta
      
      * changes cmake to clear cache
      d29cc7b4
    • S
      da5fb1d4
    • W
      Add fuse_resnet_unit pass (#36818) · 3cd3bf29
      wuhuanzhou 提交于
      * GeneratePass support attr condition and mapping, test=develop
      
      * fix coverage, test=develop
      
      * Add fuse_resnet_unit pass, test=develop
      
      * fix CI errors, test=develop
      
      * fix CI errors, test=develop
      
      * fix unittest error when compiling without CUDA, test=develop
      
      * fix static ci error, test=develop
      
      * limit kernel size must equal 1, test=develop
      3cd3bf29
    • F
    • W
      fix bug in save_inference_model (#37362) · 77bca4de
      wangguanqun 提交于
      77bca4de
    • T
      Update OP-benchamrk CI scripts (#37360) · 2e758325
      tianshuo78520a 提交于
      Update OP-benchamrk CI scripts 
      2e758325
    • J
      Refactor dygraph to eager (#37318) · b962f5fe
      Jiabin Yang 提交于
      * Add EagerTensor and tests
      
      * remove useless enforce
      
      * remove comment in cmake
      
      * fix test_error
      
      * add depends on python
      b962f5fe
    • S
      optimize graph-engine sample api's data-transfer process (#37341) · 9fc11db7
      seemingwang 提交于
      * graph engine demo
      
      * upload unsaved changes
      
      * fix dependency error
      
      * fix shard_num problem
      
      * py client
      
      * remove lock and graph-type
      
      * add load direct graph
      
      * add load direct graph
      
      * add load direct graph
      
      * batch random_sample
      
      * batch_sample_k
      
      * fix num_nodes size
      
      * batch brpc
      
      * batch brpc
      
      * add test
      
      * add test
      
      * add load_nodes; change add_node function
      
      * change sample return type to pair
      
      * resolve conflict
      
      * resolved conflict
      
      * resolved conflict
      
      * separate server and client
      
      * merge pair type
      
      * fix
      
      * resolved conflict
      
      * fixed segment fault; high-level VLOG for load edges and load nodes
      
      * random_sample return 0
      
      * rm useless loop
      
      * test:load edge
      
      * fix ret -1
      
      * test: rm sample
      
      * rm sample
      
      * random_sample return future
      
      * random_sample return int
      
      * test fake node
      
      * fixed here
      
      * memory leak
      
      * remove test code
      
      * fix return problem
      
      * add common_graph_table
      
      * random sample node &test & change data-structure from linkedList to vector
      
      * add common_graph_table
      
      * sample with srand
      
      * add node_types
      
      * optimize nodes sample
      
      * recover test
      
      * random sample
      
      * destruct weighted sampler
      
      * GraphEdgeBlob
      
      * WeightedGraphEdgeBlob to GraphEdgeBlob
      
      * WeightedGraphEdgeBlob to GraphEdgeBlob
      
      * pybind sample nodes api
      
      * pull nodes with step
      
      * fixed pull_graph_list bug; add test for pull_graph_list by step
      
      * add graph table;name
      
      * add graph table;name
      
      * add pybind
      
      * add pybind
      
      * add FeatureNode
      
      * add FeatureNode
      
      * add FeatureNode Serialize
      
      * add FeatureNode Serialize
      
      * get_feat_node
      
      * avoid local rpc
      
      * fix get_node_feat
      
      * fix get_node_feat
      
      * remove log
      
      * get_node_feat return  py:bytes
      
      * merge develop with graph_engine
      
      * fix threadpool.h head
      
      * fix
      
      * fix typo
      
      * resolve conflict
      
      * fix conflict
      
      * recover lost content
      
      * fix pybind of FeatureNode
      
      * recover cmake
      
      * recover tools
      
      * resolve conflict
      
      * resolve linking problem
      
      * code style
      
      * change test_server port
      
      * fix code problems
      
      * remove shard_num config
      
      * remove redundent threads
      
      * optimize start server
      
      * remove logs
      
      * fix code problems by reviewers' suggestions
      
      * move graph files into a folder
      
      * code style change
      
      * remove graph operations from base table
      
      * optimize get_feat function of graph engine
      
      * fix long long count problem
      
      * remove redandunt graph files
      
      * remove unused shell
      
      * recover dropout_op_pass.h
      
      * fix potential stack overflow when request number is too large & node add & node clear & node remove
      
      * when sample k is larger than neigbor num, return directly
      
      * using random seed generator of paddle to speed up
      
      * fix bug of random sample k
      
      * fix code style
      
      * fix code style
      
      * add remove graph to fleet_py.cc
      
      * fix blocking_queue problem
      
      * fix style
      
      * fix
      
      * recover capacity check
      
      * add remove graph node; add set_feature
      
      * add remove graph node; add set_feature
      
      * add remove graph node; add set_feature
      
      * add remove graph node; add set_feature
      
      * fix distributed op combining problems
      
      * optimize
      
      * remove logs
      
      * fix MultiSlotDataGenerator error
      
      * cache for graph engine
      
      * fix type compare error
      
      * more test&fix thread terminating problem
      
      * remove header
      
      * change time interval of shrink
      
      * use cache when sample nodes
      
      * remove unused function
      
      * change unique_ptr to shared_ptr
      
      * simplify cache template
      
      * cache api on client
      
      * fix
      
      * reduce sample threads when cache is not used
      
      * reduce cache memory
      
      * cache optimization
      
      * remove test function
      
      * remove extra fetch function
      
      * graph-engine data transfer optimization
      Co-authored-by: NHuang Zhengjie <270018958@qq.com>
      Co-authored-by: NWeiyue Su <weiyue.su@gmail.com>
      Co-authored-by: Nsuweiyue <suweiyue@baidu.com>
      Co-authored-by: Nluobin06 <luobin06@baidu.com>
      Co-authored-by: Nliweibin02 <liweibin02@baidu.com>
      Co-authored-by: Ntangwei12 <tangwei12@baidu.com>
      9fc11db7
    • Z
      【PTen】Rename TensorMeta member type to dtype (#37277) · c13edf66
      zyfncg 提交于
      * rename TensorBase interface data_type() to dtype()
      
      * rename type to dtype of TensorMeta
      
      * merge the code
      
      * merge the code
      
      * fix the problem when merge conflict
      c13edf66
    • C
      [PTen] Add copy_to and to method for Tensor (#37262) · 5a000900
      Chen Weihang 提交于
      * add copy_to and to method for Tensor
      
      * polish msg format
      
      * fix details error
      
      * fix copy_to test compile failed
      
      * fix typo
      5a000900
    • S
      Add paddle.incubate.graph_send_recv API (#37205) · 39012536
      Siming Dai 提交于
      * add cpu version, using set: sum, min, max
      
      * add cpu version: mean
      
      * improve cpu code and fix dynamic memory allcation problem
      
      * fix arg error, add index judge, delete fp16
      
      * fix bug in CudaAtomicMax and CudaAtomicMin
      
      * add CUDA version
      
      * fix grad_op bug for index
      
      * add op test, add correct cpu grad op
      
      * Add correct CUDA Mean grad
      
      * [Add] Successful MEAN and SUM
      
      * [Add] Successful MIN and MAX in CPU
      
      * [Add] Successful MIN and MAX in CUDA
      
      * fix windows dtype ci
      
      * fix ROCM ci by adding HIP flag
      
      * rename fused_gather_scatter to send_recv
      
      * unify name as send and recv
      
      * change zero index return time
      
      * add send_recv incubate api
      
      * fix index data type, add unittest case for API
      
      * delete redundant input tensor
      
      * fix en example and docs, add default value in pool_type
      
      * add shape judge and max grid judge
      
      * fix comment
      
      * fix index type bug
      
      * add const &
      
      * fix en docs
      
      * delete numpy in examples
      
      * add unittest for int input
      
      * fix send_recv comment
      
      * change send_recv to graph_send_recv
      39012536
    • Y
      [fleet_executor] Parse pipeline config (#37319) · ca088f92
      Yuang Liu 提交于
      ca088f92
    • W
      [fleet_executor] Add interceptor register (#37338) · f11e843a
      WangXi 提交于
      f11e843a
    • C
      [PTen] Add compatible reshape method for Tensor (#37281) · 715fd051
      Chen Weihang 提交于
      * add reshape method for Tensor
      
      * fix typo
      
      * fix typo
      
      * fix conflit with develop
      715fd051