1. 04 3月, 2022 7 次提交
    • Z
      Fix bug caused by split infershape (#40116) · 45385371
      zyfncg 提交于
      * fix bug caused by split infershape
      
      * revert infer_shape of split
      
      * revert split
      45385371
    • C
      [Phi] Remove cholsky solve deps with svd helper (#40119) · 28fd30cd
      Chen Weihang 提交于
      * remove cholsky solve deps with svd helper
      
      * fix shape infer bug
      28fd30cd
    • zhouweiwei2014's avatar
      【Phi】Migrate bitwise_and/bitwise_or/bitwise_xor/bitwise_not op into phi (#40031) · 03eb792d
      zhouweiwei2014 提交于
      * Migrate bitwise_and/or/xor/not op into phi
      
      * fix CI
      03eb792d
    • L
      clean distribution_helper, index_impl, aligned_vector code in fluid (#40071) · b9672a1e
      Leo Chen 提交于
      * clean distribution_helper, index_impl, aligned_vector code in fluid
      
      * fix conflicts
      b9672a1e
    • C
      [phi]move reduce gpu impl funcs into pten/kernels/funcs (#39990) · e2e2d531
      chentianyu03 提交于
      * move reduce gpu impl funcs into pten/kernels/funcs
      
      * change reduce header name and namespace
      
      * fix spell word error
      
      * change mutable_data to dev_ctx.Alloc
      
      * modify place to devcontex
      
      * format code style
      
      * fix build error
      
      * fix build error
      
      * fix conflict
      e2e2d531
    • C
      [Phi] Change input vec tensor to pointer type (#40078) · 50d5bf79
      Chen Weihang 提交于
      * change input vec tensor to pointer
      
      * update input between
      
      * fix format error
      
      * resolve conflict
      
      * resolve conflict
      50d5bf79
    • H
      Move conv to pten (#39354) · d50fb43e
      hong 提交于
      * move conv to pten
      
      * move conv to pten; test=develop
      
      * fix bug;
      
      * add conv cudnn impl; test=develop
      
      * update
      
      * update operator; test=develop
      
      * fix bug; test=develop
      
      * move operator and prepared_operator to develop; test=develop
      
      * resolve conflict; test=develop
      
      * remove useless code;test=develop
      
      * add depency ; test=develop
      
      * fix bug;
      
      * add sig.cc ; test=develop
      
      * fix use_op error; test=develop
      
      * fix bug; test=develop
      
      * fix bug; test=develop
      
      * add conv3d register; test=develop
      
      * fix star gan and conv_nn_grad test failed; test=develop
      
      * add header; test=develop
      
      * manul to recover to develop;
      
      * resolve confilct; test=develop
      
      * remove useless code
      
      * fix bug;
      
      * remove conv2d_cudnn; test=develop
      
      * fix bugs; test=develop
      
      * fix cpu rocm compile bugs; test=develop
      
      * fix blas error; test=develop
      
      * fix compile bug; test=develop
      
      * fix windows compile error; test=develop
      
      * fix windows error; test=develop
      
      * resolve confilct; test=develop
      d50fb43e
  2. 03 3月, 2022 9 次提交
    • 0
      move eye, lerp infershape to phi (#40105) · 1c205883
      0x45f 提交于
      1c205883
    • F
      Move compare OPs to phi (#39970) · 0969a4eb
      From00 提交于
      * Move compare OPs to phi
      
      * Fix bug
      
      * Use BroadcastKernel and ElementwiseKernel in phi
      0969a4eb
    • Y
      [Phi]Delete kernel registry of elementwise_sub op in Fluid (#40039) · cac00e0b
      YuanRisheng 提交于
      * delete elementwise_sub kernel registry
      
      * fix compile bugs in xpu ci
      
      * fix bugs when run inference ci
      cac00e0b
    • X
      [phi] transfer pad kernel into phi and pass the test_pad_op (#40012) · 9f74b84e
      xiongkun 提交于
      * add pad forward
      
      * fix error
      
      * transfer pad and pass the test_pad_op
      9f74b84e
    • Z
      [PHI] Code auto-generate for Sparse API (#40060) · 31d3d857
      zyfncg 提交于
      * suppport sparse api in yaml
      
      * support auto-gen code of sparse api
      
      * do some refactor
      
      * add unittest test_sparse_conv_api
      
      * add unitest file
      Co-authored-by: Nzkh2016 <zhangkaihuo@baidu.com>
      31d3d857
    • F
      [Phi] move gaussian_random (#39932) · 00bbb8c5
      furnace 提交于
      [Phi] move gaussian_random kernel
      00bbb8c5
    • N
      Modified Reduce for XPU2 (#38918) · 909d1e61
      niuliling123 提交于
      1. set xpu2 block_size = 64
      2. fix a bug when reduce_num is too large
      909d1e61
    • Z
      Implement SparseConv3d kernel (#39784) · 6bf85eaf
      zhangkaihuo 提交于
      * sparse conv3d: gpu code
      6bf85eaf
    • H
      Move bn to pten (#39347) · ebd0f512
      hong 提交于
      * add bn cpu version; test=develop
      
      * move batch norm to pten
      
      * move batch norm to pten; test=develop
      
      * fix bug; test=develop
      
      * fix func::tranpose depend bug; test=develop
      
      * fix compile bugs; test=develop
      
      * fix use_op batch_norm bug; test=develop
      
      * fix cudnn bn add relu test; test=develop
      
      * fix pten context build and double grad bug; test= develop
      
      * remve useless code; test=develop
      
      * add batch norm gpu fp16 support; test=develop
      
      * fix test bn op bug; test=develop
      
      * remove output dtype set; test=develop
      
      * fix bug; test=develop
      
      * fix bug; test=develop
      
      * fix applay pass to program bug; test=develop
      
      * revert to develop; test=develop
      
      * fix rocm bug; test=develop
      
      * revert operator to develop; test=develop
      
      * fix pre_commit; test=develop
      
      * fix statci check error; test=develop
      
      * resolve conflict; test=develop
      
      * ana batch norm bug;
      
      * revert batch norm op
      
      * resolve conlict
      
      * fix nan inf and speed bug; test=develop
      
      * fix bug; test=develop
      
      * fix error; test=develop
      
      * test expand op; test=develop
      
      * fix bug; test=develop
      
      * resolve confilct
      
      * resolve confilct; test=develop
      
      * polish code; test=develop
      
      * polish code; test=develop
      
      * change mutable data to ctx alloc; test=develop
      
      * make format same with ci; test=develop
      
      * fix format error with ci; test=develop
      ebd0f512
  3. 02 3月, 2022 11 次提交
  4. 01 3月, 2022 10 次提交
  5. 28 2月, 2022 3 次提交
    • S
      Move index sample (#39905) · 1b585b28
      seemingwang 提交于
      * graph engine demo
      
      * upload unsaved changes
      
      * fix dependency error
      
      * fix shard_num problem
      
      * py client
      
      * remove lock and graph-type
      
      * add load direct graph
      
      * add load direct graph
      
      * add load direct graph
      
      * batch random_sample
      
      * batch_sample_k
      
      * fix num_nodes size
      
      * batch brpc
      
      * batch brpc
      
      * add test
      
      * add test
      
      * add load_nodes; change add_node function
      
      * change sample return type to pair
      
      * resolve conflict
      
      * resolved conflict
      
      * resolved conflict
      
      * separate server and client
      
      * merge pair type
      
      * fix
      
      * resolved conflict
      
      * fixed segment fault; high-level VLOG for load edges and load nodes
      
      * random_sample return 0
      
      * rm useless loop
      
      * test:load edge
      
      * fix ret -1
      
      * test: rm sample
      
      * rm sample
      
      * random_sample return future
      
      * random_sample return int
      
      * test fake node
      
      * fixed here
      
      * memory leak
      
      * remove test code
      
      * fix return problem
      
      * add common_graph_table
      
      * random sample node &test & change data-structure from linkedList to vector
      
      * add common_graph_table
      
      * sample with srand
      
      * add node_types
      
      * optimize nodes sample
      
      * recover test
      
      * random sample
      
      * destruct weighted sampler
      
      * GraphEdgeBlob
      
      * WeightedGraphEdgeBlob to GraphEdgeBlob
      
      * WeightedGraphEdgeBlob to GraphEdgeBlob
      
      * pybind sample nodes api
      
      * pull nodes with step
      
      * fixed pull_graph_list bug; add test for pull_graph_list by step
      
      * add graph table;name
      
      * add graph table;name
      
      * add pybind
      
      * add pybind
      
      * add FeatureNode
      
      * add FeatureNode
      
      * add FeatureNode Serialize
      
      * add FeatureNode Serialize
      
      * get_feat_node
      
      * avoid local rpc
      
      * fix get_node_feat
      
      * fix get_node_feat
      
      * remove log
      
      * get_node_feat return  py:bytes
      
      * merge develop with graph_engine
      
      * fix threadpool.h head
      
      * fix
      
      * fix typo
      
      * resolve conflict
      
      * fix conflict
      
      * recover lost content
      
      * fix pybind of FeatureNode
      
      * recover cmake
      
      * recover tools
      
      * resolve conflict
      
      * resolve linking problem
      
      * code style
      
      * change test_server port
      
      * fix code problems
      
      * remove shard_num config
      
      * remove redundent threads
      
      * optimize start server
      
      * remove logs
      
      * fix code problems by reviewers' suggestions
      
      * move graph files into a folder
      
      * code style change
      
      * remove graph operations from base table
      
      * optimize get_feat function of graph engine
      
      * fix long long count problem
      
      * remove redandunt graph files
      
      * remove unused shell
      
      * recover dropout_op_pass.h
      
      * fix potential stack overflow when request number is too large & node add & node clear & node remove
      
      * when sample k is larger than neigbor num, return directly
      
      * using random seed generator of paddle to speed up
      
      * fix bug of random sample k
      
      * fix code style
      
      * fix code style
      
      * add remove graph to fleet_py.cc
      
      * fix blocking_queue problem
      
      * fix style
      
      * fix
      
      * recover capacity check
      
      * add remove graph node; add set_feature
      
      * add remove graph node; add set_feature
      
      * add remove graph node; add set_feature
      
      * add remove graph node; add set_feature
      
      * fix distributed op combining problems
      
      * optimize
      
      * remove logs
      
      * fix MultiSlotDataGenerator error
      
      * cache for graph engine
      
      * fix type compare error
      
      * more test&fix thread terminating problem
      
      * remove header
      
      * change time interval of shrink
      
      * use cache when sample nodes
      
      * remove unused function
      
      * change unique_ptr to shared_ptr
      
      * simplify cache template
      
      * cache api on client
      
      * fix
      
      * reduce sample threads when cache is not used
      
      * reduce cache memory
      
      * cache optimization
      
      * remove test function
      
      * remove extra fetch function
      
      * graph-engine data transfer optimization
      
      * support graph_split load&query
      
      * remove logs
      
      * change shards to pointer vector
      
      * use inference
      
      * remove test code
      
      * renorm op
      
      * simplify renorm op
      
      * recover local changes
      
      * recover renorm op kernel
      
      * fix init
      
      * add blanklines in renorm doc
      
      * fix import
      
      * fix import
      
      * add renorm to init.py
      
      * merge
      
      * move index_sample op
      
      * Delete api.h
      
      * Delete api.cc
      
      * fix
      
      * remove logs
      
      * recover infer shape of grad
      
      * recover changes
      
      * change shape
      
      * fix label
      
      * fix
      
      * fix
      
      * fix
      
      * fix
      
      * fix
      
      * fix
      
      * fix
      
      * fix
      
      * fix
      
      * fix
      Co-authored-by: NHuang Zhengjie <270018958@qq.com>
      Co-authored-by: NWeiyue Su <weiyue.su@gmail.com>
      Co-authored-by: Nsuweiyue <suweiyue@baidu.com>
      Co-authored-by: Nluobin06 <luobin06@baidu.com>
      Co-authored-by: Nliweibin02 <liweibin02@baidu.com>
      Co-authored-by: Ntangwei12 <tangwei12@baidu.com>
      1b585b28
    • Z
      Add sparse conv3d kernel (#39879) · bc99a76c
      zhangkaihuo 提交于
      * fix incorrect dims settings
      
      * sparse conv3d
      
      * fix out dims
      
      * test performance
      
      * test large shape success
      
      * opt scatter, double performance
      
      * test float16
      
      * remove profiling code
      
      * remove pten
      
      * opt code lines
      
      * correct boundary judgment
      
      * only cpu
      
      * test ci
      
      * test ci
      
      * remove the including paddle/fluid header; extract the conmmon function
      
      * opt code lines
      
      * use DenseTensor::data() instead of mutable_data
      
      * return rulebook for backward
      
      * specify layout
      
      * rename:conv -> sparse_conv3d
      bc99a76c
    • F
      [Phi] move truncated_gaussian_random kernel (#39971) · 23aa7a36
      furnace 提交于
      * [Phi] move truncated_gaussian_random, copy kernels
      
      * [Phi] move truncated_gaussian_random, kernel register
      
      * [Phi] move truncated_gaussian_random, delete useless codes
      23aa7a36