1. 17 1月, 2022 1 次提交
  2. 10 1月, 2022 1 次提交
    • H
      Add gpu kernel for new api : linalg.lstsq (#38621) · 405103d8
      Haohongxiang 提交于
      * add lstsq gpu kernel
      
      * update
      
      * add docs_en
      
      * modify ut
      
      * fix bugs
      
      * modify example in docs_en
      
      * remove lstsq_op.cu from ROCM cmake
      
      * modify docs_en
      
      * modify docs_en
      
      * modify docs_en
      
      * remove unneccessary TensorCopy
      405103d8
  3. 06 1月, 2022 1 次提交
  4. 05 1月, 2022 1 次提交
  5. 31 12月, 2021 4 次提交
  6. 30 12月, 2021 2 次提交
  7. 29 12月, 2021 1 次提交
  8. 28 12月, 2021 4 次提交
  9. 27 12月, 2021 1 次提交
  10. 24 12月, 2021 8 次提交
    • W
      add nansum api to math (#38137) · 6554cc10
      wangguanqun 提交于
      * add nansum api
      
      * delete layerhelper
      
      * add nansum to all and tensor_method_func
      
      * update doc
      
      * update doc
      
      * update doc
      6554cc10
    • S
      renorm op (#38130) · 6982871d
      seemingwang 提交于
      * graph engine demo
      
      * upload unsaved changes
      
      * fix dependency error
      
      * fix shard_num problem
      
      * py client
      
      * remove lock and graph-type
      
      * add load direct graph
      
      * add load direct graph
      
      * add load direct graph
      
      * batch random_sample
      
      * batch_sample_k
      
      * fix num_nodes size
      
      * batch brpc
      
      * batch brpc
      
      * add test
      
      * add test
      
      * add load_nodes; change add_node function
      
      * change sample return type to pair
      
      * resolve conflict
      
      * resolved conflict
      
      * resolved conflict
      
      * separate server and client
      
      * merge pair type
      
      * fix
      
      * resolved conflict
      
      * fixed segment fault; high-level VLOG for load edges and load nodes
      
      * random_sample return 0
      
      * rm useless loop
      
      * test:load edge
      
      * fix ret -1
      
      * test: rm sample
      
      * rm sample
      
      * random_sample return future
      
      * random_sample return int
      
      * test fake node
      
      * fixed here
      
      * memory leak
      
      * remove test code
      
      * fix return problem
      
      * add common_graph_table
      
      * random sample node &test & change data-structure from linkedList to vector
      
      * add common_graph_table
      
      * sample with srand
      
      * add node_types
      
      * optimize nodes sample
      
      * recover test
      
      * random sample
      
      * destruct weighted sampler
      
      * GraphEdgeBlob
      
      * WeightedGraphEdgeBlob to GraphEdgeBlob
      
      * WeightedGraphEdgeBlob to GraphEdgeBlob
      
      * pybind sample nodes api
      
      * pull nodes with step
      
      * fixed pull_graph_list bug; add test for pull_graph_list by step
      
      * add graph table;name
      
      * add graph table;name
      
      * add pybind
      
      * add pybind
      
      * add FeatureNode
      
      * add FeatureNode
      
      * add FeatureNode Serialize
      
      * add FeatureNode Serialize
      
      * get_feat_node
      
      * avoid local rpc
      
      * fix get_node_feat
      
      * fix get_node_feat
      
      * remove log
      
      * get_node_feat return  py:bytes
      
      * merge develop with graph_engine
      
      * fix threadpool.h head
      
      * fix
      
      * fix typo
      
      * resolve conflict
      
      * fix conflict
      
      * recover lost content
      
      * fix pybind of FeatureNode
      
      * recover cmake
      
      * recover tools
      
      * resolve conflict
      
      * resolve linking problem
      
      * code style
      
      * change test_server port
      
      * fix code problems
      
      * remove shard_num config
      
      * remove redundent threads
      
      * optimize start server
      
      * remove logs
      
      * fix code problems by reviewers' suggestions
      
      * move graph files into a folder
      
      * code style change
      
      * remove graph operations from base table
      
      * optimize get_feat function of graph engine
      
      * fix long long count problem
      
      * remove redandunt graph files
      
      * remove unused shell
      
      * recover dropout_op_pass.h
      
      * fix potential stack overflow when request number is too large & node add & node clear & node remove
      
      * when sample k is larger than neigbor num, return directly
      
      * using random seed generator of paddle to speed up
      
      * fix bug of random sample k
      
      * fix code style
      
      * fix code style
      
      * add remove graph to fleet_py.cc
      
      * fix blocking_queue problem
      
      * fix style
      
      * fix
      
      * recover capacity check
      
      * add remove graph node; add set_feature
      
      * add remove graph node; add set_feature
      
      * add remove graph node; add set_feature
      
      * add remove graph node; add set_feature
      
      * fix distributed op combining problems
      
      * optimize
      
      * remove logs
      
      * fix MultiSlotDataGenerator error
      
      * cache for graph engine
      
      * fix type compare error
      
      * more test&fix thread terminating problem
      
      * remove header
      
      * change time interval of shrink
      
      * use cache when sample nodes
      
      * remove unused function
      
      * change unique_ptr to shared_ptr
      
      * simplify cache template
      
      * cache api on client
      
      * fix
      
      * reduce sample threads when cache is not used
      
      * reduce cache memory
      
      * cache optimization
      
      * remove test function
      
      * remove extra fetch function
      
      * graph-engine data transfer optimization
      
      * support graph_split load&query
      
      * remove logs
      
      * change shards to pointer vector
      
      * use inference
      
      * remove test code
      
      * renorm op
      
      * simplify renorm op
      
      * recover local changes
      
      * recover renorm op kernel
      
      * fix init
      
      * add blanklines in renorm doc
      
      * fix import
      
      * fix import
      Co-authored-by: NHuang Zhengjie <270018958@qq.com>
      Co-authored-by: NWeiyue Su <weiyue.su@gmail.com>
      Co-authored-by: Nsuweiyue <suweiyue@baidu.com>
      Co-authored-by: Nluobin06 <luobin06@baidu.com>
      Co-authored-by: Nliweibin02 <liweibin02@baidu.com>
      Co-authored-by: Ntangwei12 <tangwei12@baidu.com>
      6982871d
    • T
      add gradient unittest and update code example for max/min (#38393) · ee69f437
      Tao Luo 提交于
      * add gradient unittest and update code example for max/min
      
      * update docs
      
      * remove _get_reduce_all_value
      ee69f437
    • Z
      new API inner&outer (#37706) · b463dff4
      zhiboniu 提交于
      b463dff4
    • zhouweiwei2014's avatar
      add new API/OP:paddle.Tensor.exponential_ (#38256) · 33185000
      zhouweiwei2014 提交于
      * add new API/OP:paddle.Tensor.exponential_
      
      * fix CI
      33185000
    • Z
      Add new API cholesky_solve (#38167) · 39f7c41f
      zhiboniu 提交于
      39f7c41f
    • zhouweiwei2014's avatar
      add new API/OP: paddle.poisson (#38117) · bcf86e5c
      zhouweiwei2014 提交于
      * add new API/OP:paddle.poisson
      
      * fix comment
      bcf86e5c
    • J
      Support test imperative basic in eager (#38313) · d48f7c89
      Jiabin Yang 提交于
      * Rearranged Eager AutoCodeGen directory structure
      
      * Removed USE_OP in Eager AutoCodeGen
      
      * Enabled generation for Operators without Grad/Inputs/Outputs
      
      * Resolved operators without input
      
      * Fixed merge conflicts
      
      * Enabled Eager AutoCodeGen for 10+ more operators
      
      * Refactored Eager AutoCodeGen with more organized helper objects
      
      * Enabled Eager AutoCodeGen for operators with multiple OpBases
      
      * Adjusted Eager AutoCodeGen to Enable Passing Output Tensor as Input Argument
      
      * Handled Dispensable Inputs/Outputs in Eager AutoCodeGen
      
      * Adjusted function generation/call between Python-C API & Dygraph API
      
      * Synchronized auto-generated Python-C API with Dygraph Forward Functions
      
      * support more eager tensor api
      
      * fix merge compile error
      
      * fix compile error and fit develop code
      
      * support pure CPU
      
      * fix some logic error in eager_mode
      
      * support _varbase_creator in eager mode
      
      * Added safe_initialized interface to EagerTensor for use in processing dispensable inputs
      
      * for eager mode
      
      * refine
      
      * support multiple constructor for eager tensor
      
      * add place related code
      
      * polish code
      
      * specific randint with dtype of int64
      
      * Support pure cpu test
      
      * eager logic
      
      * refine test in pure cpu
      
      * eager logic
      
      * eager logic
      
      * eager logic, test=develop
      
      * skip core.eager when in inference, test=develop
      
      * refine, test=develop
      
      * refine, test=develop
      
      * call RetainGrad after run forward kernel, test=develop
      
      * refine, test=develop
      
      * support dygraph util, meta, guard test
      
      * support inference test
      
      * refine test and fix initializer failed
      Co-authored-by: Njim19930609 <jim19930609@gmail.com>
      Co-authored-by: NWang Huan <wanghuan29@baidu.com>
      d48f7c89
  11. 23 12月, 2021 3 次提交
    • W
      add control/status API (#37885) · 21b7ed3e
      wuhuanzhou 提交于
      * add control/status API, test=develop
      
      * fix import error, test=develop
      
      * add is_grad_enabled unittest, test=develop
      
      * add code comment for example code and API, test=develop
      
      * add checking for type, test=develop
      
      * add api description, test=develop
      
      * fix docs index_en, test=document_fix
      
      * fix doc of is_floating_point, test=document_fix
      21b7ed3e
    • W
      Add erfinv API (#38295) · 6b59b58c
      wuhuanzhou 提交于
      * add erfinv API, test=develop
      
      * fix gradient accuracy error, test=develop
      
      * fix cuda compilation error on Windows, test=develop
      
      * fix M_2_SQRTPI undeclared identifier on Windows, test=develop
      6b59b58c
    • zhouweiwei2014's avatar
      add new API: paddle.clone;Tensor.element_size;nn.utils.parameters_to_vector (#38020) · 0eb03ed7
      zhouweiwei2014 提交于
      * add new API: paddle.clone;Tensor.element_size;nn.utils.parameters_to_vector
      
      * fix comment
      0eb03ed7
  12. 22 12月, 2021 1 次提交
  13. 21 12月, 2021 1 次提交
  14. 20 12月, 2021 2 次提交
  15. 18 12月, 2021 1 次提交
  16. 17 12月, 2021 1 次提交
  17. 16 12月, 2021 5 次提交
  18. 15 12月, 2021 1 次提交
  19. 13 12月, 2021 1 次提交
    • W
      add logit API (#37844) · b197bfe6
      wangzhen38 提交于
      * add Logit API
      
      * add unittest
      
      * conflict
      
      * pull conflit
      
      * pull conflit logit
      
      * fix unititest
      
      * fix code style
      
      * update docs style of
      
      * update en doc
      
      * fix docs en style
      
      * fix docs en style1
      
      * fix docs en style2
      
      * fix docs en style3
      
      * fix docs en style4
      
      * fix docs en style5
      
      * fix docs en style6
      
      * fix docs en style7
      
      * fix docs en style8
      
      * update by review
      
      * fix nan bug
      b197bfe6