1. 28 10月, 2021 1 次提交
    • X
      [Cherry-pick]FFT function enhancements and bugfixes (#36537) · 11b9f5f9
      Xiaoxu Chen 提交于
      * update fft api path (#36219)
      
      * update fft api path
      * add sample code for ihfft2
      Co-authored-by: Nchenfeiyu <chenfeiyu@baidu.com>
      
      * fix fft axis (#36321)
      
      fix: `-1` is used when fft's axis is `0`
      
      * use unified external error message for cufft api (#36114)
      
      * fft: modify sample code result (#36325)
      
      * dynamic load mkl as a fft backend when it is avaialble and requested (#36414)
      
      * add rocm support for fft api (#36415)
      
      * move signal apis
      
      * move fft and signal API path (#2)
      
      * move signal apis
      
      * move fft.py and signal.py to paddle/, fix typos
      
      * fix relative imports from fft.py and signal.py
      
      * fix typos in signal.py (#3)
      
      * move signal apis
      
      * move fft.py and signal.py to paddle/, fix typos
      
      * fix relative imports from fft.py and signal.py
      
      * fix typos
      
      * disable Cache when CUFFT_VERSION >= 10200 (#4)
      
      * move signal apis
      
      * move fft.py and signal.py to paddle/, fix typos
      
      * fix relative imports from fft.py and signal.py
      
      * fix typos
      
      * Add LRUCache for fft plans
      
      * add LRUCache for cuff and hipfft (#5)
      
      * move signal apis
      
      * move fft.py and signal.py to paddle/, fix typos
      
      * fix relative imports from fft.py and signal.py
      
      * fix typos
      
      * WIP: add cache
      
      * delete move constructor and operator= for CuFFTHandle and FFTConfig
      
      * remove log from CuFFTHandle and FFTConfig
      
      * add lrucache for fft rocm backend
      
      * disable LRUCache when CUFFT_VERSION >= 10200
      
      * disbale copy and move for hipFFTHandle; format code
      Co-authored-by: NXiaoxu Chen <chenxx_id@163.com>
      
      * remove debug message of cufftHandler
      
      * roll_op: support Tensor as input for shifts (#36727)
      
      * fix fftshift/ifftshift on static mode
      
      * update roll_op version
      
      * add more test cases for fftshift/ifftshift
      Co-authored-by: Nzhiboniu <31800336+zhiboniu@users.noreply.github.com>
      Co-authored-by: Nchenfeiyu <chenfeiyu@baidu.com>
      Co-authored-by: LJQ️ <33169170+lijiaqi0612@users.noreply.github.com>
      11b9f5f9
  2. 27 5月, 2021 1 次提交
  3. 02 4月, 2021 1 次提交
    • S
      graph engine (#31226) · 94736d60
      seemingwang 提交于
      * graph engine demo
      
      * upload unsaved changes
      
      * fix dependency error
      
      * fix shard_num problem
      
      * py client
      
      * remove lock and graph-type
      
      * add load direct graph
      
      * add load direct graph
      
      * add load direct graph
      
      * batch random_sample
      
      * batch_sample_k
      
      * fix num_nodes size
      
      * batch brpc
      
      * batch brpc
      
      * add test
      
      * add test
      
      * add load_nodes; change add_node function
      
      * change sample return type to pair
      
      * resolve conflict
      
      * resolved conflict
      
      * resolved conflict
      
      * separate server and client
      
      * merge pair type
      
      * fix
      
      * resolved conflict
      
      * fixed segment fault; high-level VLOG for load edges and load nodes
      
      * random_sample return 0
      
      * rm useless loop
      
      * test:load edge
      
      * fix ret -1
      
      * test: rm sample
      
      * rm sample
      
      * random_sample return future
      
      * random_sample return int
      
      * test fake node
      
      * fixed here
      
      * memory leak
      
      * remove test code
      
      * fix return problem
      
      * add common_graph_table
      
      * random sample node &test & change data-structure from linkedList to vector
      
      * add common_graph_table
      
      * sample with srand
      
      * add node_types
      
      * optimize nodes sample
      
      * recover test
      
      * random sample
      
      * destruct weighted sampler
      
      * GraphEdgeBlob
      
      * WeightedGraphEdgeBlob to GraphEdgeBlob
      
      * WeightedGraphEdgeBlob to GraphEdgeBlob
      
      * pybind sample nodes api
      
      * pull nodes with step
      
      * fixed pull_graph_list bug; add test for pull_graph_list by step
      
      * add graph table;name
      
      * add graph table;name
      
      * add pybind
      
      * add pybind
      
      * add FeatureNode
      
      * add FeatureNode
      
      * add FeatureNode Serialize
      
      * add FeatureNode Serialize
      
      * get_feat_node
      
      * avoid local rpc
      
      * fix get_node_feat
      
      * fix get_node_feat
      
      * remove log
      
      * get_node_feat return  py:bytes
      
      * merge develop with graph_engine
      
      * fix threadpool.h head
      
      * fix
      
      * fix typo
      
      * resolve conflict
      
      * fix conflict
      
      * recover lost content
      
      * fix pybind of FeatureNode
      
      * recover cmake
      
      * recover tools
      
      * resolve conflict
      
      * resolve linking problem
      
      * code style
      
      * change test_server port
      
      * fix code problems
      
      * remove shard_num config
      
      * remove redundent threads
      
      * optimize start server
      
      * remove logs
      
      * fix code problems by reviewers' suggestions
      Co-authored-by: NHuang Zhengjie <270018958@qq.com>
      Co-authored-by: NWeiyue Su <weiyue.su@gmail.com>
      Co-authored-by: Nsuweiyue <suweiyue@baidu.com>
      Co-authored-by: Nluobin06 <luobin06@baidu.com>
      Co-authored-by: Nliweibin02 <liweibin02@baidu.com>
      94736d60
  4. 06 11月, 2020 1 次提交
  5. 29 10月, 2020 1 次提交
  6. 20 4月, 2020 1 次提交
    • Z
      Optimize the error messages of paddle CUDA API (#23816) · 78170037
      Zhou Wei 提交于
      * Optimize the error messages of paddle CUDA API, test=develop
      
      * fix the error messages of paddle CUDA API, test=develop
      
      * Refactoring PADDLE_ENFORCE_CUDA_SUCCESS, and apply to curand/cudnn/cublas/NCCL,test=develop
      
      * remove build_ex_string,test=develop
      
      * merge conflict,test=develop
      78170037