1. 21 3月, 2022 1 次提交
    • H
      Add yaml config part0 (#40020) · cc853e95
      hong 提交于
      * add add yaml
      
      * add elementwise add yaml; test=develop
      
      * add norm
      
      * update
      
      * add some yaml config; test=develop
      
      * fix bug; test=develop
      
      * fix compare error; test=develop
      
      * revert erger_gen.py
      
      * update; test=deveop
      
      * remove usless code; test=deveop
      
      * fix bug; test=develop
      
      * fix test error; test=develop
      
      * remove int_type; test=develop
      
      * fix type error; test=develop
      
      * format; test=develop
      
      * remove type register; test=develop
      
      * polish code; test=develop
      
      * fix ci error; test=develop
      cc853e95
  2. 19 3月, 2022 1 次提交
  3. 16 3月, 2022 1 次提交
  4. 03 3月, 2022 1 次提交
  5. 01 3月, 2022 1 次提交
  6. 22 2月, 2022 2 次提交
  7. 20 2月, 2022 1 次提交
  8. 27 1月, 2022 1 次提交
  9. 06 1月, 2022 1 次提交
  10. 31 12月, 2021 1 次提交
  11. 28 12月, 2021 1 次提交
    • H
      Add API and op for take_along_axis (#38396) · 3310f519
      huangxu96 提交于
      * add API and op for take_along_axis
      
      * fix compile dependency problem and add example code and doc
      
      * add unitest
      
      * delete some code for CI coverage
      
      * fix code style problem
      
      * fix as review
      3310f519
  12. 22 12月, 2021 1 次提交
  13. 20 12月, 2021 1 次提交
  14. 17 12月, 2021 1 次提交
  15. 16 12月, 2021 1 次提交
    • L
      Add float16 type for scatter op. (#38136) · 9bac4a76
      Li Min 提交于
      * Add float16 type for scatter op.
      
      * Add fp16 test for scatter op.
      
      * Add int and int64 support for scatter_grad on gpu.
      
      * Add int and int64 for check_variable_and_dtype routine.
      
      * Minors.
      
      * Code format.
      9bac4a76
  16. 15 12月, 2021 1 次提交
  17. 13 12月, 2021 1 次提交
    • Z
      [rot90] fix rot90 (#38042) · 201b7c9d
      zmxdream 提交于
      * [rot90] fix rot90
      
      * fix rot90
      
      * fix for ci. test=develop
      
      * fix rot90. test=develop
      
      * update. test=develop
      
      * update. test=develop
      201b7c9d
  18. 10 12月, 2021 1 次提交
  19. 08 12月, 2021 1 次提交
  20. 06 12月, 2021 1 次提交
    • Z
      [New API]add rot90 api (#37634) · 6ff19d66
      zmxdream 提交于
      * update
      
      * update. test=develop
      
      * fix. test=develop
      
      * fix ut. test=develop
      
      * fix ut. test=develop
      
      * fix ut. test=develop
      
      * update. test=develop
      
      * fix ut. test=develop
      
      * fix ut. test=develop
      
      * fix sample code. test=develop
      
      * fix ut. test=develop
      
      * fix ut. test=develop
      
      * fix ut. test=develop
      
      * fix ut. test=develop
      
      * fix paddle.rot90 doc. test=develop
      
      * update ut. test=develop
      
      * fix. test=develop
      
      * fix .test=develop
      
      * fix .test=develop
      
      * fix doc. test=develop
      6ff19d66
  21. 26 10月, 2021 1 次提交
  22. 09 10月, 2021 1 次提交
    • F
      Add new API 'tensordot' (#36273) · 21dc7f40
      From00 提交于
      * Add new API tensordot
      
      * Set timeout value 400 for UT; Fix format for EN docs
      
      * Set timeout value 1000 for UT; Fix format for EN docs
      
      * Remove some input check
      
      * Coding style improve: don't compare boolean values to True or False
      using ==
      21dc7f40
  23. 18 9月, 2021 1 次提交
    • F
      Add FFT related operators and APIs (#35665) · 11518a43
      Feiyu Chan 提交于
      * 1. add interface for fft;
      2. add data type predicate;
      3. fix paddle.roll.
      
      * add fft c2c cufft kernel
      
      * implement argument checking & op calling parts for fft_c2c and fftn_c2c
      
      * add operator and opmaker definitions
      
      * only register float and double for cpu.
      
      * add common code for implementing FFT, add pocketfft as a dependency
      
      * add fft c2c cufft kernel function
      
      * fix bugs in python interface
      
      * add support for c2r, r2c operators, op makers, kernels and kernel functors.
      
      * test and fix bugs
      
      * 1. fft_c2c function: add support for onesided=False;
      2. add complex<float>, complex<double> support for concat and flip.
      
      * 1. fft: fix python api bugs;
      2. shape_op: add support for complex data types.
      
      * fft c2c cufft kernel done with complie and link
      
      * fix shape_op, add mkl placeholder
      
      * remove mkl
      
      * complete fft c2c in gpu
      
      * 1. implement mkl-based fft, FFTC2CFunctor and common function exec_fft;
      2. change the design, add input and output typename as template parameter for all FFTFunctors, update pocketfft-based implementation.
      
      * complete fft c2c on gpu in ND
      
      * complete fft c2c on gpu in ND
      
      * complete fft c2c backward in ND
      
      * fix MKL-based implementation
      
      * Add frame op and CPU/GPU kernels.
      
      * Add frame op forward unittest.
      
      * Add frame op forward unittest.
      
      * Remove axis parameter in FrameFunctor.
      
      * Add frame op grad CPU/GPU kernels and unittest.
      
      * Add frame op grad CPU/GPU kernels and unittest.
      
      * Update doc string.
      
      * Update after review and remove librosa requirement in unittest.
      
      * Update grad kernel.
      
      * add fft_c2r op
      
      * Remove data allocation in TransCompute function.
      
      * add fft r2c onesided with cpu(pocketfft/mkl) and gpu
      
      * last fft c2r functor
      
      * fix C2R and R2C for cufft, becase the direction is not an option in these cases.
      
      * add fft r2c onesided with cpu(pocketfft/mkl) and gpu
      
      * fix bugs in python APIs
      
      * fix fft_c2r grad kernal
      
      * fix bugs in python APIs
      
      * add cuda fft c2r grad kernal functor
      
      * clean code
      
      * fix fft_c2r python API
      
      * fill fft r2c result with conjugate symmetry (#19)
      
      fill fft r2c result with conjugate symmetry
      
      * add placeholder for unittests (#24)
      
      * simple parameterize test function by auto generate test case from parm list (#25)
      
      * miscellaneous fixes for python APIs (#26)
      
      * add placeholder for unittests
      
      * resize fft inputs before computation is n or s is provided.
      
      * add complex kernels for pad and pad_grad
      
      * simplify argument checking.
      
      * add type promotion
      
      * add int to float or complex promotion
      
      * fix output data type for static mode
      
      * fix fft's input dtype dispatch, import fft to paddle
      
      * fix typos in axes checking (#27)
      
      * fix typos in axes checking
      
      * fix argument checking (#28)
      
      * fix argument checking
      
      * Add C2R Python layer normal and abnormal use cases (#29)
      
      * documents and single case
      
      * test c2r case
      
      * New C2R Python layer normal and exception use cases
      
      * complete rfft,rfft2,rfftn,ihfft,ihfft2,ihfftn unittest and doc string (#30)
      
      * Documentation of the common interfaces of c2r and c2c (#31)
      
      * Documentation of the common interfaces of c2r and c2c
      
      * clean c++ code  (#32)
      
      * clean code
      
      * Add numpy-based implementation of spectral ops (#33)
      
      * add numpy reference implementation of spectral ops
      
      * Add fft_c2r numpy based implementation for unittest. (#34)
      
      * add fft_c2r numpy implementation
      
      * Add deframe op and stft/istft api. (#23)
      
      * Add frame api
      
      * Add deframe op and kernels.
      
      * Add stft and istft apis.
      
      * Add deframe api. Update stft and istft apis.
      
      * Fix bug in frame_from_librosa function when input dims >= 3
      
      * Rename deframe to overlap_add.
      
      * Update istft.
      
      * Update after code review.
      
      * Add overlap_add op and stft/istft api unittest (#35)
      
      * Add overlap_add op unittest.
      
      * Register complex kernels of squeeze/unsquuze op.
      
      * Add stft/istft api unittest.
      
      * Add unittest for fft helper functions (#36)
      
      * add unittests for fft helper functions. add complex kernel for roll op.
      
      * complete static graph unittest for all public api (#37)
      
      * Unittest of op with FFT C2C, C2R and r2c added (#38)
      
      * documents and single case
      
      * test c2r case
      
      * New C2R Python layer normal and exception use cases
      
      * Documentation of the common interfaces of c2r and c2c
      
      * Unittest of op with FFT C2C, C2R and r2c added
      Co-authored-by: lijiaqi0612's avatarlijiaqi <lijiaqi0612@163.com>
      
      * add fft related options to CMakeLists.txt
      
      * fix typos and clean code (#39)
      
      * fix invisible character in mkl branch and fix error in error message
      
      * clean code: remove docstring from unittest for signal.py.
      
      * always convert numpy array to paddle.Tensor to avoid comparing numpy dtype with paddle dtype. (#40)
      
      * always convert numpy array to paddle.Tensor to avoid comparing numpy dtype with paddle dtype.
      
      * fix CI Errors: numpy dtype comparison, thrust when cuda is not available (#41)
      
      1. always convert numpy array to paddle.Tensor to avoid comparing numpy dtype with paddle dtype.
      2. promote floating point tensor to complex tensor ior fft_c2c and fft_c2r;
      3. fix unittest to catch UnImplementedError and RuntimeError;
      4. fix compile error by avoid using thrust when cuda is not available.
      5.  fix sample code, use paddle.fft instead of paddle.tensor.fft
      
      * remove inclusion of thrust, add __all__ list for fft (#42)
      
      * Add api doc and update unittest. (#43)
      
      * Add doc strings.
      * Update overlap_add op unittest
      
      * fix MKL-based FFT implementation (#44)
      
      * fix MKL-based FFT implementation, MKL CDFT's FORWARD DOMAIN is always REAL for R2C and C2R
      
      * remove code for debug (#45)
      
      * use dynload for cufft (#46)
      
      * use std::ptrdiff_t as datatype of stride (instead of int64_t) to avoid argument mismatch on some platforms.
      
      * add complex support for fill_zeros_like
      
      * use dynload for cufft
      
      * Update doc and unittest. (#47)
      
      * Add doc of frame op and overlap_add op.
      
      * Update unittest.
      
      * use dynload for cufft (#48)
      
      1. use dynload for cufft
      2. fix unittest;
      3. temporarily disable Rocm.
      
      * fix conflicts and merge upstream (#49)
      
      fix conflicts and merge upstream
      
      * fix compile error: only link dyload_cuda when cuda is available (#50)
      
      * fix compile error: only link dyload_cuda when cuda is available
      
      * fix dynload for cufft on windows (#51)
      
      1. fix dynload for cufft on windows;
      2. fix unittests.
      
      * add NOMINMAX to compile on windows (#52)
      
       add NOMINMAX to compile on windows
      
      * explicitly specify capture mode for lambdas (#55)
      
       explicitly specify capture mode for lambdas
      
      * fix fft sample (#53)
      
      * fix fft sample
      
      * update scipy and numpy version for unittests of fft (#56)
      
      update scipy and numpy version for unittests of fft
      
      * Add static graph unittests of frame and overlap_add api. (#57)
      
      * Remove cache of cuFFT & Disable ONEMKL (#59)
      
      1. replace numpy.fft with scipy.fft as numpy<1.20 not support ortho norm
      2. remove cache of cufft plans;
      3. enhance error checking.
      4. default WITH_ONEMKL to OFF
      Co-authored-by: Njeff41404 <jeff41404@gmail.com>
      Co-authored-by: Nroot <root@bjyz-sys-gpu-kongming9.bjyz.baidu.com>
      Co-authored-by: NKP <109694228@qq.com>
      Co-authored-by: lijiaqi0612's avatarlijiaqi <lijiaqi0612@163.com>
      Co-authored-by: NXiaoxu Chen <chenxx_id@163.com>
      Co-authored-by: Nlijiaqi0612 <33169170+lijiaqi0612@users.noreply.github.com>
      11518a43
  24. 14 9月, 2021 1 次提交
  25. 10 9月, 2021 2 次提交
    • S
      Fix warning (#34875) · 966f042d
      sunzhongkai588 提交于
      * fix warning error , test=document_fix
      
      * fix warning error , test=document_fix
      
      * fix warning error , test=document_fix
      
      * fix warning error , test=document_fix
      
      * fix warning error , test=document_fix
      
      * fix warning error , test=document_fix
      
      * fix warning error , test=document_fix
      966f042d
    • Z
      add api_op fill_diagonal_tensor (#34515) · 98d047d7
      zhiboniu 提交于
      98d047d7
  26. 07 9月, 2021 1 次提交
  27. 17 8月, 2021 1 次提交
  28. 16 8月, 2021 1 次提交
    • D
      add unique_consecutive_op (#34334) · 875cfd57
      duanboqiang 提交于
      * add unique_consecutive_op
      
      * add unique_consecutive_op
      
      * add unique_consecutive_op
      
      * add unique_consecutive_op
      
      * add unique_consecutive_op
      
      * add unique_consecutive_op
      
      * add unique_consecutive_op
      
      * add unique_consecutive_op
      
      * remove unity build
      
      * add unique_consecutive op
      
      * add unique_consecutive op
      
      * add enable static
      
      * add noqa
      
      * add space line
      
      * add default case.
      
      * add comma
      
      * add space line
      
      * modify unique_consecutive unittest
      
      * optimize ut coverage
      
      * rebase develop
      
      * improve coverage
      
      * update en docs
      
      * update en docs
      
      * update en docs
      
      * update en docs
      
      * update en docs
      
      * update en doc
      875cfd57
  29. 26 7月, 2021 1 次提交
  30. 15 7月, 2021 1 次提交
  31. 08 7月, 2021 1 次提交
  32. 01 7月, 2021 1 次提交
  33. 23 6月, 2021 1 次提交
  34. 21 6月, 2021 1 次提交
  35. 11 6月, 2021 1 次提交
  36. 18 5月, 2021 1 次提交
  37. 30 4月, 2021 1 次提交
    • P
      Add 12 inplace APIs including auto generated (#32573) · 308073de
      pangyoki 提交于
      * add relu6_ hardsigmoid_ leaky_relu_ Inplace APIs
      
      * add softmax_with_cross_entropy_ Inplace API
      
      * add clip_ scale_ add_ subtract_ Inplace APIs
      
      * add wlist
      
      * fix parameter of scale api
      
      * add add_n_ Inplace API and remove log_ Inplace API
      
      * fix elementwise_add_ and elementwise_sub_ broadcast problem
      
      * elementwise inplace api give error message before run the op
      
      * use broadcast_shape in elementwise inplace op
      
      * add 8 inplace apis that is auto generated
      
      * add unittest for all inplace apis
      
      * add decorator for inplace apis in static mode
      
      * fix windows blas fail of exp inplace api, change array_equal to allclose
      
      * add flatten inplace api
      
      * add flatten unittest
      
      * fix flatten unittest
      
      * add decorator
      
      * fix grad.numpy in test_pylayer_op
      
      * unsupport softmax_with_cross_entropy_
      
      * add test_inplace_softmax_with_cross_entropy to static_mode_white_list
      
      * delete __all__ in inplace_utils
      
      * delete activation inplace function and add Tensor.inplace_func
      
      * change paddle.inplace_ to Tensor.inplace_
      
      * fix little problem
      
      * add paddle in inplace_utils
      308073de
  38. 29 4月, 2021 1 次提交