1. 20 1月, 2021 6 次提交
    • W
      use nvtx push pop in timeline (#30567) · 90773473
      wanghuancoder 提交于
      * delete empty line of pybing.cc, test=develop
      
      * use nvtx push pop in timeline, test=develop
      
      * change year, test=develop
      
      * add #ifdef PADDLE_WITH_CUDA, test=develop
      
      * add #ifndef WIN32, test=develop
      
      * is_pushed to is_pushed_, test=develop
      90773473
    • C
      make abs op support complex types (#30375) · 358106fc
      chentianyu03 提交于
      * rewrite abs op
      
      * rewrite abs op and remove abs in activation
      
      * remove abs register in old codes
      
      * fix abs_grad type error
      
      * fix abs double_grad output name error
      
      * modify abs_grad, abs_grad_grad functor for windows building
      
      * format code style
      
      * fix the bug of result is nan when the divisor is zero
      
      * add missing abs attr and add abs for float16
      358106fc
    • W
      update. (#30585) · 2d5758c4
      Wilber 提交于
      2d5758c4
    • T
      disable test_analyzer_detect (#30541) · 9dd71c74
      Tao Luo 提交于
      9dd71c74
    • T
      add trainers for pserver (#30523) · c9e78a22
      tangwei12 提交于
      * add trainers for pserver
      
      Change-Id: I1a75793ec81ce126d07f4c47cae09b95d530bbc8
      c9e78a22
    • W
      add some RecordEvent, for dygraph timeline (#30299) · d1b25ed9
      wanghuancoder 提交于
      * add some RecordEvent, for dygraph timeline, test=develop
      
      * change GpuMemcpySync to memory::Copy, test=develop
      
      * fix compile problem, test=develop
      
      * fix compile problem, test=develop
      
      * fix, test=develop
      
      * fix, test=develop
      d1b25ed9
  2. 19 1月, 2021 9 次提交
  3. 18 1月, 2021 7 次提交
  4. 17 1月, 2021 1 次提交
  5. 16 1月, 2021 1 次提交
  6. 15 1月, 2021 6 次提交
    • Z
      add VecCastCUDAKernel (#30296) · c9a334e1
      Zhang Ting 提交于
      c9a334e1
    • P
      Add Inplace strategy (Output reuse Input Varbase) in dygraph (#30103) · 13d75736
      pangyoki 提交于
      * add view strategy on squeeze,unsqueeze,reshape,flatten
      
      * add squeeze unittest
      
      * add unittests
      
      * use View strategy as name rather than Reuse Allacation
      
      * fix view api doc
      
      * fix format
      
      * use core.ops when input of reshape2 is Tensor
      
      * fix test_cross_entropy_loss error because of reshape2
      
      * fix test_cross_entropy_loss error because of reshape2
      
      * add inplace strategy
      
      * add elementwise_add sub
      
      * let backward op not use inplace
      
      * grad op do not use inplace
      
      * fix memory increase error and add leaf error message
      
      * delete selected_rows
      
      * change op_function
      
      * little change
      
      * solve HandleViewBetweenInputAndOutput
      
      * add unittest and leaf error message
      
      * merge view error
      
      * optimize op_function_generator format and support sum inplace op
      
      * fix format of basic_engine
      
      * fix format for framework
      
      * little change of variable wrapper
      
      * add reshape, squeeze, unsqueeze, scatter api
      
      * add relu elu tanh softmax inplace api
      
      * fix test_squeeze_op unittest
      
      * fix test_relu_op unittest
      
      * fix comment problems
      
      * delete sample code of inplace api
      
      * add reference of grad_pending_nodes in basic_engine
      
      * fix unittest name
      
      * add inplace apis into wlist
      
      * fix error message
      
      * add PADDLE_ENFORCE for set grad op twice
      
      * fix head file error
      13d75736
    • Y
      Fix float64 bug in layer norm (#30452) · 008b0a8b
      Yang Zhang 提交于
      built-in `rsqrt` is shadowed
      008b0a8b
    • 715d8628
    • W
      fix cache key for inplaced elementwise ops (#30404) · 88fc7a7d
      Wojciech Uss 提交于
      88fc7a7d
    • W
      fix the rnn mask memory bug for out of read (#30459) · 3d49882e
      wawltor 提交于
      * fix the rnn mask memory bug for out of read
      
      * update the code for the rnn
      3d49882e
  7. 14 1月, 2021 5 次提交
  8. 13 1月, 2021 5 次提交
    • C
      skip quantizing ops in cpu inference (#30342) · 8e3a2940
      cc 提交于
      * skip quantizing ops in cpu inference, test=develop
      8e3a2940
    • A
      Added support for inference using quantization aware trained dygraph (#30288) · 7bbf3ac5
      alncat 提交于
      * added support for inference using qunatization aware trained dygraph
      
      * added support for inference using qunatization aware trained dygraph
      correct boost get usage
      
      * Delete incorrect warning message (#30196)
      
      * fix warning and no grad
      
      * clean redundant API alias in 2.0 - part 2 (#30013)
      
      * delete paddle.nn.functional.assign
      
      * fix dynamic to static error
      
      * just add the op error message for the matmul xpu (#30246)
      
       add the op error message for the matmul xpu
      
      * Add Static Variable Clone (#30208)
      
      Add clone method for static Variable so that this interface will be same as dygraph. It fixed some bugs in dy2stat
      
      * use wget to replace curl to download the lcov file (#30229)
      
      * use wget to replace curl to download the lcov file
      
      * add cache for lcov
      
      * fix test_pool3d_op timeout issue (#30248)
      
      * Fix unittests bugs. (#30250)
      
      * modify error message based on comments (#30189)
      
      * modify error message based on comments
      
      * edit code according to review.
      
      * Correct spelling according to review.
      
      * Fix bug for 'save mutiple method' (#30218)
      
      * Fix bug for 'save mutiple method'
      
      * To pass coverage.
      
      * edit code to pass coverage.
      
      * edit code to pass coverage.
      
      * add unittest for coverage.
      
      * change for coverage.
      
      * edit for coverage.
      
      * added support for inference using qunatization aware trained dygraph
      
      * Alias from  paddle.fluid.layers.auc to paddle.static.auc (#30206)
      
      * add alias from  fluid.layers.auc to static.auc
      
      * Update __init__.py
      
      * added support for inference using qunatization aware trained dygraph
      correct boost get usage
      
      * corrected boost get usage
      
      * corrected naming issues and enforcing zero check
      
      * correct paddle enforce message
      
      * added more error checkings
      
      * corrected error report message and optimized code
      
      * corrected findvar usage
      
      * corrected paddle_enforce in scope
      
      * correct error messages
      
      * correct error reporting format
      Co-authored-by: NLielinJiang <50691816+LielinJiang@users.noreply.github.com>
      Co-authored-by: NXiaoguangHu <46782768+XiaoguangHu01@users.noreply.github.com>
      Co-authored-by: Nwawltor <fangzeyang0904@hotmail.com>
      Co-authored-by: NHuihuang Zheng <zhhsplendid@gmail.com>
      Co-authored-by: NYUNSHEN XIE <1084314248@qq.com>
      Co-authored-by: NBai Yifan <me@ethanbai.com>
      Co-authored-by: Ngongweibao <weibao.gong@gmail.com>
      Co-authored-by: NWeiXin <weixin10@baidu.com>
      Co-authored-by: NJiaqi Liu <liujiaqi06@baidu.com>
      7bbf3ac5
    • G
      Softmax backward optimize (#30249) · 180877e9
      GaoWei8 提交于
      * softmax backward optimize
      180877e9
    • Z
      fix bug on compiling inference shared lib with crypto;test=develop (#30269) · 10a8f3e5
      Zhang Jun 提交于
      * fix bug on compiling inference shared lib with crypto;test=develop
      
      * fix cmake bug when build inference lib using -DWITH_CRYPTO=OFF
      
      * update cmake
      
      * remove unnecessary enforce message
      10a8f3e5
    • H
      Fix Sleep Error in enforce.h (#30335) · 28e156c2
      Huihuang Zheng 提交于
      usleep function in <unistd.h> only takes argument less than 1,000,000. Current call can exceed this limit, we have to fix it. This PR can fix random CI error.
      28e156c2