1. 07 3月, 2018 2 次提交
    • Y
      Add Writer/Scanner · bcb80756
      Yu Yang 提交于
      Make vec<Tensor> can be serialized to RecordIO
      bcb80756
    • K
      Integrate float16 into data_type_transform (#8619) · 266ccaa8
      kexinzhao 提交于
      * test cpu float16 data transform
      
      * add isnan etc
      
      * small fix
      
      * fix containsNAN test error
      
      * add data_type transform GPU test
      
      * add float16 GPU example
      
      * fix error
      
      * fix GPU test error
      
      * add context wait
      266ccaa8
  2. 01 3月, 2018 2 次提交
    • C
      Add tuple type (#8519) · 12a3cea0
      chengduo 提交于
      * add the type of tuple
      
      * add lod_tensor to tuple
      12a3cea0
    • A
      Add Go_op, Channel_create, channel_close, channel_send and channel_receive ops (#8593) · 0d878e4c
      Abhinav Arora 提交于
      * Adding Python boilerplate code for Go op
      
      * Add very basic test case
      
      * Adding the python logic for go routine
      
      * Fix syntax
      
      * Changing test to notest
      
      * Rename Routine to Go
      
      * Combining GoGuard and Go in one class
      
      * Modify test
      
      * Adding fluid close channel
      
      * Fixing __init__.py for calling fluid.go()
      
      * Adding stubs for channel methods and updating test case
      
      * Removing import *
      
      * Adding imports from concurrency
      
      * Initial commit of GO_OP (for varun)
      
      * Creating local scopes and go through them
      
      * Updated go op inputs persistability enforcement
      
      * Add thread execution; compile failing though
      
      * Fix go op
      
      * Cleaned up Go op
      
      * Fix yapf format issue
      
      * Readd warp ctc dir for unit tests
      
      * Updated make_channel, channel_send, channel_recv and channel_close
      
      * Moved thread function to another method, update unit tests
      
      * remove output var
      
      * Add stubs for channel operators
      
      * Updating concurrency with signatures
      
      * Updated the signature with return status
      
      * Fixed dtype in variables
      
      * Updating stub of ChannelSend + add infershape
      
      * Updating stub of ChannelRecv + add infershape
      
      * Updated signature
      
      * Adding the channel_create operator
      
      * Merge channel send+receive ops
      
      * Update concurrency tests using all operators
      
      * Updating the create op with ChannelHolder
      
      * Fix issues with channel_create_op
      
      * Add the implementation for channel_close op
      
      * Add channel close operator, fix channel close op
      
      * Adding the channel_send op
      
      * Comment channels C++ and Python code
      
      * Concurrency python api comment fix
      
      * Update unit test to add Status variable
      
      * Adding channel receive operator
      
      * Update concurrency test to demonstrate a complete CSP flow
      
      * Fix clang-format issues
      
      * Fixed "Out" parameter name
      
      * Fixing merge conflict in framework.py
      
      * Add channel ops to framework.py no_kernel_op_set
      
      * Seperating channel_send and channel_recv operators
      
      * Documenting capacity type
      
      * Update concurrency test to create go block as child block of main program
      
      * Changing set status implementation
      0d878e4c
  3. 24 2月, 2018 1 次提交
  4. 10 2月, 2018 1 次提交
  5. 06 2月, 2018 1 次提交
  6. 05 2月, 2018 1 次提交
  7. 31 1月, 2018 2 次提交
    • F
      draft of Reader classes · f32ca636
      fengjiayi 提交于
      f32ca636
    • D
      Fix/lod (#7714) · ae7d1c1f
      dzhwinter 提交于
      * "Need to re-design LoD "
      
      * "add lod design"
      
      * "fix lod gpu ptr pointer"
      
      * "removed commented code"
      
      * "fix CI"
      
      * "remove set lod in pybind"
      
      * "fix style check"
      
      * "fix CI"
      
      * "fix long type template error"
      
      * "pybind reorder to use Place"
      
      * "fix ci"
      
      * "fix ci"
      
      * fix ci
      
      * "sperate as a new file"
      
      * "fix CI"
      
      * "fix ci"
      
      * small fix
      
      * "add test"
      
      * "fix adam op"
      
      * "fix lstmp op"
      
      * "fix adam op"
      
      * "follow comments"
      
      * "fix ci"
      ae7d1c1f
  8. 30 1月, 2018 2 次提交
    • L
      make inference_lib_dist · 9b5d41b6
      Luo Tao 提交于
      9b5d41b6
    • Y
      Correct deps of threadpool (#7955) · 97014750
      Yi Wang 提交于
      * refine channel test
      
      * follow comments
      
      * Add dependency enforce to threadpool
      
      * Revert changes to channel_test.cc
      
      * Revert changes to channel_test.cc
      
      * Add #include "paddle/framework/macros.h"
      97014750
  9. 29 1月, 2018 1 次提交
  10. 26 1月, 2018 1 次提交
    • K
      New Run() method for framework::Executor (#7807) · 788f5c6d
      kexinzhao 提交于
      * initial commit
      
      * add new executor run function
      
      * fix bug
      
      * fix multiple definition of feed_fetch_method issue
      
      * fix cmake
      
      * fix tensor copy error
      
      * refine executor code
      
      * add comments
      
      * temporary modification
      
      * address comments
      
      * fix bug
      788f5c6d
  11. 22 1月, 2018 1 次提交
  12. 21 1月, 2018 1 次提交
    • Q
      Data type transform (#7653) · 85671b8a
      Qiao Longfei 提交于
      * init complete data layout transform
      
      * can compile
      
      * test passed
      
      * optimize code
      
      * fix while_grad_op first step loss lod problem
      
      * optimize in out ptr for transform
      
      * add check
      
      * update copyright
      
      * clean code
      
      * add NeedTransformLayout
      
      * add comment
      
      * change the interface of data_type_transform
      
      * init data_type_transform_test
      
      * complete data_type_transform_test
      
      * add TransDataType to data_transform
      85671b8a
  13. 20 1月, 2018 1 次提交
  14. 19 1月, 2018 1 次提交
  15. 17 1月, 2018 1 次提交
  16. 16 1月, 2018 2 次提交
  17. 12 1月, 2018 1 次提交
  18. 10 1月, 2018 1 次提交
  19. 08 1月, 2018 1 次提交
    • Q
      cpu gpu transform function (#7191) · 0f353ab4
      Qiao Longfei 提交于
      * add rename guard
      
      * add device_data_transform
      
      * add device_data_transform_test
      
      * modify GetExpectedKernelType
      
      * update operator.run
      
      * support test test_label_semantic_roles
      
      * optimize code
      
      * optimize code
      
      * rename GetActualKernelType to GetExpectedKernelType
      
      * fix chunk_eval_op and device_data_transform_test
      
      * add is_same_place to place
      
      * optimize code, refine rename_guard
      
      * refine rename guard, add GetKernelTypeForVar
      
      * optimize code
      
      * add some log
      
      * rename guard
      
      * use sub scope to create var
      
      * fix compile
      
      * add IsInitialized for Tensor
      
      * add VarIsTensor
      
      * fix op_registry_test
      
      * test
      
      * tmp disable priority
      
      * restore switch_kernel.md
      
      * code clean
      0f353ab4
  20. 05 1月, 2018 2 次提交
    • Y
      Add COWPtr and its unittest · 0cfb5465
      Yang Yu 提交于
      It will be used for LoD information in LoDTensor since LoD is a copy
      on write field.
      
      It is pretty slow for copying LoD information between operators. For
      resnet it will cost roughly 10% time of whole time, including reading
      data.
      0cfb5465
    • D
      Feature/use cudnn (#7141) · 5593858d
      dzhwinter 提交于
      * "add c++ side kernel selection"
      
      * "add multiple kernel op test"
      
      * "kernel selection only support cudnn"
      
      * "better formatter"
      
      * "small fix with UseCPU"
      
      * "depends on change interface Get(Place, Library)"
      
      * "fix CI"
      
      * "fix python cudnn test"
      
      * "leave the register cudnn op to another PR"
      
      * "fix CI"
      
      * "use all kernel by default"
      
      * "fix CI"
      5593858d
  21. 04 1月, 2018 1 次提交
  22. 03 1月, 2018 1 次提交
  23. 02 1月, 2018 1 次提交
    • D
      Feature/transform (#7111) · 899a79cc
      dzhwinter 提交于
      * "fix data transform"
      
      * "data transformer"
      
      * "add device pool"
      
      * "add test"
      
      * "fix ci"
      
      * "fix datalayout implementation "
      
      * "fix based on comment"
      899a79cc
  24. 28 12月, 2017 4 次提交
  25. 27 12月, 2017 2 次提交
  26. 26 12月, 2017 2 次提交
    • Q
      Add data transform fn (#6953) · f97f69fe
      Qiao Longfei 提交于
      * init data_transform
      
      * complete DataTransform
      
      * fix build error
      
      * add data_transform_test
      
      * add a register test for data_transform_fn
      
      * use function to simulate registration macro
      
      * add register macro
      
      * update test
      
      * clean code
      
      * restore unrelated code
      
      * update data transform test
      
      * generate unique name for REGISTER_DATA_TRANSFORM_FN
      
      * add const
      
      * follow comment
      
      * update KernelTypePair hash function
      f97f69fe
    • D
      "fix threadpool style" (#7017) · 80dafdf5
      dzhwinter 提交于
      * "fix threadpool style"
      
      * "remove header"
      80dafdf5
  27. 25 12月, 2017 2 次提交
    • Y
      Implement a simple threadpool (#6684) · 127bc2e0
      Yancey 提交于
      * implement a simple threadpool
      
      * unlock before cv.notify
      
      * add done function
      
      * add lock with GetAvailable function
      
      * delete done_
      
      * using call_once in GetInstance
      
      * update by comment
      
      * update comment
      
      * enhance unit test for multi threads task
      127bc2e0
    • Q
      add op_kernel_type_test · 313afc9c
      qiaolongfei 提交于
      313afc9c
  28. 24 12月, 2017 1 次提交
    • D
      Feature/operator run place (#6783) · 735eba29
      dzhwinter 提交于
      * "change operator interface"
      
      * "move devicepool to device_context"
      
      * "fix operator test"
      
      * "fix op_registry Run interface"
      
      * "net op passed. Need to fix nccl multi-Context"
      
      * "add nccl group function"
      
      * "add nccl group function"
      
      * "fix gpu count exceed 32 error"
      
      * "fix recurrent op, nccl op"
      
      * "change the other operators interface with Place"
      
      * "fix typo"
      
      * "fix pybind"
      
      * "fix device in python side"
      
      * "fix pybind failed"
      
      * "add init for test"
      
      * "fix CI"
      735eba29