1. 29 3月, 2018 1 次提交
  2. 15 3月, 2018 1 次提交
    • T
      Implement Select OP (#9088) · 1e4c504e
      Thuan Nguyen 提交于
      * Fix old documentation for channel_recv
      
      * Initial design of CSP select
      
      * Redesign channel implementation for Select Op
      
      * Remove unecessary header
      
      * Initial checkin of select op, currently will read all the conditional_op in the cases block and also pull out all channels involved in the select.
      
      * Init python select op API
      
      * Python select bug fix when checking op creates block
      
      * Add case_to_execute as (a) input to select, (b) into the passed inputs into the select op
      
      * Add in addition code for select op
      
      * Init fibonacci test from python
      
      * implement fibonnaci sequence test
      
      * update fib unit test
      
      * Improve select test cases
      
      * Shorten non-pep-8-ed lines
      
      * Add methods on channel needed by select op
      
      * Fix compile issues, finish implementation, still need to debug code
      
      * Fix issue with fibonncci test, it works now!
      
      * Change QueueMessage callback to take in an ChannelAction enum, fix select unit test
      
      * Fix case attributes
      
      * Fix issue with select control flow
      
      * Make cases - previously on each selectcase conditional_block - attributes to select
      
      * Use class constants for type of channel
      
      * Change select op to take in "cases" attribute
      
      * return boolean from select callback function to tell Channel if this RECV or SEND should be executed
      
      * Improve attributes and inputs comments on select op
      
      * Fix issues with python unit test
      
      * Assert fibonacci final output
      
      * Fix issue when channel name / channel var is null for "default" case in select op
      
      * Assert base select test output
      
      * Make QueueMessage use shared pointer and modify the order of the callback
      
      * Fixing the order in which the callback is called
      
      * Move channel utility methods to paddle/fluid/operators/concurrency/channel_util
      
      * Create channel_util and move channel util methods
      
      * Fix crash when calling select_op
      
      * Fix deadlock
      
      * Fix issue of channel destructor deadlock
      
      * Fix precommit issues
      
      * Accidentally checked in changes to beam_search_op, reverting change.
      
      * Fix dependency issue in concurrency cmake
      
      * add device_context dependency for concurrency target
      1e4c504e
  3. 07 3月, 2018 2 次提交
    • Y
      Add Writer/Scanner · bcb80756
      Yu Yang 提交于
      Make vec<Tensor> can be serialized to RecordIO
      bcb80756
    • K
      Integrate float16 into data_type_transform (#8619) · 266ccaa8
      kexinzhao 提交于
      * test cpu float16 data transform
      
      * add isnan etc
      
      * small fix
      
      * fix containsNAN test error
      
      * add data_type transform GPU test
      
      * add float16 GPU example
      
      * fix error
      
      * fix GPU test error
      
      * add context wait
      266ccaa8
  4. 01 3月, 2018 2 次提交
    • C
      Add tuple type (#8519) · 12a3cea0
      chengduo 提交于
      * add the type of tuple
      
      * add lod_tensor to tuple
      12a3cea0
    • A
      Add Go_op, Channel_create, channel_close, channel_send and channel_receive ops (#8593) · 0d878e4c
      Abhinav Arora 提交于
      * Adding Python boilerplate code for Go op
      
      * Add very basic test case
      
      * Adding the python logic for go routine
      
      * Fix syntax
      
      * Changing test to notest
      
      * Rename Routine to Go
      
      * Combining GoGuard and Go in one class
      
      * Modify test
      
      * Adding fluid close channel
      
      * Fixing __init__.py for calling fluid.go()
      
      * Adding stubs for channel methods and updating test case
      
      * Removing import *
      
      * Adding imports from concurrency
      
      * Initial commit of GO_OP (for varun)
      
      * Creating local scopes and go through them
      
      * Updated go op inputs persistability enforcement
      
      * Add thread execution; compile failing though
      
      * Fix go op
      
      * Cleaned up Go op
      
      * Fix yapf format issue
      
      * Readd warp ctc dir for unit tests
      
      * Updated make_channel, channel_send, channel_recv and channel_close
      
      * Moved thread function to another method, update unit tests
      
      * remove output var
      
      * Add stubs for channel operators
      
      * Updating concurrency with signatures
      
      * Updated the signature with return status
      
      * Fixed dtype in variables
      
      * Updating stub of ChannelSend + add infershape
      
      * Updating stub of ChannelRecv + add infershape
      
      * Updated signature
      
      * Adding the channel_create operator
      
      * Merge channel send+receive ops
      
      * Update concurrency tests using all operators
      
      * Updating the create op with ChannelHolder
      
      * Fix issues with channel_create_op
      
      * Add the implementation for channel_close op
      
      * Add channel close operator, fix channel close op
      
      * Adding the channel_send op
      
      * Comment channels C++ and Python code
      
      * Concurrency python api comment fix
      
      * Update unit test to add Status variable
      
      * Adding channel receive operator
      
      * Update concurrency test to demonstrate a complete CSP flow
      
      * Fix clang-format issues
      
      * Fixed "Out" parameter name
      
      * Fixing merge conflict in framework.py
      
      * Add channel ops to framework.py no_kernel_op_set
      
      * Seperating channel_send and channel_recv operators
      
      * Documenting capacity type
      
      * Update concurrency test to create go block as child block of main program
      
      * Changing set status implementation
      0d878e4c
  5. 24 2月, 2018 1 次提交
  6. 10 2月, 2018 1 次提交
  7. 06 2月, 2018 1 次提交
  8. 05 2月, 2018 1 次提交
  9. 31 1月, 2018 2 次提交
    • F
      draft of Reader classes · f32ca636
      fengjiayi 提交于
      f32ca636
    • D
      Fix/lod (#7714) · ae7d1c1f
      dzhwinter 提交于
      * "Need to re-design LoD "
      
      * "add lod design"
      
      * "fix lod gpu ptr pointer"
      
      * "removed commented code"
      
      * "fix CI"
      
      * "remove set lod in pybind"
      
      * "fix style check"
      
      * "fix CI"
      
      * "fix long type template error"
      
      * "pybind reorder to use Place"
      
      * "fix ci"
      
      * "fix ci"
      
      * fix ci
      
      * "sperate as a new file"
      
      * "fix CI"
      
      * "fix ci"
      
      * small fix
      
      * "add test"
      
      * "fix adam op"
      
      * "fix lstmp op"
      
      * "fix adam op"
      
      * "follow comments"
      
      * "fix ci"
      ae7d1c1f
  10. 30 1月, 2018 2 次提交
    • L
      make inference_lib_dist · 9b5d41b6
      Luo Tao 提交于
      9b5d41b6
    • Y
      Correct deps of threadpool (#7955) · 97014750
      Yi Wang 提交于
      * refine channel test
      
      * follow comments
      
      * Add dependency enforce to threadpool
      
      * Revert changes to channel_test.cc
      
      * Revert changes to channel_test.cc
      
      * Add #include "paddle/framework/macros.h"
      97014750
  11. 29 1月, 2018 1 次提交
  12. 26 1月, 2018 1 次提交
    • K
      New Run() method for framework::Executor (#7807) · 788f5c6d
      kexinzhao 提交于
      * initial commit
      
      * add new executor run function
      
      * fix bug
      
      * fix multiple definition of feed_fetch_method issue
      
      * fix cmake
      
      * fix tensor copy error
      
      * refine executor code
      
      * add comments
      
      * temporary modification
      
      * address comments
      
      * fix bug
      788f5c6d
  13. 22 1月, 2018 1 次提交
  14. 21 1月, 2018 1 次提交
    • Q
      Data type transform (#7653) · 85671b8a
      Qiao Longfei 提交于
      * init complete data layout transform
      
      * can compile
      
      * test passed
      
      * optimize code
      
      * fix while_grad_op first step loss lod problem
      
      * optimize in out ptr for transform
      
      * add check
      
      * update copyright
      
      * clean code
      
      * add NeedTransformLayout
      
      * add comment
      
      * change the interface of data_type_transform
      
      * init data_type_transform_test
      
      * complete data_type_transform_test
      
      * add TransDataType to data_transform
      85671b8a
  15. 20 1月, 2018 1 次提交
  16. 19 1月, 2018 1 次提交
  17. 17 1月, 2018 1 次提交
  18. 16 1月, 2018 2 次提交
  19. 12 1月, 2018 1 次提交
  20. 10 1月, 2018 1 次提交
  21. 08 1月, 2018 1 次提交
    • Q
      cpu gpu transform function (#7191) · 0f353ab4
      Qiao Longfei 提交于
      * add rename guard
      
      * add device_data_transform
      
      * add device_data_transform_test
      
      * modify GetExpectedKernelType
      
      * update operator.run
      
      * support test test_label_semantic_roles
      
      * optimize code
      
      * optimize code
      
      * rename GetActualKernelType to GetExpectedKernelType
      
      * fix chunk_eval_op and device_data_transform_test
      
      * add is_same_place to place
      
      * optimize code, refine rename_guard
      
      * refine rename guard, add GetKernelTypeForVar
      
      * optimize code
      
      * add some log
      
      * rename guard
      
      * use sub scope to create var
      
      * fix compile
      
      * add IsInitialized for Tensor
      
      * add VarIsTensor
      
      * fix op_registry_test
      
      * test
      
      * tmp disable priority
      
      * restore switch_kernel.md
      
      * code clean
      0f353ab4
  22. 05 1月, 2018 2 次提交
    • Y
      Add COWPtr and its unittest · 0cfb5465
      Yang Yu 提交于
      It will be used for LoD information in LoDTensor since LoD is a copy
      on write field.
      
      It is pretty slow for copying LoD information between operators. For
      resnet it will cost roughly 10% time of whole time, including reading
      data.
      0cfb5465
    • D
      Feature/use cudnn (#7141) · 5593858d
      dzhwinter 提交于
      * "add c++ side kernel selection"
      
      * "add multiple kernel op test"
      
      * "kernel selection only support cudnn"
      
      * "better formatter"
      
      * "small fix with UseCPU"
      
      * "depends on change interface Get(Place, Library)"
      
      * "fix CI"
      
      * "fix python cudnn test"
      
      * "leave the register cudnn op to another PR"
      
      * "fix CI"
      
      * "use all kernel by default"
      
      * "fix CI"
      5593858d
  23. 04 1月, 2018 1 次提交
  24. 03 1月, 2018 1 次提交
  25. 02 1月, 2018 1 次提交
    • D
      Feature/transform (#7111) · 899a79cc
      dzhwinter 提交于
      * "fix data transform"
      
      * "data transformer"
      
      * "add device pool"
      
      * "add test"
      
      * "fix ci"
      
      * "fix datalayout implementation "
      
      * "fix based on comment"
      899a79cc
  26. 28 12月, 2017 4 次提交
  27. 27 12月, 2017 2 次提交
  28. 26 12月, 2017 2 次提交
    • Q
      Add data transform fn (#6953) · f97f69fe
      Qiao Longfei 提交于
      * init data_transform
      
      * complete DataTransform
      
      * fix build error
      
      * add data_transform_test
      
      * add a register test for data_transform_fn
      
      * use function to simulate registration macro
      
      * add register macro
      
      * update test
      
      * clean code
      
      * restore unrelated code
      
      * update data transform test
      
      * generate unique name for REGISTER_DATA_TRANSFORM_FN
      
      * add const
      
      * follow comment
      
      * update KernelTypePair hash function
      f97f69fe
    • D
      "fix threadpool style" (#7017) · 80dafdf5
      dzhwinter 提交于
      * "fix threadpool style"
      
      * "remove header"
      80dafdf5
  29. 25 12月, 2017 1 次提交
    • Y
      Implement a simple threadpool (#6684) · 127bc2e0
      Yancey 提交于
      * implement a simple threadpool
      
      * unlock before cv.notify
      
      * add done function
      
      * add lock with GetAvailable function
      
      * delete done_
      
      * using call_once in GetInstance
      
      * update by comment
      
      * update comment
      
      * enhance unit test for multi threads task
      127bc2e0