- 28 12月, 2017 1 次提交
-
-
由 Qiao Longfei 提交于
-
- 27 12月, 2017 6 次提交
-
-
由 Yu Yang 提交于
* Rename API of DeviceContext Make them as usual names. * Rename API of DeviceContext Make them as usual names. * Fix compile * Fix compile * Fix compile * Fix compile * Fix compile
-
由 dzhwinter 提交于
* "fix data transform" * "split into next PR"
-
由 gongweibao 提交于
* fix bugs
-
由 QI JUN 提交于
* add KernelTypeToString interface * cache memory in local scope * fix typo * refine trans logic
-
由 dzhwinter 提交于
* "refine kernel registrar" * "refine registrar with multikey" * "fix register" * "refine multikernel register" * "fix CI" * "fix CI" * "fix registry" * "swtich GPU to CUDA" * "add register macro test case" * "fix CI"
-
由 QI JUN 提交于
* add memory switch mechanism in operator kernel switch
-
- 26 12月, 2017 4 次提交
-
-
由 Qiao Longfei 提交于
* init data_transform * complete DataTransform * fix build error * add data_transform_test * add a register test for data_transform_fn * use function to simulate registration macro * add register macro * update test * clean code * restore unrelated code * update data transform test * generate unique name for REGISTER_DATA_TRANSFORM_FN * add const * follow comment * update KernelTypePair hash function
-
由 Yang Yu 提交于
Cannot get var name recursive since they could be same.
-
由 Luo Tao 提交于
-
由 dzhwinter 提交于
* "fix threadpool style" * "remove header"
-
- 25 12月, 2017 7 次提交
-
-
由 dzhwinter 提交于
* "add data layout" * "need kernel registry support" * "fix data layout" * "reorder include headers" * "change enum to enum class" * "fix CI"
-
由 typhoonzero 提交于
-
由 Qiao Longfei 提交于
* init kernel hint * fix typo * rm unused code * add include in op_kernel.h * restore op_kernel since it will be moved to op_kernel_type * change force_cpu to use_cpu * fix compilation
-
由 dzhwinter 提交于
* "fix CopyFrom parameters" * "fix unused Place argument" * "fixed based on comment"
-
由 dzhwinter 提交于
-
由 Yancey 提交于
* implement a simple threadpool * unlock before cv.notify * add done function * add lock with GetAvailable function * delete done_ * using call_once in GetInstance * update by comment * update comment * enhance unit test for multi threads task
-
由 qiaolongfei 提交于
-
- 24 12月, 2017 4 次提交
-
-
由 qiaolongfei 提交于
-
由 dzhwinter 提交于
-
由 QI JUN 提交于
* refine OpKernelKey * refine codes * fix code style * follow comments
-
由 dzhwinter 提交于
* "change operator interface" * "move devicepool to device_context" * "fix operator test" * "fix op_registry Run interface" * "net op passed. Need to fix nccl multi-Context" * "add nccl group function" * "add nccl group function" * "fix gpu count exceed 32 error" * "fix recurrent op, nccl op" * "change the other operators interface with Place" * "fix typo" * "fix pybind" * "fix device in python side" * "fix pybind failed" * "add init for test" * "fix CI"
-
- 22 12月, 2017 5 次提交
-
-
由 fengjiayi 提交于
-
由 QI JUN 提交于
* add data layout * fix ci
-
由 QI JUN 提交于
-
由 dzhwinter 提交于
* "remove GPU Sync Interface" * "fix typo" * "fix type cast error" * "fix related Copy with stream" * "fix failed tests with DevicePool" * "fix stupid removed position error"
-
由 xuwei06 提交于
For input argument with a list of variables, drop_empty_grad is not allowed because it makes the correspondence bewteen a variable and its gradient ambiguous. Use REGISTER_OP_EX to register the op or call InputGrad(?,false) in GradOpDescMaker.
-
- 21 12月, 2017 6 次提交
-
-
由 typhoonzero 提交于
-
由 typhoonzero 提交于
-
由 fengjiayi 提交于
-
由 fengjiayi 提交于
-
由 fengjiayi 提交于
-
由 Yu Yang 提交于
* Rename XXDescBind --> XXDesc * Fix Compile
-
- 20 12月, 2017 3 次提交
- 18 12月, 2017 1 次提交
-
-
由 dzhwinter 提交于
* "add DeviceContextPool" * "add devicecontextpool in pybind" * "add comments in python side " * "fix static link error" * "fix CI error" * "add executor.py" * "fix CI error" * "add with gpu macro" * "remove comment out codes" * "add TODO items" * "update init devices"
-
- 14 12月, 2017 1 次提交
-
-
由 fengjiayi 提交于
-
- 13 12月, 2017 1 次提交
-
-
由 typhoonzero 提交于
-
- 12 12月, 2017 1 次提交
-
-
由 QI JUN 提交于
There are mainly following fixes: - take `DeviceContext` as the template parameter of math functors and OpKernel instead of `Place` - remove `eigen_device` interface in base class `DeviceContext` - remove `GetEigenDevice` interface in `ExecutionContext` and base class `DeviceContext` - remove unused `platform::EigenDeviceConverter` - rename `REGISTER_OP_GPU_KERNEL` to `REGISTER_OP_CUDA_KERNEL` - rename `USE_GPU_ONLY_OP` to `USE_CUDA_ONLY_OP`
-