- 07 12月, 2021 12 次提交
-
-
由 Yulong Ao 提交于
* [Auto Parallel] Add the unified cluster representation * [Auto Parallel] Add the graph class for physical mapping * [Auto Parallel] Add the simple physical mapper * Set the timeout of the mapper * Merge the upstream develop unittests cmake files * Fix a bug of the process group * Remove mapper unittest from platforms which is not GPU * Move the instantiation of process group after resharding * Add the local id for devices * Update the rank mapping format * [Auto Parallel] Relaunch with the rank mapping file * Remove the unnecessary json file * Avoid entering get_device_proc_info for auto mapping * Correct the mapper unit test * Add some comments * Remove the related files about mapping * Update the unittest for auto mapping * Remove unused rank_mapping unittest * Improve the unittest coverage * Improve the unittest coverage * Improve the unittest of relaunch * Fix the unittest problem in CI * Improve the unittest of relaunch * Remove unnecessary statements * Update the unittest cmakefile * Correct the cmakefile of auto parallel unittests * Modify codes based on the new elastic change * Use the GPUs exclusively in the unittest * Correct the cmakefile * Set the timeout of the unittest
-
由 YuanRisheng 提交于
* add inplace op adaptation * optimize inplace logic and fix bugs when run kernel that has args of vector<DenseTensor> * move func in kernel_context.h into kernel_context.cc * refactor logic that transform variable to densetensor * fix bugs when compile * update func name * fix bugs when run windows-ci
-
由 zmxdream 提交于
* fix heter service. test=develop * fix heter section worker in debug mode
-
由 wenbin 提交于
don't exit if requested_size < size
-
由 zyfncg 提交于
-
由 tianshuo78520a 提交于
-
由 Zuza 提交于
* quantize slice op * correct test * fix code formatting
-
由 jianghaicheng 提交于
-
由 Zhanlue Yang 提交于
-
由 Aurelius84 提交于
-
由 Leo Chen 提交于
-
由 Yuang Liu 提交于
-
- 06 12月, 2021 16 次提交
-
-
由 Baibaifan 提交于
-
由 Huihuang Zheng 提交于
1. Modify git tag for CINN 2. Support compile option "-DWITH_CINN=ON, -DWITH_TESTING=OFF"
-
由 heliqi 提交于
* add test_unsqueeze2_eltwise_fuse_pass * fix name conflictn * rebuild CI
-
由 Chen Weihang 提交于
-
由 chentianyu03 提交于
-
由 zmxdream 提交于
* update * update. test=develop * fix. test=develop * fix ut. test=develop * fix ut. test=develop * fix ut. test=develop * update. test=develop * fix ut. test=develop * fix ut. test=develop * fix sample code. test=develop * fix ut. test=develop * fix ut. test=develop * fix ut. test=develop * fix ut. test=develop * fix paddle.rot90 doc. test=develop * update ut. test=develop * fix. test=develop * fix .test=develop * fix .test=develop * fix doc. test=develop
-
由 Yuang Liu 提交于
-
由 Sing_chan 提交于
* test inference_api_test when run in windows-inference ci * test if test failed the code run correctly * put the failed test back
-
由 ronnywang 提交于
-
由 Aurelius84 提交于
-
由 danleifeng 提交于
-
由 Leo Chen 提交于
* enable sequential_run for standalone_executor * add ut * fix ut
-
由 wenbin 提交于
* adaptive_pool2d * timeout * merge develop * rename ut * new test * remove old file
-
由 wenbin 提交于
* fix * Update stack_op_plugin.cu comments
-
由 Chen Weihang 提交于
* fix reshape move storage error * remove needless set type * alloc tensor by shared storage
-
由 kuizhiqing 提交于
-
- 04 12月, 2021 1 次提交
-
-
由 zmxdream 提交于
this pr fix distributed service
-
- 03 12月, 2021 11 次提交
-
-
由 Yuang Liu 提交于
-
由 Weilong Wu 提交于
-
由 jianghaicheng 提交于
-
由 ronnywang 提交于
* refine structure for cuda and rocm * update * update * update * update
-
由 Leo Chen 提交于
* use sync h2d copy * use stream safe allocator in memcpy_h2d * remove wait * add guard
-
由 tianshuo78520a 提交于
* fix * test=document_fix
-
由 WangXi 提交于
-
由 Wilber 提交于
* update check_op_desc to support op_version and op_compat.
-
由 zyfncg 提交于
-
由 Zhanlue Yang 提交于
* Rearranged Eager AutoCodeGen directory structure * Removed USE_OP in Eager AutoCodeGen
-
由 Sing_chan 提交于
* add trt_mobilenet demo test * only deliver trt_root in run.sh
-