- 01 9月, 2022 1 次提交
-
-
由 Leo Chen 提交于
* refine cmake of framework * add deps for dense tensor * fix deps * remove alloc(ctx) * add depends on mkldnn
-
- 17 8月, 2022 2 次提交
- 03 8月, 2022 1 次提交
-
-
由 fwenguang 提交于
-
- 01 8月, 2022 2 次提交
-
-
由 Leo Chen 提交于
* remove cudaDeviceContext * remove more template * fix rocm compile * remove alias name CUDADeviceContext * fix compile * fix tests * revert changes
-
由 danleifeng 提交于
Co-authored-by: seemingwang <zsasuke@qq.com> Co-authored-by: NDesmonDay <908660116@qq.com> Co-authored-by: Nseemingwang <seemingwang@users.noreply.github.com> Co-authored-by: NThunderbrook <a754913769@163.com> Co-authored-by: Nxuewujiao <105861147+xuewujiao@users.noreply.github.com> Co-authored-by: Nroot <root@yq01-sys-hic-k8s-v100-box-a225-0693.yq01.baidu.com> Co-authored-by: NThunderbrook <52529258+Thunderbrook@users.noreply.github.com> Co-authored-by: Nroot <root@yq01-inf-hic-k8s-a100-ab2-0009.yq01.baidu.com> Co-authored-by: Nhuwei02 <53012141+huwei02@users.noreply.github.com> Co-authored-by: Nyaoxuefeng <yaoxuefeng@baidu.com> Co-authored-by: Nlxsbupt <luoxsbupt@163.com> Co-authored-by: Nmiaoli06 <106585574+miaoli06@users.noreply.github.com> Co-authored-by: Nroot <root@yq01-inf-hic-k8s-a100-ab2-0008.yq01.baidu.com> Co-authored-by: Nchao9527 <33347532+chao9527@users.noreply.github.com> Co-authored-by: Nqingshui <qshuihu@gmail.com> Co-authored-by: Nyangjunchao <yangjunchao@baidu.com>
-
- 29 7月, 2022 2 次提交
-
-
由 Leo Chen 提交于
* remove cudaDeviceContext * remove more template * fix rocm compile
-
由 Leo Chen 提交于
* init * move CUDAStream to phi * fix compilation * merge develop * add stream_owned_ member * split cuda_stream.h * fix cpu compile * fix constructor * fix bug * fix windows compile * fix inference test_levit * fix windows tests
-
- 20 7月, 2022 1 次提交
-
-
由 tianshuo78520a 提交于
-
- 19 7月, 2022 1 次提交
-
-
由 Leo Chen 提交于
* compile into one static library * fix xpu compile * fix xpu compile * fix inference compile * fix inference compile * add custom test * revert one file
-
- 14 7月, 2022 2 次提交
- 11 7月, 2022 1 次提交
-
-
由 王明冬 提交于
-
- 06 7月, 2022 1 次提交
-
-
由 houj04 提交于
-
- 05 7月, 2022 2 次提交
-
-
由 ronnywang 提交于
* Dataloader add custom device support * update test=document_fix
-
由 Ruibiao Chen 提交于
-
- 04 7月, 2022 1 次提交
-
-
由 Ruibiao Chen 提交于
-
- 26 6月, 2022 1 次提交
-
-
由 Sing_chan 提交于
-
- 24 6月, 2022 2 次提交
-
-
由 王明冬 提交于
-
由 chenjian 提交于
* record memory and op supplement info * update * update * fix a bug * fix memory recording * fix a bug * update * update * fix a bug * update * fix a bug * fix a bug * fix a bug * Revert "fix a bug" This reverts commit c1d4df52762ba9ae7c7e27cd2ba4fc3a7ed9c7a5. * fix a bug * fix format * fix
-
- 14 6月, 2022 1 次提交
-
-
由 Wilber 提交于
* cmake-lint * update
-
- 10 6月, 2022 1 次提交
-
-
由 Ruibiao Chen 提交于
* Refactor DeviceContextPool * Adjust header file order
-
- 07 6月, 2022 1 次提交
-
-
由 Wilber 提交于
-
- 05 6月, 2022 1 次提交
-
-
由 Sing_chan 提交于
-
- 04 6月, 2022 1 次提交
-
-
由 Sing_chan 提交于
-
- 02 6月, 2022 1 次提交
-
-
由 sneaxiy 提交于
* support CUDAGraph for partial graph * add ut * fix ci * fix ut again because of eager mode * fix kunlun ci * fix win ci
-
- 01 6月, 2022 1 次提交
-
-
由 Ruibiao Chen 提交于
* Add pinned memory to HostMemoryStats * Add macro for WrapStatAllocator * Fix CI errors
-
- 27 5月, 2022 1 次提交
-
-
由 Ruibiao Chen 提交于
* Support memory stats for CPU * Add UTs * Fix typos * Fix typos
-
- 21 5月, 2022 1 次提交
-
-
由 pangyoki 提交于
* delete PADDLE_WITH_TESTING in memory_block_desc * test FLAGS_allocator_strategy=naive_best_fit * delete flag naive_best_fit
-
- 19 5月, 2022 1 次提交
-
-
由 Chen Weihang 提交于
* refine enforce code * refine enforce code * fix compile failed * fix infrt failed
-
- 11 5月, 2022 1 次提交
-
-
由 Allen Guo 提交于
* update to popart v2.5.0 * use a specific version of sdk2.5.0
-
- 05 5月, 2022 2 次提交
-
-
由 Ruibiao Chen 提交于
* Add peak memory log for CI * Change VLOG to std::cout * Move print code to test_runner.py and paddle_gtest_main.cc * Fix typo * Fix conflicts * Updata message format * Fix CI errors * Add FLAGS_enable_gpu_memory_usage_log * Fix CI errors
-
由 Ruibiao Chen 提交于
-
- 27 4月, 2022 1 次提交
-
-
由 Aganlengzi 提交于
* [DO NOT MERGE] test op_test * update with more related modifications * split op_test.py to use test=allcases for testing * split op_test.py to use test=allcases for testing
-
- 25 4月, 2022 1 次提交
-
-
由 Ruibiao Chen 提交于
-
- 07 4月, 2022 1 次提交
-
-
由 liutiexing 提交于
* Profile Executors * update * fix ut * fix names * update * update
-
- 05 4月, 2022 1 次提交
-
-
由 Leo Chen 提交于
* enable new executor by default * enable stream safe allocator * test=document_fix;test=coverage * do not use scope in op kernel * fit empty program for new executor * fix communication depend * fix test_sync_batch_norm * skip unsupported place * refine datatransfer * fit for dirtributed program * fix dependencpy * fix some ut
-
- 01 4月, 2022 2 次提交
-
-
由 wanghuancoder 提交于
* support pinned, test=develop * support async_write, test=develop * refine, test=develop * refine, test=develop * refine, test=develop * refine,test=develop * refine, test=develop * refine, test=develop * refine, test=develop * refine, test=develop
-
由 From00 提交于
* Fix compilation error for gcc-54 * Remove const for gpuStream_t
-
- 30 3月, 2022 1 次提交
-
-
由 From00 提交于
Add new APIs for GPU memory monitoring (max_memory_allocated, max_memory_reserved, memory_allocated, memory_reserved) (#38657) * Add new API memory_reserved * Add memory_allocated, max_memory_reserved and max_memory_allocater * Fix CI error * Fix CI error * Enhance UT * Add FLAGS_memory_stats_opt * Add STATS macro functions * Add StatAllocator * Fix CI errors * Add UT * Fix CI errors
-