- 07 8月, 2023 1 次提交
-
-
由 Yuanle Liu 提交于
* fix cudnn 8.7+ bug on cudnnConvolutionBiasActivationForward * save_optimized_model_pass support tensorrt * update * update * fix compile * update * fix ut timeout
-
- 07 7月, 2023 1 次提交
-
-
由 Yuanle Liu 提交于
* fix WITH_SHARED_IR option type * rename WITH_INFERENCE_NVTX to WITH_NVTX and fix compile bug * update
-
- 03 7月, 2023 1 次提交
-
-
由 周周周 提交于
* commit * commit * commit * commit * final commit * use hook to collect shape and shape value
-
- 16 6月, 2023 1 次提交
-
-
由 zhupengyang 提交于
-
- 08 6月, 2023 1 次提交
-
-
由 Yuanle Liu 提交于
-
- 19 5月, 2023 1 次提交
-
-
由 zhoutianzi666 提交于
* decrease_peak_memory
-
- 16 1月, 2023 1 次提交
-
-
由 Zhang Jun 提交于
* add outvar name for nvtx mark * nly network created with kEXPLICIT_BATCH can setsetMaxBatchSize
-
- 05 12月, 2022 1 次提交
-
-
由 Wilber 提交于
-
- 01 12月, 2022 1 次提交
-
-
由 Wilber 提交于
* update memory_optimize pass
-
- 30 11月, 2022 1 次提交
-
-
由 Yuanle Liu 提交于
-
- 01 11月, 2022 1 次提交
-
-
由 Yuanle Liu 提交于
-
- 26 10月, 2022 1 次提交
-
-
由 Chen Weihang 提交于
* remove using lodtensor part2 * resolve code format error * resolve conflict * resolve conflict * replace added frameworrk tensor
-
- 19 10月, 2022 1 次提交
-
-
由 Yuanle Liu 提交于
-
- 26 6月, 2022 1 次提交
-
-
由 Sing_chan 提交于
-
- 05 6月, 2022 1 次提交
-
-
由 Sing_chan 提交于
-
- 25 4月, 2022 1 次提交
-
-
由 wenbin 提交于
-
- 15 3月, 2022 1 次提交
-
-
由 Jacek Czaja 提交于
* - Prototype of third solution - fix - compilation fixes - fix - fixe - fix - fix - compilation fix - comment fix - lint update mkldnn conv_elementwise_add_fuse_pass ut - NHWC changes to prelu - alhpa dims - UT fix - fix to UT - lint - Some fixes - added to BWD of prelu NHWC support - reverted removal of resetting cu_layout in clearing of caching * - Small changes * - compilation fix * - fix * - fix * lint * - fixes after internal review * - compilation fix * - lint
-
- 17 1月, 2022 1 次提交
-
-
由 Wilber 提交于
* add pten::Place data structure. * update ci problem * fix ci problem * update * using platform::Place=pten::Place * remove BOOST_GET_CONST for CPUPlace and GPUPlace * compile pass 25%. * compile pass 45% * compile pass 60% * remove boost_get for xpu npu mlu and ipu * compile pass on cpu and gpu. * fix compile problem * fix compile error. * update * fix ci problem * update * ci approve * fix ci problem * fix ci eager test problem * remove BOOST_GET_CONST * fix npu compile
-
- 30 12月, 2021 1 次提交
-
-
由 wenbin 提交于
* dynamic shape clone supported
-
- 28 4月, 2021 1 次提交
-
-
由 Jacek Czaja 提交于
* - Added clearing oneDNN per executor * - Executor is nt always having FLAGS_use_mkldnn set to true
-
- 04 2月, 2021 1 次提交
-
-
由 wanghuancoder 提交于
* use iwyu clean include second time, test=develop
-
- 28 12月, 2020 1 次提交
-
-
由 石晓伟 提交于
* flush denormals to zero, test=develop * add comments, test=develop
-
- 23 11月, 2020 1 次提交
-
-
由 Jacek Czaja 提交于
-
- 31 7月, 2020 1 次提交
-
-
由 Chen Weihang 提交于
* polish framework error message part 4 * fix type error * fix message error * polish by review comments
-
- 28 7月, 2020 1 次提交
-
-
由 arlesniak 提交于
* Added DNNL cache management for DyGraph * move FLAGS_use_mkldnn to more general CMakeLists, getu use of the flag in ClearGradients * missing file * Fixes after review * Bringing back original idea of place for 'use_mkldnn' flag to be accessible from platform nad imperative. * Removed duplicate and added docs * Fixes for CI
-
- 03 6月, 2020 1 次提交
-
-
由 Jacek Czaja 提交于
-
- 15 2月, 2020 1 次提交
-
-
由 flame 提交于
-
- 04 12月, 2019 1 次提交
-
-
由 Pei Yang 提交于
* make DisableGlogInfo able to mute all logs in inference.
-
- 14 11月, 2019 1 次提交
-
-
由 Chen Weihang 提交于
Add examples for error message writing specification - NotFound, OutOfRange, AlreadyExists, PermissionDenied (#21134) * add examples for error msg spec, test=develop * change ENFORCE to ENFORCE_**, test=develop * add more already exists examples, test=develop
-
- 21 1月, 2019 1 次提交
-
-
由 Yan Chunwei 提交于
-
- 07 1月, 2019 1 次提交
-
-
由 Yan Chunwei 提交于
-
- 30 11月, 2018 1 次提交
-
-
由 Wang Guibao 提交于
* AsyncExecutor: C++ side * Google naming conventions * Rename MultiExecutor to AsyncExecutor * pybind with async_executor * Naming convention * remove some flags and unused code * add refactored file of async_executor and data_feed * clear async executor interface and add data feed factory * split async executor into executor_thread_worker and async_executor, refactor pybind, add datafeed and corresponding proto * Fix async_executor interfaces: 1) Remove all protobufs; 2) Stop after each epoch * refine async_executor_refactor.cc * add some files about datafeed * Revert "add some files about datafeed" This reverts commit 8ee8133a. * Interface rework * add MultiSlotDataFeed * Creating DataFeedDesc from .proto file, then manipulate it (add/del fields etc) from python side * update data_feed for add MultiSlotDataFeed * update datafeed and async_executor to run bow_net demo * fix bug that finish_set_filelist failed in multithread * delete finish_binding_memory_(flag), because it can not be marked under the current interface * Fix bug * update async_executor.py for support set_use_slots * update async_executor.py for support set_use_slots and set set_dense_slots * fix bug that when the number of files is less than the number of threads, it will fetch nan * remove redundant code, and make executor exit when set a illegal queue size * add batch_size check * add MultiSlotDesc * Revert "add MultiSlotDesc" This reverts commit 2e72ebfa. * add some checkpoint in DataFeedDesc * add CheckFile function in MultiSlotDataFeed * update something error info * fix deaded lock bug * Fix fetch variable * Merge error * fix code style in async_executor * using one lock blocking queue replace two lock blocking queue because of some bugs * update code style * add utest for data_feed * Fix fetch var * update utest for data_feed for multithread * update SetFileList info * fix bug in utest of data_feed * Add comments for python * Add comments for python code * Fix pybind.cc with new pybind11 version * add note for DataFeedDesc's set_use_slots function * Add save_model * update data_feed_test for multi-type * add comment for executor_thread_worker * Remove unused code * update data_feed_test for generate test data file * removed unnecessary interfaces and add comments * c++ style check * update data_feed.cc * AsyncExecutor: C++ side Google naming conventions Rename MultiExecutor to AsyncExecutor pybind with async_executor Naming convention remove some flags and unused code add refactored file of async_executor and data_feed clear async executor interface and add data feed factory split async executor into executor_thread_worker and async_executor, refactor pybind, add datafeed and corresponding proto Fix async_executor interfaces: 1) Remove all protobufs; 2) Stop after each epoch refine async_executor_refactor.cc add some files about datafeed Revert "add some files about datafeed" This reverts commit 8ee8133a. add MultiSlotDataFeed Interface rework Creating DataFeedDesc from .proto file, then manipulate it (add/del fields etc) from python side update datafeed and async_executor to run bow_net demo update async_executor.py for support set_use_slots Fix bug update async_executor.py for support set_use_slots and set set_dense_slots fix bug that when the number of files is less than the number of threads, it will fetch nan remove redundant code, and make executor exit when set a illegal queue size add MultiSlotDesc Revert "add MultiSlotDesc" This reverts commit 2e72ebfa. add some checkpoint in DataFeedDesc Fix fetch variable fix code style in async_executor Fix fetch var add utest for data_feed Add comments for python update utest for data_feed for multithread fix bug in utest of data_feed Add comments for python code Fix pybind.cc with new pybind11 version add note for DataFeedDesc's set_use_slots function update data_feed_test for multi-type Add save_model update data_feed_test for generate test data file removed unnecessary interfaces and add comments add comment for executor_thread_worker Remove unused code update data_feed.cc c++ style check * commit for code style * commit for code style * commit for code style * commit for code style * Comment away __init__ in async_executor.py * clang-format fix test=develop * use PADDLE_THROW instead of exit(-1); use unique_ptr to manage scope var in data_feed_test.cc * commit for update code style * commit for update code style * Add async_executor demo; Remove some methods test=develop * commit for update code style * commit for update code style * commit for update code style * update API.spec * AsyncExecutor test=develop * AsyncExecutor test=develop * AsyncExecutor test=develop * AsyncExecutor test=develop * Fix API.spec test=develop * Fix API.spec test=develop * Fix windows build error test=develop * FIx windows build error test=develop * FIx windows build error test=develop * FIx windows build error test=develop * Fix Windows Build test=develop * Fix Windows Build test=develop * Fix Windows Build test=develop * Fix code style test=develop * Fix code style test=develop * update datafeed * Fix code style test=develop * update data_feed_test for test Tensor test=develop * Fix code style test=develop * Fix windows build failure test=develop * Fix code style and windows build failure test=develop * Fix PYTHON3.5 build failure test=develop * AsyncExecutor API test=develop
-
- 25 11月, 2018 1 次提交
-
-
由 Yan Chunwei 提交于
-
- 16 11月, 2018 1 次提交
-
-
由 Yan Chunwei 提交于
* fix inference on gpu out of mem the transfer logic in operator.cc will keep creating new scopes.
-
- 14 11月, 2018 1 次提交
-
-
由 Yan Chunwei 提交于
-
- 08 11月, 2018 1 次提交
-
-
由 minqiyang 提交于
Fix code to support cpplint syntax check test=develop
-
- 25 10月, 2018 1 次提交
-
-
由 Wojciech Uss 提交于
test=develop
-
- 17 10月, 2018 1 次提交
-
-
由 nhzlx 提交于
test=develop
-
- 16 10月, 2018 2 次提交