- 20 6月, 2023 1 次提交
-
-
由 zhangbo9674 提交于
* new_ir to shared * refine code * add ir lib path to env * refine type * refine code * fix bug * fix bug * refine code * refine code * close win * refine code * refine code * refine code * add win share * refine code * refie code * refine code * refine code * refien code * fix bug * fix bug * fix bug * solve conflict * solve conflict * fix bug * refine code * fix bug * fix bug * fix bug * fix bug * fix bug * refine code * fix interpretercore program bug * delete unuse code * fix bug * fix bug * fix bug * fix bug * fix bug * fix bug * fix bug * fix bug * fix cinn bug * fix cinn bug * debug * fix cinn bug * delete unused code * fix cinn bug * fix cinn bug * fix ug * test win openblas * test win openblas * fix win openblas bug * polish code * fix win open blas bug * close win dll * fix flag bug * test for windows * fix compile bug
-
- 15 6月, 2023 1 次提交
-
-
由 hong 提交于
* add kernel dialect * change DenseTensorTypeStorage to DenseTensorType * add test case` * add first pd_op to kernel dialect * lower pd op to kernel dialect * update * update * remove useless code * add attrite print test * fix bug * update * update * update * update * polish code * fix bug * polish code and add python test * add test * fix test error * add env flag * fix bug * revert test env * change cc_test_old to cc_test * fix build_static bug * fix type test error * udpate cmake * disable test in windows * fix inference compile
-
- 09 6月, 2023 1 次提交
-
-
由 lzydev 提交于
* auto generate code for elementwise_max * auto generate code for elementwise_max * fix composite ops * fix bug of fmax
-
- 03 6月, 2023 1 次提交
-
-
由 RedContritio 提交于
* remove reduce_sum_op.h * support auto generate for static op reduce_sum * remove reduce_sum_op in CMakeLists.txt
-
- 30 5月, 2023 1 次提交
-
-
由 RedContritio 提交于
* support auto generate for activation_op relu6 * add generated_static_op for activation_op in CMakeLists.txt
-
- 26 5月, 2023 1 次提交
-
-
由 YuanRisheng 提交于
* create phi so * fix ci bugs * fix py3 bugs * add file * fix py3 bugs * fix windows bugs * perfect so * fix py3 bugs * delete all static target in phi * fix windows bugs * fix py3 bugs * fix ci bugs * fix windows bugs * fix bugs: gflags can't be linked by dynamic and static lib * fix bugs that can not load 3rd party * fix ci bugs * fix compile bugs * fix py3 bugs * fix conflict * fix xpu bugs * fix mac compile bugs * fix psgpu bugs * fix inference failed * deal with conflict * fix LIBRARY_PATH bug * fix windows bugs * fix onednn error * fix windows compile bugs * fix windows compile bugs * fix test_cuda_graph_static_mode_error aborted * fix windows bugs * fix mac-python3 error * fix hip compile bugs * change mode to static * change to static mode * fix ci bugs * fix py3 bugs * fix windows bugs * fix bugs * add static flag * add PADDLE_API * change position of PADDLE_API * fix windows bugs * change mode to dynamic lib * fix windows static bugs * deal with conflict * fix windows unit bug * fix coverage * deal with conflict * fix windows-inference * fix py3 bugs * fix bugs when compile type_info * fix compile bugs * fix py3 bugs * fix windows bugs * fix windows openblas * fix xpu bugs * fix enforce_test in windows * update code according comment * fix windows cmake bug * fix windows bugs * fix windows bugs * delete cinn unittest * fix cinn bugs --------- Co-authored-by: lzydev <1528794076@qq.com>
-
- 22 3月, 2023 1 次提交
-
-
由 HappyHeavyRain 提交于
* test_get_kernel * add invoke signature * change reduce_max * change frobenius_norm * reset reduce_max according to composite and change reduce_all * fix the bug when Scalar(*) * fix 'scalar when support_tensor' * change code according to review * change 'keep_signature' to 'manual_signature' and add some erro info * support optimizers autogen * change sgd yaml * change generate signature * fix test/cpp/new_executor/CM * reset signature generated function * change signature funciton * change signature funciton
-
- 20 3月, 2023 1 次提交
-
-
由 tianshuo78520a 提交于
-
- 15 2月, 2023 1 次提交
-
-
由 YuanRisheng 提交于
* move profiler * add file * fix mac compile bugs * fix ci bugs * fix mac bugs * fix ci bugs * fix compile bugs * perfect code according comment
-
- 25 11月, 2022 1 次提交
-
-
由 Ruibiao Chen 提交于
* Move stream_anayzer to interpreter * Refactor StreamAnalyzer * Refactor RunNextInstructionList * Remove no_data_transform_index * Fix typos * Fix data_transfer OpFuncType error * Add event for depend_op * Update transfer OpFuncType for heter place
-
- 27 10月, 2022 1 次提交
-
-
由 Leo Chen 提交于
* make all cpp tests dynamic linked to libpaddle.so * add comments * keep old cc_test for some tests * fix some ut * make some ut use cc_test_old * fix typos and fit for win32 * fix lib path * fix some tests * skip lite test * fit for rocm * fit for cinn * fit for mac * fit for win32 * skip inference ut * skip windows * fix coverage
-
- 19 10月, 2022 1 次提交
-
-
由 Ruibiao Chen 提交于
* Support stream overlap for c_allreduce_sum * Test CI * Add notes * Add SingleStreamGuard for BuildOpFuncList
-
- 23 9月, 2022 1 次提交
-
-
由 Ruibiao Chen 提交于
* Add ExecutionConfig and fix last-live-op bug for standalone executor * Improve code design
-
- 06 9月, 2022 1 次提交
-
-
由 danleifeng 提交于
-
- 30 8月, 2022 1 次提交
-
-
由 zyfncg 提交于
* add runtime config in phi * add runtime attr for op desc and op * fix no proto error * adjust opdesc set_attr impl * try to remove conv_op extra attrs * add init runtime attr map * change extra header path * fix runtime_attr * fix trace_op * fix bug of pass * fix merge conflict * fix dygraph attrs * fix bug of pass * fix dygraph bug * fix unittest module * delete extra attr default * fix dropout kernel * polish code * fix extra output of instance_norm * fix merge confilct * fix op_desc bug * add extra attr in yaml for conv3d_transpose * don't remove extra input and output * fix save_inference_model * fix bug of batch_norm * revert some change * polish log * polish code * add code comment Co-authored-by: NChen Weihang <chenweihang@baidu.com>
-
- 04 8月, 2022 1 次提交
-
-
由 王明冬 提交于
-
- 02 8月, 2022 1 次提交
-
-
由 Ruibiao Chen 提交于
* Refactor build_op_downstream_map for standalone executor * Add some comments
-
- 20 7月, 2022 2 次提交
-
-
由 Ruibiao Chen 提交于
* Add dependency for read op in standalone executor * Fix CI errors * Add UT * add_dependency -> dependency_utils * Fix CI errors
-
由 tianshuo78520a 提交于
-
- 14 7月, 2022 1 次提交
-
-
由 wanghuancoder 提交于
* Compilation optimization
-
- 23 6月, 2022 1 次提交
-
-
由 Ruibiao Chen 提交于
* Multiplex Workqueue for InterpreterCore * Delete ResetWorkQueueOptions * Update code format
-
- 04 6月, 2022 1 次提交
-
-
由 Sing_chan 提交于
-
- 14 4月, 2022 1 次提交
-
-
由 liutiexing 提交于
* executor perf statistics * fix ut * fix ut * fix ut * add ut * add ut
-
- 24 3月, 2022 1 次提交
-
-
由 Leo Chen 提交于
-
- 22 3月, 2022 1 次提交
-
-
由 Leo Chen 提交于
* async prepare deps * fix bug that std::future is not set * add ut * refine code * fix standalone ut * disable prof
-
- 17 2月, 2022 1 次提交
-
-
由 Leo Chen 提交于
* relocate code of interpretercore gc
-
- 26 1月, 2022 1 次提交
-
-
由 Leo Chen 提交于
* update cmake file to remove fluid kernel * add pten declaration.h to where pybind.h used * fix sync_bn and tensorrt_engine * refine detection_library * fix interpreter_core * support eager legacy * fit eager legacy for pten * fall back to cpu if not found kernel * fix compile problem * fix compile problem * refine fallback logic * fit operator.run() * fix xpu compile * fit for new_exec * add REGISTER_OP_WITHOUT_GRADIENT * un-cache pt_kernel_context * fix compile * fix cudnn * fix compiling with on_infer * fix mkldnn * fix isfinite_v2 * fix xpu problem * fix op_device * refine fallback for xpu * fix xpu compile * merge develop * refine code format * fix compile * fix compile * add data_transfer * fix PreparePtenData * fix cpu context * merge develop * fix compile * fix error device context * fix xpu * fix dev_ctx
-
- 28 12月, 2021 1 次提交
-
-
由 From00 提交于
* fix reshape move storage error * remove needless set type * alloc tensor by shared storage * Utilize StreamSafeCUDAAllocator to support fast GC in new executor * Fix compile error for Windows and ROCm * Fix compile error for Windows * Modify UT stream_safe_cuda_alloc_test * Modify UT stream_safe_cuda_alloc_test * Rewrite fast GC * Rewrite fast GC * Fix compile error for BOOST_GET_CONST * Fix compile error for BOOST_GET_CONST * Changes default stream for StreamSafeCUDAAllocator * Fix a small CI error * Remove some redundant code * Fix conflict * Fix compile error for ROCm * Fix Windoes CI error * Fix CI error * Remove some unnecessary code * Fix CI error * Add UT for fast GC * Fix CI error * add device-agnostic stream class * add stream.h * fix ut * fix cpu compile * Use RWLock in GetAllocator * Fix CI error Co-authored-by: NChen Weihang <chenweihang@baidu.com> Co-authored-by: Nzhiqiu <chenqiuliang@baidu.com>
-
- 23 12月, 2021 1 次提交
-
-
由 liutiexing 提交于
* add align for WorkQueue * add spinlock * merge develop * merge * Add EventsWaiter * Revert "Add EventsWaiter" This reverts commit e206173aa9be7401b83a53581627bfaf557c8fb2. * update EventsWater * fix * split workqueue files * add more tests * fix * bugfix * bugfix * update Co-authored-by: Nliutiexing <liutiexing@google.com>
-
- 16 12月, 2021 1 次提交
-
-
由 Leo Chen 提交于
* fix cmake * not check execution time
-
- 15 12月, 2021 1 次提交
-
-
由 Leo Chen 提交于
* refine test * add download_program target * update ut code * refine code * disable profiler * add comments * refine cmake * skip coverage ci
-
- 23 11月, 2021 1 次提交
-
-
由 Aurelius84 提交于
* Add transfer_layout/dtype op * clean useless codes * fix unused var * add optest in white.txt * split into data_transfer.cc * fix cmake * modify according reviewer comment * replace cast_op with transfer_dtype_op
-
- 12 11月, 2021 1 次提交
-
-
由 Leo Chen 提交于
* split declaration and implementation * remove initdevices * refine VariableMetaInfo * add ut * fix compile
-
- 29 10月, 2021 1 次提交
-
-
由 Leo Chen 提交于
* enable check_nan_inf and fix variable scope * add ut * fix bug * update ut * revert doc change * fix npu compile
-
- 23 9月, 2021 1 次提交
-
-
由 liutiexing 提交于
* add align for WorkQueue * WorkQueue update * Revert "WorkQueue update" This reverts commit 14ce793dbb204f8ddec63c34b3b72a73c7cdb93a. * optimize WorkQueue
-
- 17 9月, 2021 1 次提交
-
-
由 Aurelius84 提交于
* format code * format interface * polish interface * Remove std::memory_order * modify into SpinLock * remove fetch_context_pool_ * fix comment * modify into WorkQueueGroup * refine code * fix pointer * fix paddle_enforce * split into AsyncWorkQueue * polish code * specify std::memory_relax * fix atomic fetch_sub * fix num_thread
-
- 14 9月, 2021 1 次提交
-
-
由 Aurelius84 提交于
-
- 11 9月, 2021 1 次提交
-
-
由 wanghuancoder 提交于
* refactor gc, test=develop * refine, test=develop * refine, test=develop * refine, test=develop * gc each tensor, test=develop * refine, test=develop
-
- 08 9月, 2021 2 次提交
-
-
由 wanghuancoder 提交于
* refactor new executor, test=develop * refine, test=develop * refine, test=develop * refine, test=develop * refine, test=develop * refine, test=develop * refine, test=develop * refine, test=develop
-
由 liutiexing 提交于
* Split Tracker and WorkQueue * add WorkQueueGroup * add unittest * fix * update * update * fix compile
-