- 05 12月, 2022 1 次提交
-
-
由 huangjiyi 提交于
-
- 29 11月, 2022 1 次提交
-
-
由 LiYuRio 提交于
* remove lod_tensor_to_array, array_to_lod_tensor, DynamicRNN * remove less_equal, greater_than, greater_equal, equal, not_equal
-
- 15 11月, 2022 1 次提交
-
-
由 Sławomir Siwek 提交于
* cleanup unused code * unify is_int8 is_bfloat16 * Simplify matmul_v2 FWD kernel * remove RunKernel methods * remove import namespace * remove headers * clean fluid/phi cross imports * remove fluid axpy_handler * delete fluid methods * activations * OneDNNMemDesc * MKLDNNFormatForSize * MatchShapeToLayout * MKLDNNMemoryFormat * MKLDNNFormat * ReorderMKLDNNHandler * to_void_cast * review suggestions * interpolate * remove fluid depedency
-
- 11 11月, 2022 1 次提交
-
-
由 zyfncg 提交于
* generate static graph code for some ops by yaml * remove deleted files * update cmake * update cmake * udpate cmake
-
- 09 11月, 2022 1 次提交
-
-
由 Chen Weihang 提交于
* move fluid op generator into fluid * remove parsed op * resolve sig undef error * append python interp find logic * remove dup code
-
- 20 10月, 2022 1 次提交
-
-
由 JingZhuangzhuang 提交于
* Add infer prune function * Update phi.cmake * Update operators.cmake * add fusion op
-
- 18 10月, 2022 1 次提交
-
-
由 zyfncg 提交于
* support generating code of opmaker for backward op invoke forward op * gsupport code-gen of opmaker for sparse op * refind logic of choose phi kernrel * fix complie budg * fix code_gen bug * fix bug * fix kernel signature code-gen * fix complie bug of VarType * fix complie bug of VarType * fix test_sparse_conv_op * fix test_sparse_norm_op
-
- 09 9月, 2022 1 次提交
-
-
由 ronnywang 提交于
* [CustomDevice] add dy2static support * update
-
- 30 8月, 2022 1 次提交
-
-
由 zyfncg 提交于
* add runtime config in phi * add runtime attr for op desc and op * fix no proto error * adjust opdesc set_attr impl * try to remove conv_op extra attrs * add init runtime attr map * change extra header path * fix runtime_attr * fix trace_op * fix bug of pass * fix merge conflict * fix dygraph attrs * fix bug of pass * fix dygraph bug * fix unittest module * delete extra attr default * fix dropout kernel * polish code * fix extra output of instance_norm * fix merge confilct * fix op_desc bug * add extra attr in yaml for conv3d_transpose * don't remove extra input and output * fix save_inference_model * fix bug of batch_norm * revert some change * polish log * polish code * add code comment Co-authored-by: NChen Weihang <chenweihang@baidu.com>
-
- 25 8月, 2022 1 次提交
-
-
由 ronnywang 提交于
* [NPU] add run_program_op_npu * add run_program_op_npu ut
-
- 19 8月, 2022 1 次提交
-
-
由 mengqingchun02 提交于
* support beam_search operator on xpu. test=kunlun * support beam_search operator on xpu. test=kunlun * support beam_search operator on xpu. test=kunlun * support beam_search operator on xpu. test=kunlun * support beam_search operator on xpu. test=kunlun * support beam_search operator on xpu. test=kunlun * support beam_search operator on xpu. test=kunlun * fix beam_search operator bugs on xpu. test=kunlun * fix beam_search operator bugs on xpu. test=kunlun * fix beam_search operator bugs on xpu. test=kunlun * fix beam_search operator bugs on xpu. test=kunlun * support beam_search_decode operator on xpu. test=kunlun * support beam_search_decode operator on xpu. test=kunlun * support beam_search_decode operator on xpu. test=kunlun * support beam_search_decode operator on xpu. test=kunlun * support beam_search_decode operator on xpu. test=kunlun * support beam_search_decode operator on xpu. test=kunlun * support beam_search_decode operator on xpu. test=kunlun * support beam_search_decode operator on xpu. test=kunlun * support beam_search_decode operator on xpu. test=kunlun * support beam_search_decode operator on xpu. test=kunlun * support beam_search_decode operator on xpu. test=kunlun * support beam_search_decode operator on xpu. test=kunlun * support beam_search_decode operator on xpu. test=kunlun * support beam_search_decode operator on xpu. test=kunlun * support beam_search_decode operator on xpu. test=kunlun
-
- 05 8月, 2022 1 次提交
-
-
由 Feiyu Chan 提交于
* move fft kernels to phi, done with cufft, pocketfft, mkl_cdft, hipfft * make stft_op use fft from phi/kernels/funcs, clean code
-
- 15 7月, 2022 1 次提交
-
-
由 Ruibiao Chen 提交于
-
- 14 7月, 2022 1 次提交
-
-
由 wanghuancoder 提交于
* Compilation optimization
-
- 12 7月, 2022 1 次提交
-
-
由 qipengh 提交于
-
- 24 6月, 2022 1 次提交
-
-
由 王明冬 提交于
-
- 14 6月, 2022 1 次提交
-
-
由 Wilber 提交于
* cmake-lint * update
-
- 10 6月, 2022 1 次提交
-
-
由 Leo Chen 提交于
* make all phi kernels to 2(host/device) static libraries directly * fix calling kernel_declare * fix compile * fix cpu compile * fix rocm compile * fix xpu compile * fix xpu kp compile * fix inference compile
-
- 02 6月, 2022 1 次提交
-
-
由 sneaxiy 提交于
* support CUDAGraph for partial graph * add ut * fix ci * fix ut again because of eager mode * fix kunlun ci * fix win ci
-
- 16 4月, 2022 1 次提交
-
-
由 王明冬 提交于
-
- 13 4月, 2022 1 次提交
-
-
由 levi131 提交于
* native commit for triple grad of sigmod * Updated unittests files * init functional jacobian api * Updated trible_test func * Updated gradient_checker & test_script * finish test with dtype float32 * add float64 test case * polish code * use atol=1e-5 with dtype float64 * fix for ci * set timeout for test_jacobian * fix dygraph grad to support high differential * polish API docstring * Updated gradient checker and some related files * fix double grad strip error for high differential * fix double grad strip error for high differential * Add Sigmoid triple grad tests * fix dygraph double grad dtype error when calling for high differential senario * Updated triple grad teses func * Use np.random to initialize ddx * Updated triple_grad_check func * add todo for gradient checker and refine some comments * remove additional code * add test for warnging in backward.py * format python code * support multi input in triple gradient checker * Add matmul triple grad kernel * Updated comments of TODO * Supported some special tests * Change code-format to follow CI std * Updated gradient_checker.py * Fix conflicts * Removed unnecessary printing log * Change code style to follow CI std * merge upstream * add_p * rm useless files * add sub_p mul_p div_p * add sqrt_p and tanh_p * add reshape_p * add broadcast_p * add broadcast_p fill_constant_p matmul_p reduce_p reshape_p transpose_p * add split_p and concat_p * add gather_p and scatter_add_p * add slice_select_p and slice_assign_p * add multi input check for add_p, sub_p, mul_p, div_p * update concat_p * refine gather_p and scatter_add_p * refine slice_assign_p and slice_select_p * add 9 test for prim ops * add more test and fix some bug * add more test * register proto * add shape valid check for broadcast_p op, and add keepdim attr into reduce_p op proto * support multi input and multi output for split_p and concat_p * fix slice bug for slice_select_p and slice_assign_p * dtype for axis attr should be long int * update dtype for axis attr int64_t * update for iscan CI * add more shape and dtype check * change IndexTensor into int32 dtype
-
- 05 4月, 2022 1 次提交
-
-
由 Guanghua Yu 提交于
-
- 28 3月, 2022 1 次提交
-
-
由 0x45f 提交于
* moving OP * move forward * move grad and infershape * code format * format code * fix code * fix code * fix CMakerLists.txt * fix comments * Refine CMakeLists for rocm ci
-
- 10 3月, 2022 1 次提交
-
-
由 Zhong Hui 提交于
* move segment_pool to phi. * mark summed ids as optional tensor. * fix as reviews.
-
- 24 2月, 2022 1 次提交
-
-
由 Chen Weihang 提交于
* rename pten to phi * fix infrt compile failed * resolve conflict
-
- 16 2月, 2022 1 次提交
-
-
由 Feiyu Chan 提交于
* move complex_functors.h and update all references to symbols within it
-
- 26 1月, 2022 2 次提交
-
-
由 Leo Chen 提交于
* update cmake file to remove fluid kernel * add pten declaration.h to where pybind.h used * fix sync_bn and tensorrt_engine * refine detection_library * fix interpreter_core * support eager legacy * fit eager legacy for pten * fall back to cpu if not found kernel * fix compile problem * fix compile problem * refine fallback logic * fit operator.run() * fix xpu compile * fit for new_exec * add REGISTER_OP_WITHOUT_GRADIENT * un-cache pt_kernel_context * fix compile * fix cudnn * fix compiling with on_infer * fix mkldnn * fix isfinite_v2 * fix xpu problem * fix op_device * refine fallback for xpu * fix xpu compile * merge develop * refine code format * fix compile * fix compile * add data_transfer * fix PreparePtenData * fix cpu context * merge develop * fix compile * fix error device context * fix xpu * fix dev_ctx
-
由 Allen Guo 提交于
* sync misc changes * apply comments 01 * fix compile error * remove is_ipu_place check * add authors Co-authored-by: NXiaobing Wang <xiaobingw@graphcore.ai> Co-authored-by: NAllen Guo <alleng@graphcore.ai> Co-authored-by: NZhixin Yao <zhixiny@graphcore.ai> Co-authored-by: NHaicheng Jiang <haichengj@graphcore.ai> Co-authored-by: NHan Zhao <hanzhao@graphcore.ai> * sync changes * restore cmake * update ir cmake and setup.py * update inference_lib cmake * restore for split PR Co-authored-by: NXiaobing Wang <xiaobingw@graphcore.ai> Co-authored-by: NZhixin Yao <zhixiny@graphcore.ai> Co-authored-by: NHaicheng Jiang <haichengj@graphcore.ai> Co-authored-by: NHan Zhao <hanzhao@graphcore.ai>
-
- 24 1月, 2022 1 次提交
-
-
由 Feiyu Chan 提交于
* migration of functors in paddle/fluid/operators/eigen and paddle/fluid/platform/eigen_ext.h * update path of data types like float16.h in includes in extensions.h
-
- 21 1月, 2022 1 次提交
-
-
由 Weilong Wu 提交于
-
- 28 12月, 2021 1 次提交
-
-
由 huangxu96 提交于
* add API and op for take_along_axis * fix compile dependency problem and add example code and doc * add unitest * delete some code for CI coverage * fix code style problem * fix as review
-
- 20 12月, 2021 1 次提交
-
-
由 fwenguang 提交于
-
- 08 12月, 2021 1 次提交
-
-
由 CtfGo 提交于
1. add a subdirectory named `cinn` in `paddle/fluid/operators` directory and move releated files into it 2. seperate CinnLaunchContext class from `cinn_launch_op.h` and put it in a new independent file named `cinn_launch_context.h`, so that it can be included by others clearly.
-
- 06 12月, 2021 1 次提交
-
-
由 Huihuang Zheng 提交于
1. Modify git tag for CINN 2. Support compile option "-DWITH_CINN=ON, -DWITH_TESTING=OFF"
-
- 01 12月, 2021 1 次提交
-
-
由 sneaxiy 提交于
* fix inplace addto pass * update * fix ut * improve ci coverage * fix musl ci compile error
-
- 27 11月, 2021 1 次提交
-
-
由 Aganlengzi 提交于
* [NPU] reorganization for device API abstraction * [NPU] delete old files * [NPU] fix npu_collective_helper * [NPU] fix collective_helper * [NPU] fix ut * [NPU] mod memory allocation and hccl_helper * [NPU] fix place_type * [NPU] split enfoce.h * move acl* call into npu_info * merge conflict * fix merge * merge conflict * merge conflict
-
- 19 11月, 2021 1 次提交
-
-
由 LiYuRio 提交于
-
- 13 11月, 2021 1 次提交
-
-
由 CtfGo 提交于
Modify serveral implements on CinnLaunchOp: 1. Skip checking input variables must be used 2. Move current helper functions to a CinnlaunchContext
-
- 03 11月, 2021 1 次提交
-
-
由 CtfGo 提交于
1. 功能不变,简化CinnLaunchOpKernel实现:将原先直接从Scope获取变量信息的方式改为借助参数ExecutionContext标准接口获取,简化了实现逻辑,相应地也简化了辅助函数的实现,原先cinn_launch_op_helper较为冗余,删除不必要的接口并迁移至cinn_launch_op.cc中定义。 2. 修复CinnLaunchOp InferShape判断是否有指定输出:HasOutput->HasOutputs 3. 添加详细的注释和debug信息,方便问题排查和代码维护
-
- 02 11月, 2021 1 次提交
-
-
由 Liu-xiandong 提交于
-