- 28 3月, 2023 12 次提交
-
-
由 Wang Xinyu 提交于
* add squeeze, unsqueeze, transpose fp16 unitest * Update test_transpose_op.py
-
由 caozhou 提交于
* add tune o1 level * add unittest
-
由 YuhangLi 提交于
-
由 Ainavo 提交于
* [Test Mv] move collective/multinode to test dir * add CMakeList.txt to test/collective * add bash_test_modules * adjust the order * recover bash_test_modules * add_subdirectory(collective) * resolve conflicts * resolve conflicts
-
由 jiangcheng 提交于
* [AMP] add fp16&bf16 support for flatten op * fix ci bug * fix inpute should astype self.dtype bug and fix zerodim test name * remove 0D-tensor bf16 test for window-inference-ci pass * remove flatten from op_accuracy_white_list
-
由 Zheng-Bicheng 提交于
* update * update
-
由 gouzil 提交于
* [CodeStyle][B015] delete unused * [CodeStyle][B015] add assert
-
由 张春乔 提交于
-
由 Infinity_lee 提交于
-
由 Zheng-Bicheng 提交于
-
由 Jiabin Yang 提交于
* optimize composite rule by making scalar shape as []1 * fix shape usage for 0D * fix rules * fix 0D error * fix flatten 0D error * fix bn eval mode * fix bn test * fix flatten
-
由 Young-Flash 提交于
* add bitwise_not trt converter * run pre-commit * modify neg_one_tensor_dims init way * fix BOOL type support requires TensorRT 8.4 * fix int8 & uint8 type * improve data type readability * modify filter logic * fix coverage CI
-
- 27 3月, 2023 9 次提交
-
-
由 Charles-hit 提交于
* add tanh and cast prim test * fix tanh test * fix 0-d test * add sqrt fp16 prim test * add public_python_api in prim test * fix test_squeeze2_op * add tanh prim test * add dropout prim test * [Dy2St]Fix clone for test state problem * clean code * modify test_cumsum_op * modify test_cumsum_op * fix dropout test * add dropout in cmake * fix dropout test --------- Co-authored-by: NAurelius84 <zhangliujie@baidu.com>
-
由 cyber-pioneer 提交于
-
由 zhouweiwei2014 提交于
-
由 Xinyu Chen 提交于
-
由 HongyuJia 提交于
* [CustomOP Inplace] Automap inplace dtype and shape, prepare for vector<Tensor> output * delete dtype,shape func of multi_inplace op * [CustomOP Inplace] Automap inplace dtype and shape, support vector<Tensor> output
-
由 Leo Chen 提交于
* unbind support bool dtype * replace np.array_equal
-
由 Leo Guo 提交于
instance_norm_grad kernel. Fix bugs that the data type of input is different from output in reduce_sum kernel. test=kunlun
-
由 Sławomir Siwek 提交于
* extract Op and OPMaker to .h * extend pattern for fused_op * set "with_residual" default to false * adjust fuse passes * remove fc+eltwise flag * fused_output_scale * activation attrs * remove extra attrs * fix int8/bf16 unit tests * simplify RecomputeOutputDims * remove unused method * Add description for attributes * add extra check * adjust op compats * update quantize test * fix protobuf parsing error * fix int8 performance * fused elementwises * merge develop * remove activation * restore activation for existing add/sub ops
-
由 houj04 提交于
-
- 25 3月, 2023 5 次提交
-
-
由 张春乔 提交于
-
由 Infinity_lee 提交于
* codestyle up027 * add to pyproject.toml
-
由 张春乔 提交于
-
由 张春乔 提交于
-
由 jjyaoao 提交于
-
- 24 3月, 2023 9 次提交
-
-
由 Zheng-Bicheng 提交于
-
由 TaoTao Li 提交于
* add all_reduce, reduce kernel and api * fix all_reduce reduce ut fix reduce op maker conflict fix merge conflicts * fix conflicts, rename ReduceOp->ReduceBaseOp in reduce_ops rename allreduce op, to remove * fix code format fix comments * modify test_collective_reduce_api ut timeout * fix PR-CI-Build fix comments: format phi operator
-
由 wanghuancoder 提交于
* delete old dygraph op test
-
由 wanghuancoder 提交于
* delete old dygraph, mlu npu do not use dygraph
-
由 Zheng-Bicheng 提交于
update
-
由 Zheng-Bicheng 提交于
-
由 ZhangDY-6483 提交于
* first version, notest * return final rst, notest * use infinity() instead of max * ut structure * start up of ut * generate lse * update * add depense * reconstruct cmake * move file * add memory efficient attention and fix blasimpl * update * update cmake * add namespace * update cmake * use .cu * update for pad3d * bug fix * bug fix * update * bug fix * update enforce * add test case * merge the lse pad * fix kernel_fn of backward * fix PADDLE_ENFORCE_EQ and phi_api * fix PADDLE_ENFORCE * fix PADDLE_ENFORCE * rerun coverage * fix memory efficient attention test * rerun ci * add cuda version condition * add cuda version condition * delete WIP test * replace PADDLE_ENFORCE * edit the namespace of datatype in multiple.cc * rerun * rerun --------- Co-authored-by: Nliuyuang <liuyuang@baidu.com>
-
由 wanghuancoder 提交于
* xpu do not test dygraph in dygraph
-
由 Yuang Liu 提交于
-
- 23 3月, 2023 5 次提交
-
-
由 HongyuJia 提交于
-
由 Wangzheee 提交于
-
由 xiaoguoguo626807 提交于
* delete prim flag for matmul_2_grad * delete prim flag for matmul_2_grad * add new setgradoutmeta for matmul_double_grad_node * modify test and delete log * deal with review
-
由 chenjian 提交于
* add meshgrid composite rule * add meshgrid composite rule * update * add into CMakeLists * fix * update * update * optimize code * fix meshgrid op * update test
-
由 wanghuancoder 提交于
* delete old dygraph xpu op test
-