- 29 8月, 2022 16 次提交
-
-
由 Chen Weihang 提交于
* migrate assign xpu kernel, test=kunlun * remove assign_value xpu, test=kunlun
-
由 Charles-hit 提交于
* support refuse forward dygraph * modify backward api exponential__grad yaml * remove print code * 当反向复用前向时进行需不需要更高阶的反向判断,如果不需要调用c++ api,需要的话则调用前向动态图生成反向节点 * fix some backward bugs * modify the generated dygraph function name
-
由 wanghuancoder 提交于
* gather gather_grad gather_nd gaussian_random xpu to phi
-
由 zhangbo9674 提交于
-
由 Zhang Ting 提交于
* fix hard_swish inf * skip_check_grad for mkldnn op * 'fix code style' * fix unittest
-
由 cambriconhsq 提交于
-
由 wanghuancoder 提交于
* pylayer set has grad with create_graph
-
由 Aurelius84 提交于
* [OpAttr]num_rows/num_colums of eye support Tensor type * fix attr cast with long type
-
由 Siming Dai 提交于
* move incubate to geometric * add paddle.geometric * fix unittest bug * add float16 support for segment op * change reindex and sample neighbors flag name * add heter graph reindex * move sample_neighbors.py to neighbors.py * delete khop_sampler in geometric * delete unused code * change sample_neighbors api input order * fix en doc * fix unittest * fix unittest * change reindex * fix division by 0 * delete unnecessary input argument * delete final_state
-
由 Allen Guo 提交于
* support depthwise_conv2d ops Co-authored-by: NZhixin Yao <zhixiny@graphcore.ai> Co-authored-by: NZhaorui Chen <zhaoruic@graphcore.ai> * fix duplicate name Co-authored-by: NZhixin Yao <zhixiny@graphcore.ai> Co-authored-by: NZhaorui Chen <zhaoruic@graphcore.ai>
-
由 Allen Guo 提交于
-
由 HongyuJia 提交于
* add legacy_api.yaml * set merged_momentum inplace only * support inplace optional<vector<tensor>> * add dygraph_mode api * add merged_adam yaml * add merged_adam python api * change testcase of merged_adam and adam * fix import of test_merged_adam_op
-
由 zhangbo9674 提交于
-
由 zhangbo9674 提交于
-
由 zyfncg 提交于
* move matmul_v2 kernel of xpu from fluid to phi, test=kunlun * fix complie bug, test=kunlun * fix complie bug, test=kunlun * fix complie bug, test=kunlun
-
由 Weilong Wu 提交于
-
- 26 8月, 2022 14 次提交
-
-
由 Ruibiao Chen 提交于
-
由 Ruibiao Chen 提交于
-
由 Ruibiao Chen 提交于
-
由 zyfncg 提交于
* delete fill xpu op in fluid * delete fill_constant header, test=kunlun * fix npu header, test=kunlun
-
由 Wang Bojun 提交于
* fix bug fix * add shape size check * polish code * multi -1 shape fix * code style improve * bug fix * code style fix
-
由 wanghuancoder 提交于
-
由 Wangzheee 提交于
-
由 danleifeng 提交于
-
由 houj04 提交于
-
由 kangguangli 提交于
* remove fluid kernel and activate phi kernel * fix parameter error * transfer mkldnn part * modify header file path * fix compile error * transfer special case * fix lod setting and special case for layout setting * add testcase and refine code
-
由 Hui Zhang 提交于
-
由 haosicheng 提交于
* add temporal shift and grad *test=kunlun * fix reduce mean grad bug *test=kunlun
-
由 xiongkun 提交于
* while support for python container. It is convenient to convert more dynamic graph codes into static graphs. * cond support python container * 1. make select_input output shape = input[1] 2. add warning in while_loop risky assign * fix 2 problem in GPT export: 1. a bug in while_op no_need_copy_var, which causes gpu memory leakage 2. a bug in undefined_var where the stop_gradient should be False. * change name by code review * format
-
由 王明冬 提交于
-
- 25 8月, 2022 10 次提交
-
-
由 Feiyu Chan 提交于
-
由 piotrekobi 提交于
* Add omp parallel for directives * Revert "Add omp parallel for directives" This reverts commit f4e4f8ddb12454018d9c1e49c074af2543659de6. * Add #pragma omp parallel for to correct file * Add check for _OPENMP definition * Disable omp on gpu * Trigger CI * Readd check for _OPENMP definition * Change macro disabling changes on GPU * Improve macro readability
-
由 Aurelius84 提交于
* [OpAttr]axis of Reverse Support Tensor type * fix coverage * fix unittest
-
由 danleifeng 提交于
* update brpc version;test=develop
-
由 Aurelius84 提交于
* [OpAttr]min/max of Uniform_rand support Tensor type * fix typo
-
由 chenjian 提交于
* fix segment fault * fix
-
由 kangguangli 提交于
* transfer memcpy_d2h from fluid to phi * refine arg check and add comment * fix cannot fallback to phi kernel * fix gpu_context host alloc when tensor size = 0 * add kernel for std::vector<DenseTensor> args * fix bugs in MemcpyD2HMultiIOKernel * remove useless header file * polish format * fix typo * add testcase for cudapinned place * refine check condition in test * polish error message * polish error message * remove header in fluid directory * merge memcpy_h2d and memcpy_d2h into one file, change register method to simplify implementation * fix code style check
-
由 ronnywang 提交于
* [NPU] add run_program_op_npu * add run_program_op_npu ut
-
由 Sing_chan 提交于
* make full_like support double_max in dygraph * fix bug
-
由 wanghuancoder 提交于
* sync_batch_norm_grad delete mean and variance
-