- 26 8月, 2022 22 次提交
-
-
由 Ruibiao Chen 提交于
-
由 Yuang Liu 提交于
-
由 Roc 提交于
* add simple reformated ci files * update * add radme for new unitetsts * add radme for new unitetsts * add radme for new unitetsts * reset mlu * update for samples * add base api * reset some dist unit tests * add warning in grenerated cmakelists file * update readme for new dist unit tests * add all collective tests * remain base file and launcher file * Update README.md * Update README.md * fix env PYTHONPATH * Update gen_ut_cmakelists.py * add all collective tests * add docs for gen_ut_cmakelists.py * pretify codes * commont name == "name" * update for comments * update function's help * update for run type * update readme * add all collective tests * add all collective tests * mv collective test files * update for all collective tests * update * update * update * update for all tests * update for checking name * Update Cmakelists.txt * update testlist.csv * remain test_parallel_dygraph_dataparallel in unittests * set broadcast op all platforms * update * remain test_broadcast_tensors_op * fix * rm some collective files * update more colective tests * update * update * update gen_ut_supports recursion * update * update * update * update * fix nccl version * update * update * update * update * fix a bug and try to pass * update * add csv * update for timeout * remove tcp store * fix * fix * update * update * update for more dist tests * move multi node tests * update * update * update * fix for auto parallele * update * update path in python file * update * reset some test in unittests * fix * update readme * fix * update * fix port
-
由 Ruibiao Chen 提交于
-
由 Ruibiao Chen 提交于
-
由 zyfncg 提交于
* delete fill xpu op in fluid * delete fill_constant header, test=kunlun * fix npu header, test=kunlun
-
由 tianshuo78520a 提交于
-
由 Wang Bojun 提交于
* fix bug fix * add shape size check * polish code * multi -1 shape fix * code style improve * bug fix * code style fix
-
由 Guanghua Yu 提交于
-
由 wanghuancoder 提交于
-
由 Liyulingyue 提交于
* irfftn; test = docutment_fix * fft; test=document_fix * fft; test=document_fix * fft; test=document_fix * subdata; test=document_fix * adaptive_avg_pool2d; test=document_fix * adaptive_avg_pool3d; test = document_fix * ftt; test=document_fix * ftt; test=document_fix * AvgPool1D; test=document_fix * avg_pool1d; test=document_fix * test=document_fix * test=document_fix * test=document_fix * test=document_fix * fft; test=document_fix * emb; test=document_fix * emb; test=document_fix * emb;test=document_fix * fold; test=document_fix * fold; test=document_fix * fold; test=document_fix * fold;test=document_fix * GELU;test=document_fix * update irfftn docs;test=document_fix * Update fft.py * Update fft.py * Update common.py * Update common.py * Update fft.py * Update input.py * Update pooling.py * dropout2d; test=document_fix * Fold; test=document_fix * update fold math;test=document_fix * Update common.py Co-authored-by: NLigoml <39876205+Ligoml@users.noreply.github.com>
-
由 Wangzheee 提交于
-
由 danleifeng 提交于
-
由 houj04 提交于
-
由 whs 提交于
-
由 ceci3 提交于
-
由 kangguangli 提交于
* remove fluid kernel and activate phi kernel * fix parameter error * transfer mkldnn part * modify header file path * fix compile error * transfer special case * fix lod setting and special case for layout setting * add testcase and refine code
-
由 Hui Zhang 提交于
-
由 Yuang Liu 提交于
-
由 haosicheng 提交于
* add temporal shift and grad *test=kunlun * fix reduce mean grad bug *test=kunlun
-
由 xiongkun 提交于
* while support for python container. It is convenient to convert more dynamic graph codes into static graphs. * cond support python container * 1. make select_input output shape = input[1] 2. add warning in while_loop risky assign * fix 2 problem in GPT export: 1. a bug in while_op no_need_copy_var, which causes gpu memory leakage 2. a bug in undefined_var where the stop_gradient should be False. * change name by code review * format
-
由 王明冬 提交于
-
- 25 8月, 2022 18 次提交
-
-
由 Feiyu Chan 提交于
-
由 piotrekobi 提交于
* Add omp parallel for directives * Revert "Add omp parallel for directives" This reverts commit f4e4f8ddb12454018d9c1e49c074af2543659de6. * Add #pragma omp parallel for to correct file * Add check for _OPENMP definition * Disable omp on gpu * Trigger CI * Readd check for _OPENMP definition * Change macro disabling changes on GPU * Improve macro readability
-
由 Aurelius84 提交于
* [OpAttr]axis of Reverse Support Tensor type * fix coverage * fix unittest
-
由 danleifeng 提交于
* update brpc version;test=develop
-
由 hong 提交于
-
由 Aurelius84 提交于
* [OpAttr]min/max of Uniform_rand support Tensor type * fix typo
-
由 chenjian 提交于
* fix segment fault * fix
-
由 kangguangli 提交于
* transfer memcpy_d2h from fluid to phi * refine arg check and add comment * fix cannot fallback to phi kernel * fix gpu_context host alloc when tensor size = 0 * add kernel for std::vector<DenseTensor> args * fix bugs in MemcpyD2HMultiIOKernel * remove useless header file * polish format * fix typo * add testcase for cudapinned place * refine check condition in test * polish error message * polish error message * remove header in fluid directory * merge memcpy_h2d and memcpy_d2h into one file, change register method to simplify implementation * fix code style check
-
由 ronnywang 提交于
* [NPU] add run_program_op_npu * add run_program_op_npu ut
-
由 Sing_chan 提交于
* make full_like support double_max in dygraph * fix bug
-
由 wanghuancoder 提交于
* sync_batch_norm_grad delete mean and variance
-
由 hong 提交于
* optimizer conv alog speed * code polish * remove useless code * fix compile error * fix cpu compile error * not use cudnn alog t * add search cache max number * polish code * fix cache test bug * add groups data format to conv args * fix cache test bug * fix cudnn_deterministic bug * fix test switch auto tune bug * fix test swith autotune bug; * fix conv cache bug * fix cache test error * fix cache test bug * fix windows mac compile error * fix workspace search error * update cudnn cache * fix cache test bug; test=develop * fix autotune swith test error * polish code * oplish code
-
由 ziyoujiyi 提交于
* back fl * delete ssl cert * . * make warning * . * unittest paral degree * solve unittest * heter & multi cloud commm ready * . * . * fl-ps v1.0 * . * support N + N mode * . * . * . * . * delete print * . * . * . * . * fix bug * . * . * fl-ps with coordinator ready * merge dev * update message parse only * update fl client scheduler * fix bug * update multithreads sync * fix ci errors * update role_maker.py * update role_maker.py * fix ci error: windows py import error * fix ci error: windows py import error * fix windows ci pylib import error * add dump fields & params * try to fix windows import fleet error * fix ps FLAGS error * fix logging risk * fix logging possible risk * write trainer_desc file * support split sparse params in local & remote * fix import paddle.fluid.core.PSGPU * fix import paddle.fluid.core.PSGPU * add remote_sparse & local_sparse config * fix unittest * fix test_dist_fleet_geo table error * fix PADDLE_ENFORCE error * fix other's pr conflict * forbidden ssd table * . * recover ssd table code * recover file mode
-
由 Rayman 提交于
-
由 Wilber 提交于
-
由 handiz 提交于
-
由 JZ-LIANG 提交于
* support high order differential with data parallel overlap * update unitest
-
由 USTCKAY 提交于
-