- 06 2月, 2023 4 次提交
-
-
由 zmxdream 提交于
* add dump_walk_path (#193) * add dump_walk_path; test=develop * add dump_walk_path; test=develop * add dump_walk_path; test=develop * Add multiple CPU communication, parameter query and merging functions, support batch alignment between multiple cards (#194) * compatible with edge_type of src2dst and src2etype2dst (#195) * do not merge_feature_shard when using metapath_split_opt (#198) * support only load reverse_edge (#199) * refactor GraphTable (#201) * fix * fix * fix code style * fix code style * fix test_dataset * fix hogwild worker * fix code style * fix code style * fix code style * fix code style * fix code style. * fix code style. --------- Co-authored-by: Ndanleifeng <52735331+danleifeng@users.noreply.github.com> Co-authored-by: Nqingshui <qshuihu@gmail.com> Co-authored-by: NWebbley <liwb5@foxmail.com> Co-authored-by: Nhuwei02 <53012141+huwei02@users.noreply.github.com>
-
由 RedContritio 提交于
* check tensor numel in PyObject_CheckLongOrToLong * add unittest
-
由 houj04 提交于
-
由 Siming Dai 提交于
* fix to_dlpack for loop * fix reference count
-
- 03 2月, 2023 1 次提交
-
-
由 Jiabin Yang 提交于
* refactor dir for prim * support blacklist for bwd comp * fix type error * remove additional file * fix git ignore * add more test * merge develop
-
- 02 2月, 2023 1 次提交
-
-
由 Xiaoxu Chen 提交于
【PRIM】Support use operator's output metadata info in constructing static backward composite (#50043) * [prim] support custom target_gradients * support infershape after append one gradop * [prim] add simple net test * fix test_loop segment fault bug * [prim] fix infer shape segment fault bug when output of grad_op_desc is empty
-
- 31 1月, 2023 5 次提交
-
-
由 niuliling123 提交于
-
由 Charles-hit 提交于
* polish static grad op maker gen * fix some bugs * fix static code gen * solve conflict * modify composite grad maker name * integrate phi and fluid info in static code gen * rename some composite maker * modify static code gen format
-
由 PuQing 提交于
* add FP16 dtype for CastNumpy2Scalar * fix throw message * add test * fix SyntaxWarning * test skip for float16 * fix dtype mistakes
-
由 ronnywang 提交于
* [CustomDevice] add custom device api * update * update * test=document_fix * update * update * add examples
-
由 姜永久 提交于
* rm flags_retain grad in pybind * retain grads for xpu test * set retain grad for xpu * rm flag * lint --------- Co-authored-by: Nwanghuancoder <wanghuan29@baidu.com>
-
- 30 1月, 2023 2 次提交
-
-
由 Ruibiao Chen 提交于
* Support stream priority for standalone executor * Fix compile error * Fix compile error * Fix compile error * Fix compile error * Fix compile error
-
由 zmxdream 提交于
* add set slot_num for psgpuwraper (#177) * add set slot_num_for_pull_feature for psgpuwarper * Add get_epoch_finish python interface (#182) * add get_epoch_finish interface * add return * delete return * add unzip op (#183) * fix miss key for error dataset (#186) * fix miss key for error dataset * fix miss key for error dataset Co-authored-by: Nyangjunchao <yangjunchao@baidu.com> * add excluded_train_pair and infer_node_type (#187) * support return of degree (#188) * fix task stuck in barrier (#189) Co-authored-by: Nyangjunchao <yangjunchao@baidu.com> * check node/feature format when loading (#190) * check node&feature format when loading * check node&feature format when loading (2£ (2) * degrade log (#191) * [PGLBOX]fix conflict * [PGLBOX]fix conflict * [PGLBOX]replace LodTensor with phi::DenseTensor * [PGLBOX]fix gpu_primitives.h include path * [PGLBOX]from platform::PADDLE_CUDA_NUM_THREADS to phi::PADDLE_CUDA_NUM_THREADS * [PGLBOX]fix unzip example code * [PGLBOX]fix unzip example code * [PGLBOX]fix unzip example code * [PGLBOX]fix unzip example code * [PGLBOX]fix unzip ut * [PGLBOX]fix unzip ut * [PGLBOX]fix code style * [PGLBOX]fix code style * [PGLBOX]fix code style * fix code style * fix code style * fix unzip ut * fix unzip ut * fix unzip ut * fix unzip * fix code stype * add ut * add c++ ut & fix train_mode_ set * fix load into memory * fix c++ ut * fix c++ ut * fix c++ ut * fix c++ ut * fix code style * fix collective * fix unzip_op.cc * fix barrier * fix code style * fix barrier * fix barrier * fix code styple * fix unzip * add unzip.py * add unzip.py * fix unzip.py --------- Co-authored-by: Nchao9527 <33347532+chao9527@users.noreply.github.com> Co-authored-by: NSiming Dai <908660116@qq.com> Co-authored-by: Nhuwei02 <53012141+huwei02@users.noreply.github.com> Co-authored-by: Nyangjunchao <yangjunchao@baidu.com>
-
- 29 1月, 2023 1 次提交
-
-
由 LiYuRio 提交于
* remove max_slot_num * fix test case
-
- 28 1月, 2023 1 次提交
-
-
由 LiYuRio 提交于
-
- 20 1月, 2023 1 次提交
-
-
由 Jiabin Yang 提交于
-
- 17 1月, 2023 3 次提交
-
-
由 zhangbo9674 提交于
* refine munmap freq for ref_cnt_mmap_allocator * add shm reuse logic * fix compile bug * fix compile bug * fix bug of file refcount * fix compile bug * fix compile bug * refine code for delete shm case * polish code * refine shm cache pool size setting logic * set buffer is 2 * refine shm cache size logic * refine max shm cache * refine shm cache size
-
由 YuanRisheng 提交于
* change feed_op to phi kernel * fix ci bugs * fix build bugs * fix ci bugs * fix compile bugs * fix ci bugs * perfect code * perfect comment code * fix install bugs * modify code according comment * remove visitor in feed_op * modify according comment * perfect code according comment * add infershape * fix py3 bugs * fix getexpected kernel type * fix getexpected kernel type * fix ci bugs * add registry for custom device * fix py3 bugs * fix floating point error * fix py3 test bugs
-
由 WangZhen 提交于
* Support call backward() without params in dy2st
-
- 13 1月, 2023 4 次提交
-
-
由 wanghuancoder 提交于
-
由 duanyanhui 提交于
* clear ProcessGroupCustom manually * fix bug * fix bug * move destroy ProcessGroup to ProcessGroupIdMap * enable destroy to all device * remove unused comments * change to internal api * Update process_group.cc * Update process_group.cc
-
由 duanyanhui 提交于
* update get_device to custom * add custom_device api * rm is_compiled_with_custom_device from framework * add todo comments
-
由 HongyuJia 提交于
-
- 12 1月, 2023 1 次提交
-
-
由 jameszhang 提交于
* Fix reduce func bug in process_group_bkcl Also catch up with a recent process_group PR that failed to add XPU branch. Note that reduce is still accomplished by allreduce for xpu. Fix this should xccl lib be updated. * fix compile issue for non-XPU
-
- 11 1月, 2023 1 次提交
-
-
由 niuliling123 提交于
-
- 10 1月, 2023 2 次提交
- 09 1月, 2023 3 次提交
-
-
由 Jiabin Yang 提交于
* proto type of composite grad in paddle * proto type of composite grad in paddle * refactor composite api with phi * fix compile error * support static graph code-gen for squeeze op * generate static graph code of unsqueeze * refine op name * fix compile error * add extra output in op_compat * remove debug log * fix clang compile error * support prim switch flag * support prim switch flag * fix dygraph error * merge develop * add code_gen * add necessary files without codegen * fix code_gen bug * add deps * modify igmnore * add ignore * delete std cout * add composite logic for backward.py * add tanh first order grad composite * support enable_prim flag for static graph * throw expection when both GrapOpMaker and GradCompOpMaker not been registered * reorganize the directory of prim api tests * fix windows error * add eager_utils * add eager_utils * modify code gen * add composite parse * add unittest for get_grad_op_desc * code optimize * fix static test on windows * support generate static graph code for imag and real op * fix windows compile error in test_static_prim * merge develop * disable test eager in inference * prim code gen * disable eager compile in inference * rm other file * rm gitignore file * code_style * add eager test * code_style * merge develop * remove useless files * modify static test * support bool flag from singlton * merge develop * recover git ignore * fix conflict * recover git ignore for generated op * fix test compile error * remove some tests * add python test * fix some name issue * add composite code gen * modify backward yaml * fix static composite grad maker code gen * remove addtional files * add some static funcs unit test * fix some bugs * fix composite grad maker register code gen * optimize some functions Co-authored-by: Nzyfncg <zhangyunfei07@baidu.com> Co-authored-by: Nwangruting <wangruting@baidu.com> Co-authored-by: Ncxxly <chenxx_id@163.com> Co-authored-by: Ncharles-hit <wanghao107@baidu.com> Co-authored-by: Nxiaoguoguo626807 <100397923+xiaoguoguo626807@users.noreply.github.com>
-
由 骑马小猫 提交于
* remove dumplicate element_size * remove tensor_methon_* method
-
由 LiYuRio 提交于
* comm_context and static init * refactor: move to phi/core/distributed * refactor: avoid mutable_data usage * fix: windows sock * fix: device without nccl Co-authored-by: Wen Sun <syl1887415157@126.com>
-
- 06 1月, 2023 3 次提交
-
-
由 Yuanle Liu 提交于
-
由 niuliling123 提交于
-
由 Yulong Ao 提交于
* [Auto Parallel] Rename methods of ProcessMesh * [Auto Parallel] Impl the python process_mesh by the c++ one * [Auto Parallel] Add some minor modifications * [Auto Parallel] Rename some methods * [Auto Parallel] Remove unnecessary codes * [Auto Parallel] Add back some removed files * [Auto Parallel] Fix bugs * [Auto Parallel] Fix a bug * Update process_mesh.cc * [Auto Parallel] Merge dist attrs of Python into C++ * [Auto Parallel] Add back deleted importing * [Auto Parallel] Add back removed unittest * [Auto Parallel] Remove type qualifiers of return types * [Auto Parallel] Fix some bugs * [Auto Parallel] Fix a bug of the quant pass * [Auto Parallel] Fix the code style
-
- 05 1月, 2023 2 次提交
-
-
由 姜永久 提交于
* rm op_function_generator * rm op_func_generator.h * rm op_function * modify cmake * rm op_function.h * rm check for op_function_generator.cc * reset imperative * rm python part * fix imperative * lint * lint * modify legacy_c * review * modify * modify legacy * rm gen op_functions code * reset framework * rm core.ops for test * core.ops->core.eager.ops.legacy * not raiseError for xpu
-
由 Wen Sun 提交于
* refactor: use base class * fix: incorrect deps * fix: add missing header * refactor: update class structures * fix: bkcl typo * fix: remove redundant def
-
- 04 1月, 2023 1 次提交
-
-
由 Aurelius84 提交于
-
- 03 1月, 2023 2 次提交
-
-
由 Yuanle Liu 提交于
-
由 zhoutianzi666 提交于
* Implement conv2d_fusion NHWC format using CUTLASS * Add unit testing for CUTLASS Conv in inference * Add experimental API for CUTLASS.
-
- 30 12月, 2022 1 次提交
-
-
由 Sanbu 提交于
* 1219 * temporarily change the num_diff_files limit, test=document_fix * Revert "temporarily change the num_diff_files limit, test=document_fix" This reverts commit 8e70f00ef468d2dad0e38b3da06295ed62990d20. * for codestyle * remove duplicate license * `static mode` -> `static graph mode` * Update hybrid_parallel_inference.py * Update layer_function_generator.py * Update manipulation.py * reset Co-authored-by: NLigoml <39876205+Ligoml@users.noreply.github.com> Co-authored-by: NSigureMo <sigure.qaq@gmail.com>
-
- 28 12月, 2022 1 次提交
-
-
由 WangZhen 提交于
-