- 31 10月, 2019 2 次提交
-
-
由 Chen Weihang 提交于
-
由 Yiqun Liu 提交于
* Refine the cache of program, context and scope in executor. test=develop * Refine the unittest test_executor_and_use_program_cache. * Add the test the PaddingRNN with use_program_cache=True. test=develop * Remove a check. test=develop * Refine the unittest to check whether it is correct when setting use_program_cache=True. test=develop
-
- 30 10月, 2019 5 次提交
-
-
由 Wilber 提交于
* fix jit_matmul bug * update jit matmul and add test
-
由 Yiqun Liu 提交于
* Move the codes of fused operators to operators/fused directory. test=develop * Correct the op name in cmake. * Change the use of PADDLE_ENFORCE. test=develop
-
由 Leo Chen 提交于
-
由 zhang wenhui 提交于
-
由 Zeng Jinle 提交于
-
- 29 10月, 2019 9 次提交
-
-
由 hong 提交于
* fix persistable; * fix save load bugs; test=develop * fix bug; test=develop * add example for new io api; test=develop * addd example; test=develop
-
由 liym27 提交于
support Tensor for split and concat, support -1 in num_or_sections, add check num_or_sections (#20780) * improve split and concat op: 1. support Tensor for argument 'dim' in split op. 2. support Tensor for argument 'axis' in concat op. test=develop * redefine function GetDataFromTensor and set unknown output shape to - 1. test=develop * add check: Attr(sections) match Input(X). test=develop * support Tensor for attr(sections) and attr(sections) can contain -1. add check for attr(sections). test=develop * modify error message for concat and call Resize only when necessary. test=develop
-
由 wangchaochaohu 提交于
-
由 Yiqun Liu 提交于
* Refine the InferShape of ReadFrom and WriteTo op, and add comment to explain why not call ShareLoD for runtime. test=develop * Add comment for ReorderLoDTensorByRank op. * Add comment for lod_tensor_to_tensor_array op to explain why only call DecreaseLoDLevel for compile time. test=develop * ShrinkRNNMemory op should call ShareLoD for compile time. test=develop * Add the implementation of IncreaseLoDLevel and add the compile-time check of lod_level in InferShape of sequence_pool. test=develop * Refine the unittest of DynamicRNN. test=develop * Change PADDLE_ENFORCE to PADDLE_ENFORCE_NE. test=develop
-
由 Yiqun Liu 提交于
* Add fusion_group_pass and elementwise pattern. * Rewrite the detector of elementwise group. test=develop * Add a comment in codegen. * Add more unittest cases. test=develop * Move code_generator related code to fusion_group directory. * Correct the including path. * Add the definition of SubGraph and finish the insert of fusion_group op in pass. * Insert graph_vis_pass in tester to visualize the graph for debug.
-
由 liym27 提交于
* improve unsqueeze op to support int, Tensor and Tensor list for argument axes. test=develop * call Resize only when necessary. test=develop
-
由 silingtong123 提交于
* Make shape tensor support int32
-
由 Huihuang Zheng 提交于
-
由 Zeng Jinle 提交于
-
- 28 10月, 2019 5 次提交
-
-
由 Aurelius84 提交于
-
由 Zeng Jinle 提交于
-
由 Chen Weihang 提交于
-
由 whs 提交于
-
由 Chen Weihang 提交于
* replace part of the old implementation, test=develop * restore concat op, test=develop * update all ops implemention & delete GetDataTypeOfVar func, test=develop
-
- 25 10月, 2019 3 次提交
-
-
由 xujiaqi01 提交于
* no longer need to define all embedding layers (no one less) of all slots in each program. make trainer_param repeated in ps.proto. * add find_distributed_lookup_table_grads instead of hard code GRAD * support embedding stop gradient. push sparse has error before fix this.* * fix fill sparse, skip slots which do not have embedding. each slot's embedding in a sparse table should be used in all training programs before fix this. * fix pull sparse, skip slots which do not have embedding. * fix collect feasign label info, skip slots which do not have embedding. * support when there are multi sparse tables in one or multi training programs, each program can pull/push its own related sparse tables instead of all sparse tables. * test=develop
-
由 Yamei-Lee 提交于
consider the situation that shape of input can contain more than one -1. test=develop
-
由 Chen Weihang 提交于
* Make formatted ENFORCE stack adapt to more situations and polish details, test=develop * restore template message position, test=develop
-
- 24 10月, 2019 9 次提交
-
-
由 Zeng Jinle 提交于
-
由 Zhang Ting 提交于
* All elements in attr(shape) of crop_tensor can be -1, test=develop, test=document_preview * fix the bug that attr(offsets) should be initialized, test=develop
-
由 danleifeng 提交于
-
由 Zeng Jinle 提交于
-
由 Zeng Jinle 提交于
* add more err msg, test=develop * add more unittests, test=develop
-
由 Tao Luo 提交于
* make search_compute support avx only * clean search_compute.h * rename sse_axpy to avx_axpy test=develop * update CMakeLists.txt test=develop
-
由 zhongpu 提交于
* add PADDLE_ENFORCE for dygraph to optimize error throw, test=develop * fix some error, test=develop * delete PADDLE_ENFORCE_EQ in VarBase::NewVarBase, test=develop
-
由 WangXi 提交于
-
由 wangchaochaohu 提交于
-
- 23 10月, 2019 7 次提交
-
-
由 Zeng Jinle 提交于
* jit/trace 1st version, test=develop * add more unittests, test=develop
-
由 zhaoyuchen2018 提交于
The op should handle k=1024 test=develop Signed-off-by: Nzhaoyuchen <zhaoyuchen01@baidu.com>
-
由 whs 提交于
-
由 Pei Yang 提交于
Bug Fix: Paddle-TRT cannot handle adaptive pooling in pool2d op converter and "num" attribute in split op converter (#20733) * fix pool2d trt converter, test=develop * add fix for split op converter, test=develop
-
由 tianshuo78520a 提交于
-
由 Tao Luo 提交于
* mv sampcd_processor.py to tools test=develop test=document_fix * update example script test=develop test=document_fix
-