1. 28 9月, 2019 6 次提交
    • Q
      Enable users to create custom cpp op outside framework. (#19256) · 1a3eef02
      qingqing01 提交于
      * How to write custom op needs to follow framework OP spec.
      * Package fluid_framework.so and headers into whl.
      * Add paddle.sysconfig.get_include() and paddle.sysconfig.get_lib() to get include dir and lib dir.
      * Export some C-APIs to merge OpInfo between core.so and custom_op.so.
      * Add unit testing.
      * Update API.spec.
      1a3eef02
    • S
      improve op uniform_random, argument shape support tensor and tensor in list (#19786) · f1eebf75
      silingtong123 提交于
      * test=develop, argument shape support tensor and tensor in list
      
      * test=develop,Increasing the coverage of CI tests
      
      * test=develop, modify the document and update API.spec
      
      * test=develop, modify the doc and update API.spec
      
      * test=develop, modify the doc and update API.spec
      
      * test=develop, modify the interface of UniformInitializer
      
      * test=develop, modify the interface of XavierInitializer and MSRAInitializer
      
      * test=develop, modify based on review's comments
      
      * test=develop, modify based on review's comments
      
      *  test=develop, modify based on review's comments
      f1eebf75
    • L
      fix pool2d pool3d,support asymmetric padding and channel_last (#19739) · 24010472
      liym27 提交于
      * fix pool2d pool3d:
      1. support asymmetric padding;
      2. support padding algorithm:"SAME" and "VALID";
      3. support channel_last: data_format NHWC and NDHWC;
      4. support inferring shape when input with negative dims in compile time;
      5. change doc of python API and c++;
      6. fix bug in cuda kernel when Attr(adaptive) is true.
      
      test=develop,test=document_preview
      
      * fix 'tensors' to 'Tensors'. test=develop,test=document_preview
      
      * add test for converage ValueError.test=develop,test=document_preview
      
      * resolve conflict in test_pool2d. test=develop
      24010472
    • Y
      Dygraph Layer kwargs & param getter setter (#19901) · 54e07994
      Youwei Song 提交于
      * opt FC
      
      * opt rest of dygraph.nn
      
      * new param shape check and unittest
      
      * add kwargs for Layer
      
      * add new set_value api
      
      * use property decorator
      
      * update API.spec, test=develop
      
      * use UserList, separate gettersetters, test=develop
      
      * update test_custom_layer_with_kwargs, test=develop
      
      * fix UserList compatibility, test=develop
      
      * fix UserList compatibility, test=develop
      
      * keep FC._w, test=develop
      
      * add unittests, Conv3D bug fix, test=develop
      
      * clean code, test=develop
      
      * fix dygraph guard in unittest, test=develop
      
      * add property setters, remove unused param in tracer, test=develop
      
      * tracer none check, test=develop
      
      * merge, test=develop
      
      * refine, test=develop
      
      * bug fix in  prelu and conv3d_transpose, test=develop
      
      * rm __set__, test=develop
      
      * set tensor value instead of assign op
      
      * fix property setter call, test=develop
      
      * fix api.spec, test=develop
      
      * fix doc sample, test=develop
      54e07994
    • H
      Add shape check in load model (#19936) · 4e99c2af
      hong 提交于
      * add parameter shape check when load parameter from file; test=develop
      
      * fix test error; test=develop
      
      * add wrong shape check; test=develop
      
      * remove useless code; test=develop
      
      * add testcase setup
      
      * add teardown, remove temp model_path; test=develop
      
      * add clean process; test=develop
      4e99c2af
    • G
      change dist tests to serial test=develop (#20051) · d4bca811
      gongweibao 提交于
      d4bca811
  2. 27 9月, 2019 5 次提交
    • J
      Optimze/optimize dygraph api (#19999) · 39ff0f9c
      Jiabin Yang 提交于
      * test=develop, fix docker with paddle nccl problem
      
      * test=develop, Add Variable api and refine dygraph related API
      
      * test=develop, Add Variable api and refine dygraph related API
      
      * test=develop, refine test for new api and error info
      
      * test=develop, refine error info and test_layers
      
      * test=develop, add API.spec
      
      * test=devleop, fix to_string python2 and python3 compat error and refien doc
      
      * test=devleop, add API spec
      
      * test=devleop, update API spec
      
      * test=devleop, update API spec
      
      * test=develop, invoke ci
      
      * test=develop, fix example code
      
      * test=develop, update API spec
      
      * test=develop, add compat test and fix inplace campat dict error
      39ff0f9c
    • K
      polish pool infer shape (#20038) · e7a6567b
      Kaipeng Deng 提交于
      * fix pool infershape. test=develop
      
      * fix unittest converage. test=develop
      
      * fix format. test=develop
      e7a6567b
    • C
      Add fp16 support for pad and split (#19881) · fb2a9cdf
      chengduo 提交于
      * make pad and split support fp16
      test=develop
      fb2a9cdf
    • T
      the integrated communicator (#19849) · 8f0b3c05
      tangwei12 提交于
      * add a base class for the Communicator
      * add AsyncCommunicator Impl for async distributed training
      8f0b3c05
    • Z
      Fix name_scope test case bug (#20034) · 5a2ecdea
      zhaoyuchen2018 提交于
      test=develop
      Signed-off-by: Nzhaoyuchen <zhaoyuchen01@baidu.com>
      5a2ecdea
  3. 26 9月, 2019 11 次提交
  4. 25 9月, 2019 8 次提交
    • Z
      add kernel for fill_op, test=develop (#19719) · b1bb2384
      zhongpu 提交于
      * add kernel for fill_op, test=develop
      
      * modify PADDLE_ENFORCE to PADDLE_ENFORCE_EQ, test=develop
      
      * add op test for fill_op, test=develop
      
      * REGISTER COP CUDA KERNEL, test=develop
      
      * update test_fill_op.py, test=develop
      
      * change FillConstantOpVarTypeInference to FillOpVarTypeInference, test=develop
      
      * fix op test, test=develop
      
      * add head file, test=develop
      b1bb2384
    • W
      add support tensor and tensorlist for strided_slice OP (#19929) · 382d099d
      wangchaochaohu 提交于
      * add support tensor and tensorlist for strided_slice OP test=develop
      
      * fix the commnet test=develop
      
      * fix test=develop
      
      * fix the bug test=develop
      
      * delete log test=develop
      
      * fix API.spec test=develop
      
      * fix test=develop
      382d099d
    • L
      Fix OpTest of bn (#19062) · 619a241b
      lvmengsi 提交于
      * fix bn
      619a241b
    • S
      Avoid treating broadcast as initialization operation (#19857) · 5920d69d
      ShenLiang 提交于
      * treat broadcast as non-initial, test=develop
      
      * rename the class name
      
      * rename the class name, test=develop
      5920d69d
    • B
      add support of matmul with multiple head even different width and height (#19708) · c670058a
      Bob Zhu 提交于
      * add support of matmul with multiple head even different width and height
      
      Original matmul with multiple head supports only the mat_a.width == mat_b.height,
      in that case, mat_b will be horizontally split. In this patch, we extend the
      support when mat_a.width != mat_b.height but mat_a.width/head_number == mat_b.height,
      in this case, mab_b will be vertically split.
      
      One example is A is [3, 8], B is [2, 16], head_number is 4. In this
      case, A will be split as [3, 2], B will be (vertically) split as
      [2, 4]. The final result will be 4 matrix of 4 matrix of [3,4], i.e. [3, 16]
      
      test=develop
      
      * add support of matmul with multiple head even different width and height
      
      Original matmul with multiple head supports only the mat_a.width == mat_b.height,
      in that case, mat_b will be horizontally split. In this patch, we extend the
      support when mat_a.width != mat_b.height but mat_a.width/head_number == mat_b.height,
      in this case, mab_b will be vertically split.
      
      One example is A is [3, 8], B is [2, 16], head_number is 4. In this
      case, A will be split as [3, 2], B will be (vertically) split as
      [2, 4]. The final result will be 4 matrix of 4 matrix of [3,4], i.e. [3, 16]
      
      test=develop
      
      * refactor the code of matmul with multiple head even different width and height
      
      test=develop
      c670058a
    • L
      refine ctc align op with padding (#19926) · 6884dc80
      Liufang Sang 提交于
      * refine ctc align op with padding 
      * refine api sample code
      6884dc80
    • T
      add input type and dtype check for softmax_op (#19975) · 65a02fc1
      Tao Luo 提交于
      * add input type and dtype check for softmax_op
      
      test=develop
      
      * refine error message
      
      test=develop
      65a02fc1
    • A
      Removing length dims constraints of seq_pad and seq_unpad (#19497) · 99a9615a
      Aurelius84 提交于
      * Removing last dims constraints of seq_pad and seq_unpad test=develop
      
      * fix test_layer api code test=develop
      
      * fix sequence_pad_op.cc conflict test=develop
      
      * remove test_analyzer_mm_dnn test=develop
      
      * fix vectorize bug test=develop
      
      * fix vectorize<int> test=develop
      99a9615a
  5. 24 9月, 2019 8 次提交
  6. 23 9月, 2019 2 次提交
    • Z
    • M
      Forward recompute3 (#19913) · 9901f696
      mapingshuo 提交于
      * add recompute based checkpoints methods for large batch training
      test=develop
      
      * add append_backward_with_forward_recomputation
      test=develop
      
      * refine optimizer
      test=develop
      
      * update backward and optimizer
      test=develop
      
      * make Variable usable
      test=develop
      
      * add recompute code
      
      * refine optimizer
      test=develop
      
      * refine addup _append_backward_ops_with_checkpoints_
      1) for recompute part, just cache the grad_op_desc without appending to block
      2) before appending grad_op_desc to backward part, addup_repetitive_vars, remove unused branch
      test=develop
      
      * make method private
      
      * add recompute strategy into DistributedStrategy
      test=develop
      
      * checkpoint version3
      test=develop
      
      * remove some print information
      test=develop
      
      * remove unused sumop
      test=develop
      
      * try to fix recompute with graph building modules
      
      * add input names to vars should be held
      
      * add memory debug tool
      
      * backup backward
      
      * Fix bugs
      
      * add backward desc for op not in any segments
      
      * add exception info for sub_block
      
      test=develop
      
      * modify code style
      
      test=develop
      
      * modify code style
      
      test=develop
      
      * remove print functions
      
      test=develop
      
      * add API spec
      
      test=develop
      test=document_preview
      
      * make Recompute a child class of Optimizer
      
      test=develop
      test=document_preview
      
      * add API spec
      
      test=develop
      test=document_preview
      
      * modify API spec
      
      test=develop
      test=document_preview
      
      * add document for Recompute
      
      test=develop
      test=document_preview
      
      * change API doc of Rcompute
      
      test=develop
      test=document_preview
      
      * code cleaning
      
      test=develop
      test=document_preview
      
      * modify API spec
      
      * fix bugs when segments hold no element
      
      * add testcase for Recompute Optimizer
      
      test=develop
      test=document_preview
      
      * add test for apply_gradient, and code cleaning
      
      test=develop
      test=document_preview
      
      * add test case for load function
      
      * enable CI
      
      test=develop
      test=document
      
      * add test case
      
      test=develop
      test=document_preview
      
      * add sample code for 4 function of recompute optimizer
      
      test=develop
      test=document_preview
      9901f696