1. 12 6月, 2019 9 次提交
    • G
      Fix GetExpectedKernelType of add_position_encoding_op (#17935) · a06b316b
      Guo Sheng 提交于
      * Fix the GetExpectedKernelType of add_position_encoding_op.
      test=develop
      
      * Fix the doc of lstm_unit outputs in nn.py.
      test=develop
      a06b316b
    • T
      combine noavx and avx package (#17889) · 5c06bff2
      tensor-tang 提交于
      * support avx and noavx core
      
      * add catch and give some log
      
      test=develop
      
      * fix build
      
      test=develop
      
      * add missing package
      
      test=develop
      
      * fix pybind name
      
      test=develop
      
      * fix import error
      
      test=develop
      
      * conbime noavx core
      
      test=develop
      
      * add requirements
      
      test=develop
      
      * fix unkown message
      
      test=develop
      
      * fix api spec
      
      test=develop
      
      * refine and clean
      
      test=develop
      
      * update
      
      * pass dist ut
      
      * follow comments
      
      test=develop
      
      * refine scripts
      
      test=develop
      5c06bff2
    • W
      Fix scatter and gather op when has duplicate index (#17952) · 8eb134c3
      wawltor 提交于
      * test=develop
      The scatter op has a calc bug when the indices has same index, the scatter op use overwrite mode to calculate the same index, fix this bug by using the accumulate mode to calculate the same index.At the same time, the gather op has the same bug when the op calc the grad. And we use the lib of open-blas and eigen to optimize the time cost in accumulate mode.
      
      * test=develop
      Fix some code format problem, and the same time add the test case in gather and scatter op
      8eb134c3
    • L
      update load_error_info, test=develop (#18000) · 75fcd292
      lujun 提交于
      Repair error prompt: Users are prompted to check whether the model or parameter files are damaged when loading parameters are wrong.
      75fcd292
    • modify the access level of anakin engine (#18015) · 04ea7cb0
      石晓伟 提交于
      test=develop
      04ea7cb0
    • W
      test=develop (#17984) · 2ae8decc
      wawltor 提交于
      Fix bug in sequence_unpad op, when allocate the output memory do not match actual memory, check memory failed. Fix this bug by allocating the output memeory in correct code position.
      2ae8decc
    • R
      Fix edit distance doc (#17947) · 9d6640ff
      ruri 提交于
      * fix im2sequence padding bug, test=develop
      
      * fix edit_distance, test=develop
      
      * add API.spec,test=develop
      9d6640ff
    • Z
      Add shape not match doc to data layer (#17936) · a1bdf25e
      Zeng Jinle 提交于
      * add shape not match doc to data layer, test=develop
      
      * fix API.spec md5
      test=develop
      a1bdf25e
    • C
      add deformable psroi pooling (#17827) · 871af28d
      cjt222 提交于
      * add deformable psroi pooling
      
      * test=develop
      
      * test=develop
      
      * test=develop
      modify format
      
      * fix bug
      
      * test=develop run ci
      
      * test=develop
      add API.spec
      
      * add test_layers.py
      
      * run ci again
      
      * test=develop
      run ci again
      
      * run ci again
      
      * test=develop
      run ci again
      
      * test=develop
      run ci again
      
      * test=develop
      run ci again
      
      * add space between two lines
      
      * test=develop
      add space between two lines
      
      * test=develop
      add space between lines
      
      * test=develop
      modify comment in nn.py
      
      * test=develop
      add space between two lines
      
      * test=develop
      add space between two lines
      
      * update API.spec
      
      * run ci again
      
      * test=develop
      run ci again
      
      * rerun ci
      
      * test=develop
      rerun ci
      
      * change input shape
      
      * run ci
      
      * test=develop
      run ci
      
      * modify format of nn.py
      
      * test=develop
      
      * test=develop
      
      * test=develop
      update API.spec
      
      * test=develop
      fix API doc
      
      * modify API comment
      
      * modift API comment
      
      * test=develop
      update API.spec
      
      * test=develop
      modify comment
      
      * test=develop
      modift comment
      
      * test=develop
      modift comment
      
      * test=develop
      update API.spec
      
      * test=develop
      modify comment
      
      * test=develop
      add inference in nn.py
      
      * test=develop
      update API.spec
      
      * test=develop
      resolve confict
      
      * test=develop
      update API.spec
      871af28d
  2. 11 6月, 2019 7 次提交
    • S
      add unfold op (new op),test=develop (#17944) · 40885c22
      SunGaofeng 提交于
      * add unfold op
      test=develop
      
      * fix divide bug in python3 when calculating output width and height
      test=develop
      
      * add name=None in python api, move redundant code into inline function
      
      * try to trigger ci for this code
      test=develop
      40885c22
    • J
      [MKL-DNN] Thread-Safety for MKL-DNN reusing Part 1 (#17965) · 84bb45c0
      Jacek Czaja 提交于
      * - removed is_reusing_
      
      * - Added TID to keys for reusing apart from softmax PD
      
      * - compilation fix
      
      * - Yet another compilation fix
      
      * - Batch Norm and Conv adapted
      
      * - Fix to softmax MT
      
      * - Fixes to MT code of MKL-DNN
      
      * - Lint fixes
      
      test=develop
      84bb45c0
    • G
      Polish codes of old prs. (#17938) · da9143c1
      gongweibao 提交于
      da9143c1
    • Update the Anakin interfaces for content-dnn and MLU (#17890) · bce259e5
      石晓伟 提交于
      * update anakin-engine interfaces for content-dnn
      
      test=develop
      
      * support only-gpu mode of Anakin
      
      modify eltwise parse
      
      test=develop
      
      * modification for thread-safe
      
      test=develop
      
      * Integrated template instance
      
      test=develop
      
      * increase template parameters
      
      test=develop
      
      * support MLU predictor
      
      test=develop
      
      * update anakin cmake files
      
      test=develop
      
      * update TargetWrapper::set_device
      
      * update the initialization of anakin subgraph
      
      test=develop
      
      * use the default constructor of base class
      
      test=develop
      bce259e5
    • T
      410907f6
    • H
      Pipeline Concurrency (#17402) · 969e6378
      hutuxian 提交于
      Add Pipeline Concurrency Train Mode:
      - Cpp: pipeline_trainer & section_worker
      - Python: PipelineOptimizer
      - Add a new data_feed type: PrivateInstantDataFeed
      - Add a test demo of pipeline trainer and the test model is gnn
      - Do not support win32 now
      969e6378
    • Z
      Light mem reuse strategy for inference. (#17925) · 4e8d5a03
      Zhaolong Xing 提交于
      * fix: when use the load model from memory mode, the RAM occupy is high
      
      test=develop
      
      * ligth mem reuse
      test=develop
      
      * fix cpplint
      test=develop
      4e8d5a03
  3. 10 6月, 2019 10 次提交
  4. 08 6月, 2019 1 次提交
  5. 07 6月, 2019 3 次提交
    • W
      Add Ligth-NAS for PaddleSlim (#17679) · 5df65e50
      whs 提交于
      * Add auto pruning strategy.
      1. Fix compressor.
      2. Enhence graph executor.
      3. Add SAController
      4. Add auto pruning strategy.
      5. Add unitest for auto pruning strategy.
      test=develop
      
      * Init light-nas
      
      * Add light nas.
      
      * Some fix.
      test=develop
      
      * Fix sa controller.
      test=develop
      
      * Fix unitest of light nas.
      test=develop
      
      * Fix setup.py.in and API.spec.
      test=develop
      
      * Fix unitest.
      1. Fix unitest on windows.
      2. Fix package importing in tests directory.
      
      * 1. Remove unused comments.
      2. Expose eval_epoch option.
      3. Remove unused function in search_agent.
      4. Expose max_client_num to yaml file.
      5. Move flops constraint to on_epoch_begin function
      test=develop
      
      * Fix light nas strategy.
      test=develop
      
      * Make controller server stable.
      test=develop
      
      * 1. Add try exception to compressor.
      2. Remove unitest of light-nas for windows.
      test=develop
      
      * Add comments
      Enhence controller
      test=develop
      
      * Fix comments.
      test=develop
      5df65e50
    • Z
      Fix cuda/cudnn version detection error (#17853) · 3925bd81
      Zeng Jinle 提交于
      * fix cuda/cudnn version detection error, test=develop
      
      * fix again, test=develop
      3925bd81
    • Y
      14a32bf0
  6. 06 6月, 2019 10 次提交