提交 · d90cb7763c49385838a62e274f8b3b5cf3fe8e48 · MegEngine 天元 / MegEngine

15 11月, 2021 2 次提交
- M
  refactor(sdk): refactor load and run with new framework · 37c1726f
  由 Megvii Engine Team 提交于 10月 28, 2021
```
GitOrigin-RevId: b092699dee49eab068e262327b078ce157e36f26
```
  37c1726f
- M
  fix(build): use official build script to build nccl · b75658c8
  由 Megvii Engine Team 提交于 11月 05, 2021
```
GitOrigin-RevId: 99e2951a6e48488e4841b497496b0b8147852b5d
```
  b75658c8
05 11月, 2021 1 次提交
- M
  fix(mgb): correct version of fbs serialization · 7fa5f6f4
  由 Megvii Engine Team 提交于 10月 29, 2021
```
GitOrigin-RevId: 2a71d9afc29befae4619edbc146f1bad038d89c2
```
  7fa5f6f4
03 11月, 2021 1 次提交

feat(whl/api/lar): enable megengine dll on Windows · 25ec2530

由 Megvii Engine Team 提交于 10月 19, 2021

1: reduce python whl package size
2: unify api link logic on all OS
3: add option: MGE_WINDOWS_BUILD_WITH_STATIC_CRT
    --- default OFF
    --- if build CRT(vc runtime) with STATIC with megengine.dll
        some CRT api will crash, for example, flush, so if you
        build with static megengine, and do not want to install CRT
        you can set MGE_WINDOWS_BUILD_WITH_STATIC_CRT TRUE
    --- how to install CRT:
        https://docs.microsoft.com/en-us/cpp/windows/latest-supported-vc-redist?view=msvc-160
        install VC_redist.x64.exe
4: rename megengine_export to megengine_shared(only export needed symbols ),
   caused by runtime symbols conflict with pytorch

GitOrigin-RevId: 93d8d80f29dcf03c00b18e8b708b26df9232e3df

25ec2530

12 10月, 2021 1 次提交
- M
  fix(cmake/midout): fix cmake midout · a29f1c8c
  由 Megvii Engine Team 提交于 10月 09, 2021
```
GitOrigin-RevId: 0fb93deb191ba235cbe312f7e7deaa979e810c9f
```
  a29f1c8c
11 10月, 2021 1 次提交
- feat(dnn/src/x86/matmul): add matmul_6x16 for x86 · d2184af3
  由泥点无名哥提交于 9月 05, 2021
  
  d2184af3
24 9月, 2021 1 次提交

fix(cmake/bazel/clang): remove finite-math-only opt from Ofast, · 5e345043

由 Megvii Engine Team 提交于 9月 18, 2021

for keep same build between gcc and clang

more detail:
```
 printf("%d\n", std::isnan(std::numeric_limits<float>::quiet_NaN()));
 printf("%d\n", std::isnan(std::nan("1")));
```

linux-clang and android-NDK clang have diff build logic with gcc and
macos/windows clang/clangcl
* clang++-12 -Ofast 1.cc
output is:
0
0

* clang++-12 -Ofast 1.cc -fno-finite-math-only
output is:
1
1

* g++ -Ofast 1.cc
output is:
1
1

GitOrigin-RevId: f0622e2ca019b0072fd6798990a8aba156c8022f

5e345043

17 9月, 2021 1 次提交
- M
  feat(src): add MGB_CUSTOM_OP macro · 93f70a95
  由 Megvii Engine Team 提交于 9月 13, 2021
```
GitOrigin-RevId: beddcf974576af51da24b34b5d8bba4753a2e4f6
```
  93f70a95
15 9月, 2021 1 次提交
- M
  fix(mgb): use link.exe when compile windows · bc9c47e7
  由 Megvii Engine Team 提交于 9月 14, 2021
```
GitOrigin-RevId: 0b9073acce66bd8d5b6350a732e1b7d8b3605b3c
```
  bc9c47e7
11 9月, 2021 1 次提交
- M
  fix(cmake/env): fix a bug about env set in cmake build mode · 8110bb21
  由 Megvii Engine Team 提交于 9月 06, 2021
```
GitOrigin-RevId: 492eaf428a1ab1f8c671730af139fa12cedae3f7
```
  8110bb21
18 8月, 2021 1 次提交
- M
  feat(lite): add header licence · f9719828
  由 Megvii Engine Team 提交于 8月 04, 2021
```
GitOrigin-RevId: bcd6860e8c8e8cdaaaeae5340284650547fb6a4b
```
  f9719828
07 8月, 2021 2 次提交

feat(cmake/windows/xp/sp2/inference): implement inference on windows xp · ce2f620e

由 Megvii Engine Team 提交于 8月 04, 2021

(os vesion >= sp2) build with cmake

* cmake build support(xp sp2):
(dbg)EXTRA_CMAKE_ARGS="-DMGE_DEPLOY_INFERENCE_ON_WINDOWS_XP_SP2=ON"
./scripts/cmake-build/host_build.sh -m -d

(opt)EXTRA_CMAKE_ARGS="-DMGE_DEPLOY_INFERENCE_ON_WINDOWS_XP_SP2=ON"
./scripts/cmake-build/host_build.sh -m

* cmake build support(xp sp3):
(dbg)EXTRA_CMAKE_ARGS="-DMGE_DEPLOY_INFERENCE_ON_WINDOWS_XP=ON"
./scripts/cmake-build/host_build.sh -m -d

(opt)EXTRA_CMAKE_ARGS="-DMGE_DEPLOY_INFERENCE_ON_WINDOWS_XP=ON"
./scripts/cmake-build/host_build.sh -m

* internal behavior:
will define MGB_HAVE_THREAD=0 when enable
-DMGE_DEPLOY_INFERENCE_ON_WINDOWS_XP_SP2=ON

* refer to
https://docs.microsoft.com/en-us/cpp/build/configuring-programs-for-windows-xp?view=msvc-160
xp sp2(x86) do not support vc runtime fully, casused by KERNEL32.dll do not
implement some base apis for c++ std function, for example,
std::mutex/std::thread/std::condition_variable as a workround, we will
disable some MegEngine features on xp sp2 env, for exampe, multi-thread etc!

* about DNN_MUTEX/MGB_MUTEX/LITE_MUTEX, if your code will build in inference
code (even CPU backends), please replace std::mutex to DNN_MUTEX/MGB_MUTEX,

* about multi-thread, if you code need multi-thread support, please
enable it when MGB_HAVE_THREAD=1

* about test build env status
1: Visual Studio 2019(MSVC version <= 14.26.28801)---- pass
2: Visual Studio 2019(MSVC version > 14.26.28801) ---- failed
   caused by this 'new' version will put VCR depends on win7
   KERNEL32.DLL, this may be fixed at Visual Studio 2019 later version
   but we do not test at this MR merge point
3: Visual Studio 2017   ---------- pass
4: Visual Studio 2014   ---------- pass
GitOrigin-RevId: ea6e1f8b4fea9aa03594e3af8d59708b4cdf7bdc

ce2f620e

M
feat(imperative): add support for quantized conv transpose2d · 3eb0505f
由 Megvii Engine Team 提交于 7月 23, 2021
```
GitOrigin-RevId: ffd6431299b2ae008fbdd1eed6458437e6b6a45f
```
3eb0505f

19 7月, 2021 4 次提交
- M
  feat(mge/distributed): enable pt shm allreduce · 809d5056
  由 Megvii Engine Team 提交于 3月 10, 2021
```
GitOrigin-RevId: 1dd5a02a512b210f2c75afd0062e4bfad1fcdddc
```
  809d5056
- M
  fix(cmake/debug): fix asan do not work at windows env · fb8f1534
  由 Megvii Engine Team 提交于 6月 25, 2021
```
GitOrigin-RevId: 6638194fd724f9d3455d2587080142479683cdb1
```
  fb8f1534
- M
  feat(cmake/debug): misc for build · 62bd6c82
  由 Megvii Engine Team 提交于 6月 09, 2021
```
* add asan build option
* fix cpuinfo build opt level
* fix host release build with out debug info
* opt "fix lite bazel/cmake symbols MR"
* other misc build opt

GitOrigin-RevId: 6ca286e195cbe7ecf36e34317a2dab81b7b13a5e
```
  62bd6c82
- M
  chore(bazel/cmake): fix lite bazel/cmake symbols · c7a5c21a
  由 Megvii Engine Team 提交于 4月 02, 2021
```
GitOrigin-RevId: 95f2666817d49d10d295e8bfc1b8b61379e17712
```
  c7a5c21a
21 6月, 2021 1 次提交
- M
  feat(aarch64/whl): support aarch64 whl · b07f3728
  由 Megvii Engine Team 提交于 6月 03, 2021
```
GitOrigin-RevId: 656a27d62be10321250652904c544fac32504e7c
```
  b07f3728
03 6月, 2021 2 次提交

fix(build/windows): fix windows build: · ec65e1f9

由 Megvii Engine Team 提交于 5月 25, 2021

* compat clang-cl 11 build at windows env
* fix cuda/cudnn/trt copy env build failed on windows

GitOrigin-RevId: 7fe2d2c0dcaabf56df1725994a41a15dd09692b6

ec65e1f9

M
feat(build): change make to ninja · baa30400
由 Megvii Engine Team 提交于 5月 11, 2021
```
GitOrigin-RevId: ccff72e39a4cd79f4cad0fbb474680a300204301
```
baa30400

25 5月, 2021 2 次提交

fix(cmake): fix cmake depends · e4af4225

由 Megvii Engine Team 提交于 5月 10, 2021

reason: some target call python3 to generate some headers,
if PYTHON_EXECUTABLE changed, headers will be invalid, which
will lead to rebuild most of obj, because this headers will
be include in most cpp/c, what`s more, this target do not depends
python3 version(python API) the output is same when use python3.5
or python3.x so change use PYTHON_EXECUTABLE to
PYTHON3_EXECUTABLE_WITHOUT_VERSION, when
PYTHON_EXECUTABLE/PYTHON_LIBRARY/PYTHON_INCLUDE_DIR changed,
can reuse the build obj

GitOrigin-RevId: c2ffe3ead3a295377a691587ab097636562f571e

e4af4225

M
feat(distributed): support distributed opr for rocm · 1bec737d
由 Megvii Engine Team 提交于 11月 18, 2020
```
GitOrigin-RevId: 4840100d07dbaa2b7d8e3e113b444ddf81eeea51
```
1bec737d

08 4月, 2021 3 次提交

M
fix(dnn): fix cudnn crash when finalize called after cudnn dtor · 04b1a45a
由 Megvii Engine Team 提交于 4月 06, 2021
```
GitOrigin-RevId: b0ad639921e8ba1e370696f16a9d87024a83f4c9
```
04b1a45a
M
chore(dotprod): dotprod is enabled by default on the android platform · 94401ce4
由 Megvii Engine Team 提交于 3月 10, 2021
```
GitOrigin-RevId: d412108732ca56586da3029f3e27543eb0638e4b
```
94401ce4

feat(cmake): fix android cmake build issue · 610fecc9

由 Megvii Engine Team 提交于 4月 01, 2021

detail: NDK do not support find_package(Threads)
but NDK support threads!

FIXME: compnode support NON-cpu compnode on no support threads env

FXIME: may have issue if ANDROID support CUDA, need fix it
if need support this case
GitOrigin-RevId: 19af74a6af07c541503fc46031d84676cc948b29

610fecc9

01 4月, 2021 2 次提交
- M
  feat(cmake): update for enflame cmake compile · 2fd3fa88
  由 Megvii Engine Team 提交于 10月 26, 2020
```
GitOrigin-RevId: 3c3c6b3462da5bcc1116a3416d3ae6b4f2cfd1f6
```
  2fd3fa88
- M
  fix(windows): remove windows build path warning · 63cc5743
  由 Megvii Engine Team 提交于 3月 23, 2021
```
GitOrigin-RevId: d6e255b7dc3c82b5389bded2b1dcf6d07f9d2da2
```
  63cc5743
23 3月, 2021 1 次提交

fix(build/lite): fix lite rc4_encryptor build failed at new ld env · 89e52281

由 Megvii Engine Team 提交于 3月 12, 2021

when cuda enable:

error log:
nn-8.0.4-trt-7.2.2.3-libs/TensorRT-7.2.2.3/lib/libnvinfer_static.a(profile.o):
undefined reference to symbol 'dlclose@@GLIBC_2.2.5'
/usr/bin/ld:
/usr/lib/gcc/x86_64-linux-gnu/9/../../../x86_64-linux-gnu/libdl.so:
error adding symbols: DSO missing from command line

ld version:
GNU ld (GNU Binutils for Ubuntu) 2.34, example ubuntu20.04

error reason: at new ld, import flags with active zone

libnvinfer_static.a depend dlclose@@GLIBC_2.2.5, which need -ldl

ld -o rc4_encryptor xxx.o -ldl libnvinfer_static.a, which failed

so we put a -ldl for CUDA_LIBS env, then will change to
ld -o rc4_encryptor xxx.o  libnvinfer_static.a -ldl

GitOrigin-RevId: adda3acb2d5c0952013d59948717060747dd2f8f

89e52281

22 3月, 2021 3 次提交

M
feat(lite): add cmake build for lite · af60ff97
由 Megvii Engine Team 提交于 1月 18, 2021
```
GitOrigin-RevId: 2424eb2681fddcdfa0d48dd7ceb41cf7e5e7bee3
```
af60ff97

feat(cmake/arm/cuda): support cmake arm with cuda, · 914af286

由 Megvii Engine Team 提交于 2月 03, 2021

* support arm-host and x86-cross-arm
* fix trt myelib cmake build issue at 'trt copy env'(about 'copy env', Please refs cb92123f)

about x86-cross-arm CUDA env:
1: run ./scripts/cmake-build/create_cuda_build_libs.py to prepare
   cuda/cudnn/trt env(download deb package info, Please refs create_cuda_build_libs.py)
2: export TRT_ROOT_DIR=xxxxx which may create by step 1
3: export CUDNN_ROOT_DIR=xxxx which may create by step 1
4: export PATH=xxxx:$PATH  xxxx need create by step 1, which nvcc have
   relative path dir: ../targets/sbsa-linux/
GitOrigin-RevId: 440c76052aabe5b07a4b64d126e759f919c257a8

914af286

M
build(cuda): support cu111 build · 0e3a6329
由 Megvii Engine Team 提交于 2月 05, 2021
```
GitOrigin-RevId: b3067ba4d5f1225048838e4b2460d858c475b75e
```
0e3a6329

23 1月, 2021 1 次提交

fix(cmake/whl): reduce wheel size · ae8b38f6

由 Megvii Engine Team 提交于 1月 19, 2021

as some reason: some version of trt/cudnn need
-Wl,--whole-archive attribute to fix cuda issue,
target A(CMAKE OBJECT,eg megbrain/megdnn): -Wl,--whole-archive depend on trt/cudnn
target B(SHARED LIBS,eg libmegengine.so) --> A with `PUBLIC` depends
target C(SHARED LIBS,eg _impertive.so) --> B with `PUBLIC` depends,
then ld will force link fatbin section into C, which will be undesired

what`s more, attribute PUBLIC/PRIVATE do not take effect
to OBJECT library(megbrain/megdnn)

what`s more, megengine/megengine_export have to PUBLIC for
mgb/imperative target, as SRC/include depends

so we pull cudalib depend from megbrain/megdnn to
megengine/megengine_export on linux os, to finall
target at windows os, for example lar on windows

GitOrigin-RevId: b278a69e1cdcc17cfd22da80247af3fe30f6d4e8

ae8b38f6

15 1月, 2021 1 次提交
- M
  build(mgb/jit): disable halide by default · 495b2003
  由 Megvii Engine Team 提交于 1月 08, 2021
```
GitOrigin-RevId: ff7d94ea0a007d4e25d99bf59696a02d3c342a8b
```
  495b2003
11 1月, 2021 3 次提交
- M
  feat(dnn/atlas): add atlas stub · d1fbec4f
  由 Megvii Engine Team 提交于 11月 27, 2020
```
GitOrigin-RevId: c63294378e49732a7a41b0b6672a070a518c7882
```
  d1fbec4f
- M
  fix(mgb/core): fix memory management release cambricon var issue · 15dd5e1a
  由 Megvii Engine Team 提交于 11月 04, 2020
```
GitOrigin-RevId: abf881978c8fd85b3cfa823fea13045ff06fb88e
```
  15dd5e1a
- M
  build(imperative): clean develop before build imperative and mgb · 4cfedc16
  由 Megvii Engine Team 提交于 12月 15, 2020
```
GitOrigin-RevId: 4b56e148d864bfe41d399fd766525acdcd96fbad
```
  4cfedc16
18 12月, 2020 1 次提交
- M
  feat(imperative): auto generated opdef header and python binding · 69e3e322
  由 Megvii Engine Team 提交于 12月 18, 2020
```
GitOrigin-RevId: d2f22ad5fe0b15f45afa1ea31af8874e8b18fef9
```
  69e3e322
27 11月, 2020 1 次提交
- M
  chore(cmake): normlize some cmake message level · 61c5c9cf
  由 Megvii Engine Team 提交于 10月 20, 2020
```
GitOrigin-RevId: 4550cbfaa6b4e6e4a22daa8217ab12a1482c1e88
```
  61c5c9cf
23 11月, 2020 1 次提交
- M
  feat(mgb/jit): adapt jit mlir backend to new mgb dialect and add typecvt · 404ef808
  由 Megvii Engine Team 提交于 11月 13, 2020
```
GitOrigin-RevId: bd1b80c84f5629a4dde5302d775cd8a333ea3cf2
```
  404ef808
20 11月, 2020 1 次提交
- M
  refactor(mlir/dialect): redefine mgb dialect · d793c87c
  由 Megvii Engine Team 提交于 11月 11, 2020
```
GitOrigin-RevId: ac0653474655a2b5b89002e888592415fd472190
```
  d793c87c

MegEngine 天元 / MegEngine 1 年多 前同步成功

MegEngine 天元 / MegEngine
1 年多前同步成功