diff --git a/lite/tutorials/source_en/deploy.md b/lite/tutorials/source_en/deploy.md
index 86496baa68c949ee53c8c6aa5beb15ddd5f5c5bb..b6d16cd30f3beee3342f91a1acab25b951536533 100644
--- a/lite/tutorials/source_en/deploy.md
+++ b/lite/tutorials/source_en/deploy.md
@@ -66,22 +66,25 @@ After the compilation is complete, go to the `mindspore/output` directory of the
 tar -xvf mindspore-lite-{version}-{function}-{OS}.tar.gz
 ```
 
-Generally, the compiled output files include the following types. The architecture selection affects the types of output files.
-
-> For the x86 architecture, you can obtain the output of the conversion tool; for the Arm 64-bit architecture, you can obtain the output of the `arm64-cpu` inference framework. If `-e gpu` is added, you can obtain the output of the `arm64-cpu` inference framework. The compilation for arm 64-bit is the same as that for arm 32-bit.
+For the x86 architecture, you can obtain the output of the conversion tool and inference frameworkï¼›But for the ARM architecture, you only get inference framework.
 
-| Directory | Description | x86_64 | Arm 64-bit | Arm 32-bit |
-| --- | --- | --- | --- | --- |
-| include | Inference framework header file | No | Yes | Yes |
-| lib | Inference framework dynamic library | Yes | Yes | Yes |
-| benchmark | Benchmark test tool | Yes | Yes | Yes |
-| time_profiler | Time consumption analysis tool at the model network layer | Yes | Yes | Yes |
-| converter | Model conversion tool | Yes | No | No |
-| third_party | Header file and library of the third-party library | Yes | Yes | Yes |
+Generally, the compiled output files include the following types. The architecture selection affects the types of output files.
 
-The contents of `third party` vary depending on the architecture as follows:  
-- x86_64: `protobuf` (Protobuf dynamic library).
-- arm:  `flatbuffers` (FlatBuffers header file).
+> For the Arm 64-bit architecture, you can obtain the output of the `arm64-cpu` inference framework. If `-e gpu` is added, you can obtain the output of the `arm64-gpu` inference framework. The compilation for arm 64-bit is the same as that for arm 32-bit.
+
+| Directory | Description | converter | runtime |
+| --- | --- | --- | --- | 
+| include | Inference framework header file | No | Yes |
+| lib | Inference framework dynamic library | No | Yes |
+| benchmark | Benchmark test tool | No | Yes |
+| time_profiler | Time consumption analysis tool at the model network layer| No | Yes |
+| converter | Model conversion tool  | Yes | No | No |
+| third_party | Header file and library of the third-party library | Yes | Yes |
+
+Take the 0.7.0-beta version and CPU as an example. The contents of `third party` and `lib` vary depending on the architecture as follows:  
+- `mindspore-lite-0.7.0-converter-ubuntu`: include `protobuf` (Protobuf dynamic library).
+- `mindspore-lite-0.7.0-runtime-x86-cpu`: include `flatbuffers` (FlatBuffers header file).
+TODO: Add document content.
 
 > Before running the tools in the `converter`, `benchmark`, or `time_profiler` directory, you need to configure environment variables and set the paths of the dynamic libraries of MindSpore Lite and Protobuf to the paths of the system dynamic libraries. The following uses the 0.7.0-beta version as an example: `export LD_LIBRARY_PATH=./mindspore-lite-0.7.0/lib:./mindspore-lite-0.7.0/third_party/protobuf/lib:${LD_LIBRARY_PATH}`.
 
diff --git a/lite/tutorials/source_en/use/runtime.md b/lite/tutorials/source_en/use/runtime.md
new file mode 100644
index 0000000000000000000000000000000000000000..fe1fa8694aeb3750f199f251f86e68839128dafe
--- /dev/null
+++ b/lite/tutorials/source_en/use/runtime.md
@@ -0,0 +1,3 @@
+# Runtime 
+
+<a href="https://gitee.com/mindspore/docs/blob/master/lite/tutorials/source_en/use/runtime.md" target="_blank"><img src="../_static/logo_source.png"></a>
diff --git a/lite/tutorials/source_en/use/runtime_lite.md b/lite/tutorials/source_en/use/runtime_lite.md
deleted file mode 100644
index 834347308ff5fe95b14d70c2720c0c161522414c..0000000000000000000000000000000000000000
--- a/lite/tutorials/source_en/use/runtime_lite.md
+++ /dev/null
@@ -1,3 +0,0 @@
-# Runtime (Lite)
-
-<a href="https://gitee.com/mindspore/docs/blob/master/lite/tutorials/source_en/use/runtime_lite.md" target="_blank"><img src="../_static/logo_source.png"></a>
diff --git a/lite/tutorials/source_zh_cn/deploy.md b/lite/tutorials/source_zh_cn/deploy.md
index a12ccea3f77987e653114203583ce065d8bb7979..dec4cbe357e1bf40809dac285234bc70ef6b6784 100644
--- a/lite/tutorials/source_zh_cn/deploy.md
+++ b/lite/tutorials/source_zh_cn/deploy.md
@@ -73,25 +73,27 @@ MindSpore Liteæä¾›å¤šç§ç¼–è¯‘æ–¹å¼ï¼Œç”¨æˆ·å¯æ ¹æ®éœ€è¦é€‰æ‹©ä¸åŒçš„ç¼–
 ```bash
 tar -xvf mindspore-lite-{version}-{function}-{OS}.tar.gz
 ```
+ç¼–è¯‘x86å¯èŽ·å¾—è½¬æ¢å·¥å…·`converter`ä¸ŽæŽ¨ç†æ¡†æž¶`runtime`åŠŸèƒ½çš„è¾“å‡ºä»¶ï¼Œç¼–è¯‘ARMä»…èƒ½èŽ·å¾—æŽ¨ç†æ¡†æž¶`runtime`ã€‚
 
-ç¼–è¯‘åŽçš„è¾“å‡ºä»¶ä¸€èˆ¬åŒ…å«ä»¥ä¸‹å‡ ç§ï¼Œæž¶æž„çš„é€‰æ‹©ä¼šå½±å“è¾“å‡ºä»¶çš„ç§ç±»ã€‚
+è¾“å‡ºä»¶ä¸åŒ…å«ä»¥ä¸‹å‡ ç±»åé¡¹ï¼ŒåŠŸèƒ½ä¸åŒæ‰€å«å†…å®¹ä¹Ÿä¼šæœ‰æ‰€åŒºåˆ«ã€‚
 
-> ç¼–è¯‘x86å¯èŽ·å¾—è½¬æ¢å·¥å…·çš„è¾“å‡ºä»¶ï¼Œç¼–è¯‘ARM64é»˜è®¤å¯èŽ·å¾—`arm64-cpu`çš„æŽ¨ç†æ¡†æž¶è¾“å‡ºä»¶ï¼Œè‹¥æ·»åŠ `-e gpu`åˆ™èŽ·å¾—`arm64-gpu`çš„æŽ¨ç†æ¡†æž¶è¾“å‡ºä»¶ï¼Œç¼–è¯‘ARM32åŒç†ã€‚
+> ç¼–è¯‘ARM64é»˜è®¤å¯èŽ·å¾—`arm64-cpu`çš„æŽ¨ç†æ¡†æž¶è¾“å‡ºä»¶ï¼Œè‹¥æ·»åŠ `-e gpu`åˆ™èŽ·å¾—`arm64-gpu`çš„æŽ¨ç†æ¡†æž¶è¾“å‡ºä»¶ï¼Œç¼–è¯‘ARM32åŒç†ã€‚
 
-ç¼–è¯‘åŽçš„è¾“å‡ºä»¶ä¸€èˆ¬åŒ…å«ä»¥ä¸‹å‡ ç§ï¼Œæž¶æž„çš„é€‰æ‹©ä¼šå½±å“è¾“å‡ºä»¶çš„ç§ç±»ã€‚
+| ç›®å½• | è¯´æ˜Ž | converter | runtime |
+| --- | --- | --- | --- |
+| include | æŽ¨ç†æ¡†æž¶å¤´æ–‡ä»¶ | æ—  | æœ‰ |
+| lib | æŽ¨ç†æ¡†æž¶åŠ¨æ€åº“ | æ—  | æœ‰ | 
+| benchmark | åŸºå‡†æµ‹è¯•å·¥å…· | æ—  | æœ‰ | 
+| time_profiler | æ¨¡åž‹ç½‘ç»œå±‚è€—æ—¶åˆ†æžå·¥å…· | æ—  | æœ‰ | 
+| converter | æ¨¡åž‹è½¬æ¢å·¥å…· | æœ‰ | æ—  | 
+| third_party | ç¬¬ä¸‰æ–¹åº“å¤´æ–‡ä»¶å’Œåº“ | æœ‰ | æœ‰ | 
 
-| ç›®å½• | è¯´æ˜Ž | x86_64 | arm64 | arm32 |
-| --- | --- | --- | --- | --- |
-| include | æŽ¨ç†æ¡†æž¶å¤´æ–‡ä»¶ | æ—  | æœ‰ | æœ‰ |
-| lib | æŽ¨ç†æ¡†æž¶åŠ¨æ€åº“ | æœ‰ | æœ‰ | æœ‰ |
-| benchmark | åŸºå‡†æµ‹è¯•å·¥å…· | æœ‰ | æœ‰ | æœ‰ |
-| time_profiler | æ¨¡åž‹ç½‘ç»œå±‚è€—æ—¶åˆ†æžå·¥å…· | æœ‰ | æœ‰ | æœ‰ |
-| converter | æ¨¡åž‹è½¬æ¢å·¥å…· | æœ‰ | æ—  | æ—  |
-| third_party | ç¬¬ä¸‰æ–¹åº“å¤´æ–‡ä»¶å’Œåº“ | æœ‰ | æœ‰ | æœ‰ |
-
-åœ¨x86_64ã€ARMä¸¤ç§æž¶æž„ä¸‹ï¼Œ`third party`çš„å†…å®¹ä¸åŒã€‚å…¶ä¸ï¼š  
-- x86_64ï¼š`protobuf`ï¼ˆProtobufçš„åŠ¨æ€åº“ï¼‰ã€‚
-- ARMï¼š`flatbuffers`ï¼ˆFlatBufferså¤´æ–‡ä»¶ï¼‰ã€‚
+ä»¥0.7.0-betaç‰ˆæœ¬ï¼ŒCPUç¼–è¯‘ä¸ºä¾‹ï¼Œä¸åŒåŒ…åä¸‹ï¼Œ`third party`ä¸Ž`lib`çš„å†…å®¹ä¸åŒï¼š
+  
+- `mindspore-lite-0.7.0-converter-ubuntu`ï¼šåŒ…å«`protobuf`ï¼ˆProtobufçš„åŠ¨æ€åº“ï¼‰ã€‚
+- `mindspore-lite-0.7.0-runtime-x86-cpu`ï¼š`third party`åŒ…å«`flatbuffers`ï¼ˆFlatBufferså¤´æ–‡ä»¶ï¼‰ï¼Œ`lib`åŒ…å«`libmindspore-lite.so`ï¼ˆMindSpore Liteçš„åŠ¨æ€åº“ï¼‰ã€‚
+- `mindspore-lite-0.7.0-runtime-arm64-cpu`ï¼š`third party`åŒ…å«`flatbuffers`ï¼ˆFlatBufferså¤´æ–‡ä»¶ï¼‰ï¼Œ`lib`åŒ…å«`libmindspore-lite.so`ï¼ˆMindSpore Liteçš„åŠ¨æ€åº“ï¼‰å’Œ`liboptimize.so`ã€‚
+TODOï¼šè¡¥å…¨æ–‡ä»¶å†…å®¹
 
 > è¿è¡Œconverterã€benchmarkæˆ–time_profilerç›®å½•ä¸‹çš„å·¥å…·å‰ï¼Œéƒ½éœ€é…ç½®çŽ¯å¢ƒå˜é‡ï¼Œå°†MindSpore Liteå’ŒProtobufçš„åŠ¨æ€åº“æ‰€åœ¨çš„è·¯å¾„é…ç½®åˆ°ç³»ç»Ÿæœç´¢åŠ¨æ€åº“çš„è·¯å¾„ä¸ã€‚ä»¥0.7.0-betaç‰ˆæœ¬ä¸ºä¾‹ï¼š`export LD_LIBRARY_PATH=./mindspore-lite-0.7.0/lib:./mindspore-lite-0.7.0/third_party/protobuf/lib:${LD_LIBRARY_PATH}`ã€‚
 
diff --git a/lite/tutorials/source_zh_cn/use/runtime.md b/lite/tutorials/source_zh_cn/use/runtime.md
new file mode 100644
index 0000000000000000000000000000000000000000..f1615bf27713c1f5cfc8236251e4e5e44353d51f
--- /dev/null
+++ b/lite/tutorials/source_zh_cn/use/runtime.md
@@ -0,0 +1,354 @@
+# Runtimeä½¿ç”¨æŒ‡å—
+
+<!-- TOC -->
+
+- [Runtimeä½¿ç”¨æŒ‡å—](#runtimeä½¿ç”¨æŒ‡å—)
+    - [æ¦‚è¿°](#æ¦‚è¿°)
+    - [è¯»å–æ¨¡åž‹](#è¯»å–æ¨¡åž‹)
+    - [åˆ›å»ºä¼šè¯](#åˆ›å»ºä¼šè¯)
+        - [åˆ›å»ºä¸Šä¸‹æ–‡](#åˆ›å»ºä¸Šä¸‹æ–‡)
+        - [åˆ›å»ºä¼šè¯](#åˆ›å»ºä¼šè¯-1)
+        - [ä½¿ç”¨ç¤ºä¾‹](#ä½¿ç”¨ç¤ºä¾‹)
+    - [å›¾ç¼–è¯‘](#å›¾ç¼–è¯‘)
+        - [å¯å˜ç»´åº¦](#å¯å˜ç»´åº¦)
+        - [å›¾ç¼–è¯‘](#å›¾ç¼–è¯‘-1)
+    - [è¾“å…¥æ•°æ®](#è¾“å…¥æ•°æ®)
+        - [èŽ·å–è¾“å…¥Tensor](#èŽ·å–è¾“å…¥tensor)
+        - [æ•°æ®æ‹·è´](#æ•°æ®æ‹·è´)
+        - [ä½¿ç”¨ç¤ºä¾‹](#ä½¿ç”¨ç¤ºä¾‹-1)
+    - [å›¾æ‰§è¡Œ](#å›¾æ‰§è¡Œ)
+        - [æ‰§è¡Œä¼šè¯](#æ‰§è¡Œä¼šè¯)
+        - [ç»‘æ ¸](#ç»‘æ ¸)
+        - [å›žè°ƒè¿è¡Œ](#å›žè°ƒè¿è¡Œ)
+        - [ä½¿ç”¨ç¤ºä¾‹](#ä½¿ç”¨ç¤ºä¾‹-2)
+    - [èŽ·å–è¾“å‡º](#èŽ·å–è¾“å‡º)
+        - [èŽ·å–è¾“å‡ºTensor](#èŽ·å–è¾“å‡ºtensor)
+        - [ä½¿ç”¨ç¤ºä¾‹](#ä½¿ç”¨ç¤ºä¾‹-3)
+    - [èŽ·å–ç‰ˆæœ¬å·](#èŽ·å–ç‰ˆæœ¬å·)
+        - [ä½¿ç”¨ç¤ºä¾‹](#ä½¿ç”¨ç¤ºä¾‹-4)
+
+<!-- /TOC -->
+
+<a href="https://gitee.com/mindspore/docs/blob/master/lite/tutorials/source_zh_cn/use/runtime.md" target="_blank"><img src="../_static/logo_source.png"></a>
+
+## æ¦‚è¿°
+
+é€šè¿‡MindSpore Liteæ¨¡åž‹è½¬æ¢åŽï¼Œéœ€åœ¨Runtimeä¸å®Œæˆæ¨¡åž‹çš„æŽ¨ç†æ‰§è¡Œæµç¨‹ã€‚
+
+Runtimeæ€»ä½“ä½¿ç”¨æµç¨‹å¦‚ä¸‹å›¾æ‰€ç¤ºï¼š
+
+![img](../images/side_infer_process.png)
+
+åŒ…å«çš„ç»„ä»¶åŠåŠŸèƒ½å¦‚ä¸‹æ‰€è¿°ï¼š
+- `Model`ï¼šMindSpore Liteä½¿ç”¨çš„æ¨¡åž‹ï¼Œé€šè¿‡ç”¨æˆ·æž„å›¾æˆ–ç›´æŽ¥åŠ è½½ç½‘ç»œï¼Œæ¥å®žä¾‹åŒ–ç®—ååŽŸåž‹çš„åˆ—è¡¨ã€‚
+- `Lite Session`ï¼šæä¾›å›¾ç¼–è¯‘çš„åŠŸèƒ½ï¼Œå¹¶è°ƒç”¨å›¾æ‰§è¡Œå™¨è¿›è¡ŒæŽ¨ç†ã€‚
+- `Scheduler`ï¼šç®—åå¼‚æž„è°ƒåº¦å™¨ï¼Œæ ¹æ®å¼‚æž„è°ƒåº¦ç–ç•¥ï¼Œä¸ºæ¯ä¸€ä¸ªç®—åé€‰æ‹©åˆé€‚çš„kernelï¼Œæž„é€ kernel listï¼Œå¹¶åˆ‡åˆ†åå›¾ã€‚
+- `Executor`ï¼šå›¾æ‰§è¡Œå™¨ï¼Œæ‰§è¡Œkernel listï¼ŒåŠ¨æ€åˆ†é…å’Œé‡Šæ”¾Tensorã€‚
+- `Operator`ï¼šç®—ååŽŸåž‹ï¼ŒåŒ…å«ç®—åçš„å±žæ€§ï¼Œä»¥åŠshapeã€data typeå’Œformatçš„æŽ¨å¯¼æ–¹æ³•ã€‚
+- `Kernel`ï¼šç®—ååº“æä¾›ç®—åçš„å…·ä½“å®žçŽ°ï¼Œæä¾›ç®—åforwardçš„èƒ½åŠ›ã€‚
+- `Tensor`ï¼šMindSpore Liteä½¿ç”¨çš„Tensorï¼Œæä¾›äº†Tensorå†…å˜æ“ä½œçš„åŠŸèƒ½å’ŒæŽ¥å£ã€‚
+   
+## è¯»å–æ¨¡åž‹
+
+åœ¨MindSpore Liteä¸ï¼Œæ¨¡åž‹æ–‡ä»¶æ˜¯ä»Žæ¨¡åž‹è½¬æ¢å·¥å…·è½¬æ¢å¾—åˆ°çš„`.ms`æ–‡ä»¶ã€‚è¿›è¡Œæ¨¡åž‹æŽ¨ç†æ—¶ï¼Œéœ€è¦ä»Žæ–‡ä»¶ç³»ç»ŸåŠ è½½æ¨¡åž‹ï¼Œå¹¶è¿›è¡Œæ¨¡åž‹è§£æžï¼Œè¿™éƒ¨åˆ†æ“ä½œä¸»è¦åœ¨Modelä¸å®žçŽ°ã€‚ModelæŒæœ‰æƒé‡æ•°æ®ã€ç®—åå±žæ€§ç‰æ¨¡åž‹æ•°æ®ã€‚
+
+æ¨¡åž‹é€šè¿‡Modelç±»çš„é™æ€`Import`æ–¹æ³•ä»Žå†…å˜æ•°æ®ä¸åˆ›å»ºã€‚å‡½æ•°è¿”å›žçš„`Model`å®žä¾‹æ˜¯ä¸€ä¸ªæŒ‡é’ˆï¼Œé€šè¿‡`new`åˆ›å»ºï¼Œä¸å†éœ€è¦æ—¶ï¼Œéœ€è¦ç”¨æˆ·é€šè¿‡`delete`é‡Šæ”¾ã€‚
+
+## åˆ›å»ºä¼šè¯
+
+ä½¿ç”¨MindSpore Liteæ‰§è¡ŒæŽ¨ç†æ—¶ï¼ŒSessionæ˜¯æŽ¨ç†çš„ä¸»å…¥å£ï¼Œé€šè¿‡Sessionæˆ‘ä»¬å¯ä»¥è¿›è¡Œå›¾ç¼–è¯‘ã€å›¾æ‰§è¡Œã€‚
+
+### åˆ›å»ºä¸Šä¸‹æ–‡
+
+ä¸Šä¸‹æ–‡ä¼šä¿å˜ä¼šè¯æ‰€éœ€çš„ä¸€äº›åŸºæœ¬é…ç½®å‚æ•°ï¼Œç”¨äºŽæŒ‡å¯¼å›¾ç¼–è¯‘å’Œå›¾æ‰§è¡Œï¼Œå…¶å®šä¹‰å¦‚ä¸‹ï¼š
+
+MindSpore Liteæ”¯æŒå¼‚æž„æŽ¨ç†ï¼ŒæŽ¨ç†æ—¶çš„ä¸»é€‰åŽç«¯ç”±`Context`ä¸çš„`device_ctx_`æŒ‡å®šï¼Œé»˜è®¤ä¸ºCPUã€‚åœ¨è¿›è¡Œå›¾ç¼–è¯‘æ—¶ï¼Œä¼šæ ¹æ®ä¸»é€‰åŽç«¯è¿›è¡Œç®—åé€‰åž‹è°ƒåº¦ã€‚
+
+MindSpore Liteå†…ç½®ä¸€ä¸ªè¿›ç¨‹å…±äº«çš„çº¿ç¨‹æ± ï¼ŒæŽ¨ç†æ—¶é€šè¿‡`thread_num_`æŒ‡å®šçº¿ç¨‹æ± çš„æœ€å¤§çº¿ç¨‹æ•°ï¼Œé»˜è®¤ä¸º2çº¿ç¨‹ï¼ŒæŽ¨èæœ€å¤šä¸è¶…è¿‡4ä¸ªçº¿ç¨‹ï¼Œå¦åˆ™å¯èƒ½ä¼šå½±å“æ€§èƒ½ã€‚
+
+MindSpore Liteæ”¯æŒåŠ¨æ€å†…å˜åˆ†é…å’Œé‡Šæ”¾ï¼Œå¦‚æžœæ²¡æœ‰æŒ‡å®š`allocator`ï¼ŒæŽ¨ç†æ—¶ä¼šç”Ÿæˆä¸€ä¸ªé»˜è®¤çš„`allocator`ï¼Œä¹Ÿå¯ä»¥é€šè¿‡`Context`æ–¹æ³•åœ¨å¤šä¸ª`Context`ä¸å…±äº«å†…å˜åˆ†é…å™¨ã€‚
+
+å¦‚æžœç”¨æˆ·é€šè¿‡`new`åˆ›å»º`Context`ï¼Œä¸å†éœ€è¦æ—¶ï¼Œéœ€è¦ç”¨æˆ·é€šè¿‡`delete`é‡Šæ”¾ã€‚ä¸€èˆ¬åœ¨åˆ›å»ºå®ŒSessionåŽï¼ŒContextå³å¯é‡Šæ”¾ã€‚
+
+### åˆ›å»ºä¼šè¯
+
+ç”¨ä¸Šä¸€æ¥åˆ›å»ºå¾—åˆ°çš„`Context`ï¼Œè°ƒç”¨LiteSessionçš„é™æ€`CreateSession`æ–¹æ³•æ¥åˆ›å»º`LiteSession`ã€‚å‡½æ•°è¿”å›žçš„`LiteSession`å®žä¾‹æ˜¯ä¸€ä¸ªæŒ‡é’ˆï¼Œé€šè¿‡`new`åˆ›å»ºï¼Œä¸å†éœ€è¦æ—¶ï¼Œéœ€è¦ç”¨æˆ·é€šè¿‡`delete`é‡Šæ”¾ã€‚
+
+### ä½¿ç”¨ç¤ºä¾‹
+
+ä¸‹é¢ç¤ºä¾‹ä»£ç æ¼”ç¤ºäº†`Context`çš„åˆ›å»ºï¼Œä»¥åŠåœ¨ä¸¤ä¸ª`LiteSession`é—´å…±äº«å†…å˜æ± çš„åŠŸèƒ½ï¼š
+
+```cpp
+auto context = new (std::nothrow) lite::Context;
+if (context == nullptr) {
+    MS_LOG(ERROR) << "New context failed while running %s", modelName.c_str();
+    return RET_ERROR;
+}
+// The preferred backend is GPU, which means, if there is a GPU operator, it will run on the GPU first, otherwise it will run on the CPU.
+context->device_ctx_.type = lite::DT_GPU;
+// The medium core takes priority in thread and core binding methods. This parameter will work in the BindThread interface. For specific binding effect, see the "Run Graph" section.
+context->cpu_bind_mode_ = MID_CPU;
+// Configure the number of worker threads in the thread pool to 2, including the main thread. 
+context->thread_num_ = 2;
+// Allocators can be shared across multiple Contexts.
+auto *context2 = new Context(context->thread_num_, context->allocator, context->device_ctx_);
+context2->cpu_bind_mode_ = context->cpu_bind_mode_;
+// Use Context to create Session.
+auto session1 = session::LiteSession::CreateSession(context);
+// After the LiteSession is created, the Context can be released.
+delete (context);
+if (session1 == nullptr) {
+    MS_LOG(ERROR) << "CreateSession failed while running %s", modelName.c_str();
+    return RET_ERROR;
+}
+// session1 and session2 can share one memory pool.
+auto session2 = session::LiteSession::CreateSession(context2);
+delete (context2);
+if (session == nullptr) {
+    MS_LOG(ERROR) << "CreateSession failed while running %s", modelName.c_str();
+    return RET_ERROR;
+}
+```
+
+## å›¾ç¼–è¯‘
+
+### å¯å˜ç»´åº¦
+
+TODOï¼šè¯¥åŠŸèƒ½è¿˜åœ¨å¼€å‘ä¸ã€‚
+
+### å›¾ç¼–è¯‘
+
+åœ¨å›¾æ‰§è¡Œå‰ï¼Œéœ€è¦è°ƒç”¨`LiteSession`çš„`CompileGraph`æŽ¥å£è¿›è¡Œå›¾ç¼–è¯‘ï¼Œè¿›ä¸€æ¥è§£æžä»Žæ–‡ä»¶ä¸åŠ è½½çš„Modelå®žä¾‹ï¼Œä¸»è¦è¿›è¡Œåå›¾åˆ‡åˆ†ã€ç®—åé€‰åž‹è°ƒåº¦ã€‚è¿™éƒ¨åˆ†ä¼šè€—è´¹è¾ƒå¤šæ—¶é—´ï¼Œæ‰€ä»¥å»ºè®®`ListSession`åˆ›å»ºä¸€æ¬¡ï¼Œç¼–è¯‘ä¸€æ¬¡ï¼Œå¤šæ¬¡æ‰§è¡Œã€‚
+
+## è¾“å…¥æ•°æ®
+
+### èŽ·å–è¾“å…¥Tensor
+
+åœ¨å›¾æ‰§è¡Œå‰ï¼Œéœ€è¦å°†è¾“å…¥æ•°æ®æ‹·è´åˆ°æ¨¡åž‹çš„è¾“å…¥Tensorã€‚
+
+MindSpore Liteæä¾›ä¸¤ç§æ–¹æ³•æ¥èŽ·å–æ¨¡åž‹çš„è¾“å…¥Tensorã€‚
+
+1. ä½¿ç”¨`GetInputsByName`æ–¹æ³•ï¼Œæ ¹æ®æ¨¡åž‹è¾“å…¥èŠ‚ç‚¹çš„åç§°æ¥èŽ·å–æ¨¡åž‹è¾“å…¥Tensorä¸è¿žæŽ¥åˆ°è¯¥èŠ‚ç‚¹çš„Tensorçš„vectorã€‚
+2. ä½¿ç”¨`GetInputs`æ–¹æ³•ï¼Œç›´æŽ¥èŽ·å–æ‰€æœ‰çš„æ¨¡åž‹è¾“å…¥Tensorçš„vectorã€‚
+
+### æ•°æ®æ‹·è´
+
+å½“èŽ·å–åˆ°æ¨¡åž‹çš„è¾“å…¥ï¼Œå°±éœ€è¦å‘Tensorä¸å¡«å…¥æ•°æ®ã€‚é€šè¿‡`MSTensor`çš„`Size`æ–¹æ³•æ¥èŽ·å–Tensoråº”è¯¥å¡«å…¥çš„æ•°æ®å¤§å°ï¼Œé€šè¿‡`data_type`æ–¹æ³•æ¥èŽ·å–Tensorçš„æ•°æ®ç±»åž‹ï¼Œé€šè¿‡`MSTensor`çš„`MutableData`æ–¹æ³•æ¥èŽ·å–å¯å†™çš„æŒ‡é’ˆã€‚
+
+### ä½¿ç”¨ç¤ºä¾‹
+
+ä¸‹é¢ç¤ºä¾‹ä»£ç æ¼”ç¤ºäº†ä»Ž`LiteSession`ä¸èŽ·å–æ•´å›¾è¾“å…¥`MSTensor`ï¼Œå¹¶ä¸”å‘å…¶ä¸çŒå…¥æ¨¡åž‹è¾“å…¥æ•°æ®çš„è¿‡ç¨‹ï¼š
+
+```cpp
+// Assume we have created a LiteSession instance named session.
+auto inputs = session->GetInputs();
+// Assume that the model has only one input tensor.
+auto in_tensor = inputs.front();
+if (in_tensor == nullptr) {
+    std::cerr << "Input tensor is nullptr" << std::endl;
+    return -1;
+}
+// It is omitted that users have read the model input file and generated a section of memory buffer: input_buf, as well as the byte size of input_buf: data_size.
+if (in_tensor->Size() != data_size) {
+    std::cerr << "Input data size is not suit for model input" << std::endl;
+    return -1;
+}
+auto *in_data = in_tensor->MutableData();
+if (in_data == nullptr) {
+    std::cerr << "Data of in_tensor is nullptr" << std::endl;
+    return -1;
+}
+memcpy(in_data, input_buf, data_size);
+// Users need to free input_buf.
+// The elements in the inputs are managed by MindSpore Lite so that users do not need to free inputs.
+```
+
+éœ€è¦æ³¨æ„çš„æ˜¯ï¼š  
+- MindSpore Liteçš„æ¨¡åž‹è¾“å…¥Tensorä¸çš„æ•°æ®æŽ’å¸ƒå¿…é¡»æ˜¯NHWCã€‚
+- æ¨¡åž‹çš„è¾“å…¥`input_buf`æ˜¯ç”¨æˆ·ä»Žç£ç›˜è¯»å–çš„ï¼Œå½“æ‹·è´ç»™æ¨¡åž‹è¾“å…¥Tensorä»¥åŽï¼Œç”¨æˆ·éœ€è¦è‡ªè¡Œé‡Šæ”¾`input_buf`ã€‚
+- `GetInputs`å’Œ`GetInputsByName`æ–¹æ³•è¿”å›žçš„vectorä¸éœ€è¦ç”¨æˆ·é‡Šæ”¾ã€‚
+
+## å›¾æ‰§è¡Œ
+
+### æ‰§è¡Œä¼šè¯
+
+MindSpore Liteä¼šè¯åœ¨è¿›è¡Œå›¾ç¼–è¯‘ä»¥åŽï¼Œå³å¯ä½¿ç”¨`LiteSession`çš„`RunGraph`è¿›è¡Œæ¨¡åž‹æŽ¨ç†ã€‚
+
+### ç»‘æ ¸
+
+MindSpore Liteå†…ç½®çº¿ç¨‹æ± æ”¯æŒç»‘æ ¸ã€è§£ç»‘æ“ä½œï¼Œé€šè¿‡è°ƒç”¨`BindThread`æŽ¥å£ï¼Œå¯ä»¥å°†çº¿ç¨‹æ± ä¸çš„å·¥ä½œçº¿ç¨‹ç»‘å®šåˆ°æŒ‡å®šCPUæ ¸ï¼Œç”¨äºŽæ€§èƒ½åˆ†æžã€‚ç»‘æ ¸æ“ä½œä¸Žåˆ›å»º`LiteSession`æ—¶ç”¨æˆ·æŒ‡å®šçš„ä¸Šä¸‹æ–‡æœ‰å…³ï¼Œç»‘æ ¸æ“ä½œä¼šæ ¹æ®ä¸Šä¸‹æ–‡ä¸çš„ç»‘æ ¸ç–ç•¥è¿›è¡Œçº¿ç¨‹ä¸ŽCPUçš„äº²å’Œæ€§è®¾ç½®ã€‚
+
+éœ€è¦æ³¨æ„çš„æ˜¯ï¼Œç»‘æ ¸æ˜¯ä¸€ä¸ªäº²å’Œæ€§æ“ä½œï¼Œä¸ä¿è¯ä¸€å®šèƒ½ç»‘å®šåˆ°æŒ‡å®šçš„CPUæ ¸ï¼Œä¼šå—åˆ°ç³»ç»Ÿè°ƒåº¦çš„å½±å“ã€‚è€Œä¸”ç»‘æ ¸åŽï¼Œéœ€è¦åœ¨æ‰§è¡Œå®Œä»£ç åŽè¿›è¡Œè§£ç»‘æ“ä½œï¼Œç¤ºä¾‹å¦‚ä¸‹ï¼š
+
+```cpp
+// Assume we have created a LiteSession instance named session.
+session->BindThread(true);
+auto ret = session->RunGraph();
+if (ret != mindspore::lite::RET_OK) {
+    std::cerr << "RunGraph failed" << std::endl;
+    delete session;
+    return -1;
+}
+session->BindThread(false);
+```
+
+> ç»‘æ ¸å‚æ•°æœ‰ä¸¤ç§é€‰æ‹©ï¼šå¤§æ ¸ä¼˜å…ˆå’Œä¸æ ¸ä¼˜å…ˆã€‚  
+> åˆ¤å®šå¤§æ ¸å’Œä¸æ ¸çš„è§„åˆ™å…¶å®žæ˜¯æ ¹æ®CPUæ ¸çš„é¢‘çŽ‡è€Œä¸æ˜¯æ ¹æ®CPUçš„æž¶æž„ï¼Œå¯¹äºŽæ²¡æœ‰å¤§ä¸å°æ ¸ä¹‹åˆ†çš„CPUæž¶æž„ï¼Œåœ¨è¯¥è§„åˆ™ä¸‹ä¹Ÿå¯ä»¥åŒºåˆ†å¤§æ ¸å’Œä¸æ ¸ã€‚  
+> ç»‘å®šå¤§æ ¸ä¼˜å…ˆæ˜¯æŒ‡çº¿ç¨‹æ± ä¸çš„çº¿ç¨‹ä»Žé¢‘çŽ‡æœ€é«˜çš„æ ¸å¼€å§‹ç»‘å®šï¼Œç¬¬ä¸€ä¸ªçº¿ç¨‹ç»‘å®šåœ¨é¢‘çŽ‡æœ€é«˜çš„æ ¸ä¸Šï¼Œç¬¬äºŒä¸ªçº¿ç¨‹ç»‘å®šåœ¨é¢‘çŽ‡ç¬¬äºŒé«˜çš„æ ¸ä¸Šï¼Œä»¥æ¤ç±»æŽ¨ã€‚  
+> å¯¹äºŽä¸æ ¸ä¼˜å…ˆï¼Œä¸æ ¸çš„å®šä¹‰æ˜¯æ ¹æ®ç»éªŒæ¥å®šä¹‰çš„ï¼Œé»˜è®¤è®¾å®šä¸æ ¸æ˜¯ç¬¬ä¸‰å’Œç¬¬å››é«˜é¢‘çŽ‡çš„æ ¸ï¼Œå½“ç»‘å®šç–ç•¥ä¸ºä¸æ ¸ä¼˜å…ˆæ—¶ï¼Œä¼šä¼˜å…ˆç»‘å®šåˆ°ä¸æ ¸ä¸Šï¼Œå½“ä¸æ ¸ä¸å¤Ÿç”¨æ—¶ï¼Œä¼šå¾€å°æ ¸ä¸Šè¿›è¡Œç»‘å®šã€‚
+
+### å›žè°ƒè¿è¡Œ
+
+Mindspore Liteå¯ä»¥åœ¨è°ƒç”¨`RunGraph`æ—¶ï¼Œä¼ å…¥ä¸¤ä¸ª`KernelCallBack`å‡½æ•°æŒ‡é’ˆæ¥å›žè°ƒæŽ¨ç†æ¨¡åž‹ï¼Œç›¸æ¯”äºŽä¸€èˆ¬çš„å›¾æ‰§è¡Œï¼Œå›žè°ƒè¿è¡Œå¯ä»¥åœ¨è¿è¡Œè¿‡ç¨‹ä¸èŽ·å–é¢å¤–çš„ä¿¡æ¯ï¼Œå¸®åŠ©å¼€å‘è€…è¿›è¡Œæ€§èƒ½åˆ†æžã€Bugè°ƒè¯•ç‰ã€‚é¢å¤–çš„ä¿¡æ¯åŒ…æ‹¬ï¼š
+- å½“å‰è¿è¡Œçš„èŠ‚ç‚¹åç§°
+- æŽ¨ç†å½“å‰èŠ‚ç‚¹å‰çš„è¾“å…¥è¾“å‡ºTensor
+- æŽ¨ç†å½“å‰èŠ‚ç‚¹åŽçš„è¾“å…¥è¾“å‡ºTensor
+
+### ä½¿ç”¨ç¤ºä¾‹
+
+ä¸‹é¢ç¤ºä¾‹ä»£ç æ¼”ç¤ºäº†ä½¿ç”¨`LiteSession`è¿›è¡Œå›¾ç¼–è¯‘ï¼Œå¹¶å®šä¹‰äº†ä¸¤ä¸ªå›žè°ƒå‡½æ•°ä½œä¸ºå‰ç½®å›žè°ƒæŒ‡é’ˆå’ŒåŽç½®å›žè°ƒæŒ‡é’ˆï¼Œä¼ å…¥åˆ°`RunGraph`æŽ¥å£è¿›è¡Œå›žè°ƒæŽ¨ç†ï¼Œå¹¶æ¼”ç¤ºäº†ä¸€æ¬¡å›¾ç¼–è¯‘ï¼Œå¤šæ¬¡å›¾æ‰§è¡Œçš„ä½¿ç”¨åœºæ™¯ï¼š
+
+```cpp
+// Assume we have created a LiteSession instance named session and a Model instance named model before.
+// The methods of creating model and session can refer to "Import Model" and "Create Session" two sections.
+auto ret = session->CompileGraph(model);
+if (ret != RET_OK) {
+    std::cerr << "CompileGraph failed" << std::endl;
+    // session and model need to be released by users manually.
+    delete (session);
+    delete (model);
+    return ret;
+}
+// Copy input data into the input tensor. Users can refer to the "Input Data" section. We uses random data here.
+auto inputs = session->GetInputs();
+for (auto in_tensor : inputs) {
+    in_tensor = inputs.front();
+    if (in_tensor == nullptr) {
+        std::cerr << "Input tensor is nullptr" << std::endl;
+        return -1;
+    }
+    // When calling the MutableData method, if the data in MSTensor is not allocated, it will be malloced. After allocation, the data in MSTensor can be considered as random data.
+    (void) in_tensor->MutableData();
+}
+// Definition of callback function before forwarding operator.
+auto before_call_back_ = [&](const std::vector<mindspore::tensor::MSTensor *> &before_inputs,
+                             const std::vector<mindspore::tensor::MSTensor *> &before_outputs,
+                             const session::CallBackParam &call_param) {
+    std::cout << "Before forwarding " << call_param.name_callback_param << std::endl;
+    return true;
+};
+// Definition of callback function after forwarding operator.
+auto after_call_back_ = [&](const std::vector<mindspore::tensor::MSTensor *> &after_inputs,
+                            const std::vector<mindspore::tensor::MSTensor *> &after_outputs,
+                            const session::CallBackParam &call_param) {
+    std::cout << "After forwarding " << call_param.name_callback_param << std::endl;
+    return true;
+};
+// Call the callback function when performing the model inference process.
+ret = session_->RunGraph(before_call_back_, after_call_back_);
+if (ret != RET_OK) {
+  MS_LOG(ERROR) << "Run graph failed.";
+  return RET_ERROR;
+}
+// CompileGraph would cost much time, a better solution is calling CompileGraph only once and RunGraph much more times.
+for (size_t i = 0; i < 10; i++) {
+    auto ret = session_->RunGraph();
+    if (ret != RET_OK) {
+        MS_LOG(ERROR) << "Run graph failed.";
+        return RET_ERROR;
+    }
+}
+// session and model needs to be released by users manually.
+delete (session);
+delete (model);
+```
+
+## èŽ·å–è¾“å‡º
+
+### èŽ·å–è¾“å‡ºTensor
+
+MindSpore Liteåœ¨æ‰§è¡Œå®ŒæŽ¨ç†åŽï¼Œå°±å¯ä»¥èŽ·å–æ¨¡åž‹çš„æŽ¨ç†ç»“æžœã€‚
+
+MindSpore Liteæä¾›å››ç§æ–¹æ³•æ¥èŽ·å–æ¨¡åž‹çš„è¾“å‡º`MSTensor`ã€‚
+1. ä½¿ç”¨`GetOutputsByNodeName`æ–¹æ³•ï¼Œæ ¹æ®æ¨¡åž‹è¾“å‡ºèŠ‚ç‚¹çš„åç§°æ¥èŽ·å–æ¨¡åž‹è¾“å‡º`MSTensor`ä¸è¿žæŽ¥åˆ°è¯¥èŠ‚ç‚¹çš„Tensorçš„vectorã€‚
+2. ä½¿ç”¨`GetOutputMapByNode`æ–¹æ³•ï¼Œç›´æŽ¥èŽ·å–æ‰€æœ‰çš„æ¨¡åž‹è¾“å‡ºèŠ‚ç‚¹çš„åç§°å’Œè¿žæŽ¥åˆ°è¯¥èŠ‚ç‚¹çš„æ¨¡åž‹è¾“å‡º`MSTensor`çš„ä¸€ä¸ªmapã€‚
+3. ä½¿ç”¨`GetOutputByTensorName`æ–¹æ³•ï¼Œæ ¹æ®æ¨¡åž‹è¾“å‡ºTensorçš„åç§°æ¥èŽ·å–å¯¹åº”çš„æ¨¡åž‹è¾“å‡º`MSTensor`ã€‚
+4. ä½¿ç”¨`GetOutputMapByTensor`æ–¹æ³•ï¼Œç›´æŽ¥èŽ·å–æ‰€æœ‰çš„æ¨¡åž‹è¾“å‡º`MSTensor`çš„åç§°å’Œ`MSTensor`æŒ‡é’ˆçš„ä¸€ä¸ªmapã€‚
+
+å½“èŽ·å–åˆ°æ¨¡åž‹çš„è¾“å‡ºTensorï¼Œå°±éœ€è¦å‘Tensorä¸å¡«å…¥æ•°æ®ã€‚é€šè¿‡`MSTensor`çš„`Size`æ–¹æ³•æ¥èŽ·å–Tensoråº”è¯¥å¡«å…¥çš„æ•°æ®å¤§å°ï¼Œé€šè¿‡`data_type`æ–¹æ³•æ¥èŽ·å–`MSTensor`çš„æ•°æ®ç±»åž‹ï¼Œé€šè¿‡`MSTensor`çš„`MutableData`æ–¹æ³•æ¥èŽ·å–å¯è¯»å†™çš„å†…å˜æŒ‡é’ˆã€‚
+
+### ä½¿ç”¨ç¤ºä¾‹
+
+ä¸‹é¢ç¤ºä¾‹ä»£ç æ¼”ç¤ºäº†ä½¿ç”¨`GetOutputMapByNode`æŽ¥å£èŽ·å–è¾“å‡º`MSTensor`ï¼Œå¹¶æ‰“å°äº†æ¯ä¸ªè¾“å‡º`MSTensor`çš„å‰åä¸ªæ•°æ®æˆ–æ‰€æœ‰æ•°æ®ï¼š
+
+```cpp
+// Assume we have created a LiteSession instance named session before.
+auto output_map = session->GetOutputMapByNode();
+// Assume that the model has only one output node.
+auto out_node_iter = output_map.begin();
+std::string name = out_node_iter->first;
+// Assume that the unique output node has only one output tensor.
+auto out_tensor = out_node_iter->second.front();
+if (out_tensor == nullptr) {
+    std::cerr << "Output tensor is nullptr" << std::endl;
+    return -1;
+}
+// Assume that the data format of output data is float 32.
+if (out_tensor->data_type() != mindspore::TypeId::kNumberTypeFloat32) {
+    std::cerr << "Output of lenet should in float32" << std::endl;
+    return -1;
+}
+auto *out_data = reinterpret_cast<float *>(out_tensor->MutableData());
+if (out_data == nullptr) {
+    std::cerr << "Data of out_tensor is nullptr" << std::endl;
+    return -1;
+}
+// Print the first 10 float data or all output data of the output tensor. 
+std::cout << "Output data: ";
+for (size_t i = 0; i < 10 & i < out_tensor->ElementsNum(); i++) {
+    std::cout << " " << out_data[i];
+}
+std::cout << std::endl;
+// The elements in outputs do not need to be free by users, because outputs are managed by the MindSpore Lite.
+```
+
+éœ€è¦æ³¨æ„çš„æ˜¯ï¼Œ`GetOutputsByNodeName`ã€`GetOutputMapByNode`ã€`GetOutputByTensorName`å’Œ`GetOutputMapByTensor`æ–¹æ³•è¿”å›žçš„vectoræˆ–mapä¸éœ€è¦ç”¨æˆ·é‡Šæ”¾ã€‚ 
+
+ä¸‹é¢ç¤ºä¾‹ä»£ç æ¼”ç¤ºäº†ä½¿ç”¨`GetOutputsByNodeName`æŽ¥å£èŽ·å–è¾“å‡º`MSTensor`çš„æ–¹æ³•ï¼š
+
+```cpp
+// Assume we have created a LiteSession instance named session before.
+// Assume that model has a output node named output_node_name_0.
+auto output_vec = session->GetOutputsByNodeName("output_node_name_0");
+// Assume that output node named output_node_name_0 has only one output tensor.
+auto out_tensor = output_vec.front();
+if (out_tensor == nullptr) {
+    std::cerr << "Output tensor is nullptr" << std::endl;
+    return -1;
+}
+```
+
+ä¸‹é¢ç¤ºä¾‹ä»£ç æ¼”ç¤ºäº†ä½¿ç”¨`GetOutputMapByTensor`æŽ¥å£èŽ·å–è¾“å‡º`MSTensor`çš„æ–¹æ³•ï¼š
+
+```cpp
+// Assume we have created a LiteSession instance named session before.
+auto output_map = session->GetOutputMapByTensor();
+// Assume that output node named output_node_name_0 has only one output tensor.
+auto out_tensor = output_vec.front();
+if (out_tensor == nullptr) {
+    std::cerr << "Output tensor is nullptr" << std::endl;
+    return -1;
+}
+``` 
+
+## èŽ·å–ç‰ˆæœ¬å·
+MindSpore Liteæä¾›äº†`Version`æ–¹æ³•å¯ä»¥èŽ·å–ç‰ˆæœ¬å·ï¼ŒåŒ…å«åœ¨`include/version.h`å¤´æ–‡ä»¶ä¸ï¼Œè°ƒç”¨è¯¥æ–¹æ³•å¯ä»¥å¾—åˆ°ç‰ˆæœ¬å·å—ç¬¦ä¸²ã€‚
+
+### ä½¿ç”¨ç¤ºä¾‹
+
+ä¸‹é¢ä»£ç æ¼”ç¤ºå¦‚ä½•èŽ·å–MindSpore Liteçš„ç‰ˆæœ¬å·ï¼š
+```cpp
+#include "include/version.h"
+std::string version = mindspore::lite::Version(); 
+```
+
diff --git a/lite/tutorials/source_zh_cn/use/runtime_lite.md b/lite/tutorials/source_zh_cn/use/runtime_lite.md
deleted file mode 100644
index 3c5162b116c3bc64c3b136abf21453d5c934855c..0000000000000000000000000000000000000000
--- a/lite/tutorials/source_zh_cn/use/runtime_lite.md
+++ /dev/null
@@ -1,11 +0,0 @@
-# Runtimeä½¿ç”¨æŒ‡å—ï¼ˆLiteï¼‰
-
-<!-- TOC -->
-
-- [Runtimeä½¿ç”¨æŒ‡å—ï¼ˆLiteï¼‰](#runtimeä½¿ç”¨æŒ‡å—lite)
-
-<!-- /TOC -->
-
-<a href="https://gitee.com/mindspore/docs/blob/master/lite/tutorials/source_zh_cn/use/runtime_lite.md" target="_blank"><img src="../_static/logo_source.png"></a>
-
-