init

e128a2ed · feilong · a6b22761 · e128a2ed · e128a2ed · e128a2ed
17 changed file
--- a/.gitignore
+++ b/.gitignore
--- a/README.md
+++ b/README.md
 # skill_tree_neo4j
 Neo4J 图数据库技能树
\ No newline at end of file
+# skill_tree_pg
+本项目是 [CSDN 技能森林](https://codechina.csdn.net/csdn/skill_tree) 的 Neo4J 专项开放编辑仓库。
+本仓库面向学习者，以树状结构管理 Neo4J 技能的知识点。
+为了操作方便，在仓库中有一组微型的 Python 工具脚本，用于加工和维护技能树，一般情况下不用关注。对于项目贡献者，
+主要维护的是技能树的目录结构和相关习题
+## 目录结构说明
+data目录下包含 `难度节点`/`章节点`/`知识节点` 3级目录结构，超过3级以上的信息直接在3级目录下的 `config.json` 里通过`children` 字段配置
+* 技能树`骨架文件`：
+    * 位置：`data/tree.json`
+    * 说明：该文件是执行 `python main.py` 生成的，请勿人工编辑
+* 技能树`根节点`配置文件：
+    * 位置：`data/config.json`
+    * 说明：可编辑配置关键词等字段，其中 `node_id` 字段是生成的，请勿编辑
+* 技能树`难度节点`：
+    * 位置：`data/xxx`，例如: `data/1.Neo4J初阶`
+    * 说明：
+        * 每个技能树有 3 个等级，目录前的序号是必要的，用来保持文件夹目录的顺序
+        * 每个目录下有一个 `config.json` 可配置关键词信息，其中 `node_id` 字段是生成的，请勿编辑
+* 技能树`章节点`：
+    * 位置：`data/xxx/xxx`，例如：`data/1.Neo4J初阶/1.预备知识`
+    * 说明：
+        * 每个技能树的每个难度等级有 n 个章节，目录前的序号是必要的，用来保持文件夹目录的顺序
+        * 每个目录下有一个 `config.json` 可配置关键词信息，其中 `node_id` 字段是生成的，请勿编辑
+* 技能树`知识节点`：
+    * 位置：`data/xxx/xxx/xxx`，例如：`data/1.Neo4J初阶/1.预备知识/1.Neo4J简介`
+    * 说明：
+        * 每个技能树的每章有 `n` 个知识节点，目录前的序号是必要的，用来保持文件夹目录的顺序
+        * 每个目录下有一个 `config.json`
+            * 其中 `node_id` 字段是生成的，请勿编辑
+            * 其中 `keywords` 可配置关键字字段
+            * 其中 `children` 可配置该`知识节点`下的子树结构信息，参考后面描述
+            * 其中 `export` 可配置该`知识节点`下的导出习题信息，参考后面描述
+## `知识节点` 子树信息结构
+举例，如果在 `data/1.Neo4J初阶/1.预备知识/1.Neo4J简介/config.json` 里配置对该知识节点子树信息结构：
+```json
+{
+    "children": [
+        {
+            "什么是图数据库": {
+                "keywords": [
+                    "图数据库"
+                ],
+                "children": [
+                    {
+                        "图论": {
+                            "keywords": [
+                                "节点",
+                                "边",
+                                "关系"
+                            ],
+                            "children": []
+                        }
+                    },
+                    {
+                        "RDF": {
+                            "keywords": [],
+                            "children": []
+                        }
+                    },
+                    {
+                        "属性图": {
+                            "keywords": [],
+                            "children": []
+                        }
+                    },
+                    {
+                        "原生图": {
+                            "keywords": [
+                            ],
+                            "children": []
+                        }
+                    }
+                ]
+            },
+            "什么时候需要图数据库": {
+                "keywords": [
+                    "图数据库"
+                ],
+                "children": []
+            },
+            "Neo4J图数据库概览": {
+                "keywords": [
+                    "图数据库"
+                ],
+                "children": []
+            }
+        }
+    ],
+}
+```
+在后续的信息加工过程中，这些内容就会关联到相关的节点。
+通常情况下，我们只需要维护固定深度的目录结构，大部分知识点不涉及 children 的维护。
+## `知识节点` 的导出习题编辑
+例如 `data/1.Neo4J初阶/1.预备知识/1.Neo4J简介/config.json` 里配置对该知识节点导出的习题
+```json
+{
+    "export": [
+        "helloworld.json"
+    ]
+}
+```
+每个文件名，指向对应的习题定义 json 。
+## `知识节点` 的导出习题选项配置编辑
+首先，我们添加前文中 export 指定的习题配置，例如在 `data/1.Neo4J初阶/1.预备知识/1.Neo4J简介/` 下增加一个`helloworld.json`代码：
+```json
+{
+    "type": "code_options",
+    "author": "幻灰龙",
+    "source": "helloworld.md",
+    "notebook_enable": false
+}
+```
+其中 
+* `type` 字段目前都固定是 `code_options`。
+* `notebook_enable` 对于 Neo4J 技能树总是false。
+* `source` 字段代表习题编辑的 `markdwon` 文件。
+现在我们新建一个 `helloworld.md` 并编辑为：
+````markdown
+```
+# Hello World
+Neo4J可以通过shell直接写查询语句，也可以在Java、Python等语言中创建连接查询，以下哪个查询不是图数据库Neo4J的查询？
+## 答案
+```sql
+SELECT name FROM Person
+LEFT JOIN Person_Department
+  ON Person.Id = Person_Department.PersonId
+LEFT JOIN Department
+  ON Department.Id = Person_Department.DepartmentId
+WHERE Department.name = "IT Department"
+```
+## 选项
+### MySQL查询
+```sql
+MATCH (p:Person)-[:WORKS_AT]->(d:Dept)
+WHERE d.name = "IT Department"
+RETURN p.name
+```
+### 使用JDBC查询Neo4J
+```java
+Connection con = DriverManager.getConnection("jdbc:neo4j://localhost:7474/");
+String query =
+    "MATCH (:Person {name:{1}})-[:EMPLOYEE]-(d:Department) RETURN d.name as dept";
+try (PreparedStatement stmt = con.prepareStatement(QUERY)) {
+    stmt.setString(1,"John");
+    ResultSet rs = stmt.executeQuery();
+    while(rs.next()) {
+        String department = rs.getString("dept");
+        ....
+    }
+}
+```
+### 在Python中查询
+```python
+from neo4j import GraphDatabase
+class HelloWorldExample:
+    def __init__(self, uri, user, password):
+        self.driver = GraphDatabase.driver(uri, auth=(user, password))
+    def close(self):
+        self.driver.close()
+    def print_greeting(self, message):
+        with self.driver.session() as session:
+            greeting = session.write_transaction(self._create_and_return_greeting, message)
+            print(greeting)
+    @staticmethod
+    def _create_and_return_greeting(tx, message):
+        result = tx.run("CREATE (a:Greeting) "
+                        "SET a.message = $message "
+                        "RETURN a.message + ', from node ' + id(a)", message=message)
+        return result.single()[0]
+if __name__ == "__main__":
+    greeter = HelloWorldExample("bolt://localhost:7687", "neo4j", "password")
+    greeter.print_greeting("hello, world")
+    greeter.close()
+```
+```
+这是一个最基本的习题结构，它包含标题、答案、选项，注意这几个一级和二级标题必须填写正确，解释器会读取这几个标题。而选项的标题会被直接忽略掉，在最终生成的习题中不包含选项的三级标题，所以这个标题可以用来标注一些编辑信息，例如“此选项没有关闭文件连接”，“类型错误”等等。
+## 技能树合成
+在`src`目录下执行 `python main.py` 会合成技能树文件，合成的技能树文件: `data/tree.json`
+* 合成过程中，会自动检查每个目录下
+  * 是否有 `config.json`, 没有的话会新建一个 
+  * `config.json` 里的 `node_id` 是否存在，不存在则生成
+  * 目录序号是否连续，如果不连续会重排
+* 合成过程中，会自动检查每个知识点目录下 `config.json` 里的 `export` 里导出的习题配置，检查是否存在`exercise_id` 字段，如果不存在则生成
--- a/data/1.Neo4J初阶/1.预备知识/1.Neo4J简介/config.json
+++ b/data/1.Neo4J初阶/1.预备知识/1.Neo4J简介/config.json
+{
+    "node_id": "neo4j-0261ccb903994df281a2ec606b5d8c9e",
+    "keywords": [],
+    "children": [
+        {
+            "什么是图数据库": {
+                "keywords": [
+                    "图数据库"
+                ],
+                "children": [
+                    {
+                        "图论": {
+                            "keywords": [
+                                "节点",
+                                "边",
+                                "关系"
+                            ],
+                            "children": []
+                        }
+                    },
+                    {
+                        "RDF": {
+                            "keywords": [],
+                            "children": []
+                        }
+                    },
+                    {
+                        "属性图": {
+                            "keywords": [],
+                            "children": []
+                        }
+                    },
+                    {
+                        "原生图": {
+                            "keywords": [
+                            ],
+                            "children": []
+                        }
+                    }
+                ]
+            },
+            "什么时候需要图数据库": {
+                "keywords": [
+                    "图数据库"
+                ],
+                "children": []
+            },
+            "Neo4J图数据库概览": {
+                "keywords": [
+                    "图数据库"
+                ],
+                "children": []
+            }
+        }
+    ],
+    "export": [
+        "helloworld.md"
+    ]
+}
\ No newline at end of file
--- a/data/1.Neo4J初阶/1.预备知识/1.Neo4J简介/helloworld.json
+++ b/data/1.Neo4J初阶/1.预备知识/1.Neo4J简介/helloworld.json
+{
+    "type": "code_options",
+    "author": "幻灰龙",
+    "source": "helloworld.md",
+    "notebook_enable": false
+}
\ No newline at end of file
--- a/data/1.Neo4J初阶/1.预备知识/1.Neo4J简介/helloworld.md
+++ b/data/1.Neo4J初阶/1.预备知识/1.Neo4J简介/helloworld.md
+# Hello World
+Neo4J 可以通过 shell 直接写查询语句，也可以在 Java、Python 等语言中创建连接查询，以下哪个查询<span style="color:red">不是</span>图数据库 Neo4J 的查询？
+## 答案
+```sql
+SELECT name FROM Person
+LEFT JOIN Person_Department
+  ON Person.Id = Person_Department.PersonId
+LEFT JOIN Department
+  ON Department.Id = Person_Department.DepartmentId
+WHERE Department.name = "IT Department"
+```
+## 选项
+### MySQL查询
+```sql
+MATCH (p:Person)-[:WORKS_AT]->(d:Dept)
+WHERE d.name = "IT Department"
+RETURN p.name
+```
+### 使用JDBC查询Neo4J
+```java
+Connection con = DriverManager.getConnection("jdbc:neo4j://localhost:7474/");
+String query =
+    "MATCH (:Person {name:{1}})-[:EMPLOYEE]-(d:Department) RETURN d.name as dept";
+try (PreparedStatement stmt = con.prepareStatement(QUERY)) {
+    stmt.setString(1,"John");
+    ResultSet rs = stmt.executeQuery();
+    while(rs.next()) {
+        String department = rs.getString("dept");
+        ....
+    }
+}
+```
+### 在Python中查询
+```python
+from neo4j import GraphDatabase
+class HelloWorldExample:
+    def __init__(self, uri, user, password):
+        self.driver = GraphDatabase.driver(uri, auth=(user, password))
+    def close(self):
+        self.driver.close()
+    def print_greeting(self, message):
+        with self.driver.session() as session:
+            greeting = session.write_transaction(self._create_and_return_greeting, message)
+            print(greeting)
+    @staticmethod
+    def _create_and_return_greeting(tx, message):
+        result = tx.run("CREATE (a:Greeting) "
+                        "SET a.message = $message "
+                        "RETURN a.message + ', from node ' + id(a)", message=message)
+        return result.single()[0]
+if __name__ == "__main__":
+    greeter = HelloWorldExample("bolt://localhost:7687", "neo4j", "password")
+    greeter.print_greeting("hello, world")
+    greeter.close()
+```
\ No newline at end of file
--- a/data/1.Neo4J初阶/1.预备知识/2.安装和启动/config.json
+++ b/data/1.Neo4J初阶/1.预备知识/2.安装和启动/config.json
+{
+  "node_id": "neo4j-0ee8cb8ccd6f4a59bc20f9ccbf7d627e",
+  "keywords": [],
+  "children": [],
+  "export": []
+}
\ No newline at end of file
--- a/data/1.Neo4J初阶/1.预备知识/config.json
+++ b/data/1.Neo4J初阶/1.预备知识/config.json
+{
+  "node_id": "neo4j-a42252d5f8c24548bde127a385850a76",
+  "keywords": []
+}
\ No newline at end of file
--- a/data/1.Neo4J初阶/2.Cypher查询语言/config.json
+++ b/data/1.Neo4J初阶/2.Cypher查询语言/config.json
+{
+  "node_id": "neo4j-9ec466f015f9422dab2b6b05f0581a8b",
+  "keywords": []
+}
\ No newline at end of file
--- a/data/1.Neo4J初阶/config.json
+++ b/data/1.Neo4J初阶/config.json
+{
+  "node_id": "neo4j-5e171793d38e49e784f544a9f80d09cb",
+  "keywords": []
+}
\ No newline at end of file
--- a/data/2.Neo4J中阶/config.json
+++ b/data/2.Neo4J中阶/config.json
+{
+  "node_id": "neo4j-298b201de8044453a2d6e8d02e64962d",
+  "keywords": []
+}
\ No newline at end of file
--- a/data/3.Neo4J高阶/config.json
+++ b/data/3.Neo4J高阶/config.json
+{
+  "node_id": "neo4j-b05c040c3bfe49e29f17397e9e16c7d4",
+  "keywords": []
+}
\ No newline at end of file
--- a/data/config.json
+++ b/data/config.json
+{
+  "tree_name": "neo4j",
+  "keywords": [],
+  "node_id": "neo4j-50ecfa9d2d0f4012ae80a3656c0756ab"
+}
\ No newline at end of file
--- a/data/tree.json
+++ b/data/tree.json
+{
+  "neo4j": {
+    "node_id": "neo4j-50ecfa9d2d0f4012ae80a3656c0756ab",
+    "keywords": [],
+    "children": [
+      {
+        "Neo4J初阶": {
+          "node_id": "neo4j-5e171793d38e49e784f544a9f80d09cb",
+          "keywords": [],
+          "children": [
+            {
+              "预备知识": {
+                "node_id": "neo4j-a42252d5f8c24548bde127a385850a76",
+                "keywords": [],
+                "children": [
+                  {
+                    "Neo4J简介": {
+                      "node_id": "neo4j-0261ccb903994df281a2ec606b5d8c9e",
+                      "keywords": [],
+                      "children": []
+                    }
+                  },
+                  {
+                    "安装和启动": {
+                      "node_id": "neo4j-0ee8cb8ccd6f4a59bc20f9ccbf7d627e",
+                      "keywords": [],
+                      "children": []
+                    }
+                  }
+                ]
+              }
+            },
+            {
+              "Cypher查询语言": {
+                "node_id": "neo4j-9ec466f015f9422dab2b6b05f0581a8b",
+                "keywords": [],
+                "children": []
+              }
+            }
+          ]
+        }
+      },
+      {
+        "Neo4J中阶": {
+          "node_id": "neo4j-298b201de8044453a2d6e8d02e64962d",
+          "keywords": [],
+          "children": []
+        }
+      },
+      {
+        "Neo4J高阶": {
+          "node_id": "neo4j-b05c040c3bfe49e29f17397e9e16c7d4",
+          "keywords": [],
+          "children": []
+        }
+      }
+    ]
+  }
+}
\ No newline at end of file
--- a/main.py
+++ b/main.py
+# -*- coding: utf-8 -*-
+from src.tree import TreeWalker
+if __name__ == '__main__':
+    walker = TreeWalker("data", "neo4j", "Neo4J")
+    walker.walk()
--- a/requirement.txt
+++ b/requirement.txt
+uuid==1.30
\ No newline at end of file
--- a/src/__pycache__/tree.cpython-38.pyc
+++ b/src/__pycache__/tree.cpython-38.pyc
--- a/src/tree.py
+++ b/src/tree.py
+# -*- coding: utf-8 -*-
+import logging
+from genericpath import exists
+import json
+import os
+import uuid
+import sys
+import re
+id_set = set()
+logger = logging.getLogger(__name__)
+logger.setLevel(logging.INFO)
+handler = logging.StreamHandler(sys.stdout)
+formatter = logging.Formatter('%(asctime)s - %(levelname)s - %(message)s')
+handler.setFormatter(formatter)
+logger.addHandler(handler)
+def load_json(p):
+    with open(p, 'r', encoding='utf-8') as f:
+        return json.loads(f.read())
+def dump_json(p, j, exist_ok=False, override=False):
+    if os.path.exists(p):
+        if exist_ok:
+            if not override:
+                return
+        else:
+            logger.error(f"{p} already exist")
+            sys.exit(0)
+    with open(p, 'w+', encoding='utf-8') as f:
+        f.write(json.dumps(j, indent=2, ensure_ascii=False))
+def ensure_config(path):
+    config_path = os.path.join(path, "config.json")
+    if not os.path.exists(config_path):
+        node = {"keywords": []}
+        dump_json(config_path, node, exist_ok=True, override=False)
+        return node
+    else:
+        return load_json(config_path)
+def parse_no_name(d):
+    p = r'(\d+)\.(.*)'
+    m = re.search(p, d)
+    try:
+        no = int(m.group(1))
+        dir_name = m.group(2)
+    except:
+        sys.exit(0)
+    return no, dir_name
+def check_export(base, cfg):
+    flag = False
+    exports = []
+    for export in cfg.get('export', []):
+        ecfg_path = os.path.join(base, export)
+        if os.path.exists(ecfg_path):
+            exports.append(export)
+        else:
+            flag = True
+    if flag:
+        cfg["export"] = exports
+    return flag
+class TreeWalker:
+    def __init__(self, root, tree_name, title=None, log=None):
+        self.name = tree_name
+        self.root = root
+        self.title = tree_name if title is None else title
+        self.tree = {}
+        self.logger = logger if log is None else log
+    def walk(self):
+        root = self.load_root()
+        root_node = {
+            "node_id": root["node_id"],
+            "keywords": root["keywords"],
+            "children": []
+        }
+        self.tree[root["tree_name"]] = root_node
+        self.load_levels(root_node)
+        self.load_chapters(self.root, root_node)
+        for index, level in enumerate(root_node["children"]):
+            level_title = list(level.keys())[0]
+            level_node = list(level.values())[0]
+            level_path = os.path.join(self.root, f"{index+1}.{level_title}")
+            self.load_chapters(level_path, level_node)
+            for index, chapter in enumerate(level_node["children"]):
+                chapter_title = list(chapter.keys())[0]
+                chapter_node = list(chapter.values())[0]
+                chapter_path = os.path.join(
+                    level_path, f"{index+1}.{chapter_title}")
+                self.load_sections(chapter_path, chapter_node)
+                for index, section_node in enumerate(chapter_node["children"]):
+                    section_title = list(section_node.keys())[0]
+                    full_path = os.path.join(
+                        chapter_path, f"{index}.{section_title}")
+                    if os.path.isdir(full_path):
+                        self.ensure_exercises(full_path)
+        tree_path = os.path.join(self.root, "tree.json")
+        dump_json(tree_path, self.tree, exist_ok=True, override=True)
+        return self.tree
+    def load_levels(self, root_node):
+        levels = []
+        for level in os.listdir(self.root):
+            if not os.path.isdir(level):
+                continue
+            level_path = os.path.join(self.root, level)
+            num, config = self.load_level_node(level_path)
+            levels.append((num, config))
+        levels = self.resort_children(self.root, levels)
+        root_node["children"] = [item[1] for item in levels]
+        return root_node
+    def load_level_node(self, level_path):
+        config = self.ensure_level_config(level_path)
+        num, name = self.extract_node_env(level_path)
+        result = {
+            name: {
+                "node_id": config["node_id"],
+                "keywords": config["keywords"],
+                "children": [],
+            }
+        }
+        return num, result
+    def load_chapters(self, base, level_node):
+        chapters = []
+        for name in os.listdir(base):
+            full_name = os.path.join(base, name)
+            if os.path.isdir(full_name):
+                num, chapter = self.load_chapter_node(full_name)
+                chapters.append((num, chapter))
+        chapters = self.resort_children(base, chapters)
+        level_node["children"] = [item[1] for item in chapters]
+        return level_node
+    def load_sections(self, base, chapter_node):
+        sections = []
+        for name in os.listdir(base):
+            full_name = os.path.join(base, name)
+            if os.path.isdir(full_name):
+                num, section = self.load_section_node(full_name)
+                sections.append((num, section))
+        sections = self.resort_children(base, sections)
+        chapter_node["children"] = [item[1] for item in sections]
+        return chapter_node
+    def resort_children(self, base, children):
+        children.sort(key=lambda item: item[0])
+        for index, [number, element] in enumerate(children):
+            title = list(element.keys())[0]
+            origin = os.path.join(base, f"{number}.{title}")
+            posted = os.path.join(base, f"{index+1}.{title}")
+            if origin != posted:
+                self.logger.info(f"rename [{origin}] to [{posted}]")
+            os.rename(origin, posted)
+        return children
+    def ensure_chapters(self):
+        for subdir in os.listdir(self.root):
+            self.ensure_level_config(subdir)
+    def load_root(self):
+        config_path = os.path.join(self.root, "config.json")
+        if not os.path.exists(config_path):
+            config = {
+                "tree_name": self.name,
+                "keywords": [],
+                "node_id": self.gen_node_id(),
+            }
+            dump_json(config_path, config, exist_ok=True, override=True)
+        else:
+            config = load_json(config_path)
+            flag, result = self.ensure_node_id(config)
+            if flag:
+                dump_json(config_path, result, exist_ok=True, override=True)
+        return config
+    def ensure_level_config(self, path):
+        config_path = os.path.join(path, "config.json")
+        if not os.path.exists(config_path):
+            config = {
+                "node_id": self.gen_node_id()
+            }
+            dump_json(config_path, config, exist_ok=True, override=True)
+        else:
+            config = load_json(config_path)
+            flag, result = self.ensure_node_id(config)
+            if flag:
+                dump_json(config_path, config, exist_ok=True, override=True)
+        return config
+    def ensure_chapter_config(self, path):
+        config_path = os.path.join(path, "config.json")
+        if not os.path.exists(config_path):
+            config = {
+                "node_id": self.gen_node_id(),
+                "keywords": []
+            }
+            dump_json(config_path, config, exist_ok=True, override=True)
+        else:
+            config = load_json(config_path)
+            flag, result = self.ensure_node_id(config)
+            if flag:
+                dump_json(config_path, config, exist_ok=True, override=True)
+        return config
+    def ensure_section_config(self, path):
+        config_path = os.path.join(path, "config.json")
+        if not os.path.exists(config_path):
+            config = {
+                "node_id": self.gen_node_id(),
+                "keywords": [],
+                "children": [],
+                "export": []
+            }
+            dump_json(config_path, config, exist_ok=True, override=True)
+        else:
+            config = load_json(config_path)
+            flag, result = self.ensure_node_id(config)
+            if flag:
+                dump_json(config_path, config, exist_ok=True, override=True)
+        return config
+    def ensure_node_id(self, config):
+        if "node_id" not in config:
+            config["node_id"] = self.gen_node_id()
+            return True, config
+        else:
+            return False, config
+    def gen_node_id(self):
+        return f"{self.name}-{uuid.uuid4().hex}"
+    def extract_node_env(self, path):
+        try:
+            _, dir = os.path.split(path)
+            self.logger.info(path)
+            number, title = dir.split(".", 1)
+            return int(number), title
+        except Exception as error:
+            self.logger.error(f"目录 [{path}] 解析失败，结构不合法，可能是缺少序号")
+            sys.exit(1)
+    def load_chapter_node(self, full_name):
+        config = self.ensure_chapter_config(full_name)
+        num, name = self.extract_node_env(full_name)
+        result = {
+            name: {
+                "node_id": config["node_id"],
+                "keywords": config["keywords"],
+                "children": [],
+            }
+        }
+        return num, result
+    def load_section_node(self, full_name):
+        config = self.ensure_section_config(full_name)
+        num, name = self.extract_node_env(full_name)
+        result = {
+            name: {
+                "node_id": config["node_id"],
+                "keywords": config["keywords"],
+                "children": config.get("children", [])
+            }
+        }
+        # if "children" in config:
+        #     result["children"] = config["children"]
+        return num, result
+    def ensure_exercises(self, section_path):
+        config = self.ensure_section_config(section_path)
+        for e in config.get("export", []):
+            full_name = os.path.join(section_path, e)
+            exercise = load_json(full_name)
+            if "exercise_id" not in exercise:
+                exercise["exercise_id"] = uuid.uuid4().hex
+                dump_json(full_name, exercise)