提交 e128a2ed 编写于 作者: F feilong

init

上级 a6b22761
# skill_tree_neo4j
Neo4J 图数据库技能树
\ No newline at end of file
Neo4J 图数据库技能树
# skill_tree_pg
本项目是 [CSDN 技能森林](https://codechina.csdn.net/csdn/skill_tree) 的 Neo4J 专项开放编辑仓库。
本仓库面向学习者,以树状结构管理 Neo4J 技能的知识点。
为了操作方便,在仓库中有一组微型的 Python 工具脚本,用于加工和维护技能树,一般情况下不用关注。对于项目贡献者,
主要维护的是技能树的目录结构和相关习题
## 目录结构说明
data目录下包含 `难度节点`/`章节点`/`知识节点` 3级目录结构,超过3级以上的信息直接在3级目录下的 `config.json` 里通过`children` 字段配置
* 技能树`骨架文件`
* 位置:`data/tree.json`
* 说明:该文件是执行 `python main.py` 生成的,请勿人工编辑
* 技能树`根节点`配置文件:
* 位置:`data/config.json`
* 说明:可编辑配置关键词等字段,其中 `node_id` 字段是生成的,请勿编辑
* 技能树`难度节点`
* 位置:`data/xxx`,例如: `data/1.Neo4J初阶`
* 说明:
* 每个技能树有 3 个等级,目录前的序号是必要的,用来保持文件夹目录的顺序
* 每个目录下有一个 `config.json` 可配置关键词信息,其中 `node_id` 字段是生成的,请勿编辑
* 技能树`章节点`
* 位置:`data/xxx/xxx`,例如:`data/1.Neo4J初阶/1.预备知识`
* 说明:
* 每个技能树的每个难度等级有 n 个章节,目录前的序号是必要的,用来保持文件夹目录的顺序
* 每个目录下有一个 `config.json` 可配置关键词信息,其中 `node_id` 字段是生成的,请勿编辑
* 技能树`知识节点`
* 位置:`data/xxx/xxx/xxx`,例如:`data/1.Neo4J初阶/1.预备知识/1.Neo4J简介`
* 说明:
* 每个技能树的每章有 `n` 个知识节点,目录前的序号是必要的,用来保持文件夹目录的顺序
* 每个目录下有一个 `config.json`
* 其中 `node_id` 字段是生成的,请勿编辑
* 其中 `keywords` 可配置关键字字段
* 其中 `children` 可配置该`知识节点`下的子树结构信息,参考后面描述
* 其中 `export` 可配置该`知识节点`下的导出习题信息,参考后面描述
## `知识节点` 子树信息结构
举例,如果在 `data/1.Neo4J初阶/1.预备知识/1.Neo4J简介/config.json` 里配置对该知识节点子树信息结构:
```json
{
"children": [
{
"什么是图数据库": {
"keywords": [
"图数据库"
],
"children": [
{
"图论": {
"keywords": [
"节点",
"边",
"关系"
],
"children": []
}
},
{
"RDF": {
"keywords": [],
"children": []
}
},
{
"属性图": {
"keywords": [],
"children": []
}
},
{
"原生图": {
"keywords": [
],
"children": []
}
}
]
},
"什么时候需要图数据库": {
"keywords": [
"图数据库"
],
"children": []
},
"Neo4J图数据库概览": {
"keywords": [
"图数据库"
],
"children": []
}
}
],
}
```
在后续的信息加工过程中,这些内容就会关联到相关的节点。
通常情况下,我们只需要维护固定深度的目录结构,大部分知识点不涉及 children 的维护。
## `知识节点` 的导出习题编辑
例如 `data/1.Neo4J初阶/1.预备知识/1.Neo4J简介/config.json` 里配置对该知识节点导出的习题
```json
{
"export": [
"helloworld.json"
]
}
```
每个文件名,指向对应的习题定义 json 。
## `知识节点` 的导出习题选项配置编辑
首先,我们添加前文中 export 指定的习题配置,例如在 `data/1.Neo4J初阶/1.预备知识/1.Neo4J简介/` 下增加一个`helloworld.json`代码:
```json
{
"type": "code_options",
"author": "幻灰龙",
"source": "helloworld.md",
"notebook_enable": false
}
```
其中
* `type` 字段目前都固定是 `code_options`
* `notebook_enable` 对于 Neo4J 技能树总是false。
* `source` 字段代表习题编辑的 `markdwon` 文件。
现在我们新建一个 `helloworld.md` 并编辑为:
````markdown
```
# Hello World
Neo4J可以通过shell直接写查询语句,也可以在Java、Python等语言中创建连接查询,以下哪个查询不是图数据库Neo4J的查询?
## 答案
```sql
SELECT name FROM Person
LEFT JOIN Person_Department
ON Person.Id = Person_Department.PersonId
LEFT JOIN Department
ON Department.Id = Person_Department.DepartmentId
WHERE Department.name = "IT Department"
```
## 选项
### MySQL查询
```sql
MATCH (p:Person)-[:WORKS_AT]->(d:Dept)
WHERE d.name = "IT Department"
RETURN p.name
```
### 使用JDBC查询Neo4J
```java
Connection con = DriverManager.getConnection("jdbc:neo4j://localhost:7474/");
String query =
"MATCH (:Person {name:{1}})-[:EMPLOYEE]-(d:Department) RETURN d.name as dept";
try (PreparedStatement stmt = con.prepareStatement(QUERY)) {
stmt.setString(1,"John");
ResultSet rs = stmt.executeQuery();
while(rs.next()) {
String department = rs.getString("dept");
....
}
}
```
### 在Python中查询
```python
from neo4j import GraphDatabase
class HelloWorldExample:
def __init__(self, uri, user, password):
self.driver = GraphDatabase.driver(uri, auth=(user, password))
def close(self):
self.driver.close()
def print_greeting(self, message):
with self.driver.session() as session:
greeting = session.write_transaction(self._create_and_return_greeting, message)
print(greeting)
@staticmethod
def _create_and_return_greeting(tx, message):
result = tx.run("CREATE (a:Greeting) "
"SET a.message = $message "
"RETURN a.message + ', from node ' + id(a)", message=message)
return result.single()[0]
if __name__ == "__main__":
greeter = HelloWorldExample("bolt://localhost:7687", "neo4j", "password")
greeter.print_greeting("hello, world")
greeter.close()
```
```
这是一个最基本的习题结构,它包含标题、答案、选项,注意这几个一级和二级标题必须填写正确,解释器会读取这几个标题。而选项的标题会被直接忽略掉,在最终生成的习题中不包含选项的三级标题,所以这个标题可以用来标注一些编辑信息,例如“此选项没有关闭文件连接”,“类型错误”等等。
## 技能树合成
`src`目录下执行 `python main.py` 会合成技能树文件,合成的技能树文件: `data/tree.json`
* 合成过程中,会自动检查每个目录下
* 是否有 `config.json`, 没有的话会新建一个
* `config.json` 里的 `node_id` 是否存在,不存在则生成
* 目录序号是否连续,如果不连续会重排
* 合成过程中,会自动检查每个知识点目录下 `config.json` 里的 `export` 里导出的习题配置,检查是否存在`exercise_id` 字段,如果不存在则生成
{
"node_id": "neo4j-0261ccb903994df281a2ec606b5d8c9e",
"keywords": [],
"children": [
{
"什么是图数据库": {
"keywords": [
"图数据库"
],
"children": [
{
"图论": {
"keywords": [
"节点",
"边",
"关系"
],
"children": []
}
},
{
"RDF": {
"keywords": [],
"children": []
}
},
{
"属性图": {
"keywords": [],
"children": []
}
},
{
"原生图": {
"keywords": [
],
"children": []
}
}
]
},
"什么时候需要图数据库": {
"keywords": [
"图数据库"
],
"children": []
},
"Neo4J图数据库概览": {
"keywords": [
"图数据库"
],
"children": []
}
}
],
"export": [
"helloworld.md"
]
}
\ No newline at end of file
{
"type": "code_options",
"author": "幻灰龙",
"source": "helloworld.md",
"notebook_enable": false
}
\ No newline at end of file
# Hello World
Neo4J 可以通过 shell 直接写查询语句,也可以在 Java、Python 等语言中创建连接查询,以下哪个查询<span style="color:red">不是</span>图数据库 Neo4J 的查询?
## 答案
```sql
SELECT name FROM Person
LEFT JOIN Person_Department
ON Person.Id = Person_Department.PersonId
LEFT JOIN Department
ON Department.Id = Person_Department.DepartmentId
WHERE Department.name = "IT Department"
```
## 选项
### MySQL查询
```sql
MATCH (p:Person)-[:WORKS_AT]->(d:Dept)
WHERE d.name = "IT Department"
RETURN p.name
```
### 使用JDBC查询Neo4J
```java
Connection con = DriverManager.getConnection("jdbc:neo4j://localhost:7474/");
String query =
"MATCH (:Person {name:{1}})-[:EMPLOYEE]-(d:Department) RETURN d.name as dept";
try (PreparedStatement stmt = con.prepareStatement(QUERY)) {
stmt.setString(1,"John");
ResultSet rs = stmt.executeQuery();
while(rs.next()) {
String department = rs.getString("dept");
....
}
}
```
### 在Python中查询
```python
from neo4j import GraphDatabase
class HelloWorldExample:
def __init__(self, uri, user, password):
self.driver = GraphDatabase.driver(uri, auth=(user, password))
def close(self):
self.driver.close()
def print_greeting(self, message):
with self.driver.session() as session:
greeting = session.write_transaction(self._create_and_return_greeting, message)
print(greeting)
@staticmethod
def _create_and_return_greeting(tx, message):
result = tx.run("CREATE (a:Greeting) "
"SET a.message = $message "
"RETURN a.message + ', from node ' + id(a)", message=message)
return result.single()[0]
if __name__ == "__main__":
greeter = HelloWorldExample("bolt://localhost:7687", "neo4j", "password")
greeter.print_greeting("hello, world")
greeter.close()
```
\ No newline at end of file
{
"node_id": "neo4j-0ee8cb8ccd6f4a59bc20f9ccbf7d627e",
"keywords": [],
"children": [],
"export": []
}
\ No newline at end of file
{
"node_id": "neo4j-a42252d5f8c24548bde127a385850a76",
"keywords": []
}
\ No newline at end of file
{
"node_id": "neo4j-9ec466f015f9422dab2b6b05f0581a8b",
"keywords": []
}
\ No newline at end of file
{
"node_id": "neo4j-5e171793d38e49e784f544a9f80d09cb",
"keywords": []
}
\ No newline at end of file
{
"node_id": "neo4j-298b201de8044453a2d6e8d02e64962d",
"keywords": []
}
\ No newline at end of file
{
"node_id": "neo4j-b05c040c3bfe49e29f17397e9e16c7d4",
"keywords": []
}
\ No newline at end of file
{
"tree_name": "neo4j",
"keywords": [],
"node_id": "neo4j-50ecfa9d2d0f4012ae80a3656c0756ab"
}
\ No newline at end of file
{
"neo4j": {
"node_id": "neo4j-50ecfa9d2d0f4012ae80a3656c0756ab",
"keywords": [],
"children": [
{
"Neo4J初阶": {
"node_id": "neo4j-5e171793d38e49e784f544a9f80d09cb",
"keywords": [],
"children": [
{
"预备知识": {
"node_id": "neo4j-a42252d5f8c24548bde127a385850a76",
"keywords": [],
"children": [
{
"Neo4J简介": {
"node_id": "neo4j-0261ccb903994df281a2ec606b5d8c9e",
"keywords": [],
"children": []
}
},
{
"安装和启动": {
"node_id": "neo4j-0ee8cb8ccd6f4a59bc20f9ccbf7d627e",
"keywords": [],
"children": []
}
}
]
}
},
{
"Cypher查询语言": {
"node_id": "neo4j-9ec466f015f9422dab2b6b05f0581a8b",
"keywords": [],
"children": []
}
}
]
}
},
{
"Neo4J中阶": {
"node_id": "neo4j-298b201de8044453a2d6e8d02e64962d",
"keywords": [],
"children": []
}
},
{
"Neo4J高阶": {
"node_id": "neo4j-b05c040c3bfe49e29f17397e9e16c7d4",
"keywords": [],
"children": []
}
}
]
}
}
\ No newline at end of file
# -*- coding: utf-8 -*-
from src.tree import TreeWalker
if __name__ == '__main__':
walker = TreeWalker("data", "neo4j", "Neo4J")
walker.walk()
uuid==1.30
\ No newline at end of file
# -*- coding: utf-8 -*-
import logging
from genericpath import exists
import json
import os
import uuid
import sys
import re
id_set = set()
logger = logging.getLogger(__name__)
logger.setLevel(logging.INFO)
handler = logging.StreamHandler(sys.stdout)
formatter = logging.Formatter('%(asctime)s - %(levelname)s - %(message)s')
handler.setFormatter(formatter)
logger.addHandler(handler)
def load_json(p):
with open(p, 'r', encoding='utf-8') as f:
return json.loads(f.read())
def dump_json(p, j, exist_ok=False, override=False):
if os.path.exists(p):
if exist_ok:
if not override:
return
else:
logger.error(f"{p} already exist")
sys.exit(0)
with open(p, 'w+', encoding='utf-8') as f:
f.write(json.dumps(j, indent=2, ensure_ascii=False))
def ensure_config(path):
config_path = os.path.join(path, "config.json")
if not os.path.exists(config_path):
node = {"keywords": []}
dump_json(config_path, node, exist_ok=True, override=False)
return node
else:
return load_json(config_path)
def parse_no_name(d):
p = r'(\d+)\.(.*)'
m = re.search(p, d)
try:
no = int(m.group(1))
dir_name = m.group(2)
except:
sys.exit(0)
return no, dir_name
def check_export(base, cfg):
flag = False
exports = []
for export in cfg.get('export', []):
ecfg_path = os.path.join(base, export)
if os.path.exists(ecfg_path):
exports.append(export)
else:
flag = True
if flag:
cfg["export"] = exports
return flag
class TreeWalker:
def __init__(self, root, tree_name, title=None, log=None):
self.name = tree_name
self.root = root
self.title = tree_name if title is None else title
self.tree = {}
self.logger = logger if log is None else log
def walk(self):
root = self.load_root()
root_node = {
"node_id": root["node_id"],
"keywords": root["keywords"],
"children": []
}
self.tree[root["tree_name"]] = root_node
self.load_levels(root_node)
self.load_chapters(self.root, root_node)
for index, level in enumerate(root_node["children"]):
level_title = list(level.keys())[0]
level_node = list(level.values())[0]
level_path = os.path.join(self.root, f"{index+1}.{level_title}")
self.load_chapters(level_path, level_node)
for index, chapter in enumerate(level_node["children"]):
chapter_title = list(chapter.keys())[0]
chapter_node = list(chapter.values())[0]
chapter_path = os.path.join(
level_path, f"{index+1}.{chapter_title}")
self.load_sections(chapter_path, chapter_node)
for index, section_node in enumerate(chapter_node["children"]):
section_title = list(section_node.keys())[0]
full_path = os.path.join(
chapter_path, f"{index}.{section_title}")
if os.path.isdir(full_path):
self.ensure_exercises(full_path)
tree_path = os.path.join(self.root, "tree.json")
dump_json(tree_path, self.tree, exist_ok=True, override=True)
return self.tree
def load_levels(self, root_node):
levels = []
for level in os.listdir(self.root):
if not os.path.isdir(level):
continue
level_path = os.path.join(self.root, level)
num, config = self.load_level_node(level_path)
levels.append((num, config))
levels = self.resort_children(self.root, levels)
root_node["children"] = [item[1] for item in levels]
return root_node
def load_level_node(self, level_path):
config = self.ensure_level_config(level_path)
num, name = self.extract_node_env(level_path)
result = {
name: {
"node_id": config["node_id"],
"keywords": config["keywords"],
"children": [],
}
}
return num, result
def load_chapters(self, base, level_node):
chapters = []
for name in os.listdir(base):
full_name = os.path.join(base, name)
if os.path.isdir(full_name):
num, chapter = self.load_chapter_node(full_name)
chapters.append((num, chapter))
chapters = self.resort_children(base, chapters)
level_node["children"] = [item[1] for item in chapters]
return level_node
def load_sections(self, base, chapter_node):
sections = []
for name in os.listdir(base):
full_name = os.path.join(base, name)
if os.path.isdir(full_name):
num, section = self.load_section_node(full_name)
sections.append((num, section))
sections = self.resort_children(base, sections)
chapter_node["children"] = [item[1] for item in sections]
return chapter_node
def resort_children(self, base, children):
children.sort(key=lambda item: item[0])
for index, [number, element] in enumerate(children):
title = list(element.keys())[0]
origin = os.path.join(base, f"{number}.{title}")
posted = os.path.join(base, f"{index+1}.{title}")
if origin != posted:
self.logger.info(f"rename [{origin}] to [{posted}]")
os.rename(origin, posted)
return children
def ensure_chapters(self):
for subdir in os.listdir(self.root):
self.ensure_level_config(subdir)
def load_root(self):
config_path = os.path.join(self.root, "config.json")
if not os.path.exists(config_path):
config = {
"tree_name": self.name,
"keywords": [],
"node_id": self.gen_node_id(),
}
dump_json(config_path, config, exist_ok=True, override=True)
else:
config = load_json(config_path)
flag, result = self.ensure_node_id(config)
if flag:
dump_json(config_path, result, exist_ok=True, override=True)
return config
def ensure_level_config(self, path):
config_path = os.path.join(path, "config.json")
if not os.path.exists(config_path):
config = {
"node_id": self.gen_node_id()
}
dump_json(config_path, config, exist_ok=True, override=True)
else:
config = load_json(config_path)
flag, result = self.ensure_node_id(config)
if flag:
dump_json(config_path, config, exist_ok=True, override=True)
return config
def ensure_chapter_config(self, path):
config_path = os.path.join(path, "config.json")
if not os.path.exists(config_path):
config = {
"node_id": self.gen_node_id(),
"keywords": []
}
dump_json(config_path, config, exist_ok=True, override=True)
else:
config = load_json(config_path)
flag, result = self.ensure_node_id(config)
if flag:
dump_json(config_path, config, exist_ok=True, override=True)
return config
def ensure_section_config(self, path):
config_path = os.path.join(path, "config.json")
if not os.path.exists(config_path):
config = {
"node_id": self.gen_node_id(),
"keywords": [],
"children": [],
"export": []
}
dump_json(config_path, config, exist_ok=True, override=True)
else:
config = load_json(config_path)
flag, result = self.ensure_node_id(config)
if flag:
dump_json(config_path, config, exist_ok=True, override=True)
return config
def ensure_node_id(self, config):
if "node_id" not in config:
config["node_id"] = self.gen_node_id()
return True, config
else:
return False, config
def gen_node_id(self):
return f"{self.name}-{uuid.uuid4().hex}"
def extract_node_env(self, path):
try:
_, dir = os.path.split(path)
self.logger.info(path)
number, title = dir.split(".", 1)
return int(number), title
except Exception as error:
self.logger.error(f"目录 [{path}] 解析失败,结构不合法,可能是缺少序号")
sys.exit(1)
def load_chapter_node(self, full_name):
config = self.ensure_chapter_config(full_name)
num, name = self.extract_node_env(full_name)
result = {
name: {
"node_id": config["node_id"],
"keywords": config["keywords"],
"children": [],
}
}
return num, result
def load_section_node(self, full_name):
config = self.ensure_section_config(full_name)
num, name = self.extract_node_env(full_name)
result = {
name: {
"node_id": config["node_id"],
"keywords": config["keywords"],
"children": config.get("children", [])
}
}
# if "children" in config:
# result["children"] = config["children"]
return num, result
def ensure_exercises(self, section_path):
config = self.ensure_section_config(section_path)
for e in config.get("export", []):
full_name = os.path.join(section_path, e)
exercise = load_json(full_name)
if "exercise_id" not in exercise:
exercise["exercise_id"] = uuid.uuid4().hex
dump_json(full_name, exercise)
Markdown is supported
0% .
You are about to add 0 people to the discussion. Proceed with caution.
先完成此消息的编辑!
想要评论请 注册