README.md 1.5 KB
Newer Older
J
jonyguo 已提交
1 2 3 4 5 6 7 8 9 10 11 12 13
# MindSpore Data Special Interest Group (SIG)

This is the working repo for the Data special interest group (SIG). This repo contains all the artifacts, materials, meeting notes and proposals regarding **dataset - data processing** and **mindrecord - data format** in MindSpore. Feedbacks and contributions are welcome.
1. **Data Processing**: You can understand it as a Dataset, which is mainly responsible for reading the user's data into a Dataset, then performing related data enhancement operations (such as: resize, onehot, rotate, shuffle, batch ...), and finally provide the Dataset to the training process.
2. **Data Format**: It can conveniently normalize the user's training data to a unified format (MindRecord). The specific operation steps are as follows: The user can easily convert the training data into MindRecrod data by defining the training data schema and calling the Python API interface. The format is then read into a Dataset through MindDataset and provided to the training process.

# SIG Leads

* Liu Cunwei (Huawei)

# Logistics

* SIG leads will drive the meeting.
J
jonyguo 已提交
14
* Meeting annoucement will be posted on our gitee channel: https://gitee.com/mindspore/community/tree/master/sigs/data
J
jonyguo 已提交
15 16 17 18 19
* Feedbacks and topic requests are welcome by all.

# Discussion

* Slack channel https://app.slack.com/client/TUKCY4QDR/C010RPN6QNP?cdn_fallback=2
J
jonyguo 已提交
20
* Documents and artifacts: https://gitee.com/mindspore/community/tree/master/sigs/data
J
jonyguo 已提交
21 22 23 24 25

# Meeting notes

* [Thursday April 2, 2020](./meetings/001-20200402.md)