In the training process, AutoDLa Design agent generates tokens and adjacency matrices used for training, and the trainer uses these tokens and matrices to construct and train convolutional neural networks. The validation accuracy after 20 epochs are used as feed back for the agent, and the agent updates its policy accordingly. After several iterations, the agent learns to find a quality deep neural network.
![Picture](./AutoDLDesign/img/cnn_net.png)
![Picture](./AutoDL%20Design/img/cnn_net.png)
Here we provide the following test on the method.
### Test on the convergence of the number of tokens produced
The x-axis is the number of steps, and the y-axis is validation accuracy of the sampled models. The average performance of the sampled models improves over time.