diff --git a/tutorials/notebook/customized_debugging_information.ipynb b/tutorials/notebook/customized_debugging_information.ipynb new file mode 100644 index 0000000000000000000000000000000000000000..ee24b878115b48f5cb9b89011220188a47e99dd0 --- /dev/null +++ b/tutorials/notebook/customized_debugging_information.ipynb @@ -0,0 +1,617 @@ +{ + "cells": [ + { + "cell_type": "markdown", + "metadata": {}, + "source": [ + "#
自定义调试体验文档" + ] + }, + { + "cell_type": "markdown", + "metadata": {}, + "source": [ + "## 概述" + ] + }, + { + "cell_type": "markdown", + "metadata": {}, + "source": [ + "本文将使用[快速入门](https://gitee.com/mindspore/docs/blob/master/tutorials/tutorial_code/lenet.py)作为样例,并通过构建自定义调试函数:`Callback`、`metrics`、`Print算子`、日志打印等,同时将构建的自定义调试函数添加进代码中,通过运行效果来展示具体如何使用MindSpore提供给我们的自定义调试能力,帮助快速调试训练网络。\n", + "体验过程如下:\n", + "1. 数据集准备。\n", + "2. 定义深度学习网络LeNet5。\n", + "3. 使用Callback回调函数构建StopAtTime类来控制训练停止时间。\n", + "4. 设置日志环境变量。\n", + "5. 定义模型并执行训练。\n", + "6. 执行测试。" + ] + }, + { + "cell_type": "markdown", + "metadata": {}, + "source": [ + "## 数据集准备" + ] + }, + { + "cell_type": "markdown", + "metadata": {}, + "source": [ + "### 数据集的下载" + ] + }, + { + "cell_type": "markdown", + "metadata": {}, + "source": [ + "这里我们需要将MNIST数据集中随机取出一张图片,并增强成适合LeNet网络的数据格式(如何处理请参考[quick_start.ipynb](https://gitee.com/mindspore/docs/blob/master/tutorials/notebook/quick_start.ipynb)),训练数据集下载地址:{\"http://yann.lecun.com/exdb/mnist/train-images-idx3-ubyte.gz\", \"http://yann.lecun.com/exdb/mnist/train-labels-idx1-ubyte.gz\"} 。\n", + "
数据集放在----`Jupyter工作目录+\\MNIST_Data\\train\\`,如下图结构:" + ] + }, + { + "cell_type": "raw", + "metadata": {}, + "source": [ + "MNIST\n", + "├── test\n", + "│   ├── t10k-images-idx3-ubyte\n", + "│   └── t10k-labels-idx1-ubyte\n", + "└── train\n", + " ├── train-images-idx3-ubyte\n", + " └── train-labels-idx1-ubyte" + ] + }, + { + "cell_type": "markdown", + "metadata": {}, + "source": [ + "### 数据集的增强操作" + ] + }, + { + "cell_type": "markdown", + "metadata": {}, + "source": [ + "下载的数据集,需要通过`mindspore.dataset`处理成适用于MindSpore框架的数据,再使用一系列框架中提供的工具进行数据增强操作来适应LeNet网络的数据处理需求。" + ] + }, + { + "cell_type": "code", + "execution_count": 1, + "metadata": {}, + "outputs": [], + "source": [ + "import mindspore.dataset as ds\n", + "import mindspore.dataset.transforms.vision.c_transforms as CV\n", + "import mindspore.dataset.transforms.c_transforms as C\n", + "from mindspore.dataset.transforms.vision import Inter\n", + "from mindspore.common import dtype as mstype\n", + "\n", + "def create_dataset(data_path, batch_size=32, repeat_size=1,\n", + " num_parallel_workers=1):\n", + " \"\"\" create dataset for train or test\n", + " Args:\n", + " data_path (str): Data path\n", + " batch_size (int): The number of data records in each group\n", + " repeat_size (int): The number of replicated data records\n", + " num_parallel_workers (int): The number of parallel workers\n", + " \"\"\"\n", + " # define dataset\n", + " mnist_ds = ds.MnistDataset(data_path)\n", + "\n", + " # define operation parameters\n", + " resize_height, resize_width = 32, 32\n", + " rescale = 1.0 / 255.0\n", + " shift = 0.0\n", + " rescale_nml = 1 / 0.3081\n", + " shift_nml = -1 * 0.1307 / 0.3081\n", + "\n", + " # define map operations\n", + " resize_op = CV.Resize((resize_height, resize_width), interpolation=Inter.LINEAR)\n", + " rescale_nml_op = CV.Rescale(rescale_nml, shift_nml)\n", + " rescale_op = CV.Rescale(rescale, shift)\n", + " hwc2chw_op = CV.HWC2CHW() \n", + " type_cast_op = C.TypeCast(mstype.int32)\n", + "\n", + " # apply map operations on images\n", + " mnist_ds = mnist_ds.map(input_columns=\"label\", operations=type_cast_op, num_parallel_workers=num_parallel_workers)\n", + " mnist_ds = mnist_ds.map(input_columns=\"image\", operations=resize_op, num_parallel_workers=num_parallel_workers)\n", + " mnist_ds = mnist_ds.map(input_columns=\"image\", operations=rescale_op, num_parallel_workers=num_parallel_workers)\n", + " mnist_ds = mnist_ds.map(input_columns=\"image\", operations=rescale_nml_op, num_parallel_workers=num_parallel_workers)\n", + " mnist_ds = mnist_ds.map(input_columns=\"image\", operations=hwc2chw_op, num_parallel_workers=num_parallel_workers)\n", + "\n", + " # apply DatasetOps\n", + " buffer_size = 10000\n", + " mnist_ds = mnist_ds.shuffle(buffer_size=buffer_size)\n", + " mnist_ds = mnist_ds.batch(batch_size, drop_remainder=True)\n", + " mnist_ds = mnist_ds.repeat(repeat_size)\n", + "\n", + " return mnist_ds" + ] + }, + { + "cell_type": "markdown", + "metadata": {}, + "source": [ + "## 定义深度学习网络LeNet" + ] + }, + { + "cell_type": "markdown", + "metadata": {}, + "source": [ + "针对MNIST数据集我们采用的是LeNet5网络,先对卷积函数和全连接函数初始化,然后`construct`构建神经网络。" + ] + }, + { + "cell_type": "code", + "execution_count": 2, + "metadata": {}, + "outputs": [], + "source": [ + "from mindspore.common.initializer import TruncatedNormal\n", + "import mindspore.nn as nn\n", + "from mindspore.ops import operations as P\n", + "\n", + "def conv(in_channels, out_channels, kernel_size, stride=1, padding=0):\n", + " \"\"\"Conv layer weight initial.\"\"\"\n", + " weight = weight_variable()\n", + " return nn.Conv2d(in_channels, out_channels,\n", + " kernel_size=kernel_size, stride=stride, padding=padding,\n", + " weight_init=weight, has_bias=False, pad_mode=\"valid\")\n", + "\n", + "\n", + "def fc_with_initialize(input_channels, out_channels):\n", + " \"\"\"Fc layer weight initial.\"\"\"\n", + " weight = weight_variable()\n", + " bias = weight_variable()\n", + " return nn.Dense(input_channels, out_channels, weight, bias)\n", + "\n", + "\n", + "def weight_variable():\n", + " \"\"\"Weight initial.\"\"\"\n", + " return TruncatedNormal(0.02)\n", + "\n", + "\n", + "class LeNet5(nn.Cell):\n", + " \"\"\"Lenet network structure.\"\"\"\n", + " def __init__(self):\n", + " super(LeNet5, self).__init__()\n", + " self.batch_size = 32\n", + " self.conv1 = conv(1, 6, 5)\n", + " self.conv2 = conv(6, 16, 5)\n", + " self.fc1 = fc_with_initialize(16 * 5 * 5, 120)\n", + " self.fc2 = fc_with_initialize(120, 84)\n", + " self.fc3 = fc_with_initialize(84, 10)\n", + " self.relu = nn.ReLU()\n", + " self.max_pool2d = nn.MaxPool2d(kernel_size=2, stride=2)\n", + " self.flatten = nn.Flatten()\n", + " \n", + " def construct(self, x):\n", + " x = self.conv1(x)\n", + " x = self.relu(x)\n", + " x = self.max_pool2d(x)\n", + " x = self.conv2(x)\n", + " x = self.relu(x)\n", + " x = self.max_pool2d(x)\n", + " x = self.flatten(x)\n", + " x = self.fc1(x)\n", + " x = self.relu(x)\n", + " x = self.fc2(x)\n", + " x = self.relu(x)\n", + " x = self.fc3(x)\n", + " return x" + ] + }, + { + "cell_type": "markdown", + "metadata": {}, + "source": [ + "## 构建自定义回调函数StopAtTime" + ] + }, + { + "cell_type": "markdown", + "metadata": {}, + "source": [ + "使用回调函数的基类Callback,构建训练定时器`StopAtTime`,其基类(可在源码中找到位置在`/mindspore/nn/callback`)为:" + ] + }, + { + "cell_type": "markdown", + "metadata": {}, + "source": [ + "```python\n", + "class Callback():\n", + " def begin(self, run_context):\n", + " pass\n", + " def epoch_begin(self, run_context):\n", + " pass\n", + " def epoch_end(self, run_context):\n", + " pass\n", + " def step_begin(self, run_context): \n", + " pass\n", + " def step_end(self, run_context):\n", + " pass\n", + " def end(self, run_context):\n", + " pass\n", + "```" + ] + }, + { + "cell_type": "markdown", + "metadata": {}, + "source": [ + "- `begin`:表示训练开始时执行。\n", + "- `epoch_begin`:表示每个epoch开始时执行。\n", + "- `epoch_end`:表示每个epoch结束时执行。\n", + "- `step_begin`:表示每个step刚开始时执行。\n", + "- `step_end`:表示每个step结束时执行。\n", + "- `end`:表示训练结束时执行。\n", + "\n", + "了解上述基类的用法后,还有一个参数`run_context`,这是一个类,存储了模型训练中的各种参数,我们在这里使用`print(cb_params.list_callback)`将其放在`end`中打印(当然也可以使用`print(cb_param)`打印所有参数信息,由于参数信息太多,我们这里只选了一个参数举例),后续在执行完训练后,根据打印信息,会简单介绍`run_context`类中各参数的意义,我们开始构建训练定时器,如下:" + ] + }, + { + "cell_type": "code", + "execution_count": 3, + "metadata": {}, + "outputs": [], + "source": [ + "from mindspore.train.callback import Callback\n", + "import time\n", + "\n", + "class StopAtTime(Callback):\n", + " def __init__(self, run_time):\n", + " super(StopAtTime, self).__init__()\n", + " self.run_time = run_time*60\n", + "\n", + " def begin(self, run_context):\n", + " cb_params = run_context.original_args()\n", + " cb_params.init_time = time.time()\n", + " \n", + " def step_end(self, run_context):\n", + " cb_params = run_context.original_args()\n", + " epoch_num = cb_params.cur_epoch_num\n", + " step_num = cb_params.cur_step_num\n", + " loss = cb_params.net_outputs\n", + " cur_time = time.time()\n", + " if (cur_time - cb_params.init_time) > self.run_time:\n", + " print(\"epoch: \", epoch_num, \" step: \", step_num, \" loss: \", loss)\n", + " run_context.request_stop()\n", + " def end(self, run_context):\n", + " cb_params = run_context.original_args()\n", + " print(cb_params.list_callback)" + ] + }, + { + "cell_type": "markdown", + "metadata": {}, + "source": [ + "## 设置日志环境变量" + ] + }, + { + "cell_type": "markdown", + "metadata": {}, + "source": [ + "MindSpore采用`glog`来输出日志,我们这里将日志输出到屏幕:\n", + "\n", + "`GlOG_v`:控制日志的级别,默认值为2,即WARNING级别,对应关系如下:0-DEBUG、1-INFO、2-WARNING、3-ERROR。本次设置为1。\n", + "\n", + "`GLOG_logtostderr`:控制日志输出方式,设置为`1`时,日志输出到屏幕;值设置为`0`时,日志输出到文件。设置输出屏幕时,日志部分的信息会显示成红色,设置成输出到文件时,会在`GLOG_log_dir`路径下生成`mindspore.log`文件。\n", + "\n", + "> 更多设置请参考官网:https://www.mindspore.cn/tutorial/zh-CN/master/advanced_use/customized_debugging_information.html" + ] + }, + { + "cell_type": "code", + "execution_count": 4, + "metadata": {}, + "outputs": [ + { + "name": "stdout", + "output_type": "stream", + "text": [ + "{'GLOG_v': '1', 'GLOG_logtostderr': '1'}\n" + ] + } + ], + "source": [ + "import os\n", + "from mindspore import log as logger\n", + "\n", + "os.environ['GLOG_v'] = '1'\n", + "os.environ['GLOG_logtostderr'] = '1'\n", + "os.environ['GLOG_log_dir'] = 'D:/' if os.name==\"nt\" else '/var/log/mindspore'\n", + "os.environ['logger_maxBytes'] = '5242880'\n", + "os.environ['logger_backupCount'] = '10'\n", + "print(logger.get_log_config())" + ] + }, + { + "cell_type": "markdown", + "metadata": {}, + "source": [ + "打印信息为`GLOG_v`的等级:`INFO`级别。\n", + "\n", + "输出方式`GLOG_logtostderr`:`1`表示屏幕输出。" + ] + }, + { + "cell_type": "markdown", + "metadata": {}, + "source": [ + "## 定义网络模型并执行训练" + ] + }, + { + "cell_type": "markdown", + "metadata": {}, + "source": [ + "### 定义网络模型" + ] + }, + { + "cell_type": "markdown", + "metadata": {}, + "source": [ + "此过程中先将之前生成的模型文件`.ckpt`和`.meta`的数据删除,并将模型需要用到的参数配置到`Model`。" + ] + }, + { + "cell_type": "code", + "execution_count": 5, + "metadata": {}, + "outputs": [], + "source": [ + "from mindspore import context\n", + "from mindspore.train import Model\n", + "from mindspore.nn.metrics import Accuracy\n", + "from mindspore.nn.loss import SoftmaxCrossEntropyWithLogits\n", + "from mindspore.train.callback import ModelCheckpoint, CheckpointConfig, LossMonitor\n", + "\n", + "# clean files\n", + "if os.name == \"nt\":\n", + " os.system('del/f/s/q *.ckpt *.meta')\n", + "else:\n", + " os.system('rm -f *.ckpt *.meta *.pb')\n", + "\n", + "context.set_context(mode=context.GRAPH_MODE, device_target=\"CPU\")\n", + "lr = 0.01\n", + "momentum = 0.9 \n", + "epoch_size = 3\n", + "train_data_path = \"./MNIST_Data/train\"\n", + "eval_data_path = \"./MNIST_Data/train\"\n", + "\n", + "net_loss = SoftmaxCrossEntropyWithLogits(is_grad=False, sparse=True, reduction='mean')\n", + "repeat_size = epoch_size\n", + "network = LeNet5()\n", + "\n", + "metrics = {\n", + " 'accuracy': nn.Accuracy(),\n", + " 'loss': nn.Loss(),\n", + " 'precision': nn.Precision(),\n", + " 'recall': nn.Recall(),\n", + " 'f1_score': nn.F1()\n", + " }\n", + "net_opt = nn.Momentum(network.trainable_params(), lr, momentum)\n", + "\n", + "config_ck = CheckpointConfig(save_checkpoint_steps=1875, keep_checkpoint_max=10)\n", + "\n", + "ckpoint_cb = ModelCheckpoint(prefix=\"checkpoint_lenet\", config=config_ck)\n", + "\n", + "model = Model(network, net_loss, net_opt, metrics=metrics)" + ] + }, + { + "cell_type": "markdown", + "metadata": {}, + "source": [ + "### 执行训练" + ] + }, + { + "cell_type": "markdown", + "metadata": {}, + "source": [ + "在构建训练网络中,给`model.train`传入了三个回调函数,分别是`ckpoint_cb`,`LossMonitor`,`stop_cb`;其分别代表如下:\n", + "\n", + "`ckpoint_cb`:即是`ModelCheckpoint`,设置模型保存的回调函数。\n", + "\n", + "`LossMonitor`:loss值监视器,打印训练过程每步的loss值。\n", + "\n", + "`stop_cb`:即是`StopAtTime`,上面刚构建的训练定时器。" + ] + }, + { + "cell_type": "markdown", + "metadata": {}, + "source": [ + "我们将训练定时器`StopAtTime`设置成18秒,即`run_time=0.3`。" + ] + }, + { + "cell_type": "code", + "execution_count": 6, + "metadata": {}, + "outputs": [ + { + "name": "stdout", + "output_type": "stream", + "text": [ + "============== Starting Training ==============\n" + ] + }, + { + "name": "stderr", + "output_type": "stream", + "text": [ + "[INFO] ME(10004:11540,MainProcess):2020-07-22-16:52:22.904.779 [mindspore\\train\\serialization.py:308] Execute save the graph process.\n" + ] + }, + { + "name": "stdout", + "output_type": "stream", + "text": [ + "epoch: 1 step 375, loss is 2.3015153408050537\n", + "epoch: 1 step 750, loss is 2.2981557846069336\n", + "epoch: 1 step 1125, loss is 2.304901361465454\n", + "epoch: 1 step 1500, loss is 0.27651622891426086\n" + ] + }, + { + "name": "stderr", + "output_type": "stream", + "text": [ + "[INFO] ME(10004:11540,MainProcess):2020-07-22-16:52:33.315.965 [mindspore\\train\\serialization.py:119] Execute save checkpoint process.\n", + "[INFO] ME(10004:11540,MainProcess):2020-07-22-16:52:33.325.978 [mindspore\\train\\serialization.py:147] Save checkpoint process finish.\n" + ] + }, + { + "name": "stdout", + "output_type": "stream", + "text": [ + "epoch: 1 step 1875, loss is 0.263612300157547\n", + "Epoch time: 11051.060, per step time: 5.894, avg loss: 1.702\n", + "************************************************************\n", + "epoch: 2 step 375, loss is 0.22589832544326782\n", + "epoch: 2 step 750, loss is 0.12003941088914871\n" + ] + }, + { + "name": "stderr", + "output_type": "stream", + "text": [ + "[INFO] ME(10004:11540,MainProcess):2020-07-22-16:52:40.282.209 [mindspore\\train\\serialization.py:119] Execute save checkpoint process.\n", + "[INFO] ME(10004:11540,MainProcess):2020-07-22-16:52:40.297.275 [mindspore\\train\\serialization.py:147] Save checkpoint process finish.\n" + ] + }, + { + "name": "stdout", + "output_type": "stream", + "text": [ + "epoch: 2 step: 2927 loss: 0.26415202\n", + "Epoch time: 6953.909, per step time: 3.709, avg loss: 0.130\n", + "************************************************************\n", + "[, , <__main__.StopAtTime object at 0x000001ACBD016148>]\n" + ] + } + ], + "source": [ + "print(\"============== Starting Training ==============\")\n", + "ds_train = create_dataset(train_data_path, repeat_size = repeat_size)\n", + "stop_cb = StopAtTime(run_time=0.3)\n", + "model.train(epoch_size, ds_train, callbacks=[ckpoint_cb, LossMonitor(375), stop_cb], dataset_sink_mode=False)" + ] + }, + { + "cell_type": "markdown", + "metadata": {}, + "source": [ + "以上打印信息中,主要分为两部分:\n", + "- 日志信息部分:\n", + " - `[INFO]`部分信息即为日志输出的信息,由于没有Warning信息,目前主要记录的是训练的几个重要步骤。\n", + " \n", + "- 回调函数信息部分:\n", + " - `LossMonitor`:每步的loss值。\n", + " - `StopAtTime`:在每个epoch结束及训练时间结束时,打印当前epoch的训练总时间(单位为毫秒),每步训练花费的时间以及平均loss值,另外在训练结束时还打印了`run_context.list_callback`的信息,这条信息表示本次训练过程中使用的回调函数;另外`run_conext.original_args`中还包含以下参数:\n", + " - `train_network`:网络的各类参数。\n", + " - `epoch_num`:训练的epoch数。\n", + " - `batch_num`:一个epoch的step数。\n", + " - `mode`:MODEL的模式。\n", + " - `loss_fn`:使用的损失函数。\n", + " - `optimizer`:使用的优化器。\n", + " - `parallel_mode`:并行模式。\n", + " - `device_number`:训练卡的数量。\n", + " - `train_dataset`:训练的数据集。\n", + " - `list_callback`:使用的回调函数。\n", + " - `train_dataset_element`:打印当前batch的数据集。\n", + " - `cur_step_num`:当前训练的step数。\n", + " - `cur_epoch_num`:当前的epoch。\n", + " - `net_outputs`:网络返回值。\n", + "\n", + " 几乎在训练中的所有重要数据,都可以从Callback中取得,所以Callback也是在自定义调试中比较常用的功能。" + ] + }, + { + "cell_type": "markdown", + "metadata": {}, + "source": [ + "## 执行测试" + ] + }, + { + "cell_type": "markdown", + "metadata": {}, + "source": [ + "测试网络中我们的自定义函数`metrics`将在`model.eval`中被调用,除了模型的预测正确率外`recall`,`F1`等不同的检验标准下的预测正确率也会打印出来:" + ] + }, + { + "cell_type": "code", + "execution_count": 7, + "metadata": {}, + "outputs": [ + { + "name": "stdout", + "output_type": "stream", + "text": [ + "============== Starting Testing ==============\n", + "============== Accuracy:{'accuracy': 0.9712666666666666, 'loss': 0.0918103571044902, 'precision': array([0.979007 , 0.9815034 , 0.9695254 , 0.99006449, 0.97207177,\n", + " 0.9750469 , 0.97660037, 0.9832609 , 0.91311292, 0.97466828]), 'recall': array([0.99206483, 0.98383269, 0.97717355, 0.92643941, 0.98305375,\n", + " 0.95867921, 0.9873268 , 0.97509976, 0.97709793, 0.95074802]), 'f1_score': array([0.98549266, 0.98266667, 0.97333445, 0.95719582, 0.97753191,\n", + " 0.96679379, 0.98193429, 0.97916333, 0.94402246, 0.96255956])} ==============\n" + ] + } + ], + "source": [ + "print(\"============== Starting Testing ==============\")\n", + "ds_eval = create_dataset(eval_data_path, repeat_size=repeat_size)\n", + "acc = model.eval(ds_eval,dataset_sink_mode = False)\n", + "print(\"============== Accuracy:{} ==============\".format(acc))" + ] + }, + { + "cell_type": "markdown", + "metadata": {}, + "source": [ + "`[INFO]`部分为日志信息。\n", + "\n", + "`Accuracy`部分的信息即为`metric`控制输出的信息,模型的预测值正确率和其他标准下验证(0-9)的正确率值,至于不同的验证标准计算方法,大家可以去官网搜索`mindspore.nn`查找,这里就不多介绍了。" + ] + }, + { + "cell_type": "markdown", + "metadata": {}, + "source": [ + "## 总结\n", + "\n", + "我们使用了MNIST数据集,通过LeNet5神经网络进行训练,将自定义调试函数结合进其代码中进行调试,展示了使用方法和部分功能,并在过程中展示了训练过程中我们能够通过自定义调试函数输出的数据,来更好的认识自定义调试函数的方便性,以上就是本次的体验内容。" + ] + } + ], + "metadata": { + "kernelspec": { + "display_name": "Python 3", + "language": "python", + "name": "python3" + }, + "language_info": { + "codemirror_mode": { + "name": "ipython", + "version": 3 + }, + "file_extension": ".py", + "mimetype": "text/x-python", + "name": "python", + "nbconvert_exporter": "python", + "pygments_lexer": "ipython3", + "version": "3.7.6" + } + }, + "nbformat": 4, + "nbformat_minor": 4 +} diff --git a/tutorials/notebook/quick_start.ipynb b/tutorials/notebook/quick_start.ipynb index 558fd5bc0b20632de3236e16cebec89127e90ab4..8f425891f317ab34f53d1164e076343d890f39a1 100644 --- a/tutorials/notebook/quick_start.ipynb +++ b/tutorials/notebook/quick_start.ipynb @@ -204,7 +204,7 @@ }, { "data": { - "image/png": "iVBORw0KGgoAAAANSUhEUgAAAPsAAAEICAYAAACZA4KlAAAABHNCSVQICAgIfAhkiAAAAAlwSFlzAAALEgAACxIB0t1+/AAAADh0RVh0U29mdHdhcmUAbWF0cGxvdGxpYiB2ZXJzaW9uMy4xLjMsIGh0dHA6Ly9tYXRwbG90bGliLm9yZy+AADFEAAANdklEQVR4nO3df6wlZX3H8fendF0i0pSVAitQoRRTraGruUUTmpaGiohNgD8kbhqyJKZLIyQ1wUZCm0iampCmQm2sP5ayZWkVIVUCbWiFYhOCNoQLwWURKpSgrLuyNUBFq+uC3/5xh/Zyub/2nDk/7j7vV3Jy5sycOfPdyX7uM2eemfOkqpB06PuZSRcgaTwMu9QIwy41wrBLjTDsUiMMu9QIw65XSfJUkt+ZdB3ql2HX2CV5c5KvJPnvJE8kuWDSNbXAsGtkkvzsEvNuA/4J2ABsBf4+yZvGXF5zDPsa0h1efzjJzq5VvDnJ4UkuTnLvgvdWkl/upm9I8qkk/5zkB0m+muS4JH+Z5LkkjyV524LN/XqSb3TL/zbJ4fM++3eTPJTk+SRfS3Lagho/kmQn8MNFAv8rwBuAa6vqpar6CvBV4KIed5UWYdjXnguBc4CTgdOAiw9ivT8Bjgb2A/8OPNi9/gfgmgXv/z3g3cApwJu6dUnydmA7cAnweuCzwO1J1s9bdzPwXuDnq+rF7g/Np7plWaS2AG9d5b9DAzLsa89fVdWeqnoW+Edg0yrXu7WqHqiqHwO3Aj+uqhur6iXgZmBhy/7Jqnq6287HmAswwO8Dn62q+7qWeQdzfzzeuaDGp6vqRwBV9cGq+mC37DFgH/BHSdYlORv4LeC1B7MTdPAM+9rz3XnT/wO8bpXrPTNv+keLvF74OU/Pm/4Wc4feAG8ELu8O4Z9P8jxw4rzlC9d9hao6AJzPXMv/XeBy4BZg9yr/HRrQq06gaE36IfNaxiTH9fCZJ86b/kVgTzf9NPCxqvrYMusueytlVe1krjUHIMnXgB0D1qlVsmU/NHwd+NUkm7oTaVf18JmXJjkhyQbgSuYO9QGuA/4gyTsy54gk701y5Go/OMlp3YnF1yb5MLARuKGHmrUMw34IqKpvAn8K/CvwOHDv8musyueBO4Enu8efdduaZe57+yeB54AnWOEkYZLPJPnMvFkXAXuZ++5+FvCuqtrfQ81aRvzxCqkNtuxSIwy71AjDLjXCsEuNGGs/+2uyvg7niHFuUmrKj/khP6n9i12SPFzYk5wDfAI4DPibqrp6ufcfzhG8I2cNs0lJy7iv7l5y2cCH8UkOA/4aeA/wFmBzkrcM+nmSRmuY7+ynA09U1ZNV9RPgC8B5/ZQlqW/DhP14XnnDw+5u3isk2ZpkNsnsAbxISpqUYcK+2EmAV12OV1XbqmqmqmbWsX6RVSSNwzBh380r74w6gf+/M0rSlBkm7PcDpyY5OclrgPcDt/dTlqS+Ddz11v3c0GXAl5nretteVY/0VpmkXg3Vz15VdwB39FSLpBHyclmpEYZdaoRhlxph2KVGGHapEYZdaoRhlxph2KVGGHapEYZdaoRhlxph2KVGGHapEYZdaoRhlxph2KVGGHapEYZdaoRhlxph2KVGGHapEYZdaoRhlxph2KVGGHapEYZdaoRhlxph2KVGGHapEYZdasRQQzYneQp4AXgJeLGqZvooSlL/hgp757er6ns9fI6kEfIwXmrEsGEv4M4kDyTZutgbkmxNMptk9gD7h9ycpEENexh/RlXtSXIMcFeSx6rqnvlvqKptwDaAn8uGGnJ7kgY0VMteVXu6533ArcDpfRQlqX8Dhz3JEUmOfHkaOBvY1Vdhkvo1zGH8scCtSV7+nM9X1b/0UpWk3g0c9qp6Evi1HmuRNEJ2vUmNMOxSIwy71AjDLjXCsEuN6ONGGB3CvrznoUmXMLB3v2HTpEuYKrbsUiMMu9QIwy41wrBLjTDsUiMMu9QIwy41wn72Q9xa7idXv2zZpUYYdqkRhl1qhGGXGmHYpUYYdqkRhl1qhP3shwD70rUatuxSIwy71AjDLjXCsEuNMOxSIwy71AjDLjXCfvY1wH509WHFlj3J9iT7kuyaN29DkruSPN49HzXaMiUNazWH8TcA5yyYdwVwd1WdCtzdvZY0xVYMe1XdAzy7YPZ5wI5uegdwfs91SerZoCfojq2qvQDd8zFLvTHJ1iSzSWYPsH/AzUka1sjPxlfVtqqaqaqZdawf9eYkLWHQsD+TZCNA97yvv5IkjcKgYb8d2NJNbwFu66ccSaOyYj97kpuAM4Gjk+wGPgpcDdyS5APAt4H3jbLIQ90k+9EnOYa51w+M14phr6rNSyw6q+daJI2Ql8tKjTDsUiMMu9QIwy41wrBLjfAW1zFotWsN7F6bJrbsUiMMu9QIwy41wrBLjTDsUiMMu9QIwy41wn72Q8Ck+9K1NtiyS40w7FIjDLvUCMMuNcKwS40w7FIjDLvUCPvZe+A921oLbNmlRhh2qRGGXWqEYZcaYdilRhh2qRGGXWqE/exrgPerL879cnBWbNmTbE+yL8muefOuSvKdJA91j3NHW6akYa3mMP4G4JxF5l9bVZu6xx39liWpbyuGvaruAZ4dQy2SRmiYE3SXJdnZHeYftdSbkmxNMptk9gD7h9icpGEMGvZPA6cAm4C9wMeXemNVbauqmaqaWcf6ATcnaVgDhb2qnqmql6rqp8B1wOn9liWpbwOFPcnGeS8vAHYt9V5J02HFfvYkNwFnAkcn2Q18FDgzySaggKeAS0ZYo6aY9/KvHSuGvao2LzL7+hHUImmEvFxWaoRhlxph2KVGGHapEYZdaoS3uK4BK3VvDXOrp11n7bBllxph2KVGGHapEYZdaoRhlxph2KVGGHapEfaz92Clfu5R92Ufqn3l/lR0v2zZpUYYdqkRhl1qhGGXGmHYpUYYdqkRhl1qhP3sYzDpfngJbNmlZhh2qRGGXWqEYZcaYdilRhh2qRGGXWrEaoZsPhG4ETgO+Cmwrao+kWQDcDNwEnPDNl9YVc+NrtRD11q+b9trBNaO1bTsLwKXV9WbgXcClyZ5C3AFcHdVnQrc3b2WNKVWDHtV7a2qB7vpF4BHgeOB84Ad3dt2AOePqkhJwzuo7+xJTgLeBtwHHFtVe2HuDwJwTN/FSerPqsOe5HXAF4EPVdX3D2K9rUlmk8weYP8gNUrqwarCnmQdc0H/XFV9qZv9TJKN3fKNwL7F1q2qbVU1U1Uz61jfR82SBrBi2JMEuB54tKqumbfodmBLN70FuK3/8iT1ZTW3uJ4BXAQ8nOTlfpYrgauBW5J8APg28L7RlCipDyuGvaruBbLE4rP6LUfSqHgFndQIwy41wrBLjTDsUiMMu9QIwy41wrBLjTDsUiMMu9QIwy41wrBLjTDsUiMMu9QIwy41wiGbNTFr+Se01yJbdqkRhl1qhGGXGmHYpUYYdqkRhl1qhGGXGmHYpUYYdqkRhl1qhGGXGmHYpUYYdqkRhl1qhGGXGrFi2JOcmOTfkjya5JEkf9jNvyrJd5I81D3OHX25kga1mh+veBG4vKoeTHIk8ECSu7pl11bVX4yuPEl9WTHsVbUX2NtNv5DkUeD4URcmqV8H9Z09yUnA24D7ulmXJdmZZHuSo5ZYZ2uS2SSzB9g/VLGSBrfqsCd5HfBF4ENV9X3g08ApwCbmWv6PL7ZeVW2rqpmqmlnH+h5KljSIVYU9yTrmgv65qvoSQFU9U1UvVdVPgeuA00dXpqRhreZsfIDrgUer6pp58zfOe9sFwK7+y5PUl9WcjT8DuAh4OMlD3bwrgc1JNgEFPAVcMpIKJfViNWfj7wWyyKI7+i9H0qh4BZ3UCMMuNcKwS40w7FIjDLvUCMMuNcIhmzUUh11eO2zZpUYYdqkRhl1qhGGXGmHYpUYYdqkRhl1qRKpqfBtL/gv41rxZRwPfG1sBB2daa5vWusDaBtVnbW+sql9YbMFYw/6qjSezVTUzsQKWMa21TWtdYG2DGldtHsZLjTDsUiMmHfZtE97+cqa1tmmtC6xtUGOpbaLf2SWNz6RbdkljYtilRkwk7EnOSfIfSZ5IcsUkalhKkqeSPNwNQz074Vq2J9mXZNe8eRuS3JXk8e550TH2JlTbVAzjvcww4xPdd5Me/nzs39mTHAZ8E3gXsBu4H9hcVd8YayFLSPIUMFNVE78AI8lvAj8Abqyqt3bz/hx4tqqu7v5QHlVVH5mS2q4CfjDpYby70Yo2zh9mHDgfuJgJ7rtl6rqQMey3SbTspwNPVNWTVfUT4AvAeROoY+pV1T3Aswtmnwfs6KZ3MPefZeyWqG0qVNXeqnqwm34BeHmY8Ynuu2XqGotJhP144Ol5r3czXeO9F3BnkgeSbJ10MYs4tqr2wtx/HuCYCdez0IrDeI/TgmHGp2bfDTL8+bAmEfbFhpKapv6/M6rq7cB7gEu7w1WtzqqG8R6XRYYZnwqDDn8+rEmEfTdw4rzXJwB7JlDHoqpqT/e8D7iV6RuK+pmXR9DtnvdNuJ7/M03DeC82zDhTsO8mOfz5JMJ+P3BqkpOTvAZ4P3D7BOp4lSRHdCdOSHIEcDbTNxT17cCWbnoLcNsEa3mFaRnGe6lhxpnwvpv48OdVNfYHcC5zZ+T/E/jjSdSwRF2/BHy9ezwy6dqAm5g7rDvA3BHRB4DXA3cDj3fPG6aotr8DHgZ2MhesjROq7TeY+2q4E3ioe5w76X23TF1j2W9eLis1wivopEYYdqkRhl1qhGGXGmHYpUYYdqkRhl1qxP8CnmgwF585kCIAAAAASUVORK5CYII=\n", + "image/png": "iVBORw0KGgoAAAANSUhEUgAAAPsAAAEICAYAAACZA4KlAAAABHNCSVQICAgIfAhkiAAAAAlwSFlzAAALEgAACxIB0t1+/AAAADh0RVh0U29mdHdhcmUAbWF0cGxvdGxpYiB2ZXJzaW9uMy4xLjMsIGh0dHA6Ly9tYXRwbG90bGliLm9yZy+AADFEAAANnklEQVR4nO3df6xkZX3H8fendF0i2pSVAitSoRRTqbGruUUTmtaGiohNgD8kkoZgYlwaNakJNhraRNLUhDRVa2P9sRTK0ipCqgTa0CrFJgRtCBeyLiBUKEFZd2U1SAWr6wLf/nEP7eVyf+3MmTlzed6vZDJnzo8533tyP/c5c54590lVIemF7+eGLkDSdBh2qRGGXWqEYZcaYdilRhh2qRGGXc+T5OEkvzd0HeqXYdfUJXl1kq8m+e8kDyY5d+iaWmDYNTFJfn6FeTcA/wxsAbYD/5DkVVMurzmGfQPpTq8/kGR31ypem+TwJO9MctuSdSvJr3bTVyX5VJJ/SfJkkq8lOTbJXyX5YZL7k7xuye5+M8k3u+V/l+TwRe/9+0l2JXk8ydeTvHZJjR9Mshv48TKB/zXg5cDHq+rpqvoq8DXggh4PlZZh2Dee84AzgROB1wLvPITt/hQ4CjgA/AdwV/f6H4GPLVn/D4C3ACcBr+q2JcnrgSuBi4CXAZ8FbkyyedG25wNvA36xqp7q/tB8qluWZWoL8Jp1/hwakWHfeP66qvZW1WPAPwHb1rnd9VV1Z1X9FLge+GlVXV1VTwPXAktb9k9W1SPdfj7CQoAB3g18tqpu71rmnSz88XjjkhofqaqfAFTVe6rqPd2y+4H9wB8n2ZTkDOB3gBcfykHQoTPsG8/3Fk3/D/CSdW736KLpnyzzeun7PLJo+tssnHoDvBK4uDuFfzzJ48Dxi5Yv3fY5quogcA4LLf/3gIuB64A96/w5NKLnXUDRhvRjFrWMSY7t4T2PXzT9y8DebvoR4CNV9ZFVtl31Vsqq2s1Caw5Akq8DO0esU+tky/7C8A3g15Ns6y6kXdrDe743ySuSbAEuYeFUH+By4A+TvCELjkjytiQvXe8bJ3ltd2HxxUk+AGwFruqhZq3CsL8AVNW3gD8D/g14ALht9S3W5fPAV4CHusefd/uaZ+Fz+yeBHwIPssZFwiSfSfKZRbMuAPax8Nn9dODNVXWgh5q1ivjPK6Q22LJLjTDsUiMMu9QIwy41Yqr97C/K5jqcI6a5S6kpP+XH/KwOLPeV5PHCnuRM4BPAYcDfVtVlq61/OEfwhpw+zi4lreL2umXFZSOfxic5DPgb4K3AKcD5SU4Z9f0kTdY4n9lPBR6sqoeq6mfAF4Cz+ylLUt/GCftxPPeGhz3dvOdIsj3JfJL5g/glKWko44R9uYsAz/s6XlXtqKq5qprbxOZlNpE0DeOEfQ/PvTPqFfz/nVGSZsw4Yb8DODnJiUleBLwDuLGfsiT1beSut+7fDb0P+DILXW9XVtW9vVUmqVdj9bNX1U3ATT3VImmC/Lqs1AjDLjXCsEuNMOxSIwy71AjDLjXCsEuNMOxSIwy71AjDLjXCsEuNMOxSIwy71AjDLjXCsEuNMOxSIwy71AjDLjXCsEuNMOxSIwy71AjDLjXCsEuNMOxSIwy71AjDLjXCsEuNMOxSIwy71IixRnHVdHx5766hS5iIt7x829AlNGWssCd5GHgCeBp4qqrm+ihKUv/6aNl/t6p+0MP7SJogP7NLjRg37AV8JcmdSbYvt0KS7Unmk8wf5MCYu5M0qnFP40+rqr1JjgZuTnJ/Vd26eIWq2gHsAPiFbKkx9ydpRGO17FW1t3veD1wPnNpHUZL6N3LYkxyR5KXPTgNnAPf0VZikfo1zGn8McH2SZ9/n81X1r71U1ZgXaj/6Wtb6ue2H79fIYa+qh4Df6LEWSRNk15vUCMMuNcKwS40w7FIjDLvUCG9xbdxa3Vutdgu+ENmyS40w7FIjDLvUCMMuNcKwS40w7FIjDLvUCPvZp2DIvupxbxMdsh/eW2D7ZcsuNcKwS40w7FIjDLvUCMMuNcKwS40w7FIj7GfvwdD3fA/Z3+z98BuHLbvUCMMuNcKwS40w7FIjDLvUCMMuNcKwS40w7FIj1gx7kiuT7E9yz6J5W5LcnOSB7vnIyZYpaVzradmvAs5cMu9DwC1VdTJwS/da0gxbM+xVdSvw2JLZZwM7u+mdwDk91yWpZ6N+Zj+mqvYBdM9Hr7Riku1J5pPMH+TAiLuTNK6JX6Crqh1VNVdVc5vYPOndSVrBqGF/NMlWgO55f38lSZqEUcN+I3BhN30hcEM/5UialDXvZ09yDfAm4Kgke4APA5cB1yV5F/Ad4O2TLLJ1/n909WHNsFfV+SssOr3nWiRNkN+gkxph2KVGGHapEYZdaoRhlxph2KVGGHapEYZdaoRhlxph2KVGGHapEYZdaoRhlxrhkM3asMYZDrrF24Zt2aVGGHapEYZdaoRhlxph2KVGGHapEYZdaoT97Jqocfqzx+lH1/PZskuNMOxSIwy71AjDLjXCsEuNMOxSIwy71Aj72TeAtfqbN+q92fajT9eaLXuSK5PsT3LPonmXJvlukl3d46zJlilpXOs5jb8KOHOZ+R+vqm3d46Z+y5LUtzXDXlW3Ao9NoRZJEzTOBbr3JdndneYfudJKSbYnmU8yf5ADY+xO0jhGDfungZOAbcA+4KMrrVhVO6pqrqrmNrF5xN1JGtdIYa+qR6vq6ap6BrgcOLXfsiT1baSwJ9m66OW5wD0rrStpNqzZz57kGuBNwFFJ9gAfBt6UZBtQwMPARROsUVIP1gx7VZ2/zOwrJlCLpAny67JSIwy71AjDLjXCsEuNMOxSI7zFtQdr3WI66Vs5V3v/oW9/HfI21qF/9lljyy41wrBLjTDsUiMMu9QIwy41wrBLjTDsUiPsZ5+CIfvh/XfNepYtu9QIwy41wrBLjTDsUiMMu9QIwy41wrBLjbCffQYMfT+82mDLLjXCsEuNMOxSIwy71AjDLjXCsEuNMOxSI9YzZPPxwNXAscAzwI6q+kSSLcC1wAksDNt8XlX9cHKltst+ePVhPS37U8DFVfVq4I3Ae5OcAnwIuKWqTgZu6V5LmlFrhr2q9lXVXd30E8B9wHHA2cDObrWdwDmTKlLS+A7pM3uSE4DXAbcDx1TVPlj4gwAc3Xdxkvqz7rAneQnwReD9VfWjQ9hue5L5JPMHOTBKjZJ6sK6wJ9nEQtA/V1Vf6mY/mmRrt3wrsH+5batqR1XNVdXcJjb3UbOkEawZ9iQBrgDuq6qPLVp0I3BhN30hcEP/5Unqy3pucT0NuAC4O8mzfTyXAJcB1yV5F/Ad4O2TKVFr2ahDE9tlOF1rhr2qbgOywuLT+y1H0qT4DTqpEYZdaoRhlxph2KVGGHapEYZdaoRhlxph2KVGGHapEYZdaoRhlxph2KVGGHapEYZdaoRhlxph2KVGGHapEYZdaoRhlxph2KVGGHapEYZdaoRhlxph2KVGGHapEYZdaoRhlxph2KVGGHapEYZdasSaQzYnOR64GjgWeAbYUVWfSHIp8G7g+92ql1TVTZMqVC88G3Vc+Y1qzbADTwEXV9VdSV4K3Jnk5m7Zx6vqLydXnqS+rBn2qtoH7Oumn0hyH3DcpAuT1K9D+sye5ATgdcDt3az3Jdmd5MokR66wzfYk80nmD3JgrGIljW7dYU/yEuCLwPur6kfAp4GTgG0stPwfXW67qtpRVXNVNbeJzT2ULGkU6wp7kk0sBP1zVfUlgKp6tKqerqpngMuBUydXpqRxrRn2JAGuAO6rqo8tmr910WrnAvf0X56kvqznavxpwAXA3Ul2dfMuAc5Psg0o4GHgoolUKKkX67kafxuQZRbZpy5tIH6DTmqEYZcaYdilRhh2qRGGXWqEYZcaYdilRhh2qRGGXWqEYZcaYdilRhh2qRGGXWqEYZcakaqa3s6S7wPfXjTrKOAHUyvg0MxqbbNaF1jbqPqs7ZVV9UvLLZhq2J+382S+quYGK2AVs1rbrNYF1jaqadXmabzUCMMuNWLosO8YeP+rmdXaZrUusLZRTaW2QT+zS5qeoVt2SVNi2KVGDBL2JGcm+c8kDyb50BA1rCTJw0nuTrIryfzAtVyZZH+SexbN25Lk5iQPdM/LjrE3UG2XJvlud+x2JTlroNqOT/LvSe5Lcm+SP+rmD3rsVqlrKsdt6p/ZkxwGfAt4M7AHuAM4v6q+OdVCVpDkYWCuqgb/AkaS3waeBK6uqtd08/4CeKyqLuv+UB5ZVR+ckdouBZ4cehjvbrSirYuHGQfOAd7JgMdulbrOYwrHbYiW/VTgwap6qKp+BnwBOHuAOmZeVd0KPLZk9tnAzm56Jwu/LFO3Qm0zoar2VdVd3fQTwLPDjA967FapayqGCPtxwCOLXu9htsZ7L+ArSe5Msn3oYpZxTFXtg4VfHuDogetZas1hvKdpyTDjM3PsRhn+fFxDhH25oaRmqf/vtKp6PfBW4L3d6arWZ13DeE/LMsOMz4RRhz8f1xBh3wMcv+j1K4C9A9SxrKra2z3vB65n9oaifvTZEXS75/0D1/N/ZmkY7+WGGWcGjt2Qw58PEfY7gJOTnJjkRcA7gBsHqON5khzRXTghyRHAGczeUNQ3Ahd20xcCNwxYy3PMyjDeKw0zzsDHbvDhz6tq6g/gLBauyP8X8CdD1LBCXb8CfKN73Dt0bcA1LJzWHWThjOhdwMuAW4AHuuctM1Tb3wN3A7tZCNbWgWr7LRY+Gu4GdnWPs4Y+dqvUNZXj5tdlpUb4DTqpEYZdaoRhlxph2KVGGHapEYZdaoRhlxrxv3jNRdG9OXAOAAAAAElFTkSuQmCC\n", "text/plain": [ "
" ] @@ -387,7 +387,7 @@ "text": [ "dict_keys(['label', 'image'])\n", "Tensor of image: (32, 1, 32, 32)\n", - "labels: [6 3 5 8 9 0 1 8 1 6 8 9 1 3 0 9 0 4 0 2 2 6 7 9 4 9 1 4 1 4 3 3]\n" + "labels: [7 4 0 6 6 6 5 8 3 5 4 8 5 3 2 9 3 7 3 0 9 2 0 6 3 3 6 2 5 9 2 0]\n" ] } ], @@ -414,7 +414,7 @@ "outputs": [ { "data": { - "image/png": "\n", + "image/png": "\n", "text/plain": [ "
" ] @@ -637,7 +637,7 @@ "metadata": {}, "source": [ "构建完成神经网络后,就可以着手进行训练网络的构建,模型训练函数为`Model.train`,参数主要包含:\n", - "1. 圈数`epoch size`(每圈需要遍历完成1875组图片);\n", + "1. 每个`epoch`需要遍历完成图片的`batch`数:`epoch_size`;\n", "2. 数据集`ds_train`;\n", "3. 回调函数`callbacks`包含`ModelCheckpoint`、`LossMonitor`和`Callback`模型检测参数;\n", "4. 数据下沉模式`dataset_sink_mode`,此参数默认`True`需设置成`False`,因为此功能不支持CPU模式。" @@ -661,7 +661,7 @@ " print(\"============== Starting Training ==============\")\n", " # load training dataset\n", " ds_train = create_dataset(os.path.join(mnist_path, \"train\"), 32, repeat_size)\n", - " model.train(epoch_size, ds_train, callbacks=[ckpoint_cb, LossMonitor(), step_loss_info], dataset_sink_mode=False)" + " model.train(epoch_size, ds_train, callbacks=[ckpoint_cb, LossMonitor(125), step_loss_info], dataset_sink_mode=False)" ] }, { @@ -717,1942 +717,22 @@ "output_type": "stream", "text": [ "============== Starting Training ==============\n", - "epoch: 1 step 1, loss is 2.3036417961120605\n", - "epoch: 1 step 2, loss is 2.298630714416504\n", - "epoch: 1 step 3, loss is 2.302391529083252\n", - "epoch: 1 step 4, loss is 2.304328441619873\n", - "epoch: 1 step 5, loss is 2.3027987480163574\n", - "epoch: 1 step 6, loss is 2.2987725734710693\n", - "epoch: 1 step 7, loss is 2.298673629760742\n", - "epoch: 1 step 8, loss is 2.30534029006958\n", - "epoch: 1 step 9, loss is 2.303036689758301\n", - "epoch: 1 step 10, loss is 2.2993321418762207\n", - "epoch: 1 step 11, loss is 2.305026054382324\n", - "epoch: 1 step 12, loss is 2.302748918533325\n", - "epoch: 1 step 13, loss is 2.3028924465179443\n", - "epoch: 1 step 14, loss is 2.2981364727020264\n", - "epoch: 1 step 15, loss is 2.3044252395629883\n", - "epoch: 1 step 16, loss is 2.2937963008880615\n", - "epoch: 1 step 17, loss is 2.302947759628296\n", - "epoch: 1 step 18, loss is 2.301767587661743\n", - "epoch: 1 step 19, loss is 2.302520275115967\n", - "epoch: 1 step 20, loss is 2.2997517585754395\n", - "epoch: 1 step 21, loss is 2.300358533859253\n", - "epoch: 1 step 22, loss is 2.296435832977295\n", - "epoch: 1 step 23, loss is 2.3037216663360596\n", - "epoch: 1 step 24, loss is 2.3002805709838867\n", - "epoch: 1 step 25, loss is 2.3053646087646484\n", - "epoch: 1 step 26, loss is 2.296821355819702\n", - "epoch: 1 step 27, loss is 2.2995078563690186\n", - "epoch: 1 step 28, loss is 2.305546760559082\n", - "epoch: 1 step 29, loss is 2.305027484893799\n", - "epoch: 1 step 30, loss is 2.3053534030914307\n", - "epoch: 1 step 31, loss is 2.3146908283233643\n", - "epoch: 1 step 32, loss is 2.2932584285736084\n", - "epoch: 1 step 33, loss is 2.303548574447632\n", - "epoch: 1 step 34, loss is 2.3062822818756104\n", - "epoch: 1 step 35, loss is 2.307175397872925\n", - "epoch: 1 step 36, loss is 2.3170557022094727\n", - "epoch: 1 step 37, loss is 2.29900860786438\n", - "epoch: 1 step 38, loss is 2.301845073699951\n", - "epoch: 1 step 39, loss is 2.2978885173797607\n", - "epoch: 1 step 40, loss is 2.3042001724243164\n", - "epoch: 1 step 41, loss is 2.303277015686035\n", - "epoch: 1 step 42, loss is 2.2995805740356445\n", - "epoch: 1 step 43, loss is 2.307396411895752\n", - "epoch: 1 step 44, loss is 2.3049159049987793\n", - "epoch: 1 step 45, loss is 2.304211378097534\n", - "epoch: 1 step 46, loss is 2.300304651260376\n", - "epoch: 1 step 47, loss is 2.30509614944458\n", - "epoch: 1 step 48, loss is 2.2977967262268066\n", - "epoch: 1 step 49, loss is 2.2965590953826904\n", - "epoch: 1 step 50, loss is 2.300846815109253\n", - "epoch: 1 step 51, loss is 2.3011419773101807\n", - "epoch: 1 step 52, loss is 2.291372537612915\n", - "epoch: 1 step 53, loss is 2.3017473220825195\n", - "epoch: 1 step 54, loss is 2.302234649658203\n", - "epoch: 1 step 55, loss is 2.3049476146698\n", - "epoch: 1 step 56, loss is 2.296499252319336\n", - "epoch: 1 step 57, loss is 2.3119330406188965\n", - "epoch: 1 step 58, loss is 2.29875111579895\n", - "epoch: 1 step 59, loss is 2.299650192260742\n", - "epoch: 1 step 60, loss is 2.295192241668701\n", - "epoch: 1 step 61, loss is 2.2914061546325684\n", - "epoch: 1 step 62, loss is 2.312401294708252\n", - "epoch: 1 step 63, loss is 2.3097002506256104\n", - "epoch: 1 step 64, loss is 2.299806594848633\n", - "epoch: 1 step 65, loss is 2.302612543106079\n", - "epoch: 1 step 66, loss is 2.2979307174682617\n", - "epoch: 1 step 67, loss is 2.31546950340271\n", - "epoch: 1 step 68, loss is 2.3103079795837402\n", - "epoch: 1 step 69, loss is 2.309143304824829\n", - "epoch: 1 step 70, loss is 2.305349349975586\n", - "epoch: 1 step 71, loss is 2.3021116256713867\n", - "epoch: 1 step 72, loss is 2.302931547164917\n", - "epoch: 1 step 73, loss is 2.299171209335327\n", - "epoch: 1 step 74, loss is 2.3023552894592285\n", - "epoch: 1 step 75, loss is 2.297666311264038\n", - "epoch: 1 step 76, loss is 2.2955446243286133\n", - "epoch: 1 step 77, loss is 2.303368091583252\n", - "epoch: 1 step 78, loss is 2.2970433235168457\n", - "epoch: 1 step 79, loss is 2.3118627071380615\n", - "epoch: 1 step 80, loss is 2.2917704582214355\n", - "epoch: 1 step 81, loss is 2.3150627613067627\n", - "epoch: 1 step 82, loss is 2.3017776012420654\n", - "epoch: 1 step 83, loss is 2.3065297603607178\n", - "epoch: 1 step 84, loss is 2.318979024887085\n", - "epoch: 1 step 85, loss is 2.3001444339752197\n", - "epoch: 1 step 86, loss is 2.3014917373657227\n", - "epoch: 1 step 87, loss is 2.309880018234253\n", - "epoch: 1 step 88, loss is 2.294804096221924\n", - "epoch: 1 step 89, loss is 2.3006980419158936\n", - "epoch: 1 step 90, loss is 2.308440685272217\n", - "epoch: 1 step 91, loss is 2.3000504970550537\n", - "epoch: 1 step 92, loss is 2.3011293411254883\n", - "epoch: 1 step 93, loss is 2.3041226863861084\n", - "epoch: 1 step 94, loss is 2.289018392562866\n", - "epoch: 1 step 95, loss is 2.3014912605285645\n", - "epoch: 1 step 96, loss is 2.302461862564087\n", - "epoch: 1 step 97, loss is 2.3085408210754395\n", - "epoch: 1 step 98, loss is 2.296156167984009\n", - "epoch: 1 step 99, loss is 2.29742169380188\n", - "epoch: 1 step 100, loss is 2.304044246673584\n", - "epoch: 1 step 101, loss is 2.3033947944641113\n", - "epoch: 1 step 102, loss is 2.304230213165283\n", - "epoch: 1 step 103, loss is 2.306795358657837\n", - "epoch: 1 step 104, loss is 2.3009331226348877\n", - "epoch: 1 step 105, loss is 2.2976226806640625\n", - "epoch: 1 step 106, loss is 2.294156551361084\n", - "epoch: 1 step 107, loss is 2.299952507019043\n", - "epoch: 1 step 108, loss is 2.309349298477173\n", - "epoch: 1 step 109, loss is 2.300881862640381\n", - "epoch: 1 step 110, loss is 2.311763048171997\n", - "epoch: 1 step 111, loss is 2.3041746616363525\n", - "epoch: 1 step 112, loss is 2.296386241912842\n", - "epoch: 1 step 113, loss is 2.3057456016540527\n", - "epoch: 1 step 114, loss is 2.2932939529418945\n", - "epoch: 1 step 115, loss is 2.3116936683654785\n", - "epoch: 1 step 116, loss is 2.2911524772644043\n", - "epoch: 1 step 117, loss is 2.3195571899414062\n", - "epoch: 1 step 118, loss is 2.3009743690490723\n", - "epoch: 1 step 119, loss is 2.2873072624206543\n", - "epoch: 1 step 120, loss is 2.3133697509765625\n", - "epoch: 1 step 121, loss is 2.30419659614563\n", - "epoch: 1 step 122, loss is 2.2948532104492188\n", - "epoch: 1 step 123, loss is 2.3005154132843018\n", - "epoch: 1 step 124, loss is 2.302123785018921\n", - "epoch: 1 step 125, loss is 2.3159165382385254\n", - "epoch: 1 step 126, loss is 2.299440383911133\n", - "epoch: 1 step 127, loss is 2.294107437133789\n", - "epoch: 1 step 128, loss is 2.3011081218719482\n", - "epoch: 1 step 129, loss is 2.2969794273376465\n", - "epoch: 1 step 130, loss is 2.3207995891571045\n", - "epoch: 1 step 131, loss is 2.312513589859009\n", - "epoch: 1 step 132, loss is 2.3004298210144043\n", - "epoch: 1 step 133, loss is 2.3079676628112793\n", - "epoch: 1 step 134, loss is 2.291200637817383\n", - "epoch: 1 step 135, loss is 2.3006200790405273\n", - "epoch: 1 step 136, loss is 2.310053586959839\n", - "epoch: 1 step 137, loss is 2.305415630340576\n", - "epoch: 1 step 138, loss is 2.2974250316619873\n", - "epoch: 1 step 139, loss is 2.3018136024475098\n", - "epoch: 1 step 140, loss is 2.2954070568084717\n", - "epoch: 1 step 141, loss is 2.3027243614196777\n", - "epoch: 1 step 142, loss is 2.3045427799224854\n", - "epoch: 1 step 143, loss is 2.2927889823913574\n", - "epoch: 1 step 144, loss is 2.3079490661621094\n", - "epoch: 1 step 145, loss is 2.308187484741211\n", - "epoch: 1 step 146, loss is 2.2903213500976562\n", - "epoch: 1 step 147, loss is 2.3045201301574707\n", - "epoch: 1 step 148, loss is 2.2987563610076904\n", - "epoch: 1 step 149, loss is 2.3156399726867676\n", - "epoch: 1 step 150, loss is 2.298570394515991\n", - "epoch: 1 step 151, loss is 2.328263521194458\n", - "epoch: 1 step 152, loss is 2.3073980808258057\n", - "epoch: 1 step 153, loss is 2.29105281829834\n", - "epoch: 1 step 154, loss is 2.3087775707244873\n", - "epoch: 1 step 155, loss is 2.308013439178467\n", - "epoch: 1 step 156, loss is 2.3050355911254883\n", - "epoch: 1 step 157, loss is 2.292358636856079\n", - "epoch: 1 step 158, loss is 2.3048012256622314\n", - "epoch: 1 step 159, loss is 2.3068573474884033\n", - "epoch: 1 step 160, loss is 2.3111112117767334\n", - "epoch: 1 step 161, loss is 2.2875633239746094\n", - "epoch: 1 step 162, loss is 2.2821505069732666\n", - "epoch: 1 step 163, loss is 2.2968549728393555\n", - "epoch: 1 step 164, loss is 2.283661127090454\n", - "epoch: 1 step 165, loss is 2.305659532546997\n", - "epoch: 1 step 166, loss is 2.302140712738037\n", - "epoch: 1 step 167, loss is 2.295464038848877\n", - "epoch: 1 step 168, loss is 2.279547691345215\n", - "epoch: 1 step 169, loss is 2.2998762130737305\n", - "epoch: 1 step 170, loss is 2.2972824573516846\n", - "epoch: 1 step 171, loss is 2.306971549987793\n", - "epoch: 1 step 172, loss is 2.3028149604797363\n", - "epoch: 1 step 173, loss is 2.2961080074310303\n", - "epoch: 1 step 174, loss is 2.306591033935547\n", - "epoch: 1 step 175, loss is 2.294856309890747\n", - "epoch: 1 step 176, loss is 2.3139917850494385\n", - "epoch: 1 step 177, loss is 2.3160483837127686\n", - "epoch: 1 step 178, loss is 2.288738250732422\n", - "epoch: 1 step 179, loss is 2.3248589038848877\n", - "epoch: 1 step 180, loss is 2.2906830310821533\n", - "epoch: 1 step 181, loss is 2.295781373977661\n", - "epoch: 1 step 182, loss is 2.2896437644958496\n" - ] - }, - { - "name": "stdout", - "output_type": "stream", - "text": [ - "epoch: 1 step 183, loss is 2.309624433517456\n", - "epoch: 1 step 184, loss is 2.323194742202759\n", - "epoch: 1 step 185, loss is 2.311706781387329\n", - "epoch: 1 step 186, loss is 2.3111326694488525\n", - "epoch: 1 step 187, loss is 2.3091180324554443\n", - "epoch: 1 step 188, loss is 2.2965402603149414\n", - "epoch: 1 step 189, loss is 2.295259475708008\n", - "epoch: 1 step 190, loss is 2.296231269836426\n", - "epoch: 1 step 191, loss is 2.303983211517334\n", - "epoch: 1 step 192, loss is 2.306814432144165\n", - "epoch: 1 step 193, loss is 2.2905185222625732\n", - "epoch: 1 step 194, loss is 2.308838367462158\n", - "epoch: 1 step 195, loss is 2.2948405742645264\n", - "epoch: 1 step 196, loss is 2.298696994781494\n", - "epoch: 1 step 197, loss is 2.2964284420013428\n", - "epoch: 1 step 198, loss is 2.3125040531158447\n", - "epoch: 1 step 199, loss is 2.2885444164276123\n", - "epoch: 1 step 200, loss is 2.306826114654541\n", - "epoch: 1 step 201, loss is 2.309328079223633\n", - "epoch: 1 step 202, loss is 2.2883524894714355\n", - "epoch: 1 step 203, loss is 2.304533004760742\n", - "epoch: 1 step 204, loss is 2.301948308944702\n", - "epoch: 1 step 205, loss is 2.3106493949890137\n", - "epoch: 1 step 206, loss is 2.305791139602661\n", - "epoch: 1 step 207, loss is 2.3050265312194824\n", - "epoch: 1 step 208, loss is 2.2933125495910645\n", - "epoch: 1 step 209, loss is 2.30745792388916\n", - "epoch: 1 step 210, loss is 2.312147855758667\n", - "epoch: 1 step 211, loss is 2.3021814823150635\n", - "epoch: 1 step 212, loss is 2.282095432281494\n", - "epoch: 1 step 213, loss is 2.2988312244415283\n", - "epoch: 1 step 214, loss is 2.299859046936035\n", - "epoch: 1 step 215, loss is 2.2955031394958496\n", - "epoch: 1 step 216, loss is 2.288876533508301\n", - "epoch: 1 step 217, loss is 2.2997374534606934\n", - "epoch: 1 step 218, loss is 2.2835824489593506\n", - "epoch: 1 step 219, loss is 2.3038642406463623\n", - "epoch: 1 step 220, loss is 2.298015594482422\n", - "epoch: 1 step 221, loss is 2.2944860458374023\n", - "epoch: 1 step 222, loss is 2.2981929779052734\n", - "epoch: 1 step 223, loss is 2.3046724796295166\n", - "epoch: 1 step 224, loss is 2.3166990280151367\n", - "epoch: 1 step 225, loss is 2.2997946739196777\n", - "epoch: 1 step 226, loss is 2.2982776165008545\n", - "epoch: 1 step 227, loss is 2.304516553878784\n", - "epoch: 1 step 228, loss is 2.294159173965454\n", - "epoch: 1 step 229, loss is 2.2979888916015625\n", - "epoch: 1 step 230, loss is 2.3228976726531982\n", - "epoch: 1 step 231, loss is 2.300828695297241\n", - "epoch: 1 step 232, loss is 2.3062961101531982\n", - "epoch: 1 step 233, loss is 2.3068461418151855\n", - "epoch: 1 step 234, loss is 2.2954905033111572\n", - "epoch: 1 step 235, loss is 2.315176486968994\n", - "epoch: 1 step 236, loss is 2.2987992763519287\n", - "epoch: 1 step 237, loss is 2.3124282360076904\n", - "epoch: 1 step 238, loss is 2.298109292984009\n", - "epoch: 1 step 239, loss is 2.305518865585327\n", - "epoch: 1 step 240, loss is 2.302560329437256\n", - "epoch: 1 step 241, loss is 2.303983211517334\n", - "epoch: 1 step 242, loss is 2.320657253265381\n", - "epoch: 1 step 243, loss is 2.3005549907684326\n", - "epoch: 1 step 244, loss is 2.2985103130340576\n", - "epoch: 1 step 245, loss is 2.304802894592285\n", - "epoch: 1 step 246, loss is 2.300941228866577\n", - "epoch: 1 step 247, loss is 2.323657989501953\n", - "epoch: 1 step 248, loss is 2.286917209625244\n", - "epoch: 1 step 249, loss is 2.3042843341827393\n", - "epoch: 1 step 250, loss is 2.3197810649871826\n", - "epoch: 1 step 251, loss is 2.2893900871276855\n", - "epoch: 1 step 252, loss is 2.301969289779663\n", - "epoch: 1 step 253, loss is 2.311122417449951\n", - "epoch: 1 step 254, loss is 2.2994332313537598\n", - "epoch: 1 step 255, loss is 2.306812047958374\n", - "epoch: 1 step 256, loss is 2.303797721862793\n", - "epoch: 1 step 257, loss is 2.3182919025421143\n", - "epoch: 1 step 258, loss is 2.2994771003723145\n", - "epoch: 1 step 259, loss is 2.2922940254211426\n", - "epoch: 1 step 260, loss is 2.2942943572998047\n", - "epoch: 1 step 261, loss is 2.311457395553589\n", - "epoch: 1 step 262, loss is 2.3056118488311768\n", - "epoch: 1 step 263, loss is 2.2893776893615723\n", - "epoch: 1 step 264, loss is 2.3056020736694336\n", - "epoch: 1 step 265, loss is 2.300929307937622\n", - "epoch: 1 step 266, loss is 2.2821691036224365\n", - "epoch: 1 step 267, loss is 2.3025898933410645\n", - "epoch: 1 step 268, loss is 2.3109312057495117\n", - "epoch: 1 step 269, loss is 2.3037710189819336\n", - "epoch: 1 step 270, loss is 2.300994873046875\n", - "epoch: 1 step 271, loss is 2.3064684867858887\n", - "epoch: 1 step 272, loss is 2.306129217147827\n", - "epoch: 1 step 273, loss is 2.302043914794922\n", - "epoch: 1 step 274, loss is 2.2923431396484375\n", - "epoch: 1 step 275, loss is 2.301884889602661\n", - "epoch: 1 step 276, loss is 2.291098117828369\n", - "epoch: 1 step 277, loss is 2.3073880672454834\n", - "epoch: 1 step 278, loss is 2.3108322620391846\n", - "epoch: 1 step 279, loss is 2.3005025386810303\n", - "epoch: 1 step 280, loss is 2.309103012084961\n", - "epoch: 1 step 281, loss is 2.31179141998291\n", - "epoch: 1 step 282, loss is 2.277594566345215\n", - "epoch: 1 step 283, loss is 2.31227707862854\n", - "epoch: 1 step 284, loss is 2.290104627609253\n", - "epoch: 1 step 285, loss is 2.3200223445892334\n", - "epoch: 1 step 286, loss is 2.297940254211426\n", - "epoch: 1 step 287, loss is 2.2998781204223633\n", - "epoch: 1 step 288, loss is 2.3022446632385254\n", - "epoch: 1 step 289, loss is 2.32137131690979\n", - "epoch: 1 step 290, loss is 2.303280830383301\n", - "epoch: 1 step 291, loss is 2.298297166824341\n", - "epoch: 1 step 292, loss is 2.306844472885132\n", - "epoch: 1 step 293, loss is 2.3117358684539795\n", - "epoch: 1 step 294, loss is 2.299665927886963\n", - "epoch: 1 step 295, loss is 2.305180788040161\n", - "epoch: 1 step 296, loss is 2.3082194328308105\n", - "epoch: 1 step 297, loss is 2.3148446083068848\n", - "epoch: 1 step 298, loss is 2.304473400115967\n", - "epoch: 1 step 299, loss is 2.2927048206329346\n", - "epoch: 1 step 300, loss is 2.297117233276367\n", - "epoch: 1 step 301, loss is 2.301973581314087\n", - "epoch: 1 step 302, loss is 2.299013614654541\n", - "epoch: 1 step 303, loss is 2.3000245094299316\n", - "epoch: 1 step 304, loss is 2.311347484588623\n", - "epoch: 1 step 305, loss is 2.3053359985351562\n", - "epoch: 1 step 306, loss is 2.3081910610198975\n", - "epoch: 1 step 307, loss is 2.3169445991516113\n", - "epoch: 1 step 308, loss is 2.297750949859619\n", - "epoch: 1 step 309, loss is 2.30489444732666\n", - "epoch: 1 step 310, loss is 2.29947829246521\n", - "epoch: 1 step 311, loss is 2.301111936569214\n", - "epoch: 1 step 312, loss is 2.322146415710449\n", - "epoch: 1 step 313, loss is 2.3110451698303223\n", - "epoch: 1 step 314, loss is 2.3123786449432373\n", - "epoch: 1 step 315, loss is 2.3374645709991455\n", - "epoch: 1 step 316, loss is 2.3028552532196045\n", - "epoch: 1 step 317, loss is 2.2909250259399414\n", - "epoch: 1 step 318, loss is 2.3044238090515137\n", - "epoch: 1 step 319, loss is 2.295652389526367\n", - "epoch: 1 step 320, loss is 2.3132457733154297\n", - "epoch: 1 step 321, loss is 2.299982786178589\n", - "epoch: 1 step 322, loss is 2.2876060009002686\n", - "epoch: 1 step 323, loss is 2.305941343307495\n", - "epoch: 1 step 324, loss is 2.287879705429077\n", - "epoch: 1 step 325, loss is 2.2931158542633057\n", - "epoch: 1 step 326, loss is 2.3060152530670166\n", - "epoch: 1 step 327, loss is 2.3094165325164795\n", - "epoch: 1 step 328, loss is 2.2952308654785156\n", - "epoch: 1 step 329, loss is 2.301609516143799\n", - "epoch: 1 step 330, loss is 2.3078248500823975\n", - "epoch: 1 step 331, loss is 2.307288408279419\n", - "epoch: 1 step 332, loss is 2.3012943267822266\n", - "epoch: 1 step 333, loss is 2.2988827228546143\n", - "epoch: 1 step 334, loss is 2.293768882751465\n", - "epoch: 1 step 335, loss is 2.3158555030822754\n", - "epoch: 1 step 336, loss is 2.323178768157959\n", - "epoch: 1 step 337, loss is 2.309025526046753\n", - "epoch: 1 step 338, loss is 2.3028969764709473\n", - "epoch: 1 step 339, loss is 2.2952795028686523\n", - "epoch: 1 step 340, loss is 2.307999610900879\n", - "epoch: 1 step 341, loss is 2.297755479812622\n", - "epoch: 1 step 342, loss is 2.3044004440307617\n", - "epoch: 1 step 343, loss is 2.288539409637451\n", - "epoch: 1 step 344, loss is 2.2831764221191406\n", - "epoch: 1 step 345, loss is 2.2872886657714844\n", - "epoch: 1 step 346, loss is 2.306870460510254\n", - "epoch: 1 step 347, loss is 2.312478542327881\n", - "epoch: 1 step 348, loss is 2.2926058769226074\n", - "epoch: 1 step 349, loss is 2.309343099594116\n", - "epoch: 1 step 350, loss is 2.291848659515381\n", - "epoch: 1 step 351, loss is 2.3049778938293457\n", - "epoch: 1 step 352, loss is 2.305189371109009\n", - "epoch: 1 step 353, loss is 2.294496774673462\n", - "epoch: 1 step 354, loss is 2.308093309402466\n", - "epoch: 1 step 355, loss is 2.3015782833099365\n", - "epoch: 1 step 356, loss is 2.2906839847564697\n", - "epoch: 1 step 357, loss is 2.305084705352783\n", - "epoch: 1 step 358, loss is 2.310594081878662\n", - "epoch: 1 step 359, loss is 2.291069507598877\n", - "epoch: 1 step 360, loss is 2.299825668334961\n", - "epoch: 1 step 361, loss is 2.3010685443878174\n", - "epoch: 1 step 362, loss is 2.299398183822632\n" - ] - }, - { - "name": "stdout", - "output_type": "stream", - "text": [ - "epoch: 1 step 363, loss is 2.316545009613037\n", - "epoch: 1 step 364, loss is 2.2973861694335938\n", - "epoch: 1 step 365, loss is 2.298579454421997\n", - "epoch: 1 step 366, loss is 2.299386978149414\n", - "epoch: 1 step 367, loss is 2.28235125541687\n", - "epoch: 1 step 368, loss is 2.312197208404541\n", - "epoch: 1 step 369, loss is 2.3015496730804443\n", - "epoch: 1 step 370, loss is 2.306972026824951\n", - "epoch: 1 step 371, loss is 2.300973892211914\n", - "epoch: 1 step 372, loss is 2.2951574325561523\n", - "epoch: 1 step 373, loss is 2.297781467437744\n", - "epoch: 1 step 374, loss is 2.2964324951171875\n", - "epoch: 1 step 375, loss is 2.3094630241394043\n", - "epoch: 1 step 376, loss is 2.3110594749450684\n", - "epoch: 1 step 377, loss is 2.316398859024048\n", - "epoch: 1 step 378, loss is 2.2969913482666016\n", - "epoch: 1 step 379, loss is 2.2993545532226562\n", - "epoch: 1 step 380, loss is 2.2793471813201904\n", - "epoch: 1 step 381, loss is 2.321284294128418\n", - "epoch: 1 step 382, loss is 2.308730363845825\n", - "epoch: 1 step 383, loss is 2.311279773712158\n", - "epoch: 1 step 384, loss is 2.2807345390319824\n", - "epoch: 1 step 385, loss is 2.298273801803589\n", - "epoch: 1 step 386, loss is 2.2840373516082764\n", - "epoch: 1 step 387, loss is 2.302661657333374\n", - "epoch: 1 step 388, loss is 2.2877776622772217\n", - "epoch: 1 step 389, loss is 2.2820465564727783\n", - "epoch: 1 step 390, loss is 2.3102872371673584\n", - "epoch: 1 step 391, loss is 2.3036928176879883\n", - "epoch: 1 step 392, loss is 2.3042521476745605\n", - "epoch: 1 step 393, loss is 2.3074333667755127\n", - "epoch: 1 step 394, loss is 2.3111190795898438\n", - "epoch: 1 step 395, loss is 2.294706106185913\n", - "epoch: 1 step 396, loss is 2.2674827575683594\n", - "epoch: 1 step 397, loss is 2.317141056060791\n", - "epoch: 1 step 398, loss is 2.307194471359253\n", - "epoch: 1 step 399, loss is 2.3064684867858887\n", - "epoch: 1 step 400, loss is 2.29667067527771\n", - "epoch: 1 step 401, loss is 2.316262722015381\n", - "epoch: 1 step 402, loss is 2.2922985553741455\n", - "epoch: 1 step 403, loss is 2.30635666847229\n", - "epoch: 1 step 404, loss is 2.293581008911133\n", - "epoch: 1 step 405, loss is 2.3163130283355713\n", - "epoch: 1 step 406, loss is 2.2864110469818115\n", - "epoch: 1 step 407, loss is 2.292649269104004\n", - "epoch: 1 step 408, loss is 2.3108510971069336\n", - "epoch: 1 step 409, loss is 2.3138904571533203\n", - "epoch: 1 step 410, loss is 2.3124492168426514\n", - "epoch: 1 step 411, loss is 2.286383867263794\n", - "epoch: 1 step 412, loss is 2.305741310119629\n", - "epoch: 1 step 413, loss is 2.328089952468872\n", - "epoch: 1 step 414, loss is 2.2838659286499023\n", - "epoch: 1 step 415, loss is 2.297269582748413\n", - "epoch: 1 step 416, loss is 2.3035154342651367\n", - "epoch: 1 step 417, loss is 2.327326536178589\n", - "epoch: 1 step 418, loss is 2.310253381729126\n", - "epoch: 1 step 419, loss is 2.3029470443725586\n", - "epoch: 1 step 420, loss is 2.319157123565674\n", - "epoch: 1 step 421, loss is 2.280348777770996\n", - "epoch: 1 step 422, loss is 2.3144636154174805\n", - "epoch: 1 step 423, loss is 2.309483766555786\n", - "epoch: 1 step 424, loss is 2.305119752883911\n", - "epoch: 1 step 425, loss is 2.293011426925659\n", - "epoch: 1 step 426, loss is 2.3073065280914307\n", - "epoch: 1 step 427, loss is 2.2898974418640137\n", - "epoch: 1 step 428, loss is 2.2863576412200928\n", - "epoch: 1 step 429, loss is 2.3259050846099854\n", - "epoch: 1 step 430, loss is 2.293931245803833\n", - "epoch: 1 step 431, loss is 2.3190293312072754\n", - "epoch: 1 step 432, loss is 2.3019421100616455\n", - "epoch: 1 step 433, loss is 2.29414963722229\n", - "epoch: 1 step 434, loss is 2.3056304454803467\n", - "epoch: 1 step 435, loss is 2.297004461288452\n", - "epoch: 1 step 436, loss is 2.3015122413635254\n", - "epoch: 1 step 437, loss is 2.3112151622772217\n", - "epoch: 1 step 438, loss is 2.3200738430023193\n", - "epoch: 1 step 439, loss is 2.2920126914978027\n", - "epoch: 1 step 440, loss is 2.3166749477386475\n", - "epoch: 1 step 441, loss is 2.3098225593566895\n", - "epoch: 1 step 442, loss is 2.3176872730255127\n", - "epoch: 1 step 443, loss is 2.2913010120391846\n", - "epoch: 1 step 444, loss is 2.304748296737671\n", - "epoch: 1 step 445, loss is 2.30777645111084\n", - "epoch: 1 step 446, loss is 2.301826238632202\n", - "epoch: 1 step 447, loss is 2.3115742206573486\n", - "epoch: 1 step 448, loss is 2.2836520671844482\n", - "epoch: 1 step 449, loss is 2.3101956844329834\n", - "epoch: 1 step 450, loss is 2.3099701404571533\n", - "epoch: 1 step 451, loss is 2.2856247425079346\n", - "epoch: 1 step 452, loss is 2.301503896713257\n", - "epoch: 1 step 453, loss is 2.30840802192688\n", - "epoch: 1 step 454, loss is 2.294261932373047\n", - "epoch: 1 step 455, loss is 2.306784152984619\n", - "epoch: 1 step 456, loss is 2.312567949295044\n", - "epoch: 1 step 457, loss is 2.301384687423706\n", - "epoch: 1 step 458, loss is 2.3095791339874268\n", - "epoch: 1 step 459, loss is 2.30721378326416\n", - "epoch: 1 step 460, loss is 2.2918553352355957\n", - "epoch: 1 step 461, loss is 2.297912836074829\n", - "epoch: 1 step 462, loss is 2.293703317642212\n", - "epoch: 1 step 463, loss is 2.3068270683288574\n", - "epoch: 1 step 464, loss is 2.308913469314575\n", - "epoch: 1 step 465, loss is 2.2927842140197754\n", - "epoch: 1 step 466, loss is 2.298229217529297\n", - "epoch: 1 step 467, loss is 2.3053324222564697\n", - "epoch: 1 step 468, loss is 2.311330795288086\n", - "epoch: 1 step 469, loss is 2.3048853874206543\n", - "epoch: 1 step 470, loss is 2.3125648498535156\n", - "epoch: 1 step 471, loss is 2.291192054748535\n", - "epoch: 1 step 472, loss is 2.304387092590332\n", - "epoch: 1 step 473, loss is 2.2949419021606445\n", - "epoch: 1 step 474, loss is 2.292856454849243\n", - "epoch: 1 step 475, loss is 2.3045504093170166\n", - "epoch: 1 step 476, loss is 2.3035783767700195\n", - "epoch: 1 step 477, loss is 2.3078746795654297\n", - "epoch: 1 step 478, loss is 2.2975947856903076\n", - "epoch: 1 step 479, loss is 2.2870256900787354\n", - "epoch: 1 step 480, loss is 2.3047573566436768\n", - "epoch: 1 step 481, loss is 2.288184881210327\n", - "epoch: 1 step 482, loss is 2.29794979095459\n", - "epoch: 1 step 483, loss is 2.3161563873291016\n", - "epoch: 1 step 484, loss is 2.3217427730560303\n", - "epoch: 1 step 485, loss is 2.31135892868042\n", - "epoch: 1 step 486, loss is 2.2933225631713867\n", - "epoch: 1 step 487, loss is 2.2972028255462646\n", - "epoch: 1 step 488, loss is 2.3297030925750732\n", - "epoch: 1 step 489, loss is 2.3047773838043213\n", - "epoch: 1 step 490, loss is 2.2950005531311035\n", - "epoch: 1 step 491, loss is 2.301220655441284\n", - "epoch: 1 step 492, loss is 2.294806480407715\n", - "epoch: 1 step 493, loss is 2.3060131072998047\n", - "epoch: 1 step 494, loss is 2.2895150184631348\n", - "epoch: 1 step 495, loss is 2.3064186573028564\n", - "epoch: 1 step 496, loss is 2.2900142669677734\n", - "epoch: 1 step 497, loss is 2.3137834072113037\n", - "epoch: 1 step 498, loss is 2.2951085567474365\n", - "epoch: 1 step 499, loss is 2.2993836402893066\n", - "epoch: 1 step 500, loss is 2.292905330657959\n", - "epoch: 1 step 501, loss is 2.3023834228515625\n", - "epoch: 1 step 502, loss is 2.3147432804107666\n", - "epoch: 1 step 503, loss is 2.299447536468506\n", - "epoch: 1 step 504, loss is 2.306694984436035\n", - "epoch: 1 step 505, loss is 2.3031160831451416\n", - "epoch: 1 step 506, loss is 2.2977864742279053\n", - "epoch: 1 step 507, loss is 2.308394432067871\n", - "epoch: 1 step 508, loss is 2.3073339462280273\n", - "epoch: 1 step 509, loss is 2.3020577430725098\n", - "epoch: 1 step 510, loss is 2.3010921478271484\n", - "epoch: 1 step 511, loss is 2.300612688064575\n", - "epoch: 1 step 512, loss is 2.299712657928467\n", - "epoch: 1 step 513, loss is 2.302595853805542\n", - "epoch: 1 step 514, loss is 2.2898383140563965\n", - "epoch: 1 step 515, loss is 2.3039684295654297\n", - "epoch: 1 step 516, loss is 2.292846202850342\n", - "epoch: 1 step 517, loss is 2.291675329208374\n", - "epoch: 1 step 518, loss is 2.309509038925171\n", - "epoch: 1 step 519, loss is 2.31687068939209\n", - "epoch: 1 step 520, loss is 2.3104724884033203\n", - "epoch: 1 step 521, loss is 2.307936191558838\n", - "epoch: 1 step 522, loss is 2.3007776737213135\n", - "epoch: 1 step 523, loss is 2.293004035949707\n", - "epoch: 1 step 524, loss is 2.2929527759552\n", - "epoch: 1 step 525, loss is 2.302025079727173\n", - "epoch: 1 step 526, loss is 2.305783271789551\n", - "epoch: 1 step 527, loss is 2.31724214553833\n", - "epoch: 1 step 528, loss is 2.284543514251709\n", - "epoch: 1 step 529, loss is 2.3059749603271484\n", - "epoch: 1 step 530, loss is 2.294578790664673\n", - "epoch: 1 step 531, loss is 2.2919039726257324\n", - "epoch: 1 step 532, loss is 2.308917760848999\n", - "epoch: 1 step 533, loss is 2.293203115463257\n", - "epoch: 1 step 534, loss is 2.2920308113098145\n", - "epoch: 1 step 535, loss is 2.2818307876586914\n", - "epoch: 1 step 536, loss is 2.311978816986084\n", - "epoch: 1 step 537, loss is 2.3023197650909424\n", - "epoch: 1 step 538, loss is 2.3081576824188232\n", - "epoch: 1 step 539, loss is 2.291484832763672\n", - "epoch: 1 step 540, loss is 2.316416025161743\n", - "epoch: 1 step 541, loss is 2.30479097366333\n", - "epoch: 1 step 542, loss is 2.3058624267578125\n", - "epoch: 1 step 543, loss is 2.3109488487243652\n" - ] - }, - { - "name": "stdout", - "output_type": "stream", - "text": [ - "epoch: 1 step 544, loss is 2.2846484184265137\n", - "epoch: 1 step 545, loss is 2.303199052810669\n", - "epoch: 1 step 546, loss is 2.2937982082366943\n", - "epoch: 1 step 547, loss is 2.290144443511963\n", - "epoch: 1 step 548, loss is 2.292776346206665\n", - "epoch: 1 step 549, loss is 2.3142309188842773\n", - "epoch: 1 step 550, loss is 2.302577257156372\n", - "epoch: 1 step 551, loss is 2.2937214374542236\n", - "epoch: 1 step 552, loss is 2.298457145690918\n", - "epoch: 1 step 553, loss is 2.3043863773345947\n", - "epoch: 1 step 554, loss is 2.3041136264801025\n", - "epoch: 1 step 555, loss is 2.304441213607788\n", - "epoch: 1 step 556, loss is 2.2959136962890625\n", - "epoch: 1 step 557, loss is 2.287644624710083\n", - "epoch: 1 step 558, loss is 2.2996888160705566\n", - "epoch: 1 step 559, loss is 2.303114652633667\n", - "epoch: 1 step 560, loss is 2.3127856254577637\n", - "epoch: 1 step 561, loss is 2.2931501865386963\n", - "epoch: 1 step 562, loss is 2.3102290630340576\n", - "epoch: 1 step 563, loss is 2.3031444549560547\n", - "epoch: 1 step 564, loss is 2.3256449699401855\n", - "epoch: 1 step 565, loss is 2.3089632987976074\n", - "epoch: 1 step 566, loss is 2.296027898788452\n", - "epoch: 1 step 567, loss is 2.3051443099975586\n", - "epoch: 1 step 568, loss is 2.314303159713745\n", - "epoch: 1 step 569, loss is 2.298060178756714\n", - "epoch: 1 step 570, loss is 2.3200125694274902\n", - "epoch: 1 step 571, loss is 2.3035120964050293\n", - "epoch: 1 step 572, loss is 2.301438093185425\n", - "epoch: 1 step 573, loss is 2.309084177017212\n", - "epoch: 1 step 574, loss is 2.2914998531341553\n", - "epoch: 1 step 575, loss is 2.3038980960845947\n", - "epoch: 1 step 576, loss is 2.2964913845062256\n", - "epoch: 1 step 577, loss is 2.3185367584228516\n", - "epoch: 1 step 578, loss is 2.294168472290039\n", - "epoch: 1 step 579, loss is 2.2987618446350098\n", - "epoch: 1 step 580, loss is 2.307744026184082\n", - "epoch: 1 step 581, loss is 2.3152453899383545\n", - "epoch: 1 step 582, loss is 2.2944529056549072\n", - "epoch: 1 step 583, loss is 2.2912003993988037\n", - "epoch: 1 step 584, loss is 2.3099448680877686\n", - "epoch: 1 step 585, loss is 2.2975735664367676\n", - "epoch: 1 step 586, loss is 2.303675889968872\n", - "epoch: 1 step 587, loss is 2.304849624633789\n", - "epoch: 1 step 588, loss is 2.2880008220672607\n", - "epoch: 1 step 589, loss is 2.2954723834991455\n", - "epoch: 1 step 590, loss is 2.3061351776123047\n", - "epoch: 1 step 591, loss is 2.2987239360809326\n", - "epoch: 1 step 592, loss is 2.2963337898254395\n", - "epoch: 1 step 593, loss is 2.302501678466797\n", - "epoch: 1 step 594, loss is 2.306896686553955\n", - "epoch: 1 step 595, loss is 2.2986130714416504\n", - "epoch: 1 step 596, loss is 2.296274423599243\n", - "epoch: 1 step 597, loss is 2.285633087158203\n", - "epoch: 1 step 598, loss is 2.31473970413208\n", - "epoch: 1 step 599, loss is 2.3005778789520264\n", - "epoch: 1 step 600, loss is 2.302191734313965\n", - "epoch: 1 step 601, loss is 2.304725408554077\n", - "epoch: 1 step 602, loss is 2.3023934364318848\n", - "epoch: 1 step 603, loss is 2.318779468536377\n", - "epoch: 1 step 604, loss is 2.278282880783081\n", - "epoch: 1 step 605, loss is 2.296987533569336\n", - "epoch: 1 step 606, loss is 2.287036657333374\n", - "epoch: 1 step 607, loss is 2.296092987060547\n", - "epoch: 1 step 608, loss is 2.285728931427002\n", - "epoch: 1 step 609, loss is 2.277315139770508\n", - "epoch: 1 step 610, loss is 2.310730218887329\n", - "epoch: 1 step 611, loss is 2.2932069301605225\n", - "epoch: 1 step 612, loss is 2.2998502254486084\n", - "epoch: 1 step 613, loss is 2.33337140083313\n", - "epoch: 1 step 614, loss is 2.2876906394958496\n", - "epoch: 1 step 615, loss is 2.3134958744049072\n", - "epoch: 1 step 616, loss is 2.298297882080078\n", - "epoch: 1 step 617, loss is 2.2938692569732666\n", - "epoch: 1 step 618, loss is 2.2808151245117188\n", - "epoch: 1 step 619, loss is 2.3081531524658203\n", - "epoch: 1 step 620, loss is 2.2881011962890625\n", - "epoch: 1 step 621, loss is 2.288402795791626\n", - "epoch: 1 step 622, loss is 2.2903871536254883\n", - "epoch: 1 step 623, loss is 2.310781478881836\n", - "epoch: 1 step 624, loss is 2.3198704719543457\n", - "epoch: 1 step 625, loss is 2.291807174682617\n", - "epoch: 1 step 626, loss is 2.2885937690734863\n", - "epoch: 1 step 627, loss is 2.3097026348114014\n", - "epoch: 1 step 628, loss is 2.2967135906219482\n", - "epoch: 1 step 629, loss is 2.291393995285034\n", - "epoch: 1 step 630, loss is 2.300400733947754\n", - "epoch: 1 step 631, loss is 2.3118598461151123\n", - "epoch: 1 step 632, loss is 2.3007006645202637\n", - "epoch: 1 step 633, loss is 2.300248622894287\n", - "epoch: 1 step 634, loss is 2.320274829864502\n", - "epoch: 1 step 635, loss is 2.2868380546569824\n", - "epoch: 1 step 636, loss is 2.3002429008483887\n", - "epoch: 1 step 637, loss is 2.2873823642730713\n", - "epoch: 1 step 638, loss is 2.307615041732788\n", - "epoch: 1 step 639, loss is 2.3079922199249268\n", - "epoch: 1 step 640, loss is 2.2916922569274902\n", - "epoch: 1 step 641, loss is 2.279951572418213\n", - "epoch: 1 step 642, loss is 2.321791172027588\n", - "epoch: 1 step 643, loss is 2.2927591800689697\n", - "epoch: 1 step 644, loss is 2.3093457221984863\n", - "epoch: 1 step 645, loss is 2.305617570877075\n", - "epoch: 1 step 646, loss is 2.3011748790740967\n", - "epoch: 1 step 647, loss is 2.2950925827026367\n", - "epoch: 1 step 648, loss is 2.2976534366607666\n", - "epoch: 1 step 649, loss is 2.305446147918701\n", - "epoch: 1 step 650, loss is 2.3264527320861816\n", - "epoch: 1 step 651, loss is 2.2816526889801025\n", - "epoch: 1 step 652, loss is 2.2895913124084473\n", - "epoch: 1 step 653, loss is 2.2928690910339355\n", - "epoch: 1 step 654, loss is 2.2872660160064697\n", - "epoch: 1 step 655, loss is 2.2978932857513428\n", - "epoch: 1 step 656, loss is 2.298518657684326\n", - "epoch: 1 step 657, loss is 2.2940309047698975\n", - "epoch: 1 step 658, loss is 2.3062477111816406\n", - "epoch: 1 step 659, loss is 2.2833542823791504\n", - "epoch: 1 step 660, loss is 2.297724723815918\n", - "epoch: 1 step 661, loss is 2.290584087371826\n", - "epoch: 1 step 662, loss is 2.3093016147613525\n", - "epoch: 1 step 663, loss is 2.2913601398468018\n", - "epoch: 1 step 664, loss is 2.2878904342651367\n", - "epoch: 1 step 665, loss is 2.2982962131500244\n", - "epoch: 1 step 666, loss is 2.30412220954895\n", - "epoch: 1 step 667, loss is 2.2898168563842773\n", - "epoch: 1 step 668, loss is 2.3147521018981934\n", - "epoch: 1 step 669, loss is 2.308183193206787\n", - "epoch: 1 step 670, loss is 2.2994282245635986\n", - "epoch: 1 step 671, loss is 2.312607765197754\n", - "epoch: 1 step 672, loss is 2.3106467723846436\n", - "epoch: 1 step 673, loss is 2.292339563369751\n", - "epoch: 1 step 674, loss is 2.2949345111846924\n", - "epoch: 1 step 675, loss is 2.300790309906006\n", - "epoch: 1 step 676, loss is 2.304965019226074\n", - "epoch: 1 step 677, loss is 2.2852847576141357\n", - "epoch: 1 step 678, loss is 2.3012306690216064\n", - "epoch: 1 step 679, loss is 2.312943458557129\n", - "epoch: 1 step 680, loss is 2.2971394062042236\n", - "epoch: 1 step 681, loss is 2.2854537963867188\n", - "epoch: 1 step 682, loss is 2.2959823608398438\n", - "epoch: 1 step 683, loss is 2.3153486251831055\n", - "epoch: 1 step 684, loss is 2.2996137142181396\n", - "epoch: 1 step 685, loss is 2.2879552841186523\n", - "epoch: 1 step 686, loss is 2.292052745819092\n", - "epoch: 1 step 687, loss is 2.28926157951355\n", - "epoch: 1 step 688, loss is 2.302530527114868\n", - "epoch: 1 step 689, loss is 2.2990405559539795\n", - "epoch: 1 step 690, loss is 2.3113174438476562\n", - "epoch: 1 step 691, loss is 2.311800003051758\n", - "epoch: 1 step 692, loss is 2.2938122749328613\n", - "epoch: 1 step 693, loss is 2.281641721725464\n", - "epoch: 1 step 694, loss is 2.315920114517212\n", - "epoch: 1 step 695, loss is 2.3154196739196777\n", - "epoch: 1 step 696, loss is 2.29622220993042\n", - "epoch: 1 step 697, loss is 2.3036105632781982\n", - "epoch: 1 step 698, loss is 2.291242837905884\n", - "epoch: 1 step 699, loss is 2.2951271533966064\n", - "epoch: 1 step 700, loss is 2.299222230911255\n", - "epoch: 1 step 701, loss is 2.290109157562256\n", - "epoch: 1 step 702, loss is 2.2930047512054443\n", - "epoch: 1 step 703, loss is 2.280167579650879\n", - "epoch: 1 step 704, loss is 2.315762996673584\n", - "epoch: 1 step 705, loss is 2.3222908973693848\n", - "epoch: 1 step 706, loss is 2.2864747047424316\n", - "epoch: 1 step 707, loss is 2.267042636871338\n", - "epoch: 1 step 708, loss is 2.3018813133239746\n", - "epoch: 1 step 709, loss is 2.281364917755127\n", - "epoch: 1 step 710, loss is 2.307040214538574\n", - "epoch: 1 step 711, loss is 2.308441162109375\n", - "epoch: 1 step 712, loss is 2.322699785232544\n", - "epoch: 1 step 713, loss is 2.2905800342559814\n", - "epoch: 1 step 714, loss is 2.284076690673828\n", - "epoch: 1 step 715, loss is 2.3076560497283936\n", - "epoch: 1 step 716, loss is 2.29257869720459\n", - "epoch: 1 step 717, loss is 2.282029628753662\n", - "epoch: 1 step 718, loss is 2.312410831451416\n", - "epoch: 1 step 719, loss is 2.298229932785034\n", - "epoch: 1 step 720, loss is 2.259955883026123\n", - "epoch: 1 step 721, loss is 2.310192584991455\n", - "epoch: 1 step 722, loss is 2.317457675933838\n", - "epoch: 1 step 723, loss is 2.2887001037597656\n" - ] - }, - { - "name": "stdout", - "output_type": "stream", - "text": [ - "epoch: 1 step 724, loss is 2.311528205871582\n", - "epoch: 1 step 725, loss is 2.2796037197113037\n", - "epoch: 1 step 726, loss is 2.2791337966918945\n", - "epoch: 1 step 727, loss is 2.2870848178863525\n", - "epoch: 1 step 728, loss is 2.2893407344818115\n", - "epoch: 1 step 729, loss is 2.3080923557281494\n", - "epoch: 1 step 730, loss is 2.281989574432373\n", - "epoch: 1 step 731, loss is 2.287876844406128\n", - "epoch: 1 step 732, loss is 2.2949070930480957\n", - "epoch: 1 step 733, loss is 2.3051536083221436\n", - "epoch: 1 step 734, loss is 2.2933919429779053\n", - "epoch: 1 step 735, loss is 2.2848522663116455\n", - "epoch: 1 step 736, loss is 2.296999216079712\n", - "epoch: 1 step 737, loss is 2.283834934234619\n", - "epoch: 1 step 738, loss is 2.313833475112915\n", - "epoch: 1 step 739, loss is 2.289297580718994\n", - "epoch: 1 step 740, loss is 2.3047103881835938\n", - "epoch: 1 step 741, loss is 2.310509443283081\n", - "epoch: 1 step 742, loss is 2.300405502319336\n", - "epoch: 1 step 743, loss is 2.2882189750671387\n", - "epoch: 1 step 744, loss is 2.2621119022369385\n", - "epoch: 1 step 745, loss is 2.298179864883423\n", - "epoch: 1 step 746, loss is 2.293515920639038\n", - "epoch: 1 step 747, loss is 2.3081891536712646\n", - "epoch: 1 step 748, loss is 2.289299726486206\n", - "epoch: 1 step 749, loss is 2.294929027557373\n", - "epoch: 1 step 750, loss is 2.2733545303344727\n", - "epoch: 1 step 751, loss is 2.27662992477417\n", - "epoch: 1 step 752, loss is 2.291785955429077\n", - "epoch: 1 step 753, loss is 2.2820146083831787\n", - "epoch: 1 step 754, loss is 2.2796335220336914\n", - "epoch: 1 step 755, loss is 2.300360918045044\n", - "epoch: 1 step 756, loss is 2.285766839981079\n", - "epoch: 1 step 757, loss is 2.275282382965088\n", - "epoch: 1 step 758, loss is 2.289869785308838\n", - "epoch: 1 step 759, loss is 2.288311243057251\n", - "epoch: 1 step 760, loss is 2.2835910320281982\n", - "epoch: 1 step 761, loss is 2.2774996757507324\n", - "epoch: 1 step 762, loss is 2.2717692852020264\n", - "epoch: 1 step 763, loss is 2.2631680965423584\n", - "epoch: 1 step 764, loss is 2.2731502056121826\n", - "epoch: 1 step 765, loss is 2.281693458557129\n", - "epoch: 1 step 766, loss is 2.2641055583953857\n", - "epoch: 1 step 767, loss is 2.271340847015381\n", - "epoch: 1 step 768, loss is 2.275526285171509\n", - "epoch: 1 step 769, loss is 2.2647759914398193\n", - "epoch: 1 step 770, loss is 2.2564775943756104\n", - "epoch: 1 step 771, loss is 2.2807374000549316\n", - "epoch: 1 step 772, loss is 2.2322680950164795\n", - "epoch: 1 step 773, loss is 2.2390637397766113\n", - "epoch: 1 step 774, loss is 2.254652261734009\n", - "epoch: 1 step 775, loss is 2.2351579666137695\n", - "epoch: 1 step 776, loss is 2.2300965785980225\n", - "epoch: 1 step 777, loss is 2.263421058654785\n", - "epoch: 1 step 778, loss is 2.2214722633361816\n", - "epoch: 1 step 779, loss is 2.220160484313965\n", - "epoch: 1 step 780, loss is 2.224086284637451\n", - "epoch: 1 step 781, loss is 2.1890642642974854\n", - "epoch: 1 step 782, loss is 2.212411880493164\n", - "epoch: 1 step 783, loss is 2.2310309410095215\n", - "epoch: 1 step 784, loss is 2.215427875518799\n", - "epoch: 1 step 785, loss is 2.181779623031616\n", - "epoch: 1 step 786, loss is 2.1368837356567383\n", - "epoch: 1 step 787, loss is 2.200275421142578\n", - "epoch: 1 step 788, loss is 2.0769481658935547\n", - "epoch: 1 step 789, loss is 2.0798652172088623\n", - "epoch: 1 step 790, loss is 1.9962787628173828\n", - "epoch: 1 step 791, loss is 2.085353374481201\n", - "epoch: 1 step 792, loss is 2.023948907852173\n", - "epoch: 1 step 793, loss is 1.9775162935256958\n", - "epoch: 1 step 794, loss is 2.0264859199523926\n", - "epoch: 1 step 795, loss is 1.8990428447723389\n", - "epoch: 1 step 796, loss is 1.8507258892059326\n", - "epoch: 1 step 797, loss is 1.8836371898651123\n", - "epoch: 1 step 798, loss is 1.8537689447402954\n", - "epoch: 1 step 799, loss is 1.5868067741394043\n", - "epoch: 1 step 800, loss is 1.6315590143203735\n", - "epoch: 1 step 801, loss is 1.4959402084350586\n", - "epoch: 1 step 802, loss is 1.581732988357544\n", - "epoch: 1 step 803, loss is 1.3930209875106812\n", - "epoch: 1 step 804, loss is 1.8094263076782227\n", - "epoch: 1 step 805, loss is 1.2004598379135132\n", - "epoch: 1 step 806, loss is 1.5204395055770874\n", - "epoch: 1 step 807, loss is 1.0562299489974976\n", - "epoch: 1 step 808, loss is 1.1175340414047241\n", - "epoch: 1 step 809, loss is 1.110710620880127\n", - "epoch: 1 step 810, loss is 1.22267746925354\n", - "epoch: 1 step 811, loss is 0.9916217923164368\n", - "epoch: 1 step 812, loss is 1.3536423444747925\n", - "epoch: 1 step 813, loss is 1.3294962644577026\n", - "epoch: 1 step 814, loss is 1.222076416015625\n", - "epoch: 1 step 815, loss is 1.1926804780960083\n", - "epoch: 1 step 816, loss is 1.2677249908447266\n", - "epoch: 1 step 817, loss is 0.7866724729537964\n", - "epoch: 1 step 818, loss is 1.2224595546722412\n", - "epoch: 1 step 819, loss is 1.3521312475204468\n", - "epoch: 1 step 820, loss is 1.3983004093170166\n", - "epoch: 1 step 821, loss is 1.2548807859420776\n", - "epoch: 1 step 822, loss is 1.1634089946746826\n", - "epoch: 1 step 823, loss is 1.2329490184783936\n", - "epoch: 1 step 824, loss is 0.927302896976471\n", - "epoch: 1 step 825, loss is 1.3058924674987793\n", - "epoch: 1 step 826, loss is 0.88958340883255\n", - "epoch: 1 step 827, loss is 1.2157689332962036\n", - "epoch: 1 step 828, loss is 0.7676456570625305\n", - "epoch: 1 step 829, loss is 1.3338539600372314\n", - "epoch: 1 step 830, loss is 1.210726022720337\n", - "epoch: 1 step 831, loss is 1.4619954824447632\n", - "epoch: 1 step 832, loss is 1.070967674255371\n", - "epoch: 1 step 833, loss is 1.2459098100662231\n", - "epoch: 1 step 834, loss is 1.0285723209381104\n", - "epoch: 1 step 835, loss is 1.5987755060195923\n", - "epoch: 1 step 836, loss is 1.3421015739440918\n", - "epoch: 1 step 837, loss is 1.0287829637527466\n", - "epoch: 1 step 838, loss is 0.8404675722122192\n", - "epoch: 1 step 839, loss is 0.8053593635559082\n", - "epoch: 1 step 840, loss is 1.134331226348877\n", - "epoch: 1 step 841, loss is 0.8719921708106995\n", - "epoch: 1 step 842, loss is 0.981346607208252\n", - "epoch: 1 step 843, loss is 0.8902426958084106\n", - "epoch: 1 step 844, loss is 1.1104727983474731\n", - "epoch: 1 step 845, loss is 0.9911269545555115\n", - "epoch: 1 step 846, loss is 1.0488018989562988\n", - "epoch: 1 step 847, loss is 0.7992566227912903\n", - "epoch: 1 step 848, loss is 0.7287769317626953\n", - "epoch: 1 step 849, loss is 0.577411413192749\n", - "epoch: 1 step 850, loss is 0.8648281097412109\n", - "epoch: 1 step 851, loss is 1.8254376649856567\n", - "epoch: 1 step 852, loss is 0.9343538880348206\n", - "epoch: 1 step 853, loss is 1.118343472480774\n", - "epoch: 1 step 854, loss is 0.7372920513153076\n", - "epoch: 1 step 855, loss is 0.7415173053741455\n", - "epoch: 1 step 856, loss is 0.733338475227356\n", - "epoch: 1 step 857, loss is 0.7452120780944824\n", - "epoch: 1 step 858, loss is 0.7284213304519653\n", - "epoch: 1 step 859, loss is 0.7050371170043945\n", - "epoch: 1 step 860, loss is 0.6951356530189514\n", - "epoch: 1 step 861, loss is 0.9011646509170532\n", - "epoch: 1 step 862, loss is 0.5518014430999756\n", - "epoch: 1 step 863, loss is 0.5539737343788147\n", - "epoch: 1 step 864, loss is 0.8515357971191406\n", - "epoch: 1 step 865, loss is 0.8021255731582642\n", - "epoch: 1 step 866, loss is 0.8317680358886719\n", - "epoch: 1 step 867, loss is 0.7628864049911499\n", - "epoch: 1 step 868, loss is 1.076028823852539\n", - "epoch: 1 step 869, loss is 0.8407242298126221\n", - "epoch: 1 step 870, loss is 0.9949799180030823\n", - "epoch: 1 step 871, loss is 0.740688681602478\n", - "epoch: 1 step 872, loss is 0.8933119773864746\n", - "epoch: 1 step 873, loss is 0.606465220451355\n", - "epoch: 1 step 874, loss is 0.7506833672523499\n", - "epoch: 1 step 875, loss is 0.683668315410614\n", - "epoch: 1 step 876, loss is 0.40669941902160645\n", - "epoch: 1 step 877, loss is 0.8291460275650024\n", - "epoch: 1 step 878, loss is 0.3711823523044586\n", - "epoch: 1 step 879, loss is 0.7083078622817993\n", - "epoch: 1 step 880, loss is 0.6440669894218445\n", - "epoch: 1 step 881, loss is 1.1711574792861938\n", - "epoch: 1 step 882, loss is 0.8168731331825256\n", - "epoch: 1 step 883, loss is 0.7596681714057922\n", - "epoch: 1 step 884, loss is 0.6600638628005981\n", - "epoch: 1 step 885, loss is 0.711455225944519\n", - "epoch: 1 step 886, loss is 0.4803568124771118\n", - "epoch: 1 step 887, loss is 0.5074321627616882\n", - "epoch: 1 step 888, loss is 0.5216010212898254\n", - "epoch: 1 step 889, loss is 0.5154662132263184\n", - "epoch: 1 step 890, loss is 0.8748500943183899\n", - "epoch: 1 step 891, loss is 0.6415078043937683\n", - "epoch: 1 step 892, loss is 0.331193208694458\n", - "epoch: 1 step 893, loss is 0.48383283615112305\n", - "epoch: 1 step 894, loss is 0.3206727206707001\n", - "epoch: 1 step 895, loss is 0.33423829078674316\n", - "epoch: 1 step 896, loss is 0.5230504274368286\n", - "epoch: 1 step 897, loss is 0.5705544352531433\n", - "epoch: 1 step 898, loss is 0.6121129393577576\n", - "epoch: 1 step 899, loss is 0.3681733310222626\n", - "epoch: 1 step 900, loss is 0.32715505361557007\n", - "epoch: 1 step 901, loss is 0.7740625739097595\n", - "epoch: 1 step 902, loss is 0.37605494260787964\n", - "epoch: 1 step 903, loss is 0.2818998396396637\n" - ] - }, - { - "name": "stdout", - "output_type": "stream", - "text": [ - "epoch: 1 step 904, loss is 0.8959805369377136\n", - "epoch: 1 step 905, loss is 0.35969066619873047\n", - "epoch: 1 step 906, loss is 0.5569093823432922\n", - "epoch: 1 step 907, loss is 0.6490380167961121\n", - "epoch: 1 step 908, loss is 0.4854198098182678\n", - "epoch: 1 step 909, loss is 0.681684136390686\n", - "epoch: 1 step 910, loss is 0.5594454407691956\n", - "epoch: 1 step 911, loss is 0.28246811032295227\n", - "epoch: 1 step 912, loss is 0.564134955406189\n", - "epoch: 1 step 913, loss is 0.3480125367641449\n", - "epoch: 1 step 914, loss is 0.30245739221572876\n", - "epoch: 1 step 915, loss is 0.3057793080806732\n", - "epoch: 1 step 916, loss is 0.40296462178230286\n", - "epoch: 1 step 917, loss is 0.5620078444480896\n", - "epoch: 1 step 918, loss is 0.3885918855667114\n", - "epoch: 1 step 919, loss is 0.3405517637729645\n", - "epoch: 1 step 920, loss is 0.5268466472625732\n", - "epoch: 1 step 921, loss is 0.11871110647916794\n", - "epoch: 1 step 922, loss is 0.4030132293701172\n", - "epoch: 1 step 923, loss is 0.44585901498794556\n", - "epoch: 1 step 924, loss is 0.42256829142570496\n", - "epoch: 1 step 925, loss is 0.05742168426513672\n", - "epoch: 1 step 926, loss is 0.43707379698753357\n", - "epoch: 1 step 927, loss is 0.31248682737350464\n", - "epoch: 1 step 928, loss is 0.48477858304977417\n", - "epoch: 1 step 929, loss is 0.22904114425182343\n", - "epoch: 1 step 930, loss is 0.6515914797782898\n", - "epoch: 1 step 931, loss is 0.38605231046676636\n", - "epoch: 1 step 932, loss is 0.5625531673431396\n", - "epoch: 1 step 933, loss is 0.39168664813041687\n", - "epoch: 1 step 934, loss is 0.20879001915454865\n", - "epoch: 1 step 935, loss is 0.2718164324760437\n", - "epoch: 1 step 936, loss is 0.5753874778747559\n", - "epoch: 1 step 937, loss is 0.4230019152164459\n", - "epoch: 1 step 938, loss is 0.4174725115299225\n", - "epoch: 1 step 939, loss is 0.8505776524543762\n", - "epoch: 1 step 940, loss is 0.25025784969329834\n", - "epoch: 1 step 941, loss is 0.711872398853302\n", - "epoch: 1 step 942, loss is 0.1983395218849182\n", - "epoch: 1 step 943, loss is 0.3040161728858948\n", - "epoch: 1 step 944, loss is 0.4821297824382782\n", - "epoch: 1 step 945, loss is 0.5496278405189514\n", - "epoch: 1 step 946, loss is 0.3190930187702179\n", - "epoch: 1 step 947, loss is 0.3668420910835266\n", - "epoch: 1 step 948, loss is 0.3687497675418854\n", - "epoch: 1 step 949, loss is 0.15128424763679504\n", - "epoch: 1 step 950, loss is 0.474439412355423\n", - "epoch: 1 step 951, loss is 0.4503163993358612\n", - "epoch: 1 step 952, loss is 0.3093399405479431\n", - "epoch: 1 step 953, loss is 0.2081325352191925\n", - "epoch: 1 step 954, loss is 0.564822256565094\n", - "epoch: 1 step 955, loss is 0.6963821053504944\n", - "epoch: 1 step 956, loss is 0.516822874546051\n", - "epoch: 1 step 957, loss is 0.08782825618982315\n", - "epoch: 1 step 958, loss is 0.7355473041534424\n", - "epoch: 1 step 959, loss is 0.3020493984222412\n", - "epoch: 1 step 960, loss is 0.17054177820682526\n", - "epoch: 1 step 961, loss is 0.29689866304397583\n", - "epoch: 1 step 962, loss is 0.2564084529876709\n", - "epoch: 1 step 963, loss is 0.32002487778663635\n", - "epoch: 1 step 964, loss is 0.6085982322692871\n", - "epoch: 1 step 965, loss is 0.2022058367729187\n", - "epoch: 1 step 966, loss is 0.28568506240844727\n", - "epoch: 1 step 967, loss is 0.40296539664268494\n", - "epoch: 1 step 968, loss is 0.7496739029884338\n", - "epoch: 1 step 969, loss is 0.20458398759365082\n", - "epoch: 1 step 970, loss is 0.2953733801841736\n", - "epoch: 1 step 971, loss is 0.5422435998916626\n", - "epoch: 1 step 972, loss is 0.29642194509506226\n", - "epoch: 1 step 973, loss is 0.3858273923397064\n", - "epoch: 1 step 974, loss is 0.3633837401866913\n", - "epoch: 1 step 975, loss is 0.25124484300613403\n", - "epoch: 1 step 976, loss is 0.20992040634155273\n", - "epoch: 1 step 977, loss is 0.13602547347545624\n", - "epoch: 1 step 978, loss is 0.1998327523469925\n", - "epoch: 1 step 979, loss is 0.30320289731025696\n", - "epoch: 1 step 980, loss is 0.33806726336479187\n", - "epoch: 1 step 981, loss is 0.6388981938362122\n", - "epoch: 1 step 982, loss is 0.36718547344207764\n", - "epoch: 1 step 983, loss is 0.2681766450405121\n", - "epoch: 1 step 984, loss is 0.26537206768989563\n", - "epoch: 1 step 985, loss is 0.3361125588417053\n", - "epoch: 1 step 986, loss is 0.6573469638824463\n", - "epoch: 1 step 987, loss is 0.24379801750183105\n", - "epoch: 1 step 988, loss is 0.31784507632255554\n", - "epoch: 1 step 989, loss is 0.27742987871170044\n", - "epoch: 1 step 990, loss is 0.5038543343544006\n", - "epoch: 1 step 991, loss is 0.09328068047761917\n", - "epoch: 1 step 992, loss is 0.40782731771469116\n", - "epoch: 1 step 993, loss is 0.08224949240684509\n", - "epoch: 1 step 994, loss is 0.2283739447593689\n", - "epoch: 1 step 995, loss is 0.2058914601802826\n", - "epoch: 1 step 996, loss is 0.5321757793426514\n", - "epoch: 1 step 997, loss is 0.15863986313343048\n", - "epoch: 1 step 998, loss is 0.2732018828392029\n", - "epoch: 1 step 999, loss is 0.21776258945465088\n", - "epoch: 1 step 1000, loss is 0.3760163187980652\n", - "epoch: 1 step 1001, loss is 0.2683853507041931\n", - "epoch: 1 step 1002, loss is 0.2589399218559265\n", - "epoch: 1 step 1003, loss is 0.04534674063324928\n", - "epoch: 1 step 1004, loss is 0.5600655674934387\n", - "epoch: 1 step 1005, loss is 0.2572353780269623\n", - "epoch: 1 step 1006, loss is 0.7334855198860168\n", - "epoch: 1 step 1007, loss is 0.2793818712234497\n", - "epoch: 1 step 1008, loss is 0.11112275719642639\n", - "epoch: 1 step 1009, loss is 0.4475588798522949\n", - "epoch: 1 step 1010, loss is 0.27338021993637085\n", - "epoch: 1 step 1011, loss is 0.20580638945102692\n", - "epoch: 1 step 1012, loss is 0.40580126643180847\n", - "epoch: 1 step 1013, loss is 0.07276472449302673\n", - "epoch: 1 step 1014, loss is 0.3398958146572113\n", - "epoch: 1 step 1015, loss is 0.6265392303466797\n", - "epoch: 1 step 1016, loss is 0.08471273630857468\n", - "epoch: 1 step 1017, loss is 0.31604471802711487\n", - "epoch: 1 step 1018, loss is 0.338906854391098\n", - "epoch: 1 step 1019, loss is 0.49559280276298523\n", - "epoch: 1 step 1020, loss is 0.27650532126426697\n", - "epoch: 1 step 1021, loss is 0.6406198740005493\n", - "epoch: 1 step 1022, loss is 0.3085279166698456\n", - "epoch: 1 step 1023, loss is 0.4372611939907074\n", - "epoch: 1 step 1024, loss is 0.5810808539390564\n", - "epoch: 1 step 1025, loss is 0.4246627688407898\n", - "epoch: 1 step 1026, loss is 0.3712407648563385\n", - "epoch: 1 step 1027, loss is 0.3081851303577423\n", - "epoch: 1 step 1028, loss is 0.21821783483028412\n", - "epoch: 1 step 1029, loss is 0.5841119885444641\n", - "epoch: 1 step 1030, loss is 0.3688592314720154\n", - "epoch: 1 step 1031, loss is 0.31897640228271484\n", - "epoch: 1 step 1032, loss is 0.2769280970096588\n", - "epoch: 1 step 1033, loss is 0.5379514098167419\n", - "epoch: 1 step 1034, loss is 0.25380346179008484\n", - "epoch: 1 step 1035, loss is 0.34640347957611084\n", - "epoch: 1 step 1036, loss is 0.16768336296081543\n", - "epoch: 1 step 1037, loss is 0.3480660319328308\n", - "epoch: 1 step 1038, loss is 0.38453879952430725\n", - "epoch: 1 step 1039, loss is 0.3001658022403717\n", - "epoch: 1 step 1040, loss is 0.7822790145874023\n", - "epoch: 1 step 1041, loss is 0.2834794819355011\n", - "epoch: 1 step 1042, loss is 0.3137734830379486\n", - "epoch: 1 step 1043, loss is 0.5421685576438904\n", - "epoch: 1 step 1044, loss is 0.18315750360488892\n", - "epoch: 1 step 1045, loss is 0.05776865780353546\n", - "epoch: 1 step 1046, loss is 0.27788466215133667\n", - "epoch: 1 step 1047, loss is 0.3679546117782593\n", - "epoch: 1 step 1048, loss is 0.4025610387325287\n", - "epoch: 1 step 1049, loss is 0.4029242694377899\n", - "epoch: 1 step 1050, loss is 0.16425970196723938\n", - "epoch: 1 step 1051, loss is 0.14769382774829865\n", - "epoch: 1 step 1052, loss is 0.1314668357372284\n", - "epoch: 1 step 1053, loss is 0.2774055004119873\n", - "epoch: 1 step 1054, loss is 0.6040791869163513\n", - "epoch: 1 step 1055, loss is 0.15977488458156586\n", - "epoch: 1 step 1056, loss is 0.3691514730453491\n", - "epoch: 1 step 1057, loss is 0.28412625193595886\n", - "epoch: 1 step 1058, loss is 0.5344435572624207\n", - "epoch: 1 step 1059, loss is 0.4580501317977905\n", - "epoch: 1 step 1060, loss is 0.12012836337089539\n", - "epoch: 1 step 1061, loss is 0.18099641799926758\n", - "epoch: 1 step 1062, loss is 0.08387185633182526\n", - "epoch: 1 step 1063, loss is 0.4229600131511688\n", - "epoch: 1 step 1064, loss is 0.23241564631462097\n", - "epoch: 1 step 1065, loss is 0.6064302921295166\n", - "epoch: 1 step 1066, loss is 0.617723286151886\n", - "epoch: 1 step 1067, loss is 0.08103369176387787\n", - "epoch: 1 step 1068, loss is 0.2632162868976593\n", - "epoch: 1 step 1069, loss is 0.09751598536968231\n", - "epoch: 1 step 1070, loss is 0.09698432683944702\n", - "epoch: 1 step 1071, loss is 0.2307710200548172\n", - "epoch: 1 step 1072, loss is 0.24673278629779816\n", - "epoch: 1 step 1073, loss is 0.47342631220817566\n", - "epoch: 1 step 1074, loss is 0.26532411575317383\n", - "epoch: 1 step 1075, loss is 0.17981591820716858\n", - "epoch: 1 step 1076, loss is 0.21233554184436798\n", - "epoch: 1 step 1077, loss is 0.1286160945892334\n" - ] - }, - { - "name": "stdout", - "output_type": "stream", - "text": [ - "epoch: 1 step 1078, loss is 0.22428953647613525\n", - "epoch: 1 step 1079, loss is 0.19300851225852966\n", - "epoch: 1 step 1080, loss is 0.13316313922405243\n", - "epoch: 1 step 1081, loss is 0.33551234006881714\n", - "epoch: 1 step 1082, loss is 0.6692869067192078\n", - "epoch: 1 step 1083, loss is 0.18172471225261688\n", - "epoch: 1 step 1084, loss is 0.44903361797332764\n", - "epoch: 1 step 1085, loss is 0.22222338616847992\n", - "epoch: 1 step 1086, loss is 0.23797328770160675\n", - "epoch: 1 step 1087, loss is 0.540205717086792\n", - "epoch: 1 step 1088, loss is 0.3685106635093689\n", - "epoch: 1 step 1089, loss is 0.34796908497810364\n", - "epoch: 1 step 1090, loss is 0.06477059423923492\n", - "epoch: 1 step 1091, loss is 0.2196572870016098\n", - "epoch: 1 step 1092, loss is 0.31783750653266907\n", - "epoch: 1 step 1093, loss is 0.4156820774078369\n", - "epoch: 1 step 1094, loss is 0.5367379784584045\n", - "epoch: 1 step 1095, loss is 0.2490120530128479\n", - "epoch: 1 step 1096, loss is 0.2063281387090683\n", - "epoch: 1 step 1097, loss is 0.3062203824520111\n", - "epoch: 1 step 1098, loss is 0.5498185753822327\n", - "epoch: 1 step 1099, loss is 0.24774104356765747\n", - "epoch: 1 step 1100, loss is 0.17051738500595093\n", - "epoch: 1 step 1101, loss is 0.23986303806304932\n", - "epoch: 1 step 1102, loss is 0.4491817057132721\n", - "epoch: 1 step 1103, loss is 0.3186136484146118\n", - "epoch: 1 step 1104, loss is 0.3498857617378235\n", - "epoch: 1 step 1105, loss is 0.28111812472343445\n", - "epoch: 1 step 1106, loss is 0.32438984513282776\n", - "epoch: 1 step 1107, loss is 0.073697529733181\n", - "epoch: 1 step 1108, loss is 0.6178018450737\n", - "epoch: 1 step 1109, loss is 0.3223627805709839\n", - "epoch: 1 step 1110, loss is 0.4303666353225708\n", - "epoch: 1 step 1111, loss is 0.35319793224334717\n", - "epoch: 1 step 1112, loss is 0.15253594517707825\n", - "epoch: 1 step 1113, loss is 0.1582525074481964\n", - "epoch: 1 step 1114, loss is 0.29381710290908813\n", - "epoch: 1 step 1115, loss is 0.40499237179756165\n", - "epoch: 1 step 1116, loss is 0.2567298114299774\n", - "epoch: 1 step 1117, loss is 0.1395392268896103\n", - "epoch: 1 step 1118, loss is 0.1815434694290161\n", - "epoch: 1 step 1119, loss is 0.1104380339384079\n", - "epoch: 1 step 1120, loss is 0.19085386395454407\n", - "epoch: 1 step 1121, loss is 0.2832586467266083\n", - "epoch: 1 step 1122, loss is 0.0957142785191536\n", - "epoch: 1 step 1123, loss is 0.06894905865192413\n", - "epoch: 1 step 1124, loss is 0.09497775882482529\n", - "epoch: 1 step 1125, loss is 0.09656362980604172\n", - "epoch: 1 step 1126, loss is 0.20896317064762115\n", - "epoch: 1 step 1127, loss is 0.07586798071861267\n", - "epoch: 1 step 1128, loss is 0.31554341316223145\n", - "epoch: 1 step 1129, loss is 0.30824175477027893\n", - "epoch: 1 step 1130, loss is 0.32638534903526306\n", - "epoch: 1 step 1131, loss is 0.46965035796165466\n", - "epoch: 1 step 1132, loss is 0.40726661682128906\n", - "epoch: 1 step 1133, loss is 0.23672117292881012\n", - "epoch: 1 step 1134, loss is 0.6298201680183411\n", - "epoch: 1 step 1135, loss is 0.20049166679382324\n", - "epoch: 1 step 1136, loss is 0.4530198574066162\n", - "epoch: 1 step 1137, loss is 0.06658273190259933\n", - "epoch: 1 step 1138, loss is 0.03635825589299202\n", - "epoch: 1 step 1139, loss is 0.4011572003364563\n", - "epoch: 1 step 1140, loss is 0.3976319432258606\n", - "epoch: 1 step 1141, loss is 0.282844603061676\n", - "epoch: 1 step 1142, loss is 0.45697933435440063\n", - "epoch: 1 step 1143, loss is 0.15638460218906403\n", - "epoch: 1 step 1144, loss is 0.28267666697502136\n", - "epoch: 1 step 1145, loss is 0.12315250933170319\n", - "epoch: 1 step 1146, loss is 0.27250251173973083\n", - "epoch: 1 step 1147, loss is 0.2064804881811142\n", - "epoch: 1 step 1148, loss is 0.29979947209358215\n", - "epoch: 1 step 1149, loss is 0.4557610750198364\n", - "epoch: 1 step 1150, loss is 0.3270760774612427\n", - "epoch: 1 step 1151, loss is 0.41850700974464417\n", - "epoch: 1 step 1152, loss is 0.1860130876302719\n", - "epoch: 1 step 1153, loss is 0.27925026416778564\n", - "epoch: 1 step 1154, loss is 0.4810539484024048\n", - "epoch: 1 step 1155, loss is 0.12464289367198944\n", - "epoch: 1 step 1156, loss is 0.16106286644935608\n", - "epoch: 1 step 1157, loss is 0.16907911002635956\n", - "epoch: 1 step 1158, loss is 0.06749197095632553\n", - "epoch: 1 step 1159, loss is 0.29827752709388733\n", - "epoch: 1 step 1160, loss is 0.24341711401939392\n", - "epoch: 1 step 1161, loss is 0.08036495745182037\n", - "epoch: 1 step 1162, loss is 0.24053184688091278\n", - "epoch: 1 step 1163, loss is 0.11434865742921829\n", - "epoch: 1 step 1164, loss is 0.13994774222373962\n", - "epoch: 1 step 1165, loss is 0.045894984155893326\n", - "epoch: 1 step 1166, loss is 0.18184658885002136\n", - "epoch: 1 step 1167, loss is 0.1025833785533905\n", - "epoch: 1 step 1168, loss is 0.29611027240753174\n", - "epoch: 1 step 1169, loss is 0.14438430964946747\n", - "epoch: 1 step 1170, loss is 0.29945775866508484\n", - "epoch: 1 step 1171, loss is 0.2431691586971283\n", - "epoch: 1 step 1172, loss is 0.2653314769268036\n", - "epoch: 1 step 1173, loss is 0.23848384618759155\n", - "epoch: 1 step 1174, loss is 0.42164602875709534\n", - "epoch: 1 step 1175, loss is 0.11151123046875\n", - "epoch: 1 step 1176, loss is 0.42921751737594604\n", - "epoch: 1 step 1177, loss is 0.12506745755672455\n", - "epoch: 1 step 1178, loss is 0.3728172183036804\n", - "epoch: 1 step 1179, loss is 0.22415082156658173\n", - "epoch: 1 step 1180, loss is 0.3576843738555908\n", - "epoch: 1 step 1181, loss is 0.40304967761039734\n", - "epoch: 1 step 1182, loss is 0.40224263072013855\n", - "epoch: 1 step 1183, loss is 0.1798265278339386\n", - "epoch: 1 step 1184, loss is 0.5039652585983276\n", - "epoch: 1 step 1185, loss is 0.13534842431545258\n", - "epoch: 1 step 1186, loss is 0.10237590968608856\n", - "epoch: 1 step 1187, loss is 0.11380477249622345\n", - "epoch: 1 step 1188, loss is 0.33532029390335083\n", - "epoch: 1 step 1189, loss is 0.37075909972190857\n", - "epoch: 1 step 1190, loss is 0.2595348656177521\n", - "epoch: 1 step 1191, loss is 0.5487306714057922\n", - "epoch: 1 step 1192, loss is 0.23060090839862823\n", - "epoch: 1 step 1193, loss is 0.20298916101455688\n", - "epoch: 1 step 1194, loss is 0.167954221367836\n", - "epoch: 1 step 1195, loss is 0.31161487102508545\n", - "epoch: 1 step 1196, loss is 0.294167160987854\n", - "epoch: 1 step 1197, loss is 0.19163778424263\n", - "epoch: 1 step 1198, loss is 0.15200237929821014\n", - "epoch: 1 step 1199, loss is 0.32278865575790405\n", - "epoch: 1 step 1200, loss is 0.7320305705070496\n", - "epoch: 1 step 1201, loss is 0.2634473145008087\n", - "epoch: 1 step 1202, loss is 0.1496431529521942\n", - "epoch: 1 step 1203, loss is 0.33701348304748535\n", - "epoch: 1 step 1204, loss is 0.1080637201666832\n", - "epoch: 1 step 1205, loss is 0.41639846563339233\n", - "epoch: 1 step 1206, loss is 0.11405153572559357\n", - "epoch: 1 step 1207, loss is 0.11913920938968658\n", - "epoch: 1 step 1208, loss is 0.31337475776672363\n", - "epoch: 1 step 1209, loss is 0.08249074965715408\n", - "epoch: 1 step 1210, loss is 0.1016281247138977\n", - "epoch: 1 step 1211, loss is 0.1404246836900711\n", - "epoch: 1 step 1212, loss is 0.34405234456062317\n", - "epoch: 1 step 1213, loss is 0.19854076206684113\n", - "epoch: 1 step 1214, loss is 0.12035379558801651\n", - "epoch: 1 step 1215, loss is 0.13874436914920807\n", - "epoch: 1 step 1216, loss is 0.06980980932712555\n", - "epoch: 1 step 1217, loss is 0.12272334098815918\n", - "epoch: 1 step 1218, loss is 0.11663737148046494\n", - "epoch: 1 step 1219, loss is 0.22052861750125885\n", - "epoch: 1 step 1220, loss is 0.15443752706050873\n", - "epoch: 1 step 1221, loss is 0.20201179385185242\n", - "epoch: 1 step 1222, loss is 0.2150571197271347\n", - "epoch: 1 step 1223, loss is 0.21844343841075897\n", - "epoch: 1 step 1224, loss is 0.030694296583533287\n", - "epoch: 1 step 1225, loss is 0.04256812483072281\n", - "epoch: 1 step 1226, loss is 0.0408121682703495\n", - "epoch: 1 step 1227, loss is 0.22226330637931824\n", - "epoch: 1 step 1228, loss is 0.0520462691783905\n", - "epoch: 1 step 1229, loss is 0.1642502397298813\n", - "epoch: 1 step 1230, loss is 0.4938809275627136\n", - "epoch: 1 step 1231, loss is 0.07367879152297974\n", - "epoch: 1 step 1232, loss is 0.19636599719524384\n", - "epoch: 1 step 1233, loss is 0.05613357201218605\n", - "epoch: 1 step 1234, loss is 0.18221279978752136\n", - "epoch: 1 step 1235, loss is 0.17399460077285767\n", - "epoch: 1 step 1236, loss is 0.21953904628753662\n", - "epoch: 1 step 1237, loss is 0.12992781400680542\n", - "epoch: 1 step 1238, loss is 0.21823962032794952\n", - "epoch: 1 step 1239, loss is 0.06261825561523438\n", - "epoch: 1 step 1240, loss is 0.059050578624010086\n", - "epoch: 1 step 1241, loss is 0.05838501825928688\n", - "epoch: 1 step 1242, loss is 0.38619691133499146\n", - "epoch: 1 step 1243, loss is 0.07106852531433105\n", - "epoch: 1 step 1244, loss is 0.15589046478271484\n", - "epoch: 1 step 1245, loss is 0.04772113263607025\n", - "epoch: 1 step 1246, loss is 0.21523457765579224\n", - "epoch: 1 step 1247, loss is 0.5234687924385071\n", - "epoch: 1 step 1248, loss is 0.4725988507270813\n", - "epoch: 1 step 1249, loss is 0.10416189581155777\n" - ] - }, - { - "name": "stdout", - "output_type": "stream", - "text": [ - "epoch: 1 step 1250, loss is 0.524069607257843\n", - "epoch: 1 step 1251, loss is 0.2652841806411743\n", - "epoch: 1 step 1252, loss is 0.2759210765361786\n", - "epoch: 1 step 1253, loss is 0.18665245175361633\n", - "epoch: 1 step 1254, loss is 0.30893442034721375\n", - "epoch: 1 step 1255, loss is 0.23022381961345673\n", - "epoch: 1 step 1256, loss is 0.12406717985868454\n", - "epoch: 1 step 1257, loss is 0.35657501220703125\n", - "epoch: 1 step 1258, loss is 0.1813289225101471\n", - "epoch: 1 step 1259, loss is 0.19929412007331848\n", - "epoch: 1 step 1260, loss is 0.3767518401145935\n", - "epoch: 1 step 1261, loss is 0.045815952122211456\n", - "epoch: 1 step 1262, loss is 0.3493342995643616\n", - "epoch: 1 step 1263, loss is 0.2095005214214325\n", - "epoch: 1 step 1264, loss is 0.038586486130952835\n", - "epoch: 1 step 1265, loss is 0.21155467629432678\n", - "epoch: 1 step 1266, loss is 0.1904488503932953\n", - "epoch: 1 step 1267, loss is 0.30973947048187256\n", - "epoch: 1 step 1268, loss is 0.0544302798807621\n", - "epoch: 1 step 1269, loss is 0.06591106206178665\n", - "epoch: 1 step 1270, loss is 0.48256736993789673\n", - "epoch: 1 step 1271, loss is 0.13080522418022156\n", - "epoch: 1 step 1272, loss is 0.19014829397201538\n", - "epoch: 1 step 1273, loss is 0.22884391248226166\n", - "epoch: 1 step 1274, loss is 0.08307930827140808\n", - "epoch: 1 step 1275, loss is 0.3609071373939514\n", - "epoch: 1 step 1276, loss is 0.03559282049536705\n", - "epoch: 1 step 1277, loss is 0.1531498283147812\n", - "epoch: 1 step 1278, loss is 0.18672996759414673\n", - "epoch: 1 step 1279, loss is 0.2806112468242645\n", - "epoch: 1 step 1280, loss is 0.27941325306892395\n", - "epoch: 1 step 1281, loss is 0.3150455355644226\n", - "epoch: 1 step 1282, loss is 0.42465540766716003\n", - "epoch: 1 step 1283, loss is 0.06855327636003494\n", - "epoch: 1 step 1284, loss is 0.07230948656797409\n", - "epoch: 1 step 1285, loss is 0.2690505385398865\n", - "epoch: 1 step 1286, loss is 0.24108488857746124\n", - "epoch: 1 step 1287, loss is 0.15547722578048706\n", - "epoch: 1 step 1288, loss is 0.13465237617492676\n", - "epoch: 1 step 1289, loss is 0.6730027794837952\n", - "epoch: 1 step 1290, loss is 0.13007678091526031\n", - "epoch: 1 step 1291, loss is 0.02121412381529808\n", - "epoch: 1 step 1292, loss is 0.049784980714321136\n", - "epoch: 1 step 1293, loss is 0.11070896685123444\n", - "epoch: 1 step 1294, loss is 0.2565460205078125\n", - "epoch: 1 step 1295, loss is 0.05546726658940315\n", - "epoch: 1 step 1296, loss is 0.11611609905958176\n", - "epoch: 1 step 1297, loss is 0.09831753373146057\n", - "epoch: 1 step 1298, loss is 0.05479292571544647\n", - "epoch: 1 step 1299, loss is 0.21358942985534668\n", - "epoch: 1 step 1300, loss is 0.325830340385437\n", - "epoch: 1 step 1301, loss is 0.5012524127960205\n", - "epoch: 1 step 1302, loss is 0.294604629278183\n", - "epoch: 1 step 1303, loss is 0.15517286956310272\n", - "epoch: 1 step 1304, loss is 0.039048220962285995\n", - "epoch: 1 step 1305, loss is 0.0981094092130661\n", - "epoch: 1 step 1306, loss is 0.17601901292800903\n", - "epoch: 1 step 1307, loss is 0.1788097769021988\n", - "epoch: 1 step 1308, loss is 0.27252811193466187\n", - "epoch: 1 step 1309, loss is 0.2025330811738968\n", - "epoch: 1 step 1310, loss is 0.3006960153579712\n", - "epoch: 1 step 1311, loss is 0.2401321530342102\n", - "epoch: 1 step 1312, loss is 0.11384464800357819\n", - "epoch: 1 step 1313, loss is 0.13461089134216309\n", - "epoch: 1 step 1314, loss is 0.12387176603078842\n", - "epoch: 1 step 1315, loss is 0.034613706171512604\n", - "epoch: 1 step 1316, loss is 0.191122904419899\n", - "epoch: 1 step 1317, loss is 0.03321123495697975\n", - "epoch: 1 step 1318, loss is 0.13519324362277985\n", - "epoch: 1 step 1319, loss is 0.23534858226776123\n", - "epoch: 1 step 1320, loss is 0.036745283752679825\n", - "epoch: 1 step 1321, loss is 0.37704044580459595\n", - "epoch: 1 step 1322, loss is 0.4474242031574249\n", - "epoch: 1 step 1323, loss is 0.26421141624450684\n", - "epoch: 1 step 1324, loss is 0.4650835692882538\n", - "epoch: 1 step 1325, loss is 0.1605958789587021\n", - "epoch: 1 step 1326, loss is 0.09721336513757706\n", - "epoch: 1 step 1327, loss is 0.23781290650367737\n", - "epoch: 1 step 1328, loss is 0.2742028534412384\n", - "epoch: 1 step 1329, loss is 0.2173849642276764\n", - "epoch: 1 step 1330, loss is 0.13021768629550934\n", - "epoch: 1 step 1331, loss is 0.13957762718200684\n", - "epoch: 1 step 1332, loss is 0.05297097936272621\n", - "epoch: 1 step 1333, loss is 0.2608557343482971\n", - "epoch: 1 step 1334, loss is 0.34402093291282654\n", - "epoch: 1 step 1335, loss is 0.11400490254163742\n", - "epoch: 1 step 1336, loss is 0.25951486825942993\n", - "epoch: 1 step 1337, loss is 0.31313174962997437\n", - "epoch: 1 step 1338, loss is 0.19678816199302673\n", - "epoch: 1 step 1339, loss is 0.14140719175338745\n", - "epoch: 1 step 1340, loss is 0.06490781158208847\n", - "epoch: 1 step 1341, loss is 0.2738936245441437\n", - "epoch: 1 step 1342, loss is 0.14719265699386597\n", - "epoch: 1 step 1343, loss is 0.13369572162628174\n", - "epoch: 1 step 1344, loss is 0.18537528812885284\n", - "epoch: 1 step 1345, loss is 0.1548815816640854\n", - "epoch: 1 step 1346, loss is 0.20167259871959686\n", - "epoch: 1 step 1347, loss is 0.04860328882932663\n", - "epoch: 1 step 1348, loss is 0.26139897108078003\n", - "epoch: 1 step 1349, loss is 0.27616557478904724\n", - "epoch: 1 step 1350, loss is 0.09206501394510269\n", - "epoch: 1 step 1351, loss is 0.1818128377199173\n", - "epoch: 1 step 1352, loss is 0.12076954543590546\n", - "epoch: 1 step 1353, loss is 0.21239732205867767\n", - "epoch: 1 step 1354, loss is 0.3705439865589142\n", - "epoch: 1 step 1355, loss is 0.35355713963508606\n", - "epoch: 1 step 1356, loss is 0.231022447347641\n", - "epoch: 1 step 1357, loss is 0.22299233078956604\n", - "epoch: 1 step 1358, loss is 0.014381278306245804\n", - "epoch: 1 step 1359, loss is 0.19613440334796906\n", - "epoch: 1 step 1360, loss is 0.45969653129577637\n", - "epoch: 1 step 1361, loss is 0.07987777143716812\n", - "epoch: 1 step 1362, loss is 0.2917234003543854\n", - "epoch: 1 step 1363, loss is 0.21176476776599884\n", - "epoch: 1 step 1364, loss is 0.15656745433807373\n", - "epoch: 1 step 1365, loss is 0.1705927699804306\n", - "epoch: 1 step 1366, loss is 0.28086960315704346\n", - "epoch: 1 step 1367, loss is 0.27005940675735474\n", - "epoch: 1 step 1368, loss is 0.14511409401893616\n", - "epoch: 1 step 1369, loss is 0.09363985061645508\n", - "epoch: 1 step 1370, loss is 0.18109314143657684\n", - "epoch: 1 step 1371, loss is 0.22667500376701355\n", - "epoch: 1 step 1372, loss is 0.4181455969810486\n", - "epoch: 1 step 1373, loss is 0.060120873153209686\n", - "epoch: 1 step 1374, loss is 0.37927189469337463\n", - "epoch: 1 step 1375, loss is 0.21917177736759186\n", - "epoch: 1 step 1376, loss is 0.554511547088623\n", - "epoch: 1 step 1377, loss is 0.430952787399292\n", - "epoch: 1 step 1378, loss is 0.5707189440727234\n", - "epoch: 1 step 1379, loss is 0.304430216550827\n", - "epoch: 1 step 1380, loss is 0.15270961821079254\n", - "epoch: 1 step 1381, loss is 0.12651991844177246\n", - "epoch: 1 step 1382, loss is 0.15000121295452118\n", - "epoch: 1 step 1383, loss is 0.1347784847021103\n", - "epoch: 1 step 1384, loss is 0.13474896550178528\n", - "epoch: 1 step 1385, loss is 0.1895076185464859\n", - "epoch: 1 step 1386, loss is 0.19797468185424805\n", - "epoch: 1 step 1387, loss is 0.378776490688324\n", - "epoch: 1 step 1388, loss is 0.3142922818660736\n", - "epoch: 1 step 1389, loss is 0.18020403385162354\n", - "epoch: 1 step 1390, loss is 0.20521847903728485\n", - "epoch: 1 step 1391, loss is 0.15917985141277313\n", - "epoch: 1 step 1392, loss is 0.12527571618556976\n", - "epoch: 1 step 1393, loss is 0.07082045078277588\n", - "epoch: 1 step 1394, loss is 0.0378243662416935\n", - "epoch: 1 step 1395, loss is 0.3067173957824707\n", - "epoch: 1 step 1396, loss is 0.09532315284013748\n", - "epoch: 1 step 1397, loss is 0.22564195096492767\n", - "epoch: 1 step 1398, loss is 0.06700921058654785\n", - "epoch: 1 step 1399, loss is 0.19249945878982544\n", - "epoch: 1 step 1400, loss is 0.09617947787046432\n", - "epoch: 1 step 1401, loss is 0.39516302943229675\n", - "epoch: 1 step 1402, loss is 0.09691809862852097\n", - "epoch: 1 step 1403, loss is 0.048507556319236755\n", - "epoch: 1 step 1404, loss is 0.33102044463157654\n", - "epoch: 1 step 1405, loss is 0.07082463055849075\n", - "epoch: 1 step 1406, loss is 0.12862513959407806\n", - "epoch: 1 step 1407, loss is 0.42798691987991333\n", - "epoch: 1 step 1408, loss is 0.20406198501586914\n", - "epoch: 1 step 1409, loss is 0.23511460423469543\n", - "epoch: 1 step 1410, loss is 0.2709139585494995\n", - "epoch: 1 step 1411, loss is 0.19246803224086761\n", - "epoch: 1 step 1412, loss is 0.20838122069835663\n", - "epoch: 1 step 1413, loss is 0.21794021129608154\n", - "epoch: 1 step 1414, loss is 0.08478481322526932\n", - "epoch: 1 step 1415, loss is 0.012655519880354404\n", - "epoch: 1 step 1416, loss is 0.14444799721240997\n", - "epoch: 1 step 1417, loss is 0.037335995584726334\n", - "epoch: 1 step 1418, loss is 0.06604143232107162\n", - "epoch: 1 step 1419, loss is 0.14586074650287628\n", - "epoch: 1 step 1420, loss is 0.1347969025373459\n" - ] - }, - { - "name": "stdout", - "output_type": "stream", - "text": [ - "epoch: 1 step 1421, loss is 0.2732638716697693\n", - "epoch: 1 step 1422, loss is 0.28914135694503784\n", - "epoch: 1 step 1423, loss is 0.18879273533821106\n", - "epoch: 1 step 1424, loss is 0.13441845774650574\n", - "epoch: 1 step 1425, loss is 0.09011184424161911\n", - "epoch: 1 step 1426, loss is 0.1790235936641693\n", - "epoch: 1 step 1427, loss is 0.18365859985351562\n", - "epoch: 1 step 1428, loss is 0.030408916994929314\n", - "epoch: 1 step 1429, loss is 0.24467791616916656\n", - "epoch: 1 step 1430, loss is 0.29295310378074646\n", - "epoch: 1 step 1431, loss is 0.07330065965652466\n", - "epoch: 1 step 1432, loss is 0.11970701068639755\n", - "epoch: 1 step 1433, loss is 0.19104799628257751\n", - "epoch: 1 step 1434, loss is 0.22241932153701782\n", - "epoch: 1 step 1435, loss is 0.2240758091211319\n", - "epoch: 1 step 1436, loss is 0.10165629535913467\n", - "epoch: 1 step 1437, loss is 0.20622384548187256\n", - "epoch: 1 step 1438, loss is 0.22979219257831573\n", - "epoch: 1 step 1439, loss is 0.4825741946697235\n", - "epoch: 1 step 1440, loss is 0.34842449426651\n", - "epoch: 1 step 1441, loss is 0.24044188857078552\n", - "epoch: 1 step 1442, loss is 0.11065179854631424\n", - "epoch: 1 step 1443, loss is 0.11732476949691772\n", - "epoch: 1 step 1444, loss is 0.17063923180103302\n", - "epoch: 1 step 1445, loss is 0.14149747788906097\n", - "epoch: 1 step 1446, loss is 0.17800576984882355\n", - "epoch: 1 step 1447, loss is 0.12914671003818512\n", - "epoch: 1 step 1448, loss is 0.287578821182251\n", - "epoch: 1 step 1449, loss is 0.1374102085828781\n", - "epoch: 1 step 1450, loss is 0.16122445464134216\n", - "epoch: 1 step 1451, loss is 0.09191429615020752\n", - "epoch: 1 step 1452, loss is 0.08258654922246933\n", - "epoch: 1 step 1453, loss is 0.026686370372772217\n", - "epoch: 1 step 1454, loss is 0.14997930824756622\n", - "epoch: 1 step 1455, loss is 0.2664296329021454\n", - "epoch: 1 step 1456, loss is 0.2523840069770813\n", - "epoch: 1 step 1457, loss is 0.26017773151397705\n", - "epoch: 1 step 1458, loss is 0.18211503326892853\n", - "epoch: 1 step 1459, loss is 0.039881281554698944\n", - "epoch: 1 step 1460, loss is 0.1660570502281189\n", - "epoch: 1 step 1461, loss is 0.3116016089916229\n", - "epoch: 1 step 1462, loss is 0.14994260668754578\n", - "epoch: 1 step 1463, loss is 0.3222777545452118\n", - "epoch: 1 step 1464, loss is 0.1977226287126541\n", - "epoch: 1 step 1465, loss is 0.11967132985591888\n", - "epoch: 1 step 1466, loss is 0.10561515390872955\n", - "epoch: 1 step 1467, loss is 0.2444068342447281\n", - "epoch: 1 step 1468, loss is 0.2793606221675873\n", - "epoch: 1 step 1469, loss is 0.031023547053337097\n", - "epoch: 1 step 1470, loss is 0.025633910670876503\n", - "epoch: 1 step 1471, loss is 0.18152564764022827\n", - "epoch: 1 step 1472, loss is 0.06083272397518158\n", - "epoch: 1 step 1473, loss is 0.2525058686733246\n", - "epoch: 1 step 1474, loss is 0.15242691338062286\n", - "epoch: 1 step 1475, loss is 0.07130670547485352\n", - "epoch: 1 step 1476, loss is 0.0347457155585289\n", - "epoch: 1 step 1477, loss is 0.147914320230484\n", - "epoch: 1 step 1478, loss is 0.060940198600292206\n", - "epoch: 1 step 1479, loss is 0.4348907768726349\n", - "epoch: 1 step 1480, loss is 0.2367706149816513\n", - "epoch: 1 step 1481, loss is 0.3024718463420868\n", - "epoch: 1 step 1482, loss is 0.11994308233261108\n", - "epoch: 1 step 1483, loss is 0.05357520282268524\n", - "epoch: 1 step 1484, loss is 0.31533414125442505\n", - "epoch: 1 step 1485, loss is 0.3344602882862091\n", - "epoch: 1 step 1486, loss is 0.08471953123807907\n", - "epoch: 1 step 1487, loss is 0.0591905303299427\n", - "epoch: 1 step 1488, loss is 0.12241467088460922\n", - "epoch: 1 step 1489, loss is 0.1028100997209549\n", - "epoch: 1 step 1490, loss is 0.0971469059586525\n", - "epoch: 1 step 1491, loss is 0.17085614800453186\n", - "epoch: 1 step 1492, loss is 0.20307905972003937\n", - "epoch: 1 step 1493, loss is 0.15554949641227722\n", - "epoch: 1 step 1494, loss is 0.152117058634758\n", - "epoch: 1 step 1495, loss is 0.036166153848171234\n", - "epoch: 1 step 1496, loss is 0.0663430467247963\n", - "epoch: 1 step 1497, loss is 0.06434255093336105\n", - "epoch: 1 step 1498, loss is 0.028085848316550255\n", - "epoch: 1 step 1499, loss is 0.10241769999265671\n", - "epoch: 1 step 1500, loss is 0.22307536005973816\n", - "epoch: 1 step 1501, loss is 0.016080163419246674\n", - "epoch: 1 step 1502, loss is 0.2076130211353302\n", - "epoch: 1 step 1503, loss is 0.14079713821411133\n", - "epoch: 1 step 1504, loss is 0.17134781181812286\n", - "epoch: 1 step 1505, loss is 0.24002285301685333\n", - "epoch: 1 step 1506, loss is 0.07507986575365067\n", - "epoch: 1 step 1507, loss is 0.015371493995189667\n", - "epoch: 1 step 1508, loss is 0.35458993911743164\n", - "epoch: 1 step 1509, loss is 0.13178271055221558\n", - "epoch: 1 step 1510, loss is 0.07642673701047897\n", - "epoch: 1 step 1511, loss is 0.24822889268398285\n", - "epoch: 1 step 1512, loss is 0.16871564090251923\n", - "epoch: 1 step 1513, loss is 0.40745192766189575\n", - "epoch: 1 step 1514, loss is 0.2804504632949829\n", - "epoch: 1 step 1515, loss is 0.1647489070892334\n", - "epoch: 1 step 1516, loss is 0.2085198163986206\n", - "epoch: 1 step 1517, loss is 0.011187231168150902\n", - "epoch: 1 step 1518, loss is 0.06131260469555855\n", - "epoch: 1 step 1519, loss is 0.13793037831783295\n", - "epoch: 1 step 1520, loss is 0.08268798887729645\n", - "epoch: 1 step 1521, loss is 0.2918003499507904\n", - "epoch: 1 step 1522, loss is 0.1475374847650528\n", - "epoch: 1 step 1523, loss is 0.05017763748764992\n", - "epoch: 1 step 1524, loss is 0.4219624102115631\n", - "epoch: 1 step 1525, loss is 0.07047577202320099\n", - "epoch: 1 step 1526, loss is 0.36795493960380554\n", - "epoch: 1 step 1527, loss is 0.205402672290802\n", - "epoch: 1 step 1528, loss is 0.3113693594932556\n", - "epoch: 1 step 1529, loss is 0.018939964473247528\n", - "epoch: 1 step 1530, loss is 0.11963711678981781\n", - "epoch: 1 step 1531, loss is 0.26485997438430786\n", - "epoch: 1 step 1532, loss is 0.11608637869358063\n", - "epoch: 1 step 1533, loss is 0.038664810359478\n", - "epoch: 1 step 1534, loss is 0.16135820746421814\n", - "epoch: 1 step 1535, loss is 0.027460472658276558\n", - "epoch: 1 step 1536, loss is 0.10712457448244095\n", - "epoch: 1 step 1537, loss is 0.2344018816947937\n", - "epoch: 1 step 1538, loss is 0.535405158996582\n", - "epoch: 1 step 1539, loss is 0.22821743786334991\n", - "epoch: 1 step 1540, loss is 0.1968739926815033\n", - "epoch: 1 step 1541, loss is 0.29195213317871094\n", - "epoch: 1 step 1542, loss is 0.025655176490545273\n", - "epoch: 1 step 1543, loss is 0.07978092133998871\n", - "epoch: 1 step 1544, loss is 0.1264479160308838\n", - "epoch: 1 step 1545, loss is 0.22964981198310852\n", - "epoch: 1 step 1546, loss is 0.12184898555278778\n", - "epoch: 1 step 1547, loss is 0.4731254279613495\n", - "epoch: 1 step 1548, loss is 0.03339137136936188\n", - "epoch: 1 step 1549, loss is 0.014725659042596817\n", - "epoch: 1 step 1550, loss is 0.34388190507888794\n", - "epoch: 1 step 1551, loss is 0.2549399733543396\n", - "epoch: 1 step 1552, loss is 0.24979887902736664\n", - "epoch: 1 step 1553, loss is 0.19499702751636505\n", - "epoch: 1 step 1554, loss is 0.1645338535308838\n", - "epoch: 1 step 1555, loss is 0.24743524193763733\n", - "epoch: 1 step 1556, loss is 0.03985191136598587\n", - "epoch: 1 step 1557, loss is 0.07091329246759415\n", - "epoch: 1 step 1558, loss is 0.052609167993068695\n", - "epoch: 1 step 1559, loss is 0.07124129682779312\n", - "epoch: 1 step 1560, loss is 0.0961894765496254\n", - "epoch: 1 step 1561, loss is 0.0910254493355751\n", - "epoch: 1 step 1562, loss is 0.11323731392621994\n", - "epoch: 1 step 1563, loss is 0.17447282373905182\n", - "epoch: 1 step 1564, loss is 0.27797597646713257\n", - "epoch: 1 step 1565, loss is 0.12834854423999786\n", - "epoch: 1 step 1566, loss is 0.04837791621685028\n", - "epoch: 1 step 1567, loss is 0.23610715568065643\n", - "epoch: 1 step 1568, loss is 0.00686205830425024\n", - "epoch: 1 step 1569, loss is 0.18006019294261932\n", - "epoch: 1 step 1570, loss is 0.0256658885627985\n", - "epoch: 1 step 1571, loss is 0.15297570824623108\n", - "epoch: 1 step 1572, loss is 0.1414184272289276\n", - "epoch: 1 step 1573, loss is 0.17526394128799438\n", - "epoch: 1 step 1574, loss is 0.08118576556444168\n", - "epoch: 1 step 1575, loss is 0.10771070420742035\n", - "epoch: 1 step 1576, loss is 0.03570283204317093\n", - "epoch: 1 step 1577, loss is 0.11714303493499756\n", - "epoch: 1 step 1578, loss is 0.1816290020942688\n", - "epoch: 1 step 1579, loss is 0.021464530378580093\n", - "epoch: 1 step 1580, loss is 0.11372466385364532\n", - "epoch: 1 step 1581, loss is 0.17419934272766113\n", - "epoch: 1 step 1582, loss is 0.011160505935549736\n", - "epoch: 1 step 1583, loss is 0.24291273951530457\n", - "epoch: 1 step 1584, loss is 0.010671941563487053\n", - "epoch: 1 step 1585, loss is 0.017322441563010216\n", - "epoch: 1 step 1586, loss is 0.2335474044084549\n", - "epoch: 1 step 1587, loss is 0.2556104063987732\n", - "epoch: 1 step 1588, loss is 0.0648474395275116\n", - "epoch: 1 step 1589, loss is 0.22502899169921875\n", - "epoch: 1 step 1590, loss is 0.47354650497436523\n", - "epoch: 1 step 1591, loss is 0.2790185511112213\n" - ] - }, - { - "name": "stdout", - "output_type": "stream", - "text": [ - "epoch: 1 step 1592, loss is 0.037289269268512726\n", - "epoch: 1 step 1593, loss is 0.6669546961784363\n", - "epoch: 1 step 1594, loss is 0.027973078191280365\n", - "epoch: 1 step 1595, loss is 0.3043333888053894\n", - "epoch: 1 step 1596, loss is 0.26716819405555725\n", - "epoch: 1 step 1597, loss is 0.3676808476448059\n", - "epoch: 1 step 1598, loss is 0.04774334654211998\n", - "epoch: 1 step 1599, loss is 0.1505802571773529\n", - "epoch: 1 step 1600, loss is 0.060209549963474274\n", - "epoch: 1 step 1601, loss is 0.1517183780670166\n", - "epoch: 1 step 1602, loss is 0.03348312899470329\n", - "epoch: 1 step 1603, loss is 0.2172851860523224\n", - "epoch: 1 step 1604, loss is 0.16908381879329681\n", - "epoch: 1 step 1605, loss is 0.4056672155857086\n", - "epoch: 1 step 1606, loss is 0.3545580506324768\n", - "epoch: 1 step 1607, loss is 0.0785195454955101\n", - "epoch: 1 step 1608, loss is 0.06628265231847763\n", - "epoch: 1 step 1609, loss is 0.17911334335803986\n", - "epoch: 1 step 1610, loss is 0.30736130475997925\n", - "epoch: 1 step 1611, loss is 0.22257380187511444\n", - "epoch: 1 step 1612, loss is 0.14600497484207153\n", - "epoch: 1 step 1613, loss is 0.05811571329832077\n", - "epoch: 1 step 1614, loss is 0.2594680190086365\n", - "epoch: 1 step 1615, loss is 0.14777813851833344\n", - "epoch: 1 step 1616, loss is 0.21176908910274506\n", - "epoch: 1 step 1617, loss is 0.08806657046079636\n", - "epoch: 1 step 1618, loss is 0.08265901356935501\n", - "epoch: 1 step 1619, loss is 0.06283356994390488\n", - "epoch: 1 step 1620, loss is 0.244098961353302\n", - "epoch: 1 step 1621, loss is 0.16140873730182648\n", - "epoch: 1 step 1622, loss is 0.4248126149177551\n", - "epoch: 1 step 1623, loss is 0.5076652765274048\n", - "epoch: 1 step 1624, loss is 0.12746888399124146\n", - "epoch: 1 step 1625, loss is 0.1834188550710678\n", - "epoch: 1 step 1626, loss is 0.14382439851760864\n", - "epoch: 1 step 1627, loss is 0.19014105200767517\n", - "epoch: 1 step 1628, loss is 0.30281925201416016\n", - "epoch: 1 step 1629, loss is 0.09651453047990799\n", - "epoch: 1 step 1630, loss is 0.2950707972049713\n", - "epoch: 1 step 1631, loss is 0.033858783543109894\n", - "epoch: 1 step 1632, loss is 0.08982428908348083\n", - "epoch: 1 step 1633, loss is 0.18717190623283386\n", - "epoch: 1 step 1634, loss is 0.22012434899806976\n", - "epoch: 1 step 1635, loss is 0.23406831920146942\n", - "epoch: 1 step 1636, loss is 0.19017261266708374\n", - "epoch: 1 step 1637, loss is 0.10751032829284668\n", - "epoch: 1 step 1638, loss is 0.020188376307487488\n", - "epoch: 1 step 1639, loss is 0.027881013229489326\n", - "epoch: 1 step 1640, loss is 0.07523792237043381\n", - "epoch: 1 step 1641, loss is 0.29369279742240906\n", - "epoch: 1 step 1642, loss is 0.21822905540466309\n", - "epoch: 1 step 1643, loss is 0.29793232679367065\n", - "epoch: 1 step 1644, loss is 0.043750420212745667\n", - "epoch: 1 step 1645, loss is 0.09380664676427841\n", - "epoch: 1 step 1646, loss is 0.28589677810668945\n", - "epoch: 1 step 1647, loss is 0.026868261396884918\n", - "epoch: 1 step 1648, loss is 0.18957915902137756\n", - "epoch: 1 step 1649, loss is 0.17896448075771332\n", - "epoch: 1 step 1650, loss is 0.10533048212528229\n", - "epoch: 1 step 1651, loss is 0.19312293827533722\n", - "epoch: 1 step 1652, loss is 0.047780029475688934\n", - "epoch: 1 step 1653, loss is 0.043677132576704025\n", - "epoch: 1 step 1654, loss is 0.12250451743602753\n", - "epoch: 1 step 1655, loss is 0.24486027657985687\n", - "epoch: 1 step 1656, loss is 0.047279682010412216\n", - "epoch: 1 step 1657, loss is 0.06328191608190536\n", - "epoch: 1 step 1658, loss is 0.08435419946908951\n", - "epoch: 1 step 1659, loss is 0.043745022267103195\n", - "epoch: 1 step 1660, loss is 0.2011641561985016\n", - "epoch: 1 step 1661, loss is 0.36145713925361633\n", - "epoch: 1 step 1662, loss is 0.10772320628166199\n", - "epoch: 1 step 1663, loss is 0.08749391883611679\n", - "epoch: 1 step 1664, loss is 0.30133727192878723\n", - "epoch: 1 step 1665, loss is 0.10319367796182632\n", - "epoch: 1 step 1666, loss is 0.09821230173110962\n", - "epoch: 1 step 1667, loss is 0.016956666484475136\n", - "epoch: 1 step 1668, loss is 0.0710502415895462\n", - "epoch: 1 step 1669, loss is 0.007008133456110954\n", - "epoch: 1 step 1670, loss is 0.21395447850227356\n", - "epoch: 1 step 1671, loss is 0.128061443567276\n", - "epoch: 1 step 1672, loss is 0.20678196847438812\n", - "epoch: 1 step 1673, loss is 0.22415968775749207\n", - "epoch: 1 step 1674, loss is 0.08397642523050308\n", - "epoch: 1 step 1675, loss is 0.18893855810165405\n", - "epoch: 1 step 1676, loss is 0.20789159834384918\n", - "epoch: 1 step 1677, loss is 0.3421628773212433\n", - "epoch: 1 step 1678, loss is 0.08329568058252335\n", - "epoch: 1 step 1679, loss is 0.10777005553245544\n", - "epoch: 1 step 1680, loss is 0.28770652413368225\n", - "epoch: 1 step 1681, loss is 0.16173502802848816\n", - "epoch: 1 step 1682, loss is 0.08911692351102829\n", - "epoch: 1 step 1683, loss is 0.09613072127103806\n", - "epoch: 1 step 1684, loss is 0.0196407288312912\n", - "epoch: 1 step 1685, loss is 0.14691004157066345\n", - "epoch: 1 step 1686, loss is 0.16538022458553314\n", - "epoch: 1 step 1687, loss is 0.17081548273563385\n", - "epoch: 1 step 1688, loss is 0.16752029955387115\n", - "epoch: 1 step 1689, loss is 0.07556203007698059\n", - "epoch: 1 step 1690, loss is 0.05724847689270973\n", - "epoch: 1 step 1691, loss is 0.07723591476678848\n", - "epoch: 1 step 1692, loss is 0.22748827934265137\n", - "epoch: 1 step 1693, loss is 0.20023450255393982\n", - "epoch: 1 step 1694, loss is 0.045707911252975464\n", - "epoch: 1 step 1695, loss is 0.11219199001789093\n", - "epoch: 1 step 1696, loss is 0.17740048468112946\n", - "epoch: 1 step 1697, loss is 0.058840714395046234\n", - "epoch: 1 step 1698, loss is 0.22153080999851227\n", - "epoch: 1 step 1699, loss is 0.07449723035097122\n", - "epoch: 1 step 1700, loss is 0.3810114860534668\n", - "epoch: 1 step 1701, loss is 0.08629638701677322\n", - "epoch: 1 step 1702, loss is 0.2227892279624939\n", - "epoch: 1 step 1703, loss is 0.16094565391540527\n", - "epoch: 1 step 1704, loss is 0.18608491122722626\n", - "epoch: 1 step 1705, loss is 0.12596172094345093\n", - "epoch: 1 step 1706, loss is 0.20060348510742188\n", - "epoch: 1 step 1707, loss is 0.2897909879684448\n", - "epoch: 1 step 1708, loss is 0.14372855424880981\n", - "epoch: 1 step 1709, loss is 0.09452032297849655\n", - "epoch: 1 step 1710, loss is 0.1649545133113861\n", - "epoch: 1 step 1711, loss is 0.13143958151340485\n", - "epoch: 1 step 1712, loss is 0.08895035833120346\n", - "epoch: 1 step 1713, loss is 0.28394749760627747\n", - "epoch: 1 step 1714, loss is 0.1974935084581375\n", - "epoch: 1 step 1715, loss is 0.10568737238645554\n", - "epoch: 1 step 1716, loss is 0.01058405451476574\n", - "epoch: 1 step 1717, loss is 0.1257704347372055\n", - "epoch: 1 step 1718, loss is 0.0896354541182518\n", - "epoch: 1 step 1719, loss is 0.3377484083175659\n", - "epoch: 1 step 1720, loss is 0.17294782400131226\n", - "epoch: 1 step 1721, loss is 0.11671129614114761\n", - "epoch: 1 step 1722, loss is 0.06993158161640167\n", - "epoch: 1 step 1723, loss is 0.08449213206768036\n", - "epoch: 1 step 1724, loss is 0.0490744486451149\n", - "epoch: 1 step 1725, loss is 0.1778782606124878\n", - "epoch: 1 step 1726, loss is 0.016006093472242355\n", - "epoch: 1 step 1727, loss is 0.07804140448570251\n", - "epoch: 1 step 1728, loss is 0.09830362349748611\n", - "epoch: 1 step 1729, loss is 0.4913692772388458\n", - "epoch: 1 step 1730, loss is 0.11207643896341324\n", - "epoch: 1 step 1731, loss is 0.044747576117515564\n", - "epoch: 1 step 1732, loss is 0.0635237842798233\n", - "epoch: 1 step 1733, loss is 0.036319926381111145\n", - "epoch: 1 step 1734, loss is 0.22375133633613586\n", - "epoch: 1 step 1735, loss is 0.12693189084529877\n", - "epoch: 1 step 1736, loss is 0.1794988363981247\n", - "epoch: 1 step 1737, loss is 0.0602247416973114\n", - "epoch: 1 step 1738, loss is 0.37853333353996277\n", - "epoch: 1 step 1739, loss is 0.2785823345184326\n", - "epoch: 1 step 1740, loss is 0.11330924183130264\n", - "epoch: 1 step 1741, loss is 0.24572473764419556\n", - "epoch: 1 step 1742, loss is 0.13654503226280212\n", - "epoch: 1 step 1743, loss is 0.08993854373693466\n", - "epoch: 1 step 1744, loss is 0.14939749240875244\n", - "epoch: 1 step 1745, loss is 0.25143638253211975\n", - "epoch: 1 step 1746, loss is 0.19276076555252075\n", - "epoch: 1 step 1747, loss is 0.1520737111568451\n", - "epoch: 1 step 1748, loss is 0.21680326759815216\n", - "epoch: 1 step 1749, loss is 0.2511528730392456\n", - "epoch: 1 step 1750, loss is 0.11513651907444\n", - "epoch: 1 step 1751, loss is 0.10224784165620804\n", - "epoch: 1 step 1752, loss is 0.20663639903068542\n", - "epoch: 1 step 1753, loss is 0.061586134135723114\n", - "epoch: 1 step 1754, loss is 0.039487749338150024\n", - "epoch: 1 step 1755, loss is 0.2036251574754715\n", - "epoch: 1 step 1756, loss is 0.03966877609491348\n", - "epoch: 1 step 1757, loss is 0.027231315150856972\n", - "epoch: 1 step 1758, loss is 0.11806073784828186\n", - "epoch: 1 step 1759, loss is 0.043962445110082626\n", - "epoch: 1 step 1760, loss is 0.20480386912822723\n", - "epoch: 1 step 1761, loss is 0.1087326630949974\n", - "epoch: 1 step 1762, loss is 0.0780656635761261\n", - "epoch: 1 step 1763, loss is 0.19735795259475708\n" - ] - }, - { - "name": "stdout", - "output_type": "stream", - "text": [ - "epoch: 1 step 1764, loss is 0.18277199566364288\n", - "epoch: 1 step 1765, loss is 0.1360340714454651\n", - "epoch: 1 step 1766, loss is 0.017330918461084366\n", - "epoch: 1 step 1767, loss is 0.13024410605430603\n", - "epoch: 1 step 1768, loss is 0.29709434509277344\n", - "epoch: 1 step 1769, loss is 0.10498026013374329\n", - "epoch: 1 step 1770, loss is 0.11527147144079208\n", - "epoch: 1 step 1771, loss is 0.345134437084198\n", - "epoch: 1 step 1772, loss is 0.2717594802379608\n", - "epoch: 1 step 1773, loss is 0.2283378392457962\n", - "epoch: 1 step 1774, loss is 0.0384673997759819\n", - "epoch: 1 step 1775, loss is 0.176451176404953\n", - "epoch: 1 step 1776, loss is 0.09406087547540665\n", - "epoch: 1 step 1777, loss is 0.10655457526445389\n", - "epoch: 1 step 1778, loss is 0.11743699014186859\n", - "epoch: 1 step 1779, loss is 0.20475371181964874\n", - "epoch: 1 step 1780, loss is 0.11362048238515854\n", - "epoch: 1 step 1781, loss is 0.16925252974033356\n", - "epoch: 1 step 1782, loss is 0.1613941192626953\n", - "epoch: 1 step 1783, loss is 0.16446731984615326\n", - "epoch: 1 step 1784, loss is 0.2628989517688751\n", - "epoch: 1 step 1785, loss is 0.2599729597568512\n", - "epoch: 1 step 1786, loss is 0.07843288779258728\n", - "epoch: 1 step 1787, loss is 0.00616246834397316\n", - "epoch: 1 step 1788, loss is 0.14303813874721527\n", - "epoch: 1 step 1789, loss is 0.09004691243171692\n", - "epoch: 1 step 1790, loss is 0.0073612225241959095\n", - "epoch: 1 step 1791, loss is 0.11986434459686279\n", - "epoch: 1 step 1792, loss is 0.37537509202957153\n", - "epoch: 1 step 1793, loss is 0.03435497358441353\n", - "epoch: 1 step 1794, loss is 0.17364943027496338\n", - "epoch: 1 step 1795, loss is 0.09953322261571884\n", - "epoch: 1 step 1796, loss is 0.06589128077030182\n", - "epoch: 1 step 1797, loss is 0.06567772477865219\n", - "epoch: 1 step 1798, loss is 0.030495651066303253\n", - "epoch: 1 step 1799, loss is 0.13174638152122498\n", - "epoch: 1 step 1800, loss is 0.12769488990306854\n", - "epoch: 1 step 1801, loss is 0.1406596601009369\n", - "epoch: 1 step 1802, loss is 0.01889662630856037\n", - "epoch: 1 step 1803, loss is 0.1025170162320137\n", - "epoch: 1 step 1804, loss is 0.1047017052769661\n", - "epoch: 1 step 1805, loss is 0.10653477162122726\n", - "epoch: 1 step 1806, loss is 0.2483469694852829\n", - "epoch: 1 step 1807, loss is 0.4505705237388611\n", - "epoch: 1 step 1808, loss is 0.08267026394605637\n", - "epoch: 1 step 1809, loss is 0.04931456968188286\n", - "epoch: 1 step 1810, loss is 0.18475544452667236\n", - "epoch: 1 step 1811, loss is 0.0898744985461235\n", - "epoch: 1 step 1812, loss is 0.06011299043893814\n", - "epoch: 1 step 1813, loss is 0.0558500699698925\n", - "epoch: 1 step 1814, loss is 0.2750040590763092\n", - "epoch: 1 step 1815, loss is 0.08818687498569489\n", - "epoch: 1 step 1816, loss is 0.1620287448167801\n", - "epoch: 1 step 1817, loss is 0.006809838116168976\n", - "epoch: 1 step 1818, loss is 0.04070911556482315\n", - "epoch: 1 step 1819, loss is 0.029029523953795433\n", - "epoch: 1 step 1820, loss is 0.1900891661643982\n", - "epoch: 1 step 1821, loss is 0.3131425380706787\n", - "epoch: 1 step 1822, loss is 0.12208942323923111\n", - "epoch: 1 step 1823, loss is 0.008195384405553341\n", - "epoch: 1 step 1824, loss is 0.20186489820480347\n", - "epoch: 1 step 1825, loss is 0.10134484618902206\n", - "epoch: 1 step 1826, loss is 0.15988561511039734\n", - "epoch: 1 step 1827, loss is 0.055812690407037735\n", - "epoch: 1 step 1828, loss is 0.14566737413406372\n", - "epoch: 1 step 1829, loss is 0.11692202836275101\n", - "epoch: 1 step 1830, loss is 0.057336632162332535\n", - "epoch: 1 step 1831, loss is 0.13866601884365082\n", - "epoch: 1 step 1832, loss is 0.3332042098045349\n", - "epoch: 1 step 1833, loss is 0.26320505142211914\n", - "epoch: 1 step 1834, loss is 0.06900594383478165\n", - "epoch: 1 step 1835, loss is 0.02555154450237751\n", - "epoch: 1 step 1836, loss is 0.03933090716600418\n", - "epoch: 1 step 1837, loss is 0.021902436390519142\n", - "epoch: 1 step 1838, loss is 0.1975776106119156\n", - "epoch: 1 step 1839, loss is 0.2500876486301422\n", - "epoch: 1 step 1840, loss is 0.21521836519241333\n", - "epoch: 1 step 1841, loss is 0.10229639708995819\n", - "epoch: 1 step 1842, loss is 0.04220893234014511\n", - "epoch: 1 step 1843, loss is 0.055768031626939774\n", - "epoch: 1 step 1844, loss is 0.2947942316532135\n", - "epoch: 1 step 1845, loss is 0.12519270181655884\n", - "epoch: 1 step 1846, loss is 0.03666522726416588\n", - "epoch: 1 step 1847, loss is 0.06074002757668495\n", - "epoch: 1 step 1848, loss is 0.0761866495013237\n", - "epoch: 1 step 1849, loss is 0.23314397037029266\n", - "epoch: 1 step 1850, loss is 0.029110223054885864\n", - "epoch: 1 step 1851, loss is 0.31116774678230286\n", - "epoch: 1 step 1852, loss is 0.055939022451639175\n", - "epoch: 1 step 1853, loss is 0.2014581859111786\n", - "epoch: 1 step 1854, loss is 0.019982852041721344\n", - "epoch: 1 step 1855, loss is 0.052759602665901184\n", - "epoch: 1 step 1856, loss is 0.21708421409130096\n", - "epoch: 1 step 1857, loss is 0.03414442390203476\n", - "epoch: 1 step 1858, loss is 0.037223730236291885\n", - "epoch: 1 step 1859, loss is 0.01988343521952629\n", - "epoch: 1 step 1860, loss is 0.03718801960349083\n", - "epoch: 1 step 1861, loss is 0.47103869915008545\n", - "epoch: 1 step 1862, loss is 0.0968848392367363\n", - "epoch: 1 step 1863, loss is 0.06905733793973923\n", - "epoch: 1 step 1864, loss is 0.13268259167671204\n", - "epoch: 1 step 1865, loss is 0.3494003713130951\n", - "epoch: 1 step 1866, loss is 0.0347442626953125\n", - "epoch: 1 step 1867, loss is 0.08115438371896744\n", - "epoch: 1 step 1868, loss is 0.12732252478599548\n", - "epoch: 1 step 1869, loss is 0.08057542145252228\n", - "epoch: 1 step 1870, loss is 0.05422119051218033\n", - "epoch: 1 step 1871, loss is 0.09048718959093094\n", - "epoch: 1 step 1872, loss is 0.13636641204357147\n", - "epoch: 1 step 1873, loss is 0.2992154359817505\n", - "epoch: 1 step 1874, loss is 0.30933624505996704\n", - "epoch: 1 step 1875, loss is 0.06503278017044067\n", - "Epoch time: 22519.637, per step time: 12.010, avg loss: 1.140\n", + "epoch: 1 step 125, loss is 2.3136098384857178\n", + "epoch: 1 step 250, loss is 2.303882598876953\n", + "epoch: 1 step 375, loss is 2.3046326637268066\n", + "epoch: 1 step 500, loss is 2.3024802207946777\n", + "epoch: 1 step 625, loss is 2.3106091022491455\n", + "epoch: 1 step 750, loss is 2.298833131790161\n", + "epoch: 1 step 875, loss is 2.3070852756500244\n", + "epoch: 1 step 1000, loss is 2.284291982650757\n", + "epoch: 1 step 1125, loss is 0.7130898237228394\n", + "epoch: 1 step 1250, loss is 0.17307262122631073\n", + "epoch: 1 step 1375, loss is 0.3248927891254425\n", + "epoch: 1 step 1500, loss is 0.09352534264326096\n", + "epoch: 1 step 1625, loss is 0.025928258895874023\n", + "epoch: 1 step 1750, loss is 0.0918595939874649\n", + "epoch: 1 step 1875, loss is 0.20610764622688293\n", + "Epoch time: 15709.893, per step time: 8.379, avg loss: 1.440\n", "************************************************************\n" ] } @@ -2721,7 +801,7 @@ "outputs": [ { "data": { - "image/png": "\n", + "image/png": "\n", "text/plain": [ "
" ] @@ -2788,7 +868,7 @@ "output_type": "stream", "text": [ "============== Starting Testing ==============\n", - "============== Accuracy:{'Accuracy': 0.9613381410256411} ==============\n" + "============== Accuracy:{'Accuracy': 0.9553285256410257} ==============\n" ] } ], @@ -2831,7 +911,7 @@ "outputs": [ { "data": { - "image/png": "\n", + "image/png": "\n", "text/plain": [ "
" ] @@ -2915,19 +995,17 @@ "name": "stdout", "output_type": "stream", "text": [ - "Row 2, column 5 is incorrectly identified as 8, the correct value should be 3 \n", - "\n", - "Row 3, column 3 is incorrectly identified as 5, the correct value should be 8 \n", + "Row 1, column 7 is incorrectly identified as 2, the correct value should be 3 \n", "\n", - "Row 4, column 6 is incorrectly identified as 6, the correct value should be 0 \n", + "Row 4, column 3 is incorrectly identified as 0, the correct value should be 8 \n", "\n", - "[8 0 5 4 9 1 8 7 0 5 2 4 8 2 4 1 2 1 5 3 9 8 5 7 1 9 3 8 2 6 0 1] <--Predicted figures\n", - "[8 0 5 4 9 1 8 7 0 5 2 4 3 2 4 1 2 1 8 3 9 8 5 7 1 9 3 8 2 0 0 1] <--The right number\n" + "[2 7 7 0 6 3 2 3 2 5 2 1 8 7 8 3 0 5 2 1 0 8 2 2 1 8 0 3 6 8 8 2] <--Predicted figures\n", + "[2 7 7 0 6 3 3 3 2 5 2 1 8 7 8 3 0 5 2 1 0 8 2 2 1 8 8 3 6 8 8 2] <--The right number\n" ] }, { "data": { - "image/png": "\n", + "image/png": "\n", "text/plain": [ "
" ] @@ -2959,7 +1037,7 @@ " index = 0\n", " print(\"Row {}, column {} is incorrectly identified as {}, the correct value should be {}\".format(int(i/8)+1, i%8+1, pred[i], labels[i]), '\\n')\n", "if index:\n", - " print(\"All the figures in this group are predicted correctly!\")\n", + " print(\"All the figures in this group are predicted correctly!\")\n", "print(pred, \"<--Predicted figures\") \n", "print(labels, \"<--The right number\")\n", "plt.show()" @@ -2984,13 +1062,13 @@ "output_type": "stream", "text": [ "Figure 1 probability of corresponding numbers [0-9]:\n", - " [-3.4469228 2.0546532 -3.5465317 -1.3858355 -0.28758872 1.1322775\n", - " -0.8353202 -3.6402948 8.3553 1.3656969 ]\n" + " [-0.48477417 2.0016153 11.054499 2.3544474 -2.7436607 -3.630352\n", + " -3.7523592 0.9330094 2.6389365 -6.602851 ]\n" ] }, { "data": { - "image/png": "\n", + "image/png": "\n", "text/plain": [ "
" ] @@ -3003,13 +1081,13 @@ "output_type": "stream", "text": [ "Figure 2 probability of corresponding numbers [0-9]:\n", - " [10.008167 -5.861808 1.4555234 -1.482836 -2.721113 -0.8507886\n", - " 3.199972 -2.2156045 0.32849124 -1.4580953 ]\n" + " [-1.916862 -0.16948226 -0.2352289 -0.5903556 0.8726251 -0.41480547\n", + " -3.0238853 4.210627 -0.70848167 1.8492212 ]\n" ] }, { "data": { - "image/png": "\n", + "image/png": "\n", "text/plain": [ "
" ] diff --git a/tutorials/source_zh_cn/advanced_use/customized_debugging_information.md b/tutorials/source_zh_cn/advanced_use/customized_debugging_information.md index 5df1e2da144f9a23353310778f83d36760d53431..1ad1416adb435c816444792446b600465af63990 100644 --- a/tutorials/source_zh_cn/advanced_use/customized_debugging_information.md +++ b/tutorials/source_zh_cn/advanced_use/customized_debugging_information.md @@ -14,6 +14,8 @@ +   + ## 概述