CRNN-CTC cannot download file data.tar.gz
Created by: sfraczek
Problem description
I cannot download CRNN-CTC dataset https://github.com/PaddlePaddle/models/blob/develop/fluid/ocr_recognition/ctc_reader.py#L14. Sever resets connection. I already retried on different networks and machines with the same result. Can you upload the file somewhere else? What is this file? Can I find it elsewhere?
Command run
python ctc_train.py
Output
λ 2b5c263aa3a5 /dataset/sfraczek/paddle-models/fluid/ocr_recognition {develop} python ctc_train.py
-----------  Configuration Arguments -----------
average_window: 0.15
batch_size: 32
eval_period: 15000
init_model: None
log_period: 1000
max_average_window: 15625
min_average_window: 10000
parallel: False
pass_num: 100
save_model_dir: ./models
save_model_period: 15000
use_gpu: True
------------------------------------------------
/usr/local/lib/python2.7/dist-packages/paddle/fluid/evaluator.py:64: Warning: The EditDistance is deprecated, because maintain a modified program inside evaluator cause bug easily, please use fluid.metrics.EditDistance instead.
  % (self.__class__.__name__, self.__class__.__name__), Warning)
file md5 b5df92d1c0ab0d585f4771e062a065e1 1de60d54d19632022144e4e58c2637b5
Cache file /root/.cache/paddle/dataset/ctc_data/data.tar.gz not found, downloading http://cloud.dlnel.org/filepub/?uuid=df937251-3c0b-480d-9a7b-0080dfeee65c
Traceback (most recent call last):
  File "ctc_train.py", line 150, in <module>
    main()
  File "ctc_train.py", line 146, in main
    train(args, data_reader=ctc_reader)
  File "ctc_train.py", line 52, in train
    train_list_file=train_list)
  File "/dataset/sfraczek/paddle-models/fluid/ocr_recognition/ctc_reader.py", line 166, in train
    data_dir = download_data()
  File "/dataset/sfraczek/paddle-models/fluid/ocr_recognition/ctc_reader.py", line 194, in download_data
    DATA_URL, CACHE_DIR_NAME, DATA_MD5, save_name=SAVED_FILE_NAME)
  File "/usr/local/lib/python2.7/dist-packages/paddle/v2/dataset/common.py", line 90, in download
    shutil.copyfileobj(r.raw, f)
  File "/usr/lib/python2.7/shutil.py", line 49, in copyfileobj
    buf = fsrc.read(length)
  File "/usr/local/lib/python2.7/dist-packages/requests/packages/urllib3/response.py", line 311, in read
    flush_decoder = True
  File "/usr/lib/python2.7/contextlib.py", line 35, in __exit__
    self.gen.throw(type, value, traceback)
  File "/usr/local/lib/python2.7/dist-packages/requests/packages/urllib3/response.py", line 244, in _error_catcher
    raise ProtocolError('Connection broken: %r' % e, e)
requests.packages.urllib3.exceptions.ProtocolError: ('Connection broken: IncompleteRead(15973 bytes read, 411 more expected)', IncompleteRead(15973 bytes read, 411 more expected))
@wanghaoshuang maybe you could help me?