ResNeXt101_32x48d_wsl 继续训练时checkpoint 出错
Created by: Otis4631
这是报错信息
----------------------
Error Message Summary:
----------------------
PaddleCheckError: Cannot open file output/ResNeXt101_32x48d_wsl/9/tmp_4 for load op at [/paddle/paddle/fluid/operators/load_op.h:37]
[operator < load > error]
这是配置
##Training details
export CUDA_VISIBLE_DEVICES=0,1,2,3,4,5,6,7
export FLAGS_fast_eager_deletion_mode=1
export FLAGS_eager_delete_tensor_gb=0.0
export FLAGS_fraction_of_gpu_memory_to_use=0.98
python train.py \
--model=ResNeXt101_32x48d_wsl \
--batch_size=128 \
--total_images=12000 \
--class_dim=121 \
--image_shape=3,224,224 \
--model_save_dir=output/ \
--lr_strategy=cosine_decay \
--num_epochs=260 \
--lr=0.1 \
--reader_thread=4 \
--l2_decay=1e-4 \
--checkpoint="output/ResNeXt101_32x48d_wsl/9/"
自动保存的时候没保存这个tmp_4吗?