#25 ValueError: For 'build_searched_strategy', no such strategy file: . Please check whether the 'strategy_filename' exists.

Open
created 9 months ago by rickyzhu · 0 comments
## 问题 手动修复[前一个bug后](https://openi.pcl.ac.cn/PCL-Platform.Intelligence/pcl_pangu/issues/24),又出现了如下报错: ```python --------------------------------------------------------------------------- ValueError Traceback (most recent call last) <ipython-input-1-fa31d81797ff> in <module> 4 set_context(backend='mindspore') 5 config = alpha.model_config_npu(model='2B6',load='./model') ----> 6 alpha.inference(config,input='四川的省会是?') ~/anaconda3/envs/MindSpore/lib/python3.7/site-packages/pcl_pangu/model/alpha/alpha.py in inference(config, top_k, top_p, input, input_file, generate_max_tokens, output_file, oneCardInference) 96 config_dict['output_file'] = output_file 97 config_dict['generate_max_tokens'] = generate_max_tokens ---> 98 run_ms_inference(config_dict) 99 100 elif 'onnx-' in backend_context: ~/anaconda3/envs/MindSpore/lib/python3.7/site-packages/pcl_pangu/model/alpha/alpha.py in run_ms_inference(config_dict) 160 from inference_alpha_ms13 import opt, setup_args, main 161 new_opt = setup_args(opt, config_dict) --> 162 main(new_opt) 163 164 def run_ms_finetune_merge_OpenI(config_dict): ~/anaconda3/envs/MindSpore/lib/python3.7/site-packages/pcl_pangu/model/panguAlpha_mindspore/inference_alpha_ms13.py in main(opt) 316 """Main process for predict or export model""" 317 --> 318 model_predict, config = load_model(opt) 319 if opt.export: 320 export_mindir(model_predict, config) ~/anaconda3/envs/MindSpore/lib/python3.7/site-packages/pcl_pangu/model/panguAlpha_mindspore/inference_alpha_ms13.py in load_model(args_opt) 179 print(f"Loading from path {ckpt_file_list[0]}", flush=True) 180 print(predict_layout) --> 181 load_distributed_checkpoint(eval_net, ckpt_file_list, predict_strategy=predict_layout) 182 print("================load param ok=================", flush=True) 183 else: ~/anaconda3/envs/MindSpore/lib/python3.7/site-packages/mindspore/train/serialization.py in load_distributed_checkpoint(network, checkpoint_filenames, predict_strategy, train_strategy_filename, strict_load, dec_key, dec_mode) 1575 if train_strategy_filename is None: 1576 train_strategy_filename = context.get_auto_parallel_context("strategy_ckpt_load_file") -> 1577 _train_strategy = build_searched_strategy(train_strategy_filename) 1578 train_strategy = _convert_to_list(_train_strategy) 1579 ~/anaconda3/envs/MindSpore/lib/python3.7/site-packages/mindspore/train/serialization.py in build_searched_strategy(strategy_filename) 1416 1417 if not os.path.isfile(strategy_filename): -> 1418 raise ValueError(f"For 'build_searched_strategy', no such strategy file: {strategy_filename}. " 1419 f"Please check whether the 'strategy_filename' exists.") 1420 ValueError: For 'build_searched_strategy', no such strategy file: . Please check whether the 'strategy_filename' exists. ``` ## 症结 如报错信息所示,`inference_alpha_ms13.py`的[第180行](https://openi.pcl.ac.cn/PCL-Platform.Intelligence/pcl_pangu/src/branch/master/pcl_pangu/model/panguAlpha_mindspore/inference_alpha_ms13.py#L180)需要传入一个不为空的strategy,但由于`args_opt.strategy_load_ckpt_path`为空,所以[第148行](https://openi.pcl.ac.cn/PCL-Platform.Intelligence/pcl_pangu/src/branch/master/pcl_pangu/model/panguAlpha_mindspore/inference_alpha_ms13.py#L147)也将`predict_layout`设为`None`。 推测该项目缺少了关于pangu 2B6 的strategy文件(虽然我也不知道时啥,总之赶紧补少吧)。 尝试过把[第31行](https://openi.pcl.ac.cn/PCL-Platform.Intelligence/pcl_pangu/src/branch/master/pcl_pangu/model/panguAlpha_mindspore/inference_alpha_ms13.py#L31)的注释去掉,结果报了下面的错误。 ```shell In [2]: from pcl_pangu.context import set_context ...: from pcl_pangu.model import alpha, evolution, mPangu ...: ...: set_context(backend='mindspore') ...: config = alpha.model_config_npu(model='2B6',load='./model') ...: alpha.inference(config,input='四川的省会是?') /home/ma-user/anaconda3/envs/MindSpore/lib/python3.7/site-packages/requests/__init__.py:91: RequestsDependencyWarning: urllib3 (1.26.10) or chardet (3.0.4) doesn't match a supported version! RequestsDependencyWarning) --------------------------- inference config -------------------------- > Base Model: [alpha] > Model Size: [2B6] > global batch_size: 8 > generate_max_tokens length: 128 ---------------------------- end of config ---------------------------- --------------------------------------------------------------------------- ImportError Traceback (most recent call last) <ipython-input-2-fa31d81797ff> in <module> 4 set_context(backend='mindspore') 5 config = alpha.model_config_npu(model='2B6',load='./model') ----> 6 alpha.inference(config,input='四川的省会是?') ~/anaconda3/envs/MindSpore/lib/python3.7/site-packages/pcl_pangu/model/alpha/alpha.py in inference(config, top_k, top_p, input, input_file, generate_max_tokens, output_file, oneCardInference) 96 config_dict['output_file'] = output_file 97 config_dict['generate_max_tokens'] = generate_max_tokens ---> 98 run_ms_inference(config_dict) 99 100 elif 'onnx-' in backend_context: ~/anaconda3/envs/MindSpore/lib/python3.7/site-packages/pcl_pangu/model/alpha/alpha.py in run_ms_inference(config_dict) 158 current_dir = os.path.abspath(os.path.join(os.path.dirname(__file__), os.path.pardir)) 159 sys.path.append(current_dir + '/panguAlpha_mindspore') --> 160 from inference_alpha_ms13 import opt, setup_args, main 161 new_opt = setup_args(opt, config_dict) 162 main(new_opt) ~/anaconda3/envs/MindSpore/lib/python3.7/site-packages/pcl_pangu/model/panguAlpha_mindspore/inference_alpha_ms13.py in <module> 29 from mindspore.train.model import Model 30 from mindspore.train.serialization import load_checkpoint, load_param_into_net, load_distributed_checkpoint ---> 31 from src.serialization import load_distributed_checkpoint 32 from src.pangu_alpha import PanguAlpha, EvalNet 33 from src.pangu_alpha_config import PANGUALPHAConfig, set_parse ~/anaconda3/envs/MindSpore/lib/python3.7/site-packages/pcl_pangu/model/panguAlpha_mindspore/src/serialization.py in <module> 38 from mindspore.common.initializer import initializer 39 from mindspore.common.parameter import Parameter ---> 40 from mindspore.common.api import _executor 41 from mindspore.common import dtype as mstype 42 from mindspore._checkparam import check_input_data, Validator ImportError: cannot import name '_executor' from 'mindspore.common.api' (/home/ma-user/anaconda3/envs/MindSpore/lib/python3.7/site-packages/mindspore/common/api.py) ```
Sign in to join this conversation.
No Milestone
No Assignees
1 Participants
Notifications
Due Date

No due date set.

Dependencies

This issue currently doesn't have any dependencies.

Loading…
There is no content yet.