After successfully training your ML model and after selecting the best run, you are about to deploy it as a web service to the production environment.Because you anticipate a massive amount of requests to be handled by the service, you choose AKS as a compute target.You are using the following script to deploy your model:
# deploy model inference_config = InferenceConfig(runtime= "python", entry_script=script_file, conda_file=env_file) deployment_config = AciWebservice.deploy_configuration(cpu_cores = 1, memory_gb = 4) service_name = "fraud-detection-service" service = Model.deploy(ws, service_name, [model], inference_config, deployment_config) service.wait_for_deployment(True) print(service.state)
Running the deployment script results in the service state “Failed”.You have a look at your scoring script and you suspect that something is wrong with getting data in the run() function:
# scoring script ... def init(): global model # Get the path to the deployed model file and load it model_path = Model.get_model_path('fraud_detection_model') model = joblib.load(model_path) # Called when a request is received def run(raw_data): # Get the input data as a numpy array data = np.array(json.loads(raw_data)['data']) # Get a prediction from the model predictions = model.predict(data) # Get the classnames for predictions classnames = ['non-fraud', 'fraud'] predicted_classes = [] for prediction in predictions: predicted_classes.append(classnames[prediction]) # Return the predictions as JSON return json.dumps(predicted_classes)
Is this the best way to localize the error?

Question

After successfully training your ML model and after selecting the best run, you are about to deploy it as a web service to the production environment.Because you anticipate a massive amount of requests to be handled by the service, you choose AKS as a compute target.You are using the following script to deploy your model:

# deploy model inference_config = InferenceConfig(runtime= "python",  entry_script=script_file,  conda_file=env_file) deployment_config = AciWebservice.deploy_configuration(cpu_cores = 1, memory_gb = 4) service_name = "fraud-detection-service" service = Model.deploy(ws, service_name, [model], inference_config, deployment_config) service.wait_for_deployment(True) print(service.state)

Running the deployment script results in the service state “Failed”.You have a look at your scoring script and you suspect that something is wrong with getting data in the run() function:

# scoring script ... def init(): global model # Get the path to the deployed model file and load it model_path = Model.get_model_path('fraud_detection_model') model = joblib.load(model_path) # Called when a request is received def run(raw_data): # Get the input data as a numpy array data = np.array(json.loads(raw_data)['data']) # Get a prediction from the model predictions = model.predict(data) # Get the classnames for predictions classnames = ['non-fraud', 'fraud'] predicted_classes = [] for prediction in predictions: predicted_classes.append(classnames[prediction]) # Return the predictions as JSON return json.dumps(predicted_classes)

Is this the best way to localize the error?

Exam-Answer · Accepted Answer

No.

Troubleshooting Deployment of ML Model on AKS

Question

Answers

Explanations