Practice Set 35
Questions 341–350 (10 questions)
A car company has dealership locations in multiple cities. The company uses a machine learning (ML) recommendation system to market cars to its customers.An ML engineer trained the ML recommendation model on a dataset that includes multiple attributes about each car. The dataset includes attributes such as car brand, car type, fuel efficiency, and price.The ML engineer uses Amazon SageMaker Data Wrangler to analyze and visualize data. The ML engineer needs to identify the distribution of car prices for a specific type of car.Which type of visualization should the ML engineer use to meet these requirements? [{"voted_answers": "D", "vote_count": 2, "is_most_voted": true}]
A media company is building a computer vision model to analyze images that are on social media. The model consists of CNNs that the company trained by using images that the company stores in Amazon S3. The company used an Amazon SageMaker training job in File mode with a single Amazon EC2 On-Demand Instance.Every day, the company updates the model by using about 10,000 images that the company has collected in the last 24 hours. The company configures training with only one epoch. The company wants to speed up training and lower costs without the need to make any code changes.Which solution will meet these requirements? [{"voted_answers": "D", "vote_count": 7, "is_most_voted": true}, {"voted_answers": "B", "vote_count": 6, "is_most_voted": false}, {"voted_answers": "C", "vote_count": 1, "is_most_voted": false}, {"voted_answers": "A", "vote_count": 1, "is_most_voted": false}]
A telecommunications company has deployed a machine learning model using Amazon SageMaker. The model identifies customers who are likely to cancel their contract when calling customer service. These customers are then directed to a specialist service team. The model has been trained on historical data from multiple years relating to customer contracts and customer service interactions in a single geographic region.The company is planning to launch a new global product that will use this model. Management is concerned that the model might incorrectly direct a large number of calls from customers in regions without historical data to the specialist service team.Which approach would MOST effectively address this issue? [{"voted_answers": "A", "vote_count": 3, "is_most_voted": true}, {"voted_answers": "C", "vote_count": 1, "is_most_voted": false}]
A machine learning (ML) engineer is creating a binary classification model. The ML engineer will use the model in a highly sensitive environment.There is no cost associated with missing a positive label. However, the cost of making a false positive inference is extremely high.What is the most important metric to optimize the model for in this scenario? [{"voted_answers": "B", "vote_count": 2, "is_most_voted": true}]
An ecommerce company discovers that the search tool for the company's website is not presenting the top search results to customers. The company needs to resolve the issue so the search tool will present results that customers are most likely to want to purchase.Which solution will meet this requirement with the LEAST operational effort? [{"voted_answers": "C", "vote_count": 2, "is_most_voted": true}]
A machine learning (ML) specialist collected daily product usage data for a group of customers. The ML specialist appended customer metadata such as age and gender from an external data source.The ML specialist wants to understand product usage patterns for each day of the week for customers in specific age groups. The ML specialist creates two categorical features named dayofweek and binned_age, respectively.Which approach should the ML specialist use discover the relationship between the two new categorical features? [{"voted_answers": "B", "vote_count": 2, "is_most_voted": true}, {"voted_answers": "A", "vote_count": 1, "is_most_voted": false}]
A company needs to develop a model that uses a machine learning (ML) model for risk analysis. An ML engineer needs to evaluate the contribution each feature of a training dataset makes to the prediction of the target variable before the ML engineer selects features.How should the ML engineer predict the contribution of each feature? [{"voted_answers": "B", "vote_count": 3, "is_most_voted": true}, {"voted_answers": "A", "vote_count": 2, "is_most_voted": false}, {"voted_answers": "D", "vote_count": 1, "is_most_voted": false}]
A company is building a predictive maintenance system using real-time data from devices on remote sites. There is no AWS Direct Connect connection or VPN connection between the sites and the company's VPC. The data needs to be ingested in real time from the devices into Amazon S3.Transformation is needed to convert the raw data into clean .csv data to be fed into the machine learning (ML) model. The transformation needs to happen during the ingestion process. When transformation fails, the records need to be stored in a specific location in Amazon S3 for human review. The raw data before transformation also needs to be stored in Amazon S3.How should an ML specialist architect the solution to meet these requirements with the LEAST effort? [{"voted_answers": "A", "vote_count": 5, "is_most_voted": true}, {"voted_answers": "C", "vote_count": 2, "is_most_voted": false}, {"voted_answers": "D", "vote_count": 1, "is_most_voted": false}]
A company wants to use machine learning (ML) to improve its customer churn prediction model. The company stores data in an Amazon Redshift data warehouse.A data science team wants to use Amazon Redshift machine learning (Amazon Redshift ML) to build a model and run predictions for new data directly within the data warehouse.Which combination of steps should the company take to use Amazon Redshift ML to meet these requirements? (Choose three.) [{"voted_answers": "ACF", "vote_count": 4, "is_most_voted": true}]
A company’s machine learning (ML) team needs to build a system that can detect whether people in a collection of images are wearing the company’s logo. The company has a set of labeled training data.Which algorithm should the ML team use to meet this requirement? [{"voted_answers": "D", "vote_count": 2, "is_most_voted": true}]