机器学习专家使用AWS KMS将数据集上传到受服务器端加密保护的Amazon S3存储桶。ML专家应该如何定义Amazon SageMaker笔记本实例,以便它可以从Amazon S3读取相同的数据集?
一家房地产公司希望创建一个基于历史数据集预测房价的机器学习模型。数据集包含32个特征。哪种模式可以满足业务需求?
一家公司希望根据现有的历史销售数据预测房屋的销售价格。公司数据集中的目标变量是销售价格。这些特征包括地块大小、居住面积测量值、非居住面积测量、卧室数量、浴室数量、建造年份和邮政编码等参数。该公司希望使用多变量线性回归来预测房屋销售价格。机器学习专家应该采取哪一步来删除与分析无关的特征并降低模型的复杂性?
一家卡车运输公司正在全球范围内收集其卡车车队的实时图像数据。数据正在快速增长,每天生成大约100 GB的新数据。该公司希望探索机器学习用例,同时确保数据只能由特定IAM用户访问。哪个存储选项提供了最大的处理灵活性,并允许IAM进行访问控制?
A machine learning specialist works for a fruit processing company and needs to build a system that categorizes apples into three types. The specialist has collected a dataset that contains 150 images for each type of apple and applied transfer learning on a neural network that was pretrained on ImageNet with this dataset. The company requires at least 85% accuracy to make use of the model. After an exhaustive grid search, the optimal hyperparameters produced the following: ✑ 68% accuracy on the training set ✑ 67% accuracy on the validation set What can the machine learning specialist do to improve the system's accuracy?