100% satisfaction guarantee Immediately available after payment Both online and in PDF No strings attached 4.2 TrustPilot
logo-home
Exam (elaborations)

AWS Certified Machine Exam 1 With complete solutions 2024_2025

Rating
-
Sold
-
Pages
22
Grade
A+
Uploaded on
02-10-2024
Written in
2024/2025

AWS Certified Machine Exam 1 With complete solutions 2024_2025

Institution
AWS Database Speciality
Course
AWS Database speciality










Whoops! We can’t load your doc right now. Try again or contact support.

Written for

Institution
AWS Database speciality
Course
AWS Database speciality

Document information

Uploaded on
October 2, 2024
Number of pages
22
Written in
2024/2025
Type
Exam (elaborations)
Contains
Questions & answers

Subjects

Content preview

AWS Certified Machine Exam 1 With
complete solutions 2024/2025




A Data Scientist observes oscillations in training accuracy while doing mini-batch
training on a neural network for a classification task.Which of the following is the
MOST LIKELY CAUSE of this problem?
A. The class distribution in the dataset is imbalanced.
B. Dataset shuffling is disabled.
C. The batch size is too big.
D. The learning rate is very high. - ANSWER-D. The learning rate is very high.

Which common parameters MUST be given when submitting Amazon SageMaker
training tasks that use one of the built-in algorithms? (Select three.)
A. The training channel identifying the location of training data on an Amazon S3
bucket.
B. The validation channel identifying the location of validation data on an Amazon
S3 bucket.
C. The IAM role that Amazon SageMaker can assume to perform tasks on behalf
of the users.
D. Hyperparameters in a JSON array as documented for the algorithm used.
E. The Amazon EC2 instance class specifying whether training will be run using
CPU or GPU.
F. The output path specifying where on an Amazon S3 bucket the trained model
will persist. - ANSWER-C. The IAM role that Amazon SageMaker can assume to
perform tasks on behalf of the users.
E. The Amazon EC2 instance class specifying whether training will be run using
CPU or GPU.
F. The output path specifying where on an Amazon S3 bucket the trained model
will persist.

,What is the real class frequency for Romance and the anticipated class frequency
for Adventure given the following confusion matrix for a movie classification
model?
A. The true class frequency for Romance is 77.56% and the predicted class
frequency for Adventure is 20.85%
B. The true class frequency for Romance is 57.92% and the predicted class
frequency for Adventure is 13.12%
C. The true class frequency for Romance is 0.78 and the predicted class
frequency for Adventure is (0.47-0.32)
D. The true class frequency for Romance is 77.56% * 0.78 and the predicted class
frequency for Adventure is 20.85%*0.32 - ANSWER-B. The true class frequency
for Romance is 57.92% and the predicted class frequency for Adventure is 13.12%

A retail chain has been utilizing Amazon Kinesis Data Firehose to ingest
purchase details from its network of 20,000 outlets into Amazon S3. To facilitate
the training of a more advanced machine learning model, training data will need
additional but straightforward transformations, and certain characteristics will be
merged. Daily retraining of the model is required.
Which update will take the LEAST amount of development work, given the vast
number of stores and historical data ingestion?
A. Require that the stores to switch to capturing their data locally on AWS
Storage Gateway for loading into Amazon S3, then use AWS Glue to do the
transformation.
B. Deploy an Amazon EMR cluster running Apache Spark with the transformation
logic, and have the cluster run each day on the accumulating records in Amazon
S3, outputting new/transformed records to Amazon S3.
C. Spin up a fleet of Amazon EC2 instances with the transformat - ANSWER-D.
Insert an Amazon Kinesis Data Analytics stream downstream of the Kinesis Data
Firehose stream that transforms raw record attributes into simple transformed
values using SQL.

A data scientist conducts data exploration and analysis using an Amazon
SageMaker notebook instance. This involves installing some Python packages on
the notebook instance that are not natively accessible on Amazon SageMaker.
How can a machine learning professional guarantee that the data scientist's
essential packages are automatically accessible on the notebook instance?
A. Install AWS Systems Manager Agent on the underlying Amazon EC2 instance
and use Systems Manager Automation to execute the package installation
commands.

, B. Create a Jupyter notebook file (.ipynb) with cells containing the package
installation commands to execute and place the file under the /etc/init directory of
each Amazon SageMaker notebook instance.
C. Use the conda package manager from within the Jupyter notebook console to
apply the necessary conda packages to the default kernel of the notebook.
D. Create an Amazon SageMaker lifecycle conf - ANSWER-D. Create an Amazon
SageMaker lifecycle configuration with package installation commands and
assign the lifecycle configuration to the notebook instance.

A web-based business wishes to increase conversions on its landing page. The
business developed a multi-class deep learning network algorithm using Amazon
SageMaker regularly using a big historical dataset of client visits. However, there
is an overfitting issue: training data indicates a prediction accuracy of 90%,
whereas test data indicates only a prediction accuracy of 70%.
The organization has to increase the generalizability of its model prior to putting
it in production in order to optimize visit-to-purchase conversions.
Which activity is advised to ensure that the company's test and validation data is
modelled with the HIGHEST degree of accuracy possible?
A. Increase the randomization of training data in the mini-batches used in training
B. Allocate a higher proportion of the overall data to the training dataset
C. Apply L1 or L2 regularization and dropouts to the training
D. Reduce the number of layers and u - ANSWER-C. Apply L1 or L2 regularization
and dropouts to the training

A business evaluates the risk variables associated with a specific energy sector
using a long short-term memory (LSTM) model. The program analyzes multi-page
text documents and categorizes each phrase as either posing a danger or posing
no risk. The model is underperforming, despite the Data Scientist's extensive
experimentation with several network architectures and tuning of the associated
hyperparameters.Which technique will result in the MAXIMUM increase in
performance?
A. Initialize the words by term frequency-inverse document frequency (TF-IDF)
vectors pretrained on a large collection of news articles related to the energy
sector.
B. Use gated recurrent units (GRUs) instead of LSTM and run the training process
until the validation loss stops decreasing.
C. Reduce the learning rate and run the training process until the training loss
stops decreasing.
D. Initialize the words by word2vec embeddings pretrained on - ANSWER-D.
Initialize the words by word2vec embeddings pretrained on a large collection of
news articles related to the energy sector.

Get to know the seller

Seller avatar
Reputation scores are based on the amount of documents a seller has sold for a fee and the reviews they have received for those documents. There are three levels: Bronze, Silver and Gold. The better the reputation, the more your can rely on the quality of the sellers work.
BRAINSCAPE1 Chamberlain College Nursing
View profile
Follow You need to be logged in order to follow users or courses
Sold
122
Member since
1 year
Number of followers
14
Documents
11141
Last sold
4 days ago
download to pass in your exam

**Profile: Exam and Flashcards Sales**. **Introduction:** Welcome to my profile! I specialize in providing comprehensive exam and flashcard resources tailored to meet your educational needs. With a dedication to quality and effectiveness, I aim to assist students in achieving their academic goals with ease and confide**Services Offered:** 1. **Exam Materials:**- I offer a wide range of exam materials for various subjects and levels, including standardized tests such as SAT, ACT, GRE, GMAT, TOEFL, and more- These materials are meticulously crafted to cover all exam topics comprehensively, ensuring thorough preparation and confidence on test day. 2. **Flashcards:** - My collection of flashcards is designed to facilitate efficient learning and retention of key concepts. - Each set of flashcards is carefully curated to highlight essential information, making studying more manageable and effective. **Why Choose Me:** 1. **Quality Assurance:** - I prioritize quality in all my products, ensuring accuracy, relevance, and reliability. - Every exam material and flashcard set undergoes rigorous review and updating to reflect the latest changes in curriculum and exam formats. 2. **User-Friendly Resources:** - My resources are user-friendly, featuring clear formatting, concise explanations, and intuitive organization to enhance the learning experience. - Whether you're a visual learner or prefer text-based study aids, my materials cater to diverse learning preferences. 3. **Affordability:** - I believe that access to quality educational resources should not be cost-prohibitive. Thus, I offer competitive pricing without compromising on quality.

Read more Read less
4.4

19 reviews

5
12
4
4
3
2
2
0
1
1

Recently viewed by you

Why students choose Stuvia

Created by fellow students, verified by reviews

Quality you can trust: written by students who passed their tests and reviewed by others who've used these notes.

Didn't get what you expected? Choose another document

No worries! You can instantly pick a different document that better fits what you're looking for.

Pay as you like, start learning right away

No subscription, no commitments. Pay the way you're used to via credit card and download your PDF document instantly.

Student with book image

“Bought, downloaded, and aced it. It really can be that simple.”

Alisha Student

Frequently asked questions