If the distribution of the height of American men is approximately normal, with a mean of 69 inches
and a standard deviation of 2.5 inches, then roughly 68 percent of American men have heights
between
and
.
B
Which two properties hold true for standardized variables (also known as z-score normalization)?
(Choose two.)
A. standard deviation = 0.5
B. expected value = 0
C. expected value = 0.5
D. expected value = 1
E. standard deviation = 1
CE
(none)
Explanation
What is the main difference between traditional programming and machine learning?
D
What is the name of the design thinking work product that contains a summary description of a
particular person or role?
A
Reference: https://www.interaction-design.org/literature/topics/design-thinking
What are two methods used to detect outliers in structured data? (Choose two.)
BD
Reference:
https://www.researchgate.net/post/What-is-the-best-outliers-detection-algorithm-to-
used-for-big- data
A classification task has examples that are labeled as belonging to one of two classes:
90% of the examples belong to class-1
10% belong to class-2
Which two techniques are appropriate to deal with the class imbalance? (Choose two.)
BE
What is meant by the curse of dimensionality?
B
The least squares optimization technique (The Method of Least Squares) is used in which algorithm?
D
Reference: https://arxiv.org/ftp/arxiv/papers/1804/1804.05665.pdf
What are three elements that are typically part of a machine learning pipeline in scikit-learn or
pyspark? (Choose three.)
BCF
Reference:
https://www.analyticsvidhya.com/blog/2019/11/build-machine-learning-pipelines-
pyspark/
Which statement is true for naive Bayes?
C
Reference: http://users.sussex.ac.uk/~christ/crs/ml/lec02b.html
Select the three computing languages that IBM Cloud Object Storage SDK supports. (Choose three.)
ABE
Reference: https://cloud.ibm.com/docs/cloud-object-storage?topic=cloud-object-storage-gs-dev
DRAG DROP
What is the best step by step order for machine learning pipeline?
What are the various components that make up a time series data?
D
Considering one ML application is deployed using Kubernetes, its output depends on the data which
is constantly stored in the model, if needing to scale the system based on available CPUs, what
feature should be enabled?
A
Which one is the most appropriate use case for artificial intelligence (AI)?
C