You are creating an Oracle Cloud Infrastructure (OCI) Data Science job that will run on a recurring basis in a production environment. This job will pick up sensitive data from an Object Storage bucket, train a model, and save it to the model catalog. How would you design the authentication mechanism for the job?
Which Oracle Cloud Infrastructure (OCI) service should you use to create and run Spark
applications using ADS?
You are asked to prepare data for a custom-built model that requires transcribing Spanish video recordings into a readable text format with profane words identified. Which Oracle Cloud service would you use?
Six months ago, you created and deployed a model that predicts customer churn for a call
centre. Initially, it was yielding quality predictions. However, over the last two months, users are
questioning the credibility of the predictions.
Which two methods would you employ to verify the accuracy of the model?
You want to use ADSTuner to tune the hyperparameters of a supported model you recently
trained. You have just started your search and want to reduce the computational cost as well as
access the quality of the model class that you are using.
What is the most appropriate search space strategy to choose?
Select two reasons why it is important to rotate encryption keys when using Oracle Cloud
Infrastructure (OCI) Vault to store credentials or other secrets.
You are a data scientist leveraging Oracle Cloud Infrastructure (OCI) Data Science to create a
model and need some additional Python libraries for processing genome sequencing data. Which of
the following THREE statements are correct with respect to installing additional Python libraries to
process the data?
You realize that your model deployment is about to reach its utilization limit. What would you do to avoid the issue before requests start to fail?
After you have created and opened a notebook session, you want to use the Accelerated Data
Science (ADS) SDK to access your data and get started with an exploratory data analysis.
From which two places can you access or install the ADS SDK?
While reviewing your data, you discover that your data set has a class imbalance. You are aware
that the Accelerated Data Science (ADS) SDK provides multiple built-in automatic transformation
tools for data set transformation. Which would be the right tool to correct any imbalance between
the classes?
You have just received a new data set from a colleague. You want to quickly find out summary
information about the data set, such as the types of features, the total number of observations, and
distributions of the data. Which Accelerated Data Science (ADS) SDK method from the ADSDataset
class would you use?
While reviewing your data, you discover that your data set has a class imbalance. You are aware that the Accelerated Data Science (ADS) SDK provides multiple built-in automatic transformation tools for data set transformation. Which would be the right tool to correct any imbalance between the classes?
You train a model to predict housing prices for your city. Which two metrics from the
Accelerated Data Science (ADS) ADSEvaluator class can you use to evaluate the regression model?
Youare a data scientist working for a manufacturing company. You have developed a forecasting
model to predict the sales demand in the upcoming months. You created a model artifact that
contained custom logic requiring third party libraries. When you deployed the model, it failed to run
because you did not include all the third party dependencies in the model artifact. What file should
be modified to include the missing libraries?
You want to ensure that all stdout and stderr from your code are automatically collected and
logged, without implementing additional logging in your code. How would you achieve this with Data
Science Jobs?
You are working as a data scientist for a healthcare company. They decide to analyze the data to
find patterns in a large volume of electronic medical records. You are asked to build a PySpark
solution to analyze these records in a JupyterLab notebook. What is the order of recommended
steps to develop a PySpark application in Oracle Cloud Infrastructure (OCI) Data Science?
You have created a Data Science project in a compartment called Development and shared it
with a group of collaborators. You now need to move the project to a different compartment called
Production after completing the current development iteration.
Which statement is correct?
You have a complex Python code project that could benefit from using Data Science Jobs as it is a
repeatable machine learning model training task. The project contains many subfolders and classes.
What is the best way to run this project as a Job?
Which of the following TWO non-open source JupyterLab extensions has Oracle Cloud In-frastructure (OCI) Data Science developed and added to the notebook session experience?
You have received machine learning model training code, without clear information about the
optimal shape to run the training. How would you proceed to identify the optimal compute shape
for your model training that provides a balanced cost and processing time?
3. When preparing your model artifact to save it to the Oracle Cloud Infrastructure (OCI) Data
Science model catalog, you create a score.py file. What is the purpose of the score.py file?