Summer Sale Limited Time 60% Discount Offer - Ends in 0d 00h 00m 00s - Coupon code: 28522818

Home > SAS Institute > Statistical Business Analyst > A00-240

A00-240 SAS Statistical Business Analysis SAS9: Regression and Model Question and Answers

Question # 4

While building a predictive model, median imputations are performed while preparing the training data.

How should the imputations be addressed in the validation data?

A.

The imputed values are irrelevant to the validation data, and are not used.

B.

The imputed values must be applied directly to the validation data without recalculation.

C.

The imputed values must be recalculated using the validation data.

D.

The imputed values must be recalculated using both the training and the validation data.

Full Access
Question # 5

Refer to the exhibit:

SAS output from the RSQUARE selection method, within the REG procedure, is shown. The top two models in each subset are given.

Based on the AIC statistic, which model is the champion model?

A.

Age Weight RunTime RunPulse MaxPulse

B.

Age Weight RunTime RunPulse RestPulse MaxPulse

C.

RestPulse

D.

RunTime

Full Access
Question # 6

Assume a $10 cost for soliciting a non-responder and a $200 profit for soliciting a responder. The logistic regression model gives a probability score named P_R on a SAS data set called VALID. The VALID data set contains the responder variable Pinch, a 1/0 variable coded as 1 for responder. Customers will be solicited when their probability score is more than 0.05.

Which SAS program computes the profit for each customer in the data set VALID?

A.

Option A

B.

Option B

C.

Option C

D.

Option D

Full Access
Question # 7

Refer to the confusion matrix:

Calculate the accuracy and error rate (0 - negative outcome, 1 - positive outcome)

A.

Accuracy = 58/102, Error Rate = 23/48

B.

Accuracy = 83/102, Error Rate = 67/102

C.

Accuracy = 25/150, Error Rate = 44/150

D.

Accuracy = 83/150, Error Rate = 67/150

Full Access
Question # 8

Identify the correct SAS program for fitting a multiple linear regression model with dependent variable (y) and four predictor variables (x1-x4).

A.

Option A

B.

Option B

C.

Option C

D.

Option D

Full Access
Question # 9

Refer to the following odds ratio table:

What is a correct interpretation of the estimate?

A.

The odds of the event are 1.142 greater for each one dollar increase in salary.

B.

The odds of the event are 1.142 greater for each one thousand dollar increase in salary.

C.

The probability of the event is 1.142 greater for each one dollar increase in salary.

D.

The probability of the event is 1.142 greater for each one thousand dollar increase in salary.

Full Access
Question # 10

Refer to the ROC curve:

As you move along the curve, what changes?

A.

The priors in the population

B.

The true negative rate in the population

C.

The proportion of events in the training data

D.

The probability cutoff for scoring

Full Access
Question # 11

This question will ask you to provide a missing option.

Given the following SAS program:

What option must be added to the program to obtain a data set containing Spearman statistics?

A.

OUTCORR=estimates

B.

OUTS=estimates

C.

OUT=estimates

D.

OUTPUT=estimates

Full Access
Question # 12

Spearman statistics in the CORR procedure are useful for screening for irrelevant variables by investigating the association between which function of the input variables?

A.

Concordant and discordant pairs of ranked observations

B.

Logit link (log (p/1-p))

C.

Rank-ordered values of the variables

D.

Weighted sum of chi-square statistics for 2x2 tables

Full Access
Question # 13

A marketing analyst assessed the effect of web page design (A, B, or C) on customers' intent to purchase an expensive product. The focus group was divided randomly into three sub-groups, each of which was asked to view one of the web pages and then give their intent to purchase on a scale from 0 to 100. The analyst also asked the customers to give their income, which was coded as: I (lowest), II (medium), or III (highest). After analyzing the data, the analyst claimed that there was significant interaction and the webpage design mainly influenced high income people.

Which graph supports the analyst's conclusion?

A)

B)

C)

D)

A.

Option A

B.

Option B

C.

Option C

D.

Option D

Full Access
Question # 14

What is a drawback to performing data cleansing (imputation, transformations, etc.) on raw data prior to partitioning the data for honest assessment as opposed to performing the data cleansing after partitioning the data?

A.

It violates assumptions of the model.

B.

It requires extra computational effort and time.

C.

It omits the training (and test) data sets from the benefits of the cleansing methods.

D.

There is no ability to compare the effectiveness of different cleansing methods.

Full Access