An analyst reviews the following data:
7
3
5
2
3
7
7
10
Which of the following is the value of the mode?
A data analyst needs to calculate the mean for Q1 sales using the data set below:
Which of the following is the mean?
Daniel is using the structured Query language to work with data stored in relational database.
He would like to add several new rows to a database table.
What command should he use?
A publishing group has requested a dashboard to track submissions before publication. A key requirement is that all changes are tracked, as multiple users will be checking out documents and editing them before submissions are considered final. Which of the following is the BEST way to meet this stakeholder requirement?
Randy scored 76 on a math test, Katie scored 86 on a science test, Ralph scored 80 on a history test, and Jean scored 80 on an English test. The table below contains the mean and standard deviation of the scores for each of the courses:
Using this information, which of the following students had the BEST score?
Which of the following data manipulation techniques should an analyst use to hide unnecessary data during analysis?
Kelly wants to get feedback on the final draft of a strategic report that has taken her six months to develop.
What can she do to get prevent confusion as see seeks feedback before publishing the report?
Choose the best answer.
A data analyst is developing a data dictionary that aligns with a company's data management processes and policies. Which of the following best describes what should be included in the data dictionary?
Which of the following report types is most appropriate for a high-level, year-end report requested by a Chief Executive Officer?
A data analyst is using a two-tailed, independent t-test to determine whether the type of stretching, dynamic or static, has any influence on a dancer's flexibility. Which of the following is the alternative hypothesis?
Which of the following programming languages are best suited for analysis and machine-learning applications? (Select two).
Which of the following query statements would be used when filtering data in a relational database management system? (Select two).
A sales analyst needs to report how the sales team is performing to target. Which of the following files will be important in determining 2019 performance attainment?
Which of the following can be used to translate data into another form so it can only be read by a user who has a key or a password?
Which of the following is a domain-specific language used in programming that is designed for managing data that is held in a relational data stream management system?
Which of the following types of analysis is used when comparing last week's sales to the previous week's sales?
An analyst collected data that includes primary account numbers, expiration dates, and service codes. Which of the following data governance classifications is used to describe this data?
The current date is July 14, 2020. A data analyst has been asked to create a report that shows the company's year-over-year Q2 2020 sales. Which of the following reports should the analyst compare?
Which of the following should be accomplished NEXT after understanding a business requirement for a data analysis report?
Consider this dataset showing the retirement age of 11 people, in whole years:
54, 54, 54, 55, 56, 57, 57, 58, 58, 60, 60
This tables show a simple frequency distribution of the retirement age data.
What category of data stewardship work is focused on ensuring that the organization respects the wishes of data subjects?
What analytics suite is offered by Microsoft and directly integrates with SQL Server Databases?
A data analyst needs to write a SOL query measuring last month's website visits and distribute a summary report to the marketing team. Which of the following is the analyst creating?
A marketing analytics team received customer transaction data from two different sources. The data is complete and accurate; however, the field names appear to be inconsistent. Given the following tables:
Which of the following is considered best practice if the team wants to consolidate the files and conduct further analysis?
An analyst wants to extract data from a variety of sources and store the data in a cloud-based environment prior to cleaning. Which of the following integration techniques should the analyst use?
Given the following grocery store orders:
If a query is made to the table with the following logic:
Order_Total > 132 OR (Order Total >= 25 AND Order_Total < 74)
Which of the following is the number of orders that will be returned by the query?
A data analyst is creating a report that will provide information about various regions, products, and time periods. Which of the following formats would be the MOST efficient way to deliver this report?
Encryption is a mechanism for protecting data.
When should encryption be applied to data?
Choose the best answer.
A report is scheduled to run and be distributed at the end of business each day. On Mondays, one of the recipients opens the previous week's reports and combines them to calculate the weekly totals and projections for the coming week. This is a tedious process, and the recipient asks an analyst for help. Which of the following should the analyst recommend?
A data analyst needs to create a data visualization that aids in un the cumulative impact of sequentially introduced values that are positive or negative. Which of the following
data visualization methods should the analyst use?
Standardized tests are given to students in the middle of each month, and the results are ready by the end of the month. The superintendent needs a quick view of test performance. Which of the following would be the best recommendation to meet the superintendent's requirements?
Given the following data tables:
Which of the following MDM processes needs to take place FIRST?
An analyst is working with the income data of suburban families in the United States. The data set has a lot of outliers, and the analyst needs to provide a measure that represents the typical income. Which of the following would BEST fulfill the analyst’s goal?
A data analyst has a set with more than 40.000 rows in the sample schema below:
The analyst would like to create one column that contains the customers’ birth dates. Which of the following data quality dimensions would BEST explain the reason for compilation?
An analyst has received the requirements for an internal user dashboard. The analyst confirms the data sources and then creates a wireframe. Which of the following is the NEXT step the analyst should take in the dashboard creation process?
Given the table below:
Which of the following variables can be considered inconsistent, and how many distinct values should the variable have?
A data set for sales per month includes the following data:
Which of the following cleaning and profiling methods should be applied to the data set?
An analyst is working on a project for a director. During this process. the analyst pulled the data. created summarized tables and graphs with descriptions, created a report summary, and inserted all items into a report. After writing the report, which of the following would be the most appropriate next step?
Joe. an analyst. tests the loading time on a dashboard he is preparing to go live and finds it is slower than he would like. Which of the following must occur to decrease the loading time?
Which of the following data types must be used when working with variables that require classification into two or more groups before analysis?
An analyst has conducted a review of business questions. Which of the following should the analyst do next to conduct an analysis?
Which of the following data sampling methods involves dividing a population into subgroups by similar characteristics?
An analyst is required to run a text analysis of data that is found in articles from a digital news outlet. Which of the following would be the BEST technique for the analyst to apply to acquire the data?
A military commander would like to see the health scorecards of the troops daily and filter them based on gender and rank. Considering this data is PHI, which of the following would be the best way for the commander to view the information?
A user imports a data file into the accounts payable system each day. On a regular basis. the field input is not what the system is expecting. so it results in an error for the row and a broken import process. To resolve the issue, the user opens the file, finds the error in the row, and manually corrects it before attempting the import again. The import sometimes breaks on subsequent attempts. though. Which of the following changes should be made to this process to reduce the number of errors?
An e-commerce company recently tested a new website layout. The website was tested by a test group of customers, and an old website was presented to a control group. The table below shows the percentage of users in each group who made purchases on the websites:
Which of the following conclusions is accurate at a 95% confidence interval?
An organization would like to add a secondary email field to its customer database in order to enrich the customer profiles. Which of the following data manipulation techniques should the analyst use to add this information?
An analyst modified a data set that had a number of issues. Given the original and modified versions:
Which of the following data manipulation techniques did the analyst use?
Which one of the following would not normally be considered a summary statistic?
A data analyst has been asked to create a daily manufacturing report for the floor manager Which of the following metrics should be included in the report?
Given the information in the following tables:
Which of the following describes merging these tables to create a master file that includes all transactions for both online and in-store sales?
The director of operations at a power company needs data to help identify where company resources should be allocated in order to monitor activity for outages and restoration of power in the entire state. Specifically, the director wants to see the following:
* County outages
* Status
* Overall trend of outages
INSTRUCTIONS:
Please, select each visualization to fit the appropriate space on the dashboard and choose an appropriate color scheme. Once you have selected all visualizations, please, select the appropriate titles and labels, if applicable. Titles and labels may be used more than once.
If at any time you would like to bring back the initial state of the simulation, please click the Reset All button.
Which of the following should be accomplished NEXT after understanding a business requirement for a data analysis report?
Given the following:
Which of the following is the most important thing for an analyst to do when transforming the table for a trend analysis?
A data analyst has received a data set that contains actual and projected sales for the fourth quarter of 2019. Which of the following statistical methods should the analyst use to find the measure of dispersion?
You are working with a dataset and want to change the names of categories that you used for different types of books.
What term best describes this action?
Given the following data table:
Which of the following are appropriate reasons to undertake data cleansing? (Select two).
An analyst wants to check the progress and performance regarding the number of customers an organization served in the last six years. Which of the following represents the type of analysis the analyst should perform?
Given the customer table below:
Which of the following chart types is the most appropriate to represent the average spending of active customers vs. inactive customers?
Which of the following differentiates a flat text file from other data types?
A data analyst must separate the column shown below into multiple columns for each component of the name:
Which of the following data manipulation techniques should the analyst perform?
Which of the following reports can be used when insight into operational performance is needed each Wednesday?
An analyst has been tracking company intranet usage and has been asked to create a chat to show the most-used/most-clicked portions of a homepage that contains more than 30 links. Which of the following visualizations would BEST illustrate this information?
A development company is constructing a new Init in its apartment complex. The complex has the following floor plans:
Using the average cost per square foot of the original floor plans. which of the following should be the price of the Rose Init?
Which of the following is a best practice when updating a legacy data source?
A data engineer is creating a database field to capture whether a customer likes vanilla ice cream. Which of the following data types is the best to capture this information?
A data analyst wants to create "Income Categories" that would be calculated based on the existing variable "Income". The "Income Categories" would be as follows:
Income category 1: less than $1.
Income category 2: more than $1 and less than $20,000.
Income category 3: more than $20,001 and less than $40,000.
Income category 4: more than $40,001.
Which of the following data manipulation techniques should the data analyst use to create "Income Categories"?
An analyst is designing a dashboard to determine which site has the highest percentage of new customers. The analyst must choose an appropriate chart to include in the dashboard. The following data is available:
Which of the following types of charts should be considered to BEST display the data?
Which of the following data manipulation techniques is an example of a logical function?
Given the table below:
Which of the following boxes indicates that a Type Il error has occurred?
Which of the following would be used to store unstructured data from different sources?
An analysts building a monthly report for production and wants to ensure the audience is aware of its once-a-month cadence. Which of the following is the MOST important to convey that information?
Which of the following is a process that is used during data integration to collect, blend, and load data?
Which of the following statistical methods requires two or more categorical variables?
A customer's telephone number is in the format 123-456-7890. Which of the following data types is used for the phone number?
Which of the following statements would be used to append two tables that have the same number of columns?
A sales manager wants quarterly sales reports broken down by unit and week. Which of the following data output lists includes the most necessary information?
Jenny wants to study the academic performance of undergraduate sophomores and wants to determine the average grade point average at different points during an academic year.
What best describes the data set she needs?
A gambler thinks that a coin is fair and is equally likely to turn up heads or tails when the coin is flipped. Which of the following tests should the gambler use to fest this hypothesis?
Different people manually type a series of handwritten surveys into an online database. Which of the following issues will MOST likely arise with this data? (Choose two.)