Google Data Analytics Professional Certificate Course 5: Analyze Data – quiz answers

Coursera Google Data Analytics Professional Certificate Course 5 – Analyze Data to Answer Questions quiz answers to all weekly questions (weeks 1 – 4):

  • Week 1: Organizing data to begin analysis
  • Week 2: Formatting and adjusting data
  • Week 3: Aggregating data for analysis
  • Week 4: Performing data calculations

You may also be interested in Google Data Analytics Professional Certificate Course 1: Foundations – Cliffs Notes.


Google Data Analytics Professional Certificate quiz answers

Google Data Analytics Professional Certificate Course 1: Foundations – Cliffs Notes

Google Data Analytics Professional Certificate Course 2: Ask Questions – quiz answers

Google Data Analytics Professional Certificate Course 3: Prepare Data – quiz answers

Google Data Analytics Professional Certificate Course 4: Process Data – quiz answers

Google Data Analytics Professional Certificate Course 6: Share Data – quiz answers

Google Data Analytics Professional Certificate Course 7: Data Analysis with R – quiz answers

Google Data Analytics Professional Certificate Course 8: Capstone – quiz answers


Week 1: Organizing data to begin analysis

Organizing data makes the data easier to use in your analysis. In this part of the course, you’ll learn the importance of organizing your data through sorting and filtering. You’ll explore these processes in both spreadsheets and SQL as you continue to prepare your data for analysis.

Learning Objectives

  • Describe what is involved in the data analysis process with reference to goals and key tasks
  • Discuss the importance of organizing data before analysis with references to sorts and filters
  • Describe sorting as it relates to data in a spreadsheet or database with reference to functionality and benefits
  • Demonstrate an understanding of the steps involved in sorting and filtering data through the use of SQL queries

Answers to week 1 quiz questions

L2 Data analysis basics

Question 1

You are creating a spreadsheet that contains data about a volunteer theater production. You ask the volunteers which tasks they have already completed, then add that data to the spreadsheet. Next, you will use the information provided by the volunteers to figure out which tasks still need to be done. This is an example of which phase of analysis?

  • Formatting and adjusting data
  • Getting input from others
  • Organizing data
  • Transforming data

This is an example of getting input from others. Getting input means soliciting information from other sources to inform your decisions. Transforming data involves identifying the trends and patterns between the data.

Question 2

You are working with three datasets about voter turnout in your county. First, you identify relationships and patterns between the datasets. Then, you use formulas and functions to make calculations based on your data. This is an example of which phase of analysis?

  • Organizing data
  • Getting input from others
  • Transforming data
  • Formatting and adjusting data

This is an example of transforming data, which involves identifying relationships and patterns between the datasets and making calculations based on the data.

Question 3

You are working with a dataset from a local community college. You sort the students alphabetically by last name. This is an example of which phase of analysis?

  • Format and adjust data
  • Transform data
  • Get input from others
  • Organize data

Sorting a list of students alphabetically is an example of formatting and adjusting data. This is a step analysts take to rearrange the data to make it easier to work with.

L3 Organize data for analysis

Question 1

Fill in the blank: A data analyst uses _ to decide which data is relevant to their analysis and which data types and variables are appropriate.

  • database relationships
  • database references
  • database organization
  • database normalization

Database organization enables analysts to make decisions about which data is relevant to pull for a specific analysis. Database references let them access objects from other databases.

Question 2

You are working with a dataset that lists student athletes at a school. The Sport column designates the sport each athlete plays. Which of the following SQL queries would return only the athletes who play volleyball?

  • WHERE Sport = “Volleyball”
  • SPORT(“Volleyball”)
  • WHERE Sport = Volleyball
  • SPORT = “Volleyball”

The query WHERE Column = “Type” will return only athletes who play volleyball.

L4 Sort data in spreadsheets

Question 1

Which spreadsheet menu function is used to sort all data in a spreadsheet by the ranking of a specific sorted column?

  • Sort Data
  • Sort By Rank
  • Sort Range
  • Sort Sheet

Sort Sheet is used to sort all data in a spreadsheet by the ranking of a specific sorted column.

Question 2

In spreadsheets, data analysts can sort a range from the Data tab in the menu or by typing a function directly into an empty cell.

  • True
  • False

Sorting a range and sorting a sheet can both be done from the menu and written as a function. Analysts can work from the Data tab in the menu or type a function directly into an empty cell.

Question 3

An analyst uses =SORT to sort spreadsheet data in descending order. What do they type at the end of their sort function?

  • FALSE
  • DESCEND
  • LEFT
  • REVERSE

To sort a spreadsheet in descending order, the analyst types FALSE at the end of their sort function.

L5 Sort data in SQL

Question 1

A data analyst is writing a SQL query to sort data in a column in ascending order. The column is called column_title. What is the correct syntax for their query?

  • ORDER column_title ASC
  • ORDER BY column_title
  • ORDER BY column_title DESC
  • ORDER column_title

An ORDER BY statement sorts in ascending order by default. ORDER BY column_title is the syntax for this query.

Question 2

You want to sort a database table of newly released young adult novels. Which statement sorts action novels by word count in descending order?

  • WHERE genre = “Action” ORDER BY word_count DESC
  • WHERE word_count DESC ORDER BY genre = “Action”
  • WHERE genre = “Action” WHERE word_count DESC
  • WHERE genre = “Action” SORT BY word_count DESC

The correct statement is: SQL query WHERE genre = "Action" ORDER BY word_count DESC. The ORDER BY clause tells the database how to organize the data it returns.

Weekly challenge 1

Question 1

In the data analysis process, which of the following refers to a phase of analysis? Select all that apply.

  • Visualize the data
  • Organize data into understandable sections
  • Get input from others
  • Format data using sorts and filters

There are four phases of analysis: organize data, format and adjust data, get input from others, and transform data by observing relationships between data points and making calculations.

Question 2

During which phase of analysis can you find a correlation between two variables?

  • Format and adjust data
  • Get input from others
  • Organize data
  • Transform data

Finding a correlation between two variables occurs while transforming data.

Question 3

You are performing a calculation during your analysis of a dataset. Which phase of analysis are you in?

  • Transform data
  • Get input from others
  • Organize data
  • Format and adjust data

You are the transform data phase of analysis. This is an example of identifying relationships and patterns between data.

Question 4

Typically, a data analyst uses filters when they want to expand the amount of data they are working with.

  • True
  • False

Typically, a data analyst uses filters when they want to narrow down the amount of data they are working with.

Question 5

A data analyst is sorting spreadsheet data. They want to make sure that, when they rearrange the data, data across rows is kept together. What technique should they use to sort the data?

  • Sort Column
  • Sort Sheet
  • Sort Together
  • Sort Rows

Sort sheet sorts all of the data in a spreadsheet by a specific sorted column. Data across rows is kept together during the sort.

Question 6

A data analyst uses a function to sort a spreadsheet range between cells H1 and K65. They sort in ascending order by the first column, Column H. What is the syntax they are using?

  • =SORT(H1:K65, 1, TRUE)
  • =SORT(H1:K65, A, FALSE)
  • =SORT(H1:K65, A, TRUE)
  • =SORT(H1:K65, 1, FALSE)

The syntax is =SORT(H1:K65, 1, TRUE). The first part of the function sorts the data in the specified range. The 1 represents the first column. And a TRUE statement sorts in ascending order.

Question 7

A data analyst is querying a database that contains data about dental equipment inventory. They are only interested in data related to cleaning products. Which of the following sections of an SQL statement would return the correct result?

  • WHERE “Cleaning”
  • WHERE product = “Cleaning”
  • ORDER BY “Cleaning”
  • ORDER BY product = “Cleaning”

The correct section is WHERE Product = "Cleaning". A WHERE statement in SQL includes the name of the column, an equals sign, and the value(s) in the column to include.

Question 8

A data analyst would write the following section of a SQL query to sort Golden Retrievers, ordered by birth date, in ascending order:

WHERE Breed = "Golden Retriever" ORDER BY Birth_date
  • True
  • False

The query will return Golden Retrievers, ordered by birth date, in ascending order.

Week 2: Formatting and adjusting data

As you move closer to analyzing your data, you’ll want to have the data formatted and ready to go. In this part of the course, you’ll learn all about converting and formatting data, including how SQL queries can help you combine data. You’ll also find out the value of feedback and support from your colleagues and how it can lead to new learning that you can apply to your work.

Learning Objectives

  • Demonstrate an understanding of what is involved in the conversion and formatting of data
  • Demonstrate an understanding of the use of spreadsheets and SQL queries to combine multiple pieces of data
  • Discuss the importance of seeking feedback and support from others

Answers to week 2 quiz questions

L2 Convert and format

Question 1

A spreadsheet cell contains the coldest temperature ever recorded in New Zealand: -22 °Celsius. What function could be used to display that temperature in Fahrenheit?

  • =CONVERT(-22, C, F)
  • =CONVERT(-22, F, C)
  • =CONVERT(-22, “C”, “F”)
  • =CONVERT(-22, “F”, “C”)

=CONVERT(-22, “C”, “F”) will display -22 °C in Fahrenheit.

Question 2

A data analyst wants to ensure spreadsheet tools continue to run correctly, even if someone enters the wrong data by mistake. Which data-validation menu option should they select?

  • Deny Help Text
  • Reject Invalid Inputs
  • Forbid Entry
  • Remove Validation

To ensure spreadsheet tools continue to run correctly, even if someone enters the wrong data by mistake, select Reject Invalid Inputs.

Question 3

A data analyst clicks on the Format Cells If drop-down menu and selects the option Text Is Exactly November. This changes the color of all the cells that contain the word November. What spreadsheet tool is the analyst using?

  • Conditional formatting
  • CONVERT
  • Filtering
  • Data validation

The data analyst is using conditional formatting. Conditional formatting is a spreadsheet tool that changes how cells appear when values meet specific conditions.

L3 Combine multiple data sets

Question 1

You are working on a project related to rental properties in the United States. You write the following query:

SELECT * FROM rentals.us_housing_units

How can you instruct the database to retrieve only the first 10 results?

  • RETRIEVE 10
  • LIMIT 10
  • RETURN 10
  • FIRST 10

To display only the first 10 results, you would type, LIMIT 10.

Question 2

What function can be used to confirm that spreadsheet cell B8 contains exactly 20 characters?

  • LEN = B8,20
  • =LEN(20)
  • LEN = B8
  • =LEN(B8)

The function =LEN(B8) will display the number of characters in cell B8. The LEN function returns the length of a string of text by counting the number of characters it contains.

Weekly challenge 2

Question 1

An analyst notes that the “160” in cell A9 is formatted as text, but it should be Australian dollars. What spreadsheet tool can help them select the right format?

  • CURRENCY
  • Format as Currency
  • EXCHANGE
  • Format as Dollar

The Format as Currency tool can be used to change the text to Australian dollars.

Question 2

You are creating a spreadsheet to help you with your job search. Every time you find an interesting job, you add it to the spreadsheet. Then, you want to indicate two possible options: Need to Apply or Applied. What spreadsheet tool will save you time by enabling you to create a dropdown list with Need to Apply and Applied as the possible options?

  • Data validation
  • FIND
  • Conditional formatting
  • Pop-up menus

Data validation can be used to add drop-down lists with predetermined options for Need to Apply and Applied.

Question 3

You are using a spreadsheet to keep track of your newspaper subscriptions. You add color to indicate if a subscription is current or has expired. Which spreadsheet tool changes how cells appear when values meet each expiration date?

  • Add color
  • CONVERT
  • Data validation
  • Conditional formatting

You are using conditional formatting. Conditional formatting changes how cells appear when values meet specific conditions.

Question 4

A data analyst wants to write a SQL query to combine data from two columns and into a new column. What function can they use?

  • CONCAT
  • JOIN
  • COMBINE
  • GROUP

They can use CONCAT, which joins multiple text strings from multiple sources.

Question 5

You are querying a database of ice cream flavors to determine which stores are selling the most mint chip. For your project, you only need the first 80 records. What clause should you add to the following SQL query?

SELECT flavors FROM ice_cream_table WHERE flavor = "mint_chip"
  • LIMIT = 80
  • LIMIT_80
  • LIMIT,80
  • LIMIT 80

To return only the first 80 records, type LIMIT 80.

Question 6

A data analyst is working with a spreadsheet that has very long text strings. They use a function to count the number of characters in cell G11. What is the correct syntax?

  • =LEN(G,11)
  • =LEN(G11)
  • =LEN(G:G11)
  • =LEN(“G11”)

The correct syntax is =LEN(G11). The LEN function counts the number of characters in a text string and the parameter for the function is the cell reference.

Question 7

Spreadsheet cell L6 contains the text string “Function.” To return the substring “Fun,” what is the correct syntax?

  • =RIGHT(3,L6)
  • =LEFT(L6, 3)
  • =RIGHT(L6, 3)
  • =LEFT(3,L6)

The function =LEFT(L6, 3) will return “Fun.” The LEFT function returns a set number of characters from the left side of a text string. In this case, it returns a three-character substring from the end of the string in L6, starting from the left.

Question 8

Fill in the blank: When working with a database, data analysts can use the _ function to locate specific characters in a string.

  • IDENTIFY
  • WHERE
  • FIND
  • FROM

When working with a database, data analysts can use the FIND function to locate specific characters in a string.

Week 3: Aggregating data for analysis

As part of your analysis, you’ll often have to combine data in order to gain insights and complete business objectives. In this part of the course, you’ll explore the functions, procedures, and syntax involved in combining, or aggregating, data. You’ll learn how to do this from multiple cells in spreadsheets and from multiple database tables using SQL queries.

Learning Objectives

  • Demonstrate an understanding of functions and procedures that may be used to combine data from multiple cells in a spreadsheets
  • Demonstrate an understanding of functions and syntax to create SQL queries for combining data from multiple database tables
  • Use VLOOKUP to query data, trim data, convert text data to numeric data, and create a summary table from a queried information

Answers to week 3 quiz questions

L2 Avoid common VLOOKUP pitfalls

Question 1

To change a text string in spreadsheet cell F8 to a numerical value, what is the correct function?

  • =VALUE(F8)
  • =MATCH(F8)
  • =NUM(F8)
  • =CONVERT(F8)

To change the text string in spreadsheet cell F8 to a numerical value, the correct syntax is =VALUE(F8). Within the parenthesis, the VALUE syntax must include a reference to the specific cell whose value the function should convert.

Question 2

What is the purpose of an absolute reference within a function, such as “$C$3”?

  • To remove unnecessary instructions from a formula or function
  • To lock rows and columns so they won’t change when a function is copied
  • To represent missing values in a formula or function
  • To make formulas and functions unconditional

The purpose of an absolute reference is to lock the reference to a row or column so values won’t change when a function is copied.

Question 3

In VLOOKUP, TRUE tells the function to search for exact matches, and FALSE tells the function to look for approximate matches.

  • True
  • False

In VLOOKUP, TRUE tells the function to search for approximate matches, and FALSE tells the function to look for exact matches.

Question 4

The following is a selection from a spreadsheet:

ABC
1CountryPopulation in 2020 (millions)Growth in population 2000-2020
2China1,439,323,77613.4%
3India1,380,004,38537.1%
4United States331,002,65117.3%
5Indonesia273,523,61527.7%
6Pakistan220,892,34044.9%
7Brazil212,559,41721.9%
8Nigeria206,139,58966.3%
9Bangladesh164,689,38327.9%
10Russia145,934,462-0.8%

To search for the population of Nigeria, what is the correct VLOOKUP syntax?

  • =VLOOKUP(“Nigeria”, A2:C10, 2, false)
  • =VLOOKUP(Nigeria, A2:C10, 3, false)
  • =VLOOKUP(Nigeria, A2:C10, 3, true)
  • =VLOOKUP(Nigeria, A2,C10, 2, true)

To search for the population of Nigeria, the syntax is =VLOOKUP(“Nigeria”, A2:C10, 2, false). “Nigeria” is the reference. A2:C10 is the table array. The 2 indicates the position of the column from which the value should be returned. And the word false instructs the function to return an exact match.

Question 5

The following is a selection from a spreadsheet:

ABCD
1LocationBuildingHeightYear completed
2DubaiBurj Khalifa2,717 feet2010
3ShanghaiShanghai Tower2,073 feet2015
4MeccaMakkah Royal Clock Tower1,972 feet2012
5ShenzhenPing An Finance Center1,965 feet2017
6St. PetersburgLakhta Center1,516 feet2019
7ChicagoWillis Tower1,451 feet1974

To search for the height of the building in Mecca, what is the correct VLOOKUP syntax?

  • =VLOOKUP(Mecca, A2:D7, 2, false)
  • =VLOOKUP(Mecca, A2:D7, 2, true)
  • =VLOOKUP(Mecca, A2,D7, 3, true)
  • =VLOOKUP(“Mecca”, A2:D7, 3, false)

To search for the height of the building in Mecca, the correct syntax is =VLOOKUP(“Mecca”, A2:D7, 3, false). “Mecca” is the reference. A2:D7 is the table array. The 3 indicates the number of the column from which the value should be returned. And the word false instructs the function to return an exact match.

L3 Use JOINS to aggregate data in SQL

Question 1

A data analyst wants to retrieve only records from a database that have matching values in two different tables. Which JOIN function should they use?

  • INNER JOIN
  • RIGHT JOIN
  • LEFT JOIN
  • OUTER JOIN

To retrieve only records from a database that have matching values in two different tables, the analyst should use INNER JOIN.

Question 2

You are writing a SQL query to instruct a database to count distinct values in a specified range. Which function should you include in your query?

  • COUNT DISTINCT
  • COUNT RANGE
  • COUNT VALUES
  • COUNT

To tell a database to return distinct values in a specified range, the analyst should use COUNT DISTINCT in their query.

Question 3

A data analyst wants to temporarily name a column in their query to make it easier to read and write. What technique should they use?

  • Aliasing
  • Tagging
  • Filtering
  • Naming

To temporarily name a column in a query to make it easier to read and write, the analyst should use aliasing.

L4 Work with subqueries

Question 1

Which of the following queries contain subqueries? Select all that apply.

  • SELECT call FROM recordings ORDER BY call.employee_id, call.start_time
  • SELECT first_name, last_name FROM customers WHERE …
  • SELECT employee _id FROM employees WHERE …
  • SELECT price FROM sales WHERE price …

The three queries with statements in parentheses contain subqueries.

Question 2

Fill in the blank: A data analyst uses aliasing to make it easier to read and write a query. Aliasing involves temporarily _ a table or column in a query.

  • hiding
  • removing
  • naming
  • copying

Aliasing involves temporarily naming a table or column in a query.

Weekly challenge 3

Question 1

Fill in the blank: Data aggregation involves creating a _ collection of data that originally came from multiple sources.

  • modified
  • summarized
  • localized
  • expanded

Data aggregation involves creating a summarized collection of data from multiple sources.

Question 2

A data analyst uses the SUM function to add together numbers from a spreadsheet. However, after getting a zero result, they realize the numbers are actually text. What function can they use to convert the text to a numeric value?

  • FIGURE
  • DIGIT
  • VALUE
  • CONVERT

The analyst can use the VALUE function to convert the text that represents a number to a numeric value.

Question 3

When using VLOOKUP, there are some common limitations that data analysts should be aware of. One of these limitations is that VLOOKUP can only return a value from the data to the left of the matched value.

  • True
  • False

One limitation of VLOOKUP is that it can only return a value from the data to the right of the matched value.

Question 4

Fill in the blank: When writing a function, a data analyst wraps a table array in dollar signs. This is an _ , which is used to lock the array so rows and columns don’t change if the function is copied.

  • arbitrary reference
  • accurate reference
  • absolute reference
  • authentic reference

Wrapping a table array in dollar signs creates an absolute reference, which locks the array so rows and columns don’t change if the function is copied.

Question 5

The following is a selection from a spreadsheet:

ABC
1CountryPopulation in 2020 (millions)Growth in population 2000-2020
2China1,439,323,77613.4 %
3India1,380,004,38537.1 %
4United States331,002,65117.3 %
5Indonesia273,523,61527.7%
6Pakistan220,892,34044.9%
7Brazil212,559,41721.9%
8Nigeria206,139,58966.3%
9Bangladesh164,689,38327.9%
10Russia145,934,462-0.8%

To search for the population of Pakistan, what is the correct VLOOKUP syntax?

  • =VLOOKUP(Pakistan, A2:B10, 3, false)
  • =VLOOKUP(“Pakistan”, A2:B10, 3, false)
  • =VLOOKUP(Pakistan, A2*B10, 2, false)
  • =VLOOKUP(“Pakistan”, A2:B10, 2, false)

To search for the population of Pakistan, the syntax is =VLOOKUP(“Pakistan”, A2:B10, 2, false). “Pakistan” is the reference. A2:B10 is the table array. The 2 indicates the number of the column from which the value should be returned. And the word false instructs the function to return an exact match.

Question 6

When creating a SQL query, which JOIN clause returns all matching records in two or more database tables?

  • OUTER
  • RIGHT
  • INNER
  • LEFT

The INNER JOIN clause returns all matching records in two or more database tables.

Question 7

A data analyst writes a query that asks a database to return only distinct values in a specified range, rather than including repeating values. Which function do they use?

  • RETURN
  • COUNT DISTINCT
  • RETURN VALUES
  • COUNT

When writing SQL queries, an analyst can use the COUNT DISTINCT function to return only distinct values in a range.

Question 8

Which of the following terms describe a subquery? Select all that apply.

  • Inner select
  • Nested query
  • Inner query
  • Small query

A subquery can also be called an inner query, inner select, or nested query.

Week 4: Performing data calculations

Calculations are one of the more common tasks that data analysts complete during analysis. In this part of the course, you’ll explore formulas, functions, and pivot tables in spreadsheets and queries in SQL, all of which will help with your calculations. You’ll also learn about the benefits of using SQL to manage temporary tables.

Learning Objectives

  • Describe the use of functions to conduct basic calculations on data in spreadsheets
  • Discuss the use of pivot tables to conduct calculations on data in spreadsheets
  • Demonstrate an understanding of the use of SQL queries to complete calculations
  • Explain the importance of the data-validation process for ensuring accuracy and consistency in analysis
  • Discuss the use of SQL queries to manage temporary tables
  • Reflect on how conditional statements can be used to create complex queries and functions
  • Generate multiple points of summary based on a wide variety of conditions using COUNTIF, SUMIF, MAXIF, and AVERAGEIF

Answers to week 4 quiz questions

L2 Data calculations

Question 1

What is the correct spreadsheet formula for multiplying 50 and 233?

  • 50×233
  • =50×233
  • =50*233
  • 50*233

=50*233 is the correct formula for multiplying 50 and 233. Formulas begin with an equal sign (=). This is followed by the values to be computed. An asterisk (*) is the multiplication operator in spreadsheets.

Question 2

The following is a selection of a spreadsheet:

AB
1ExpenseAmount
2Rent$680.00
3Healthcare$101.00
4Groceries$185.00
5Clothing$41.00
6Transportation$22.00
7Mobile phone$48.00
8Dining out$79.00
9Car insurance$65.00
10Dog walker$40.00
11Gym membership$19.00
12Manicure$23.00

You are trying to determine what percentage of your monthly income is spent on big-ticket items, such as rent and groceries. To add together only the values from Column B that cost more than $150, what is the correct syntax?

  • =SUMIF(B2:B12,”<150″)
  • =SUMIF(B2:B12,>150)
  • =SUMIF(B2:B12,<150)
  • =SUMIF(B2:B12,”>150″)

To add together only the values from Column B that cost more than $100, the correct syntax is =SUMIF(B2:B12,”>150″). B2:B12 is the range, and more than 150 (>150) is the criteria.

Question 3

A data analyst is working with a spreadsheet from a cosmetics company.

You may click the link to create a copy of the dataset: Cosmetics Inc.

Which of the following is an example of an array in this spreadsheet?

  • All cells with number values
  • All cells with values greater than 100
  • The values in cells B2 through B31
  • Cells D7 and D14

The values in cells B2 through B31 together are an example of an array. An array is a collection of values in spreadsheet cells.

L3 Pivot tables

Question 1

The following is a sample pivot table from a furniture company:

productSUM of purchase_price
bed$799.99
bookcase$58.89
chair$234.50
chaise$399.95
couch$9,000.00
desk$509.85
fan$111.92
lamp$160.97
mirror$199.95
ottoman$299.99
rug$808.65
vase$19.98
Grand Total12604.635

What is the purpose of the pivot table in this spreadsheet?

  • To organize all of the data into a smaller format
  • To find the average price of each product
  • To calculate purchase price data
  • To summarize data about each product

The purpose of the pivot table is to calculate purchase price data. The pivot table shows the total purchase price for each item and the total overall purchase price for all of the items. This pivot table doesn’t include all of the data from the transaction sheet.

Question 2

The following is a sample pivot table from a furniture company:

productSUM of purchase_price
bed$799.99
bookcase$58.89
chair$234.50
chaise$399.95
couch$9,000.00
desk$509.85
fan$111.92
lamp$160.97
mirror$199.95
ottoman$299.99
rug$808.65
vase$19.98
Grand Total12604.635

How could the pivot table be adjusted to show the same data, but only for products categorized as beige?

  • Add a filter to show only beige products
  • Add a new column labeled beige
  • Sort the current row by product color
  • Summarize the values by product

To show the same data, but only for products categorized as beige, add a filter to show only beige products.

Question 3

The following is a sample pivot table from a furniture company:

productSUM of purchase_price
bed$799.99
bookcase$58.89
chair$234.50
chaise$399.95
couch$9,000.00
desk$509.85
fan$111.92
lamp$160.97
mirror$199.95
ottoman$299.99
rug$808.65
vase$19.98
Grand Total12604.635

The value added to the pivot table is the purchase price of the products.

  • TRUE
  • FALSE

The value added to the pivot table is purchase price. In the pivot table editor, the Values menu shows “product_price” as the value in the pivot table.

Question 4

The following is a sample pivot table from a furniture company:

productSUM of purchase_priceCalculated Field 1
bed$799.99$0.00
bookcase$58.89$0.00
chair$234.50$0.00
chaise$399.95$0.00
couch$9,000.00$0.00
desk$509.85$0.00
fan$111.92$0.00
lamp$160.97$0.00
mirror$199.95$0.00
ottoman$299.99$0.00
rug$808.65$0.00
vase$19.98$0.00
Grand Total12604.635$0.00

Which spreadsheet tool should you use if you want to find an average value using values generated within a pivot table?

  • A filter
  • Conditional formatting
  • A calculated field
  • Data validation

To find an average value using values generated within a pivot table, use a calculated field. A calculated field is a new field within a pivot table that carries out certain calculations based on the values of other fields.

L4 SQL calculations

Question 1

You are creating a query to request worker information from your database. You will use that information to calculate employees’ weekly pay. What clause would you include to store pay values in a new weekly_pay column?

SELECT Employee_ID, number_of_hours, Hourly_rate
FROM Wages_table
  • (number_of_hours * Hourly_rate) AS weekly_pay
  • (weekly_pay * Hourly_rate) TO number_of_hours
  • (weekly_pay * Hourly_rate) AS number_of_hours
  • (number_of_hours * Hourly_rate) TO weekly_pay

To store pay values in the weekly_pay column, the correct statement is (number_of_hours * Hourly_rate) AS weekly_pay. The AS command gives a temporary name to the column.

Question 2

In a SQL query, which calculation does the modulo (%) operator perform?

  • It converts a decimal to a percent
  • It finds the square root of a number
  • It applies an exponent to a value
  • It returns the remainder of a division calculation

The modulo operator returns the remainder of a division calculation when included in a SQL query.

Question 3

You are working with a dataset with the column name “firstquarterexpenses.” How can you rename this column to make it more readable?

  • Firstquarterexpenses
  • first_quarter_expenses
  • first+quarter+expenses
  • first quarter expenses

You can rename the column first_quarter_expenses. Using underscores between words helps avoid potential issues while keeping the names readable.

L5 Data validation

Question 1

The entire data-validation process takes place before you begin your analysis.

  • True
  • False

The data-validation process takes place throughout your analysis. This process involves checking and rechecking the quality of your data so that it is complete, accurate, secure and consistent.

Question 2

You’re analyzing patient data for a health care company. During the data-validation process, you notice that the first date of service for some of the patients is later than the most recent date of service. Which type of data-validation check are you completing?

  • Data consistency
  • Data structure
  • Data type
  • Data range

This is a check for data consistency. During a data consistency check, you confirm that the data makes sense in the context of other related data.

Question 3

During analysis, you complete a data-validation check for errors in customer identification (ID) numbers. Customer IDs must be eight characters and can contain numbers only. Which of the following customer ID errors will a data-type check help you identify?

  • IDs with text
  • IDs that are repeated
  • IDs in the wrong column
  • IDs with more than eight characters

Completing a data-type check will help you identify customer IDs that contain text. The data type for IDs should be numeric only.

L6 Using SQL with temporary tables

Question 1

When are temporary tables automatically deleted?

  • After running a query in your SQL database
  • After running a report from the table
  • After completing all calculations in the table
  • After ending the session in a SQL database

Temporary tables are automatically deleted after ending the session in a SQL database.

Question 2

The following SQL query contains information about bike trips:

WITH 1_hr_trips AS (
    SELECT *
    FROM bigquery-public-data.new_york.citibike_trips
    WHERE tripduration = 60

What data will appear in the temporary table created through this query?

  • The total number of bike trips
  • Bike trips equal to or more than one hour
  • A random subset of bike trips
  • Bike trips that lasted exactly one hour

This temporary table will show bike trips that lasted exactly one hour. The name of the table is “1_hr_trips” and the query includes the condition that trips in the table equal one hour.

Question 3

What benefit does a CREATE TABLE statement add to a temporary table?

  • Metadata about the data in the table
  • Access for anyone to use the table
  • Automated calculations
  • Specific naming conventions

A CREATE TABLE statement provides access for anyone to use the temporary table.

Weekly challenge 4

Question 1

You are analyzing sales data in a spreadsheet. Which of the following could you find out by using the MAX function?

  • Total sales for the year
  • Difference between two months of sales
  • The month with the highest sales
  • Sales per month over a year

You could find out the month with the highest sales using the MAX function. The MAX function returns the largest numeric value from a range of cells.

Question 2

A data analyst is working with a spreadsheet from a furniture company.

Sample Transaction Table.

The analyst inputs a function to find the number of product prices that are less than $150.00. Which formula will return that result?

  • =SUMIF(G2:G30, “>150”)
  • =COUNTIF(G2:G30, “<150”)
  • =SUMIF(G2:G30, “<150”)
  • =COUNTIF(G2:G30, “>=150”)

The COUNTIF formula =COUNTIF(G2:G30, “<150”) will allow the analyst to count all product price values in Column G that are less than $150.

Question 3

A data analyst is working in a spreadsheet and uses the SUMIF function in the formula below as part of their analysis.

=SUMIF(A1:A25, "<10", C1:C25)

Which part of this formula is the criteria or condition?

  • “<10”
  • A1:A25
  • C1:C25
  • =SUMIF

The criteria or condition for this SUMIF formula is “<10”. This means that if any values in the range A1 through A25 are less than 10, their corresponding values in the range C1 through C25 will be added together.

Question 4

A data analyst is working in a spreadsheet and uses the SUMPRODUCT function in the formula below as part of their analysis.

=SUMPRODUCT(A2:A10,B2:B10)

How does the SUMPRODUCT function calculate the cell ranges identified in the parentheses?

  • It multiplies the values in the first range, then multiplies the values in the second range.
  • It adds the ranges, then multiplies them by the last value in the second array.
  • It adds the values in the first range, then adds the values in the second range.
  • It multiplies the ranges, then adds the sum of the products of the two ranges.

=SUMPRODUCT(A2:A10,B2:B10) calculates the cell ranges by multiplying each value in the first range by its corresponding value in the second range (the results are the products). Then, the formula adds those products together.

Question 5

A data analyst creates a pivot table in a spreadsheet containing movie data.

Movie Data Project).

If the analyst wants to summarize the data using the AVERAGE function in the Values menu, which spreadsheet columns could they add data from? Select all that apply.

  • Box Office Revenue
  • Budget
  • Movie Title
  • Genre

To summarize the data using the AVERAGE function, the analyst could use the Budget column or the Box Office Revenue column. Both have numeric values that the AVERAGE function could calculate.

Question 6

A data analyst uses the following SQL query to perform basic calculations on their data. Which types of operators is the analyst using in this SQL query? Select all that apply.

SELECT
    Yes_Responses,
    No_Responses,
    Total_Surveys,
    (Yes_Responses + No_Responses) / Total_Surveys AS Responses_Per_Survey
FROM 
    Survey_1
  • Subtraction
  • Multiplication
  • Addition
  • Division

The analyst is using the division operator (/) in this SQL query to divide the sum of “yes” and “no” responses by the total number of surveys.

Question 7

A data analyst uses the following query to perform a calculation on a company’s inventory. Which of the following will be the return in the “Overstock” column for this query?

SELECT
    Total_Inventory % Total_Stores AS Overstock
FROM
    Shipment_1
  • The remainder when the values in “Total_Inventory” are divided by the values in “Total_Stores”
  • The percentage of the “Total_Inventory” that is located in “Total_Stores”
  • The difference between the values in “Total_Inventory” and the values in “Total_Stores”
  • The combined total of the values in “Total_Inventory” and the values in “Total_Stores”

The return for this query will be the remainder when the total inventory is divided by the total number of stores. The modulo operator (%) calculates the remainder when two values are divided.

Question 8

A data analyst completes a calculation in a SQL query using the AVG function. Which of the following best describes the return for this query?

SELECT 
    AVG (salary) AS avg_employee_salary 
FROM 
    employees 
WHERE 
    salary < 30000
  • The number of all salaries in the “employees” table
  • A single average of all of the salaries less than $30,000
  • A single count of salaries that average less than $30,000
  • The annual salary for each employee

The return for this query would be a single average of all of the salaries less than $30,000. The AVG function is an aggregate function that returns the average value of a group. In this query, the group is “salary” and the condition is salaries less than $30,000.

Question 9

Use the following SQL query to answer the question:

SELECT
    location,
    SUM(customer_orders) AS total_orders
FROM
    bulk_orders

Which statement should you add after the FROM statement to organize rows by location?

  • EXTRACT location
  • WHERE location
  • AS location
  • GROUP BY location

You should add the GROUP BY statement to organize rows by location. In this query, GROUP BY groups rows from the Bulk_orders table with the same location value into summary rows.

Question 10

Fill in the blank: The data validation process involves checking and rechecking the quality of your data to make sure that it is complete and _. Select all that apply.

  • cited
  • accurate
  • consistent
  • secure

Data validation involves checking and rechecking the quality of your data to make sure it is complete, accurate, secure, and consistent.

Related content

Basic Statistics Mini-Course

Google Data Analytics Professional Certificate Course 1: Foundations – Cliffs Notes

Google Data Analytics Professional Certificate Course 2: Ask Questions – quiz answers

Google Data Analytics Professional Certificate Course 3: Prepare Data – quiz answers

Google Data Analytics Professional Certificate Course 4: Process Data – quiz answers

Google Data Analytics Professional Certificate Course 6: Share Data – quiz answers

Google Data Analytics Professional Certificate Course 7: Data Analysis with R – quiz answers

Google Data Analytics Professional Certificate Course 8: Capstone – quiz answers

IT career paths – everything you need to know

Back to DTI Courses

1st Annual University of Ottawa Supervisor Bullying ESG Business Risk Assessment Briefing

Disgraced uOttawa President Jacques Frémont ignores bullying problem

How to end supervisor bullying at uOttawa

PhD in DTI uOttawa program review

Rocci Luppicini – Supervisor bullying at uOttawa case updates

The case for policy reform: Tyranny

The trouble with uOttawa Prof. A. Vellino

The ugly truth about uOttawa Prof. Liam Peyton

uOttawa engineering supervisor bullying scandal

uOttawa President Jacques Frémont ignores university bullying problem

uOttawa Prof. Liam Peyton denies academic support to postdoc

Updated uOttawa policies and regulations: A power grab

What you must know about uOttawa Prof. Rocci Luppicini

Why a PhD from uOttawa may not be worth the paper it’s printed on

Why uOttawa Prof. Andre Vellino refused academic support to postdoc

Supervisor Bullying

Text copying is disabled!