A data organization leader is upset about the data analysis teams reports being different from the data engineering teams reports. The leader believes the siloed nature of their organizations data engineering and data analysis architectures is to blame.
Which of the following describes how a data lakehouse could alleviate this issue?
b
Which of the following tools is used by Auto Loader process data incrementally?
b
Which of the following is hosted completely in the control plane of the classic Databricks architecture?
e
A data engineer is running code in a Databricks Repo that is cloned from a central Git repository. A colleague of the data engineer informs them that changes have been made and synced to the central Git repository. The data engineer now needs to sync their Databricks Repo to get the changes from the central Git repository.
Which of the following Git operations does the data engineer need to run to accomplish this task?
c
A data engineer needs to create a table in Databricks using data from a CSV file at location /path/to/csv.
They run the following command:
Which of the following lines of code fills in the above blank to successfully complete the task?
b
Which of the following must be specified when creating a new Delta Live Tables pipeline?
e
A data engineer is working with two tables. Each of these tables is displayed below in its entirety.
The data engineer runs the following query to join these tables together:
Which of the following will be returned by the above query?
d
A data engineer has a Python variable table_name that they would like to use in a SQL query. They want to construct a Python code block that will run the query using table_name.
They have the following incomplete code block:
____(fSELECT customer_id, spend FROM {table_name})
Which of the following can be used to fill in the blank to successfully complete the task?
e
Which of the following SQL keywords can be used to convert a table from a long format to a wide format?
b
A data engineer wants to create a data entity from a couple of tables. The data entity must be used by other data engineers in other sessions. It also must be saved to a physical location.
Which of the following data entities should the data engineer create?
c