You deploy Apache Kafka to an Azure HDInsight cluster.
You plan to load data into a topic that has a specific schem
a.
You need to load the data while maintaining the existing schema.
Which file format should you use to receive the data?
Note: This question is part of a series of questions that use the same or similar answer choices. An answer choice may be correct for more than one question in the series. Each question is independent of the other questions in this series. Information and details provided in a question apply only to that question.
You are implementing a batch processing solution by using Azure HDInsight.
You have two tables. Each table is larger than 250 TB. Both tables have approximately the same number of rows and columns.
You need to match the tables based on a key column. You must minimize the size of the data table that is produced.
What should you do?
Note: This question is part of a series of questions that use the same or similar answer choices. An answer choice may be correct for more than one question in the series. Each question is independent of the other questions in this series. Information and details provided in a question apply only to that question.
You are implementing a batch processing solution by using Azure HDInsight.
You have a workflow that retrieves data by using a U-SQL query.
You need to provide the ability to query and combine data from multiple data sources.
What should you do?
Note: This question is part of a series of questions that present the same scenario. Each question in the series contains a unique solution that might meet the stated goals. Some question sets might have more than one correct solution, while others might not have a correct solution.
After you answer a question in this sections, you will NOT be able to return to it. As a result, these questions will not appear in the review screen.
You have an Apache Pig table named Sales in Apache HCatalog.
You need to make the data in the table accessible from Apache Pig.
Solution: You use the following script.
Does this meet the goal?
Note: This question is part of a series of questions that present the same scenario. Each question in the series contains a unique solution that might meet the stated goals. Some question sets might have more than one correct solution, while others might not have a correct solution.
After you answer a question in this sections, you will NOT be able to return to it. As a result, these questions will not appear in the review screen.
You have an Apache Pig table named Sales in Apache HCatalog.
You need to make the data in the table accessible from Apache Pig.
Solution: You use the following script.
Does this meet the goal?