Apache MapReduce divides tasks into ________ and ________ phases for processing large datasets.
- Input, Output
- Map, Reduce
- Map, Shuffle
- Sort, Combine
Apache MapReduce divides tasks into Map and Reduce phases for processing large datasets. The Map phase handles input data and generates key-value pairs, while the Reduce phase aggregates and processes these pairs.
Loading...
Related Quiz
- The integration of ________ in monitoring systems enables proactive identification and resolution of issues before they impact data pipeline performance.
- Which of the following best describes the relationship between normalization and data redundancy?
- ________ is a data extraction technique that involves extracting data from a source system's log files, typically in real-time.
- A well-defined data ________ helps ensure that data is consistent, accurate, and reliable across the organization.
- Which component of Kafka is responsible for storing the published messages?