To improve performance, ____ is often used in MapReduce to process data before it reaches the Reducer.
- Aggregator
- Combiner
- Sorter
- Transformer
To improve performance, a Combiner is often used in MapReduce to process data before it reaches the Reducer. The Combiner performs a local aggregation of the data output by the Mapper, reducing the volume of data that needs to be transferred over the network.
Loading...
Related Quiz
- ____ in Flume are responsible for storing events until they are consumed by sinks.
- In advanced Oozie workflows, ____ is used to manage job retries and error handling.
- HBase ____ are used to categorize columns into logical groups.
- The integration of Scala with Hadoop is often facilitated through the ____ framework for distributed computing.
- The ____ of a Hadoop cluster indicates the balance of load across its nodes.