____ is a column-oriented file format in Hadoop, optimized for querying large datasets.
- Avro
- ORC
- Parquet
- SequenceFile
Parquet is a column-oriented file format in Hadoop designed for optimal query performance on large datasets. It organizes data in a columnar fashion, allowing for efficient compression and improved read performance, making it suitable for analytical workloads.
Loading...
Related Quiz
- How does data latency in batch processing compare to real-time processing?
- What is the primary storage model used by Apache HBase?
- How does Apache Kafka complement Hadoop in building robust, scalable data pipelines?
- Which Java-based framework is commonly used for unit testing in Hadoop applications?
- For a financial institution requiring immediate fraud detection, what type of processing in Hadoop would be most effective?