In a scenario involving streaming data, which Hadoop file format would be most efficient?

  • Avro
  • ORC
  • Parquet
  • SequenceFile
In a scenario involving streaming data, the Avro file format would be most efficient. Avro is a binary serialization format that supports schema evolution and is suitable for streaming data due to its compact structure and efficient serialization, making it well-suited for real-time data processing in Hadoop.
Add your answer
Loading...

Leave a comment

Your email address will not be published. Required fields are marked *