Given the need for near-real-time data processing in Hadoop, which tool would be best for ingesting streaming data from various sources?

  • Flume
  • Kafka
  • Sqoop
  • Storm
Kafka is the preferred tool for ingesting streaming data from various sources in Hadoop when near-real-time data processing is required. It acts as a distributed, fault-tolerant, and scalable messaging system, efficiently handling real-time data streams.
Add your answer
Loading...

Leave a comment

Your email address will not be published. Required fields are marked *