Scenario: Your team needs to build a recommendation system that requires real-time access to user data stored in HDFS. Which Hadoop component would you recommend for this use case, and how would you implement it?

  • Apache Flume
  • Apache HBase
  • Apache Spark Streaming
  • Apache Storm
Apache Spark Streaming is well-suited for real-time processing and accessing user data stored in HDFS. It allows for continuous computation on streaming data, making it ideal for building real-time recommendation systems.
Add your answer
Loading...

Leave a comment

Your email address will not be published. Required fields are marked *