Apache Spark leverages a distributed storage system called ________ for fault-tolerant storage of RDDs.

  • Apache HBase
  • Cassandra
  • HDFS
  • S3
Apache Spark utilizes HDFS (Hadoop Distributed File System) for fault-tolerant storage of Resilient Distributed Datasets (RDDs). HDFS provides the necessary durability and fault tolerance required for distributed processing in Spark.
Add your answer
Loading...

Leave a comment

Your email address will not be published. Required fields are marked *