In Spark, ____ are immutable collections of data items distributed over a cluster.

  • Data Blocks
  • DataFrames
  • DataSets
  • Resilient Distributed Datasets (RDDs)
In Spark, Resilient Distributed Datasets (RDDs) are immutable collections of data items distributed over a cluster. RDDs are the fundamental data structure in Spark, providing fault tolerance and parallel processing capabilities.
Add your answer
Loading...

Leave a comment

Your email address will not be published. Required fields are marked *