Apache Spark's ____ abstraction provides an efficient way of handling distributed data across nodes.

  • DataFrame
  • RDD (Resilient Distributed Dataset)
  • SparkContext
  • SparkSQL
Apache Spark's RDD (Resilient Distributed Dataset) abstraction is a fundamental data structure that provides fault-tolerant distributed processing of data across nodes. It allows efficient data handling and transformation in a parallel and resilient manner.
Add your answer
Loading...

Leave a comment

Your email address will not be published. Required fields are marked *