What is a broadcast variable in Apache Spark, and how is it used?

  • A variable cached in memory for faster access
  • A variable replicated to every executor node
  • A variable shared across all nodes in a cluster
  • A variable used for inter-process communication
A broadcast variable in Apache Spark is replicated to every executor node for efficient data distribution. It's used for broadcasting large read-only datasets to all tasks across the cluster to avoid excessive data shuffling.
Add your answer
Loading...

Leave a comment

Your email address will not be published. Required fields are marked *