12. Spark Internals
SPARK INTERNALS
An Execution Plan Is Created From Your RDD'S
The Job Is Broken Into Stages Based On When Data Needs To Be Organized
Each Stage Is Broken Into Tasks (Which May Be Distributed Across A Cluster)
Finally The Tasks Are Scheduled Across Your Cluster And Executed
Previous11. Ratings Histogram WalkthroughNext13. Key /Value RDD's, and the Average Friends by Age example
Last updated