Spark Physical and Logical Plan Analysis
Spark Physical and Logical Plan Analysis
• The code written is first noted as an unresolved logical plan, if it is valid then Spark converts this into a logical plan
• The logical plan is passed through the Catalyst Optimizer to apply optimized rules.
• The Optimized Logical Plan is then converted into a Physical Plan
• The Physical Plan is executed by the Spark executors.
DataFrame.explain(extended=None, mode=None)[source]
Prints the (logical and physical) plans to the console for debugging purpose.
specifies the expected output format of plans:
PySpark example
We have used aggregation function to achieve the above problem statement.