108: PySpark - Jonathan Rioux
Apache Spark is a unified analytics engine for large-scale data processing.
PySpark blends the powerful Spark big data processing engine with the Python programming language to provide a data analysis platform that can scale up for nearly any task.
PySpark blends the powerful Spark big data processing engine with the Python programming language to provide a data analysis platform that can scale up for nearly any task.
Johnathan Rioux, author of "PySpark in Action", joins the show and gives us a great introduction of Spark and PySpark to help us decide how to get started and decide whether or not to decide if Spark and PySpark are right you.
Special Guest: Jonathan Rioux.
Links:
- Show notes and resources can be found at testandcode.com
- This podcast produced in conjunction with pythontest.com
Creators and Guests
