Apache Spark: What? Why? When?
Apache Spark is a unified analytics engine for big data processing, with built-in modules for streaming, SQL, machine learning and graph processing.
This presentation focuses on Spark cluster internals and performance.
Slides can also be viewed on slideshare
Thanks!