The major libraries that constitute the Spark Ecosystem
Spark MLib- Machine learning library in Spark for commonly used learning algorithms like clustering, regression, classification, etc.
Spark Streaming This library is used to process real time streaming data.
Spark GraphX Spark API for graph parallel computations with basic operators like joinVertices, subgraph, aggregateMessages, etc.
Spark SQL Helps execute SQL like queries on Spark data using standard visualization or BI tools.