The Tidbits of Apache Spark

Why Spark for Big Data?

Features of Apache Spark

Apache Spark Architecture

Apache Spark Cluster Manager

Features of Cluster Manager:

Apache Spark supports mainly three Types of Cluster Managers:

(./sbin/start-master.sh)
($./sbin/start-slave.sh master-spark-URL.)

Resilient Distributed Dataset (RDD)

Features of an RDD in Spark

Operations of RDD

When to use RDD?

Limitations of RDD:

Conclusion

What’s Next?

--

--

Get the Medium app

A button that says 'Download on the App Store', and if clicked it will lead you to the iOS App store
A button that says 'Get it on, Google Play', and if clicked it will lead you to the Google Play store