SPADOCK – Network Analysis with Spark Streaming using Docker containers

Our application presents a whole new ecosystem for Big Data analysis by combining the strengths of Apache Spark, HDFS and Docker. We eliminate the need for virtual machines and implement a complex social analysis data model using Docker containers instead. This is expected to improve performance and solve most of the big data problems encountered. Also we plan on implementing certain APIs and capitalize on the machine learning and GraphX libraries of Spark to enhance performance and make complex computations and delays due to munging data a thing of the past.


Leave a Reply

Fill in your details below or click an icon to log in: Logo

You are commenting using your account. Log Out /  Change )

Google+ photo

You are commenting using your Google+ account. Log Out /  Change )

Twitter picture

You are commenting using your Twitter account. Log Out /  Change )

Facebook photo

You are commenting using your Facebook account. Log Out /  Change )

Connecting to %s