SPADOCK – Network Analysis with Spark Streaming using Docker containers

Our application presents a whole new ecosystem for Big Data analysis by combining the strengths of Apache Spark, HDFS and Docker. We eliminate the need for virtual machines and implement a complex social analysis data model using Docker containers instead. This is expected to improve performance and solve most of the big data problems encountered. Also we plan on implementing certain APIs and capitalize on the machine learning and GraphX libraries of Spark to enhance performance and make complex computations and delays due to munging data a thing of the past.

Advertisements

Leave a Reply

Fill in your details below or click an icon to log in:

WordPress.com Logo

You are commenting using your WordPress.com account. Log Out / Change )

Twitter picture

You are commenting using your Twitter account. Log Out / Change )

Facebook photo

You are commenting using your Facebook account. Log Out / Change )

Google+ photo

You are commenting using your Google+ account. Log Out / Change )

Connecting to %s