Comparative Study of Open Source Processing Frameworks for Analysis of Big Data

Sanisha Chandel


Multiple terabytes of data is being generated every moment nowadays by various devices. Merely collecting data is worthless if not utilized for better decision making. Big Data Analytics provide the value to data and helps in improving decision making. Various data processing frameworks are available, open source as well as enterprise based, which can process data in batches as well as in real time. This paper provides an overview and comparative study of such open source frameworks highlighting their main features. The aim of the paper is to give better insight of open source frameworks and help researchers find the best framework for their application.


Big Data; Data analytics; Big Data Analytics; Hadoop; Spark; Flink; Samza; Storm

Full Text:



Amir Gandomi, MurtazaHaider,”Beyond the hype: big data concepts, methods, and analytics”,International Journal of Information Management, Volume 35, Issue 2, April 2015

Justin Ellingwood,“Hadoop, Storm, Samza, Spark and Flink: Big Data Frameworks Compared”,

YashikaVerma,SumitHooda,” A review paper on Big Data and Hadoop”,IJSRD - International Journal for Scientific Research & Development| vol. 3, Issue 02, | ISSN (online): 2321-0613, 2015

Samiddha Mukherjee, Ravi Shaw, “Big Data – Concepts, Applications, Challenges and Future Scope”, International Journal of Advanced Research in Computer and Communication Engineering vol. 5, Issue 2, February 2016.

Nada Elgendy and Ahmed Elragal,”Big data analytics: A literature review paper”,, 2014

T. Giri Babu, Dr. G. Anjan Babu,” A Survey on Data Science Technologies & Big Data Analytics”, International Journal of Advanced Research in Computer Science and Software Engineering, vol. 6, Issue 2, February 2016

R.A.Fadnavis, SamrudhiTabhane,”Big Data Processing Using Hadoop”,International Journal of Computer Science and Information Technologies, vol. 6, 2015)

Ms. Vibhavari Chavan, Prof. Rajesh. N. Phursule,” Survey Paper on Big Data”, International Journal of Computer Science and Information Technologies, vol. 5, 2014

V. Srilakshmi, V.Lakshmi Chetana, T.P.Ann Thabitha,” A Study on Big Data Technologies”, International Journal of Innovative Research in Computer and Communication Engineering, vol. 4, Issue 6, June 2016

Abdul GhaffarShoro& Tariq Rahim Soomro, “Big Data Analysis: Ap Spark Perspective”, Global Journal of Computer Science and Technology: C Software & Data Engineering vol. 15, Issue 1, Version 1.0, 2015

AnkushVerma, Ashik Husain Mansuri, Dr. Neelesh Jain, “A Review on Big Data Environment on Different Frameworks, Techniques and Tools”, International Journal of Core Engineering & Management (IJCEM), vol. 3, Issue 3, June 2016

Zhigao Zheng, Ping Wang, Jing Liu, Shengli Sun,”Real-Time Big Data Processing Framework: Challenges and Solutions”, Applied Mathematics & Information Sciences an International Journal, November 2015, March 2012



  • There are currently no refbacks.

Copyright (c) 2017 International Journal of Advanced Research in Computer Science