Hadoop Vs Spark: An Integrated Big Data Analytics Architecture For Distributed Computing Frameworks

Authors

  • S R V Prasad Reddy Author
  • Dr.K. Parthiban Author
  • Rajendrakumar Ramadass Author
  • Kakarla Hari Kishore Author

DOI:

https://doi.org/10.64252/73cceq93

Keywords:

Hadoop MapReduce, Spark, decentralized frameworks, parallel computing, big data, and big data analysis.

Abstract

In the digital age, decision-makers may now access enormous amounts of data.   The phrase "big data" refers to databases that are difficult to handle using traditional tools and techniques due to their size, diversity, and dynamic nature.The last few years have seen an increase in the generation of data from multiple sources due to the introduction of cloud computing technology.Today's data processing equipment must be able to handle the enormous volumes of newly created information.  For the DS &BDA sector in SC & L, we first propose a novel method of systematic review in this research.   In order to classify the existing models and approaches, arrange their areas of practical usage, identify research needs, and propose potential future research methods; we then use the recommended methodology for an organized literature review on DS &BDA methods in the SC &L fields. The Adoop framework rose to prominence with its dispersed file structure and MapReduce programming methodology. On the other hand, Spark is a newly created framework for big data management and analysis that allows you to investigate an infinite number of the underlying properties of large data.The research compares Hadoop MapReduce with Spark's operating principles, effectiveness, and expense, simplicity of use, connectivity, data mining, disaster tolerance, and hygiene. Experimental observations of Hadoop MapReduce and Spark's performance have been made to ascertain their suitability for use in a variety of distributed computing scenarios.

Downloads

Download data is not yet available.

Downloads

Published

2025-08-11

Issue

Section

Articles

How to Cite

Hadoop Vs Spark: An Integrated Big Data Analytics Architecture For Distributed Computing Frameworks. (2025). International Journal of Environmental Sciences, 477-488. https://doi.org/10.64252/73cceq93