Adobe PDF (7.27 MB)
Title Details:
Computational Methods for Big Data Analysis (Hadoop and MapReduce)
Authors: Verykios, Vasileios
Kagklis, Vasileios
Stavropoulos, Ilias
Reviewer: Kalles, Dimitrios
Description:
Abstract:
The solution of Hadoop & MapReduce and their associated applications includes among others a simple but very powerful method for the processing and analysis of extremely large data sets, even up to the level of multiple PetaBytes. At its core, MapReduce is a process for combining data from multiple inputs (creating the "map"), and then reducing (reduce) it using a function that will refine the desired results. This chapter presents multiple use cases of Hadoop & MapReduce and their applications for multi-TB and even multi-PB instances; Hadoop uses a distributed file system, HDFS. The Hadoop/MapReduce system is useful for data that is less structured such as Internet pages or multiple document or image data that is not fully organized-structured.
Type: Chapter
Creation Date: 2015
Item Details:
License: http://creativecommons.org/licenses/by-nc-nd/3.0/gr
Handle http://hdl.handle.net/11419/2973
Bibliographic Reference: Verykios, V., Kagklis, V., & Stavropoulos, I. (2015). Computational Methods for Big Data Analysis (Hadoop and MapReduce) [Chapter]. In Verykios, V., Kagklis, V., & Stavropoulos, E. 2015. Data science through the R language [Undergraduate textbook]. Kallipos, Open Academic Editions. https://hdl.handle.net/11419/2973
Language: Greek
Is Part of: Data science through the R language
Publication Origin: Kallipos, Open Academic Editions