| Title Details: | |
|
Computational Methods for Big Data Analysis (Hadoop and MapReduce) |
|
| Authors: |
Verykios, Vasileios Kagklis, Vasileios Stavropoulos, Ilias |
| Reviewer: |
Kalles, Dimitrios |
| Description: | |
| Abstract: |
The solution of Hadoop & MapReduce and their associated applications includes among others a simple but very powerful method for the processing and analysis of extremely large data sets, even up to the level of multiple PetaBytes. At its core, MapReduce is a process for combining data from multiple inputs (creating the "map"), and then reducing (reduce) it using a function that will refine the desired results. This chapter presents multiple use cases of Hadoop & MapReduce and their applications for multi-TB and even multi-PB instances; Hadoop uses a distributed file system, HDFS. The Hadoop/MapReduce system is useful for data that is less structured such as Internet pages or multiple document or image data that is not fully organized-structured.
|
| Type: |
Chapter |
| Creation Date: | 2015 |
| Item Details: | |
| License: |
http://creativecommons.org/licenses/by-nc-nd/3.0/gr |
| Handle | http://hdl.handle.net/11419/2973 |
| Bibliographic Reference: | Verykios, V., Kagklis, V., & Stavropoulos, I. (2015). Computational Methods for Big Data Analysis (Hadoop and MapReduce) [Chapter]. In Verykios, V., Kagklis, V., & Stavropoulos, E. 2015. Data science through the R language [Undergraduate textbook]. Kallipos, Open Academic Editions. https://hdl.handle.net/11419/2973 |
| Language: |
Greek |
| Is Part of: |
Data science through the R language |
| Publication Origin: |
Kallipos, Open Academic Editions |
