Title Details: | |
Technologies for Big Data Analytics |
|
Authors: |
Karakasidis, Alexandros Koloniari, Georgia Gounaris, Anastasios Papadopoulos, Apostolos |
Subject: | MATHEMATICS AND COMPUTER SCIENCE > COMPUTER SCIENCE > INFORMATION MANAGEMENT MATHEMATICS AND COMPUTER SCIENCE > COMPUTER SCIENCE > INFORMATION MANAGEMENT > DATA MINING MATHEMATICS AND COMPUTER SCIENCE > COMPUTER SCIENCE > INFORMATION MANAGEMENT > INFORMATION STORAGE AND RETRIEVAL MATHEMATICS AND COMPUTER SCIENCE > COMPUTER SCIENCE > PARALLEL AND DISTRIBUTED COMPUTING > DISTRIBUTED SYSTEMS MATHEMATICS AND COMPUTER SCIENCE > COMPUTER SCIENCE > OPERATING SYSTEMS > FILE SYSTEMS MATHEMATICS AND COMPUTER SCIENCE > COMPUTER SCIENCE > INFORMATION MANAGEMENT > DATABASE SYSTEMS MATHEMATICS AND COMPUTER SCIENCE > COMPUTER SCIENCE > INFORMATION MANAGEMENT > QUERY LANGUAGES MATHEMATICS AND COMPUTER SCIENCE > COMPUTER SCIENCE > OPERATING SYSTEMS > FAULT TOLERANCE |
Keywords: |
Big data
Big data analytics Distributed and parallel systems NOSQL systems Management and analysis of data streams Management and analysis of graphs GPUs |
Description: | |
Abstract: |
This book comes to fill a large gap that exists in the Greek literature regarding the processing and analysis of big data. More specifically, it aims to present and use the most commonly used systems and techniques for managing and extracting useful knowledge from data characterized by large volume, potentially fast renewal rates and diversity in terms of their structure. In more detail, in this book we study specific application development methodologies as well as specific systems. For example, we study the MapReduce model and how it is implemented in Hadoop, Spark, and other systems. In addition, the Hadoop ecosystem is studied in detail, as well as the Spark application development methodology, while examples are given in Scala and Python. Also, NOSQL systems are presented, emphasizing the four different categories of systems, depending on their functionality. In the book we also study techniques for data stream management and how it is carried out by the systems, while in a separate chapter we study issues of graph data processing and analysis. Then, the basics of managing resources in a distributed system using different resource management systems and how the applications are run are reviewed. Also, in a separate chapter we study ways of processing data in GPU systems that are also widely used for data analysis in a parallel way. Installation details are also given in the book. The book is addressed to postgraduate students and mainly to courses directly related to topics related to the processing and analysis of big data using modern distributed and parallel systems. However, part of the book could work as a supplement in undergraduate courses with a similar subject.
|
Linguistic Editors: |
Alexopoulou, Katerina |
Technical Editors: |
Karakasidis, Alexandros Koloniari, Georgia Gounaris, Anastasios Papadopoulos, Apostolos |
Type: |
Postgraduate textbook |
Creation Date: | 16-12-2024 |
Item Details: | |
ISBN |
978-618-228-309-7 |
License: |
Attribution - NonCommercial - ShareAlike 4.0 International (CC BY-NC-SA 4.0) |
DOI | http://dx.doi.org/10.57713/kallipos-1058 |
Handle | http://hdl.handle.net/11419/14277 |
Bibliographic Reference: | Karakasidis, A., Koloniari, G., Gounaris, A., & Papadopoulos, A. (2024). Technologies for Big Data Analytics [Postgraduate textbook]. Kallipos, Open Academic Editions. https://dx.doi.org/10.57713/kallipos-1058 |
Language: |
Greek |
Consists of: |
1. Introduction to Big Data 2. Distributed File Systems 3. Introduction to MapReduce 4. NOSQL Systems 5. Introduction to Apache Spark 6. Advanced Topics in Apache Spark 7. Data Stream Processing 8. Network Analysis 9. Resource Management and Application Execution 10. Parallelism with MapReduce in GPUs 11. Algorithmic Issues in Big Data Analytics |
Number of pages |
362 |
Publication Origin: |
Kallipos, Open Academic Editions |
You can also view | |
User comments | |
There are no published comments available! | |