Title Details: | |
Data science through the R language |
|
Authors: |
Verykios, Vasileios Kagklis, Vasileios Stavropoulos, Elias |
Reviewer: |
Kalles, Dimitrios |
Subject: | MATHEMATICS AND COMPUTER SCIENCE > COMPUTER SCIENCE > INFORMATION MANAGEMENT > DATA MINING MATHEMATICS AND COMPUTER SCIENCE > COMPUTER SCIENCE > INFORMATION MANAGEMENT MATHEMATICS AND COMPUTER SCIENCE > COMPUTER SCIENCE > INFORMATION MANAGEMENT > DATABASE SYSTEMS MATHEMATICS AND COMPUTER SCIENCE > COMPUTER SCIENCE > INTELLIGENT SYSTEMS > BASIC MACHINE LEARNING MATHEMATICS AND COMPUTER SCIENCE > COMPUTER SCIENCE > INTELLIGENT SYSTEMS > ADVANCED MACHINE LEARNING ENGINEERING AND TECHNOLOGY > TECHNOLOGICAL SCIENCES AND ENGINEERING > INDUSTRIAL TECHNOLOGY AND ENGINEERING > DECISION SUPPORT SYSTEMS AND EXPERT SYSTEMS |
Keywords: |
Data Mining
Statistics Data Science Big Data Analytics Machine Learning Decision Trees Association Rules Classification Clustering Data Cleaning NoSQL Databases OLAP Hadoop Mapreduce |
Description: | |
Abstract: |
The increasing accumulation of large amounts of data creates new opportunities in the following areas: science, economy, education, research, etc. In the past, either the storage of such data was not feasible or its analysis was far beyond the computational capabilities of modern computer systems. Today, with the technological convergence in the above mentioned areas, there is a need to train scientists who will be able to cope with modern needs, providing solutions to serious issues through intelligent data analysis. The purpose of this book is to document the basic principles underlying this new data science, also known as Data Mining. The book starts with a description of the current state of the art in data mining, the domains on which it is based and the new challenges. It then covers issues of multidimensional data and report generation, as well as issues of new database formats, (NoSQL systems). Data types and data quality are an integral part of the book, which together with processing and similarity measures form the basis of a data mining project. Summary statistics and visualization are the first step in data exploration. The main part of the book deals with the detailed description and analysis of alternative algorithms for performing the basic functions of Data Mining, which are Categorization, Correlation Analysis and Cluster Analysis. More advanced issues concerning the estimation of the generated models, their comparison and faster execution are also covered in the last part of the book. All topics are discussed in depth through programs in the R language. A large number of solved exercises help to understand and consolidate the concepts, while solvable problems help the student to develop a comprehensive view on the subject.
|
Linguistic Editors: |
Spanaka, Adamantia |
Graphic Editors: |
Filoni, Valentina |
Type: |
Undergraduate textbook |
Creation Date: | 2015 |
Item Details: | |
ISBN |
978-960-603-394-0 |
License: |
http://creativecommons.org/licenses/by-nc-nd/3.0/gr |
DOI | http://dx.doi.org/10.57713/kallipos-734 |
Handle | http://hdl.handle.net/11419/2965 |
Bibliographic Reference: | Verykios, V., Kagklis, V., & Stavropoulos, E. (2015). Data science through the R language [Undergraduate textbook]. Kallipos, Open Academic Editions. https://dx.doi.org/10.57713/kallipos-734 |
Language: |
Greek |
Consists of: |
1. Εισαγωγή στην Εξόρυξη Δεδομένων 2. Εισαγωγή στην R 3. Τύποι, Ποιότητα και Προεπεξεργασία Δεδομένων 4. Συνοπτική Στατιστική και Οπτικοποίηση 5. Κατηγοριοποίηση και Πρόβλεψη 6. Εξόρυξη Συχνών Στοιχειοσυνόλων και Κανόνων Συσχέτισης 7. Συσταδοποίηση 8. Υπολογιστικές Μέθοδοι για Ανάλυση Μεγάλων Δεδομένων (Hadoop και MapReduce) 9. Λυμένα Θέματα |
Number of pages |
232 |
Publication Origin: |
Kallipos, Open Academic Editions |
You can also view | |
User comments | |
There are no published comments available! | |