Adobe PDF (6.94 MB)
Table of Contents - Adobe PDF (245.17 kB)
Brochure
Download
User comments
Similar Books
Title Details:
Data science through the R language
Authors: Verykios, Vasileios
Kagklis, Vasileios
Stavropoulos, Elias
Reviewer: Kalles, Dimitrios
Subject: MATHEMATICS AND COMPUTER SCIENCE > COMPUTER SCIENCE > INFORMATION MANAGEMENT > DATA MINING
MATHEMATICS AND COMPUTER SCIENCE > COMPUTER SCIENCE > INFORMATION MANAGEMENT
MATHEMATICS AND COMPUTER SCIENCE > COMPUTER SCIENCE > INFORMATION MANAGEMENT > DATABASE SYSTEMS
MATHEMATICS AND COMPUTER SCIENCE > COMPUTER SCIENCE > INTELLIGENT SYSTEMS > BASIC MACHINE LEARNING
MATHEMATICS AND COMPUTER SCIENCE > COMPUTER SCIENCE > INTELLIGENT SYSTEMS > ADVANCED MACHINE LEARNING
ENGINEERING AND TECHNOLOGY > TECHNOLOGICAL SCIENCES AND ENGINEERING > INDUSTRIAL TECHNOLOGY AND ENGINEERING > DECISION SUPPORT SYSTEMS AND EXPERT SYSTEMS
Keywords:
Data Mining
Statistics
Data Science
Big Data Analytics
Machine Learning
Decision Trees
Association Rules
Classification
Clustering
Data Cleaning
NoSQL Databases
OLAP
Hadoop
Mapreduce
Description:
Abstract:
The increasing accumulation of large amounts of data creates new opportunities in the following areas: science, economy, education, research, etc. In the past, either the storage of such data was not feasible or its analysis was far beyond the computational capabilities of modern computer systems. Today, with the technological convergence in the above mentioned areas, there is a need to train scientists who will be able to cope with modern needs, providing solutions to serious issues through intelligent data analysis. The purpose of this book is to document the basic principles underlying this new data science, also known as Data Mining. The book starts with a description of the current state of the art in data mining, the domains on which it is based and the new challenges. It then covers issues of multidimensional data and report generation, as well as issues of new database formats, (NoSQL systems). Data types and data quality are an integral part of the book, which together with processing and similarity measures form the basis of a data mining project. Summary statistics and visualization are the first step in data exploration. The main part of the book deals with the detailed description and analysis of alternative algorithms for performing the basic functions of Data Mining, which are Categorization, Correlation Analysis and Cluster Analysis. More advanced issues concerning the estimation of the generated models, their comparison and faster execution are also covered in the last part of the book. All topics are discussed in depth through programs in the R language. A large number of solved exercises help to understand and consolidate the concepts, while solvable problems help the student to develop a comprehensive view on the subject.
Linguistic Editors: Spanaka, Adamantia
Graphic Editors: Filoni, Valentina
Type: Undergraduate textbook
Creation Date: 2015
Item Details:
ISBN 978-960-603-394-0
License: http://creativecommons.org/licenses/by-nc-nd/3.0/gr
DOI http://dx.doi.org/10.57713/kallipos-734
Handle http://hdl.handle.net/11419/2965
Bibliographic Reference: Verykios, V., Kagklis, V., & Stavropoulos, E. (2015). Data science through the R language [Undergraduate textbook]. Kallipos, Open Academic Editions. https://dx.doi.org/10.57713/kallipos-734
Language: Greek
Consists of:
1. Εισαγωγή στην Εξόρυξη Δεδομένων
2. Εισαγωγή στην R
3. Τύποι, Ποιότητα και Προεπεξεργασία Δεδομένων
4. Συνοπτική Στατιστική και Οπτικοποίηση
5. Κατηγοριοποίηση και Πρόβλεψη
6. Εξόρυξη Συχνών Στοιχειοσυνόλων και Κανόνων Συσχέτισης
7. Συσταδοποίηση
8. Υπολογιστικές Μέθοδοι για Ανάλυση Μεγάλων Δεδομένων (Hadoop και MapReduce)
9. Λυμένα Θέματα
Number of pages 232
Publication Origin: Kallipos, Open Academic Editions
You can also view
User comments
There are no published comments available!