The versatile capabilities and large set of add-on packages make R an excellent alternative to many existing and often expensive data mining tools. Exploring this area from the perspective of a practitioner, Data Mining with R: Learning with Case Studies uses practical examples to illustrate the power of R and data mining.
Assuming no prior knowledge of R or data mining/statistical techniques, the book covers a diverse set of problems that pose different challenges in terms of size, type of data, goals of analysis, and analytical tools. To present the main data mining processes and techniques, the author takes a hands-on approach that utilizes a series of detailed, real-world case studies:
- Predicting algae blooms
- Predicting stock market returns
- Detecting fraudulent transactions
- Classifying microarray samples
With these case studies, the author supplies all necessary steps, code, and data.
Web Resource
A supporting website mirrors the do-it-yourself approach of the text. It offers a collection of freely available R source files that encompass all the code used in the case studies. The site also provides the data sets from the case studies as well as an R package of several functions.
This is certainly one of the best books for a direct implementation of data mining algorithms. Another good point of the book is that for most of the problems there are different ways to solve them. ... an invaluable resource for data miners, R programmers, as well as people involved in fields such as fraud detection and stock market prediction. If you’re serious about data mining and want to learn from experiences in the field, don’t hesitate!
―Sandro Saitta, Data Mining Research blog, May 2011
If you want to learn how to analyze your data with a free software package that has been built by expert statisticians and data miners, this is your book. A broad range of real-world case studies highlights the breadth and depth of the R software.
―Bernhard Pfahringer, University of Waikato, New Zealand
Both R novices and experts will find this a great reference for data mining.
―Intelligent Trading blog and R-bloggers, November 2010