Now in its second edition, this textbook introduces readers to the IBM SPSS Modeler and guides them through data mining processes and relevant statistical methods. Focusing on step-by-step tutorials and well-documented examples that help demystify complex mathematical algorithms and computer programs, it also features a variety of exercises and solutions, as well as an accompanying website with data sets and SPSS Modeler streams. While intended for students, the simplicity of the Modeler makes the book useful for anyone wishing to learn about basic and more advanced data mining, and put this knowledge into practice. This revised and updated second edition includes a new chapter on imbalanced data and resampling techniques as well as an extensive case study on the cross-industry standard process for data mining.
Now in its second edition, this textbook introduces readers to the IBM SPSS Modeler and guides them through data mining processes and relevant statistical methods. Focusing on step-by-step tutorials and well-documented examples that help demystify complex mathematical algorithms and computer programs, it also features a variety of exercises and solutions, as well as an accompanying website with data sets and SPSS Modeler streams. While intended for students, the simplicity of the Modeler makes the book useful for anyone wishing to learn about basic and more advanced data mining, and put this knowledge into practice. This revised and updated second edition includes a new chapter on imbalanced data and resampling techniques as well as an extensive case study on the cross-industry standard process for data mining. Makes data mining accessible to those without programming skills Provides a step-by-step introduction to the IBM SPSS Modeler Includes a new chapter on imbalanced data and resampling techniques, and an extensive case study on CRISP-DM Features numerous explanatory illustrations and data examples, together with easy access to ready-to-use datasets and solution streams Covers all levels of complexity, from basic statistics to advanced predictive modeling