Skip to main content Skip to book tools

Data-Mining Projects and Database Essentials

An Overview of Practical Skills for CRISP-DM

Authors

Mark Keith, PhD
Brigham Young University
Edition 1
(rev. 2812)

"Data are just summaries of thousands of stories – tell a few of those stories to help make the data meaningful." — Chip and Dan Heath

Data mining techniques are changing daily and, thus, requiring academic curriculums to constantly evolve to maintain relevance. Because of this, paper-based books full of text-based instruction are inadequate because they cannot keep up with the rate of change. Therefore, the purpose of this online book is to teach--through practice-based video tutorials--the latest and most common techniques for both descriptive and predictive data analytics. We use currently industry-leading tool, Tableau, to teach dashboard design and story telling which describes the current state of an organization based on measurable data. However, the supreme value of data is in it's ability to predict the future. This is also the most difficult and risky directive. Therefore, we begin by teaching basic methods in Excel for multiple regression and the assumptions of linear regression. Afterward, the bulk of the course is spent covering more advanced algorithms and techniques using an industry-leading tool for predictive analysis: Microsoft Azure Machine Learning Studio. We chose these tools, first, because they are mainstream industry tools that you are likely to use across a variety of industries, but second, because both come with free versions for students :)

  • Any Device, Any Location, Any Time

    The course reader looks great on desktops, laptops, tablets, phones, and just about any other device. Students are able to read, watch, and listen how, where, and when they want.
  • Embedded, Customizable Assessments

    Your students complete assessments inline with the text, making for a seamless, engaging experience. As the instructor, you can customize existing assessments or create entirely new ones within the text.
  • Advanced Analytics

    Monitor the progress of your class collectively as well as students individually. Course analytics provide you with student engagement and performance in reading, videos, and assessments. Analytics help you focus directly on those topics or students that need it.
  • Flashcards

    Flashcards are a great way to learn the key terms. Each course comes pre-loaded with cards for every term. Students can also make their own to enhance their learning experience.
  • Searchable Glossary

    A searchable glossary is available at any point in the course reader, allowing students to search for any key term they want and jump straight to that word in the text - no matter where they're reading.
  • Highlights and Notes

    Students can highlight the text and make notes throughout the content using their touch screen or mouse.
  • Text, Video, Audio, Assessments

    Course content is provided in multiple ways, allowing your students to learn the way they learn. The HTML5-based course reader works like any other web page--the way the web was meant to be used.