Data Mining: Concepts and Techniques provides the concepts and techniques in processing gathered data or information, which will be used in various applications. Specifically, it explains data mining and the tools used in discovering knowledge from the collected data. This book is referred as the knowledge discovery from data (KDD). It focuses on the feasibility, usefulness, effectiveness, and scalability of techniques of large data sets. After describing data mining, this edition explains the methods of knowing, preprocessing, processing, and warehousing data. It then presents information about data warehouses, online analytical processing (OLAP), and data cube technology. Then, the methods involved in mining frequent patterns, associations, and correlations for large data sets are described. The book details the methods for data classification and introduces the concepts and methods for data clustering. The remaining chapters discuss the outlier detection and the trends, applications, and research frontiers in data mining.
This book is intended for Computer Science students, application developers, business professionals, and researchers who seek information on data mining.
- Presents dozens of algorithms and implementation examples, all in pseudo-code and suitable for use in real-world, large-scale data mining projects
- Addresses advanced topics such as mining object-relational databases, spatial databases, multimedia databases, time-series databases, text databases, the World Wide Web, and applications in several fields
- Provides a comprehensive, practical look at the concepts and techniques you need to get the most out of your data
Chapter 1 Introduction
Chapter 2. Getting to Know Your Data
Chapter 3. Preprocessing: Data Reduction, Transformation, and Integration
Chapter 4. Data Warehousing and On-Line Analytical Processing
Chapter 5. Data Cube Technology
Chapter 6. Mining Frequent Patterns, Associations and Correlations: Concepts and
Chapter 7. Advanced Frequent Pattern Mining
Chapter 8. Classification: Basic Concepts
Chapter 9. Classification: Advanced Methods
Chapter 10. Cluster Analysis: Basic Concepts and Methods
Chapter 11. Cluster Analysis: Advanced Methods
Chapter 12. Outlier Analysis
Chapter 13. Trends and Research Frontiers in Data Mining
Jiawei Han is Professor in the Department of Computer Science at the University of Illinois at Urbana-Champaign. Well known for his research in the areas of data mining and database systems, he has received many awards for his contributions in the field, including the 2004 ACM SIGKDD Innovations Award. He has served as Editor-in-Chief of ACM Transactions on Knowledge Discovery from Data, and on editorial boards of several journals, including IEEE Transactions on Knowledge and Data Engineering and Data Mining and Knowledge Discovery.
Micheline Kamber is a researcher with a passion for writing in easy-to-understand terms. She has a master's degree in computer science (specializing in artificial intelligence) from Concordia University, Canada.
Jian Pei is currently a Canada Research Chair (Tier 1) in Big Data Science and a Professor in the School of Computing Science at Simon Fraser University. He is also an associate member of the Department of Statistics and Actuarial Science. He is a well-known leading researcher in the general areas of data science, big data, data mining, and database systems. His expertise is on developing effective and efficient data analysis techniques for novel data intensive applications. He is recognized as a Fellow of the Association of Computing Machinery (ACM) for his "contributions to the foundation, methodology and applications of data mining and as a Fellow of the Institute of Electrical and Electronics Engineers (IEEE) for his "contributions to data mining and knowledge discovery. He is the editor-in-chief of the IEEE Transactions of Knowledge and Data Engineering (TKDE), a director of the Special Interest Group on Knowledge Discovery in Data (SIGKDD) of the Association for Computing Machinery (ACM), and a general co-chair or program committee co-chair of many premier conferences.