Conclusions/Lessons Learned
- Optimal data structures for query software are definitely not optimal for mining software—a “two-tiered” approach to data warehousing will frequently be necessary.
- The major part of data mining (possibly 85 to 95%) is data preparation and data cleansing.
- Optimal use of mining software requires “perfect” data, structured with fillers for missing records and missing fields.