Abstract
A conceptual model of a database, such as an Entity-Relationship (ER) model, is a specification of objects, attributes, and their relationships. A conceptual model plays important roles in developing successful database applications. Although critical, a conceptual model of a legacy database may not be always available in practice, and discovering and constructing such a model from the data, and from the data only, is a challenging problem. In this paper, we develop a new approach to address object identification and model construction. Our approach has many favorable features, including its robustness in dealing with noise data and scalability to large databases and data sets. We implement this approach in a system called McKey (Model Construction with Key identification) for discovering and building ER models from instances of large legacy databases. We apply McKey to three very large legacy databases, and obtain comprehensive models within hours, which gives many magnitudes of savings of manpower.
Original language | English |
---|---|
Pages (from-to) | 108-119 |
Number of pages | 12 |
Journal | Proceedings of SPIE - The International Society for Optical Engineering |
Volume | 3695 |
State | Published - 1999 |
Event | Proceedings of the 1999 Data Mining and Knowledge Discovery: Theory, Tools, and Technology - Orlando, FL, USA Duration: Apr 5 1999 → Apr 6 1999 |