American University
Browse

Data mining in multi-relations databases

Download (1.99 MB)
thesis
posted on 2023-08-04, 21:42 authored by Mark Yen-Ming Goh

Tools used to apply knowledge discovery to relational databases are focused on single tables. Unfortunately, the data needed for knowledge discovery is rarely isolated to a single relation. Rather, the data is spread out over several relations. Relevant data relations are to be joined in order to create a single relation called a Universal Relation (UR). However, from a data mining point of view, this could lead to many issues such as universal relations of unmanageable sizes. In this thesis, we consider the problem of knowledge discovery in multi-relation databases. In particular, we examine a knowledge discovery algorithm for multiple databases based on distributed decision tree induction, knowledge discovery algorithms based on primary and foreign keys, peculiar and surprising data, and the foreign set - which allows multi-relations mining without a primary or foreign key. Lastly, we propose extensions of these methods with the foreign set.

History

Publisher

ProQuest

Language

English

Notes

Thesis (M.S.)--American University, 2009.

Handle

http://hdl.handle.net/1961/thesesdissertations:6085

Media type

application/pdf

Access statement

Unprocessed

Usage metrics

    Theses and Dissertations

    Exports

    RefWorks
    BibTeX
    Ref. manager
    Endnote
    DataCite
    NLM
    DC