Mining for Significant Rare Events from Large Databases: Discover the Unknown Facts - Suman Katragadda - Books - LAP Lambert Academic Publishing - 9783838340258 - June 23, 2010
In case cover and title do not match, the title is correct

Mining for Significant Rare Events from Large Databases: Discover the Unknown Facts

Suman Katragadda

Mining for Significant Rare Events from Large Databases: Discover the Unknown Facts

In this work, we present a novel algorithm for extracting valuable knowledge from large databases. Rare events are difficult to mine due to very little support they possess. Our algorithm, SARG (Significant Association Rule Generator) helps us to mine for significant patterns (including rare events) from large databases by defining the support fraction per cell in the contingency table instead of per the entire contingency table. It uses a combination of both support ? confidence and chi square statistic framework for mining significant patterns from vast raw data. In this algorithm, we introduce the notion of critical attribute and critical attribute value which are passed as input parameters to the SARG algorithm to make the mining process more selective. We ran our algorithm against a huge medical file provided by the Cleveland Clinic Foundation, Cleveland, OH. We compared the results of SARG algorithm with the results produced by Brin?s chi square algorithm. Some of the results produced by SARG are unknown medical facts that are not produced by Brin's chi square algorithm.

Media Books     Paperback Book   (Book with soft cover and glued back)
Released June 23, 2010
ISBN13 9783838340258
Publishers LAP Lambert Academic Publishing
Pages 124
Dimensions 225 × 7 × 150 mm   ·   190 g
Language English