<conference paper>
Online algorithms for mining semi-structured data stream

Creator
Language
Publisher
Date
Source Title
First Page
Last Page
Publication Type
Access Rights
Rights
Related DOI
Related DOI
Related URI
Related URI
Related HDL
Relation
Abstract In this paper, we study an online data mining problem from streams of semi-structured data such as XML data. Modeling semi-structured data and patterns as labeled ordered trees, we present an online a...lgorithm StreamT that receives fragments of an unseen possibly infinite semi-structured data in the document order through a data stream, and can return the current set of frequent patterns immediately on request at any time. A crucial part of our algorithm is the incremental maintenance of the occurrences of possibly frequent patterns using a tree sweeping technique. We give modifications of the algorithm to other online mining model. We present theoretical and empirical analyses to evaluate the performance of the algorithm.show more

Hide fulltext details.

pdf 01183882 pdf 541 KB 473  

Details

Record ID
Peer-Reviewed
Related URI
Subject Terms
DOI
Notes
Type
Created Date 2009.04.22
Modified Date 2020.12.09

People who viewed this item also viewed