XML Data Mining

Romei, Andrea and Turini, Franco (2009) XML Data Mining. Technical Report del Dipartimento di Informatica . Università di Pisa, Pisa, IT.

Other (GZip)
Available under License Creative Commons Attribution No Derivatives.
Download (782Kb)

Official URL: http://compass2.di.unipi.it/TR/Files/TR-09-07.pdf....

Abstract

XML is the standard language for representing semi-structured data. With the spreading of XML sources, mining XML data can be an important objective in the near future. This paper presents a project focussed on designing a general-purpose query language in support of mining XML data. In our framework, raw data, mining models and domain knowledge are represented by way of XML documents and stored inside XML native databases. Data mining tasks are expressed in an extension of XQuery. Special attention is given to the frequent pattern discovery problem, and a way of exploiting domain-dependent optimizations and efficient data structures as deeper as possible in the extraction process is presented. We report the results of a first bunch of experiments, showing that a good trade-off between expressiveness and efficiency in XML data mining is not a chimera.

Item Type:	Book
Uncontrolled Keywords:	XML; Data Mining; Knowledge Discovery; XQuery; Apriori; Constraint-based frequent itemsets
Subjects:	Area01 - Scienze matematiche e informatiche > INF/01 - Informatica
Divisions:	Dipartimenti (until 2012) > DIPARTIMENTO DI INFORMATICA
Depositing User:	dott.ssa Sandra Faita
Date Deposited:	04 Dec 2014 14:26
Last Modified:	04 Dec 2014 14:26
URI:	http://eprints.adm.unipi.it/id/eprint/2227

Repository staff only actions

View Item