The collation of large electronic databases of scienti?c and commercial
infor- tion has led to a dramatic growth of interest in methods for
discovering struc- res in such databases. These methods often go under
the general name of data mining. One important subdiscipline within data
mining is concerned with the identi?cation and detection of anomalous,
interesting, unusual, or valuable - cords or groups of records, which we
call patterns. Familiar examples are the detection of fraud in
credit-card transactions, of particular coincident purchases in
supermarket transactions, of important nucleotide sequences in gene
sequence analysis, and of characteristic traces in EEG records. Tools
for the detection of such patterns have been developed within the data
mining community, but also within other research communities, typically
without an awareness that the - sic problem was common to many
disciplines. This is not unreasonable: each of these disciplines has a
large literature of its own, and a literature which is growing rapidly.
Keeping up with any one of these is di?cult enough, let alone keeping up
with others as well, which may in any case be couched in an - familiar
technical language. But, of course, this means that opportunities are
being lost, discoveries relating to the common problem made in one area
are not transferred to the other area, and breakthroughs and problem
solutions are being rediscovered, or not discovered for a long time,
meaning that e?ort is being wasted and opportunities may be lost.