Discovering Unexpected Information for Technology Watch
Christine Largeron and François Jacquenet
In: ECML/PKDD 2004, 21-23 Sep 2004, Pisa, Italy.
The purpose of technology watch is to gather, process and integrate the
scientific and technical information that is useful to economic players. In
this article, we propose to use text mining techniques to automate
processing of data found in scientific text databases. The watch activity
introduces an unusual difficulty compared with conventional areas of
application for text mining techniques since, instead of searching for
frequent knowledge hidden in the texts, the target is unexpected knowledge.
As a result, the usual measures used for knowledge discovery have to be
revised. For that purpose, we have developed the UnexpectedMiner system
using new measures for to estimate the unexpectedness of a document. Our
system is evaluated using a base that contains articles
relating to the field of machine learning.
|Postscript - PASCAL Members only - Requires a viewer, such as GhostView|