PASCAL - Pattern Analysis, Statistical Modelling and Computational Learning

How many missing answers can be tolerated by query learners ?
Hans Simon
Theory of Computing Systems Volume 37, Number 1, pp. 77-94, 2004.


We consider the model of exact learning using an equivalence oracle and an incomplete membership oracle. In this model, a random subset of the learner's membership queries is left unanswered. Our results are as follows. First, we analyze the obvious method for coping with missing answers: search exhaustively through all possible ``answer patterns'' associated with the unanswered queries. Thereafter, we present two specific concept classes that are efficiently learnable using an equivalence oracle and a (completely reliable) membership oracle, but are provably not polynomially learnable if the membership oracle becomes slightly incomplete. The first class will demonstrate that the aforementioned method of exhaustively searching through all possible answer patterns cannot be substantially improved in general (despite its apparent simplicity). The second class will demonstrate that the incomplete membership oracle can be rendered useless even if it leaves only a fraction $1/\mbox{poly}(n)$ of all queries unanswered. Finally, we present a learning algorithm for monotone DNF formulas that can cope with a relatively large fraction of missing answers (more than sixty percent), but is as efficient (in terms of run-time and number of queries) as the classical algorithm whose questions are always answered reliably.

Postscript - PASCAL Members only - Requires a viewer, such as GhostView
EPrint Type:Article
Project Keyword:Project Keyword UNSPECIFIED
Subjects:Theory & Algorithms
ID Code:2217
Deposited By:Hans Simon
Deposited On:29 September 2006