Royal Holloway logo with departmental theme Royal Holloway, University of London

LEARNING FIXED-DIMENSION LINEAR THRESHOLDS FROM FRAGMENTED DATA
Dr Paul Goldberg, Department of Computer Science, University of Warwick

Abstract: We investigate PAC-learning in a situation in which examples (consisting of an input vector and 0/1 label) have some of the components of the input vector concealed from the learner. This is a special case of Restricted Focus of Attention (RFA) learning. Our interest here is in 1-RFA learning, where only a single component of an input vector is given, for each example. We argue that 1-RFA learning merits special consideration within the wider field of RFA learning. It is the most restrictive form of RFA learning (so that positive results apply in general), and it models a type of ``data fusion'' scenario, where we have sets of observations from a number of separate sensors, but these sensors are uncorrelated sources.

Within this setting we study the well-known class of linear threshold functions, the characteristic functions of Euclidean half-spaces. The sample complexity (i.e. sample-size requirement as a function of the parameters) of this learning problem is affected by the input distribution. We show that the sample complexity is always finite, for any given input distribution, but we also exhibit methods for defining ``bad'' input distributions for which the sample complexity can grow arbitrarily fast. We identify fairly general sufficient conditions for an input distribution to give rise to sample complexity that is polynomial in the PAC parameters $\epsilon^{-1}$ and $\delta^{-1}$. We give an algorithm (using an empirical $\epsilon$-cover) whose sample complexity is polynomial in these parameters and the dimension (number of inputs), for input distributions that satisfy our conditions. The runtime is polynomial in $\epsilon^{-1}$ and $\delta^{-1}$ provided that the dimension is any constant. We show how to adapt the algorithm to handle uniform misclassification noise.

This seminar was held at the Department of Computer Science, Royal Holloway, University of London on 25 October, 1999.

back


Last updated Mon, 15-Dec-2008 14:49 GMT / PS
Department of Computer Science, University of London, Egham, Surrey TW20 0EX
Tel/Fax : +44 (0)1784 443421 /439786
@@('' )@@
@@('' )@@
@@('' )@@
@@('' )@@
@@('' )@@
@@('' )@@
@@('' )@@
@@('' )@@
@@('' )@@