1. Learning General Halfspaces with General Massart Noise under the Gaussian Distribution
- Author
-
Ilias Diakonikolas, Daniel M. Kane, Vasilis Kontonis, Christos Tzamos, and Nikos Zarifis
- Subjects
FOS: Computer and information sciences ,Computer Science - Machine Learning ,Massart Noise ,Statistics - Machine Learning ,Computer Science - Data Structures and Algorithms ,PAC learning ,FOS: Mathematics ,Halfspaces ,Mathematics - Statistics Theory ,Data Structures and Algorithms (cs.DS) ,Machine Learning (stat.ML) ,Statistics Theory (math.ST) ,Machine Learning (cs.LG) - Abstract
We study the problem of PAC learning halfspaces on $\mathbb{R}^d$ with Massart noise under the Gaussian distribution. In the Massart model, an adversary is allowed to flip the label of each point $\mathbf{x}$ with unknown probability $\eta(\mathbf{x}) \leq \eta$, for some parameter $\eta \in [0,1/2]$. The goal is to find a hypothesis with misclassification error of $\mathrm{OPT} + \epsilon$, where $\mathrm{OPT}$ is the error of the target halfspace. This problem had been previously studied under two assumptions: (i) the target halfspace is homogeneous (i.e., the separating hyperplane goes through the origin), and (ii) the parameter $\eta$ is strictly smaller than $1/2$. Prior to this work, no nontrivial bounds were known when either of these assumptions is removed. We study the general problem and establish the following: For $\eta, Comment: Revised presentation
- Published
- 2021
- Full Text
- View/download PDF