1. Machine Learning Requires Probability and Statistics [Perspectives]
- Author
-
Edward R. Dougherty and Ulisses Braga-Neto
- Subjects
Training set ,business.industry ,Computer science ,Applied Mathematics ,Computation ,Probabilistic logic ,020206 networking & telecommunications ,Probability and statistics ,02 engineering and technology ,Machine learning ,computer.software_genre ,Discriminant ,Test set ,Signal Processing ,0202 electrical engineering, electronic engineering, information engineering ,Statistical inference ,Artificial intelligence ,Electrical and Electronic Engineering ,business ,computer ,Coding (social sciences) - Abstract
Machine Learning Requires Probability and Statistics The contemporary practice of machine learning often involves the application of deterministic, computationally intensive algorithms to iteratively minimize a criterion of fit between a discriminant and sample data. There is often little interest in using probability to model the uncertainty in the problem and statistics to characterize the behavior of predictors derived from data, with the emphasis being on computation and coding. It follows that little can be stated about performance on future data, beyond perhaps a simple error count on a given test set. In this article, we argue that the knowledge imparted by deterministic computational methods is not rigorously related to the real world and, in particular, future events. This connection requires rigorous probabilistic modeling and statistical inference as well as an understanding of the proper role of computation and an appreciation of epistemological issues.
- Published
- 2020