1. Dealing with Multi-Dimensional Data and the Burden of Annotation
- Author
-
Benjamin R. Mitchell, Stanley N. Cohen, and Marion C. Cohen
- Subjects
Pathology ,medicine.medical_specialty ,Artificial neural network ,business.industry ,Computer science ,Deep learning ,02 engineering and technology ,Bottleneck ,Field (computer science) ,Pathology and Forensic Medicine ,Hebbian theory ,020204 information systems ,0202 electrical engineering, electronic engineering, information engineering ,medicine ,Leverage (statistics) ,Reinforcement learning ,020201 artificial intelligence & image processing ,Artificial intelligence ,business ,Transfer of learning - Abstract
The need for huge data sets represents a bottleneck for the application of artificial intelligence. Substantially fewer annotated target lesions than normal tissues for comparison present an additional problem in the field of pathology. Organic brains overcome these limitations by utilizing large numbers of specialized neural nets arranged in both linear and parallel fashion, with each solving a restricted classification problem. They rely on local Hebbian error corrections as compared to the nonlocal back-propagation used in most artificial neural nets, and leverage reinforcement. For these reasons, even toddlers are able to classify objects after only a few examples. Rather than provide an overview of current AI research in pathology, this review focuses on general strategies for overcoming the data bottleneck. These include transfer learning, zero-shot learning, Siamese networks, one-class models, generative networks, and reinforcement learning. Neither an extensive mathematic background nor advanced programing skills are needed to make these subjects accessible to pathologists. However, some familiarity with the basic principles of deep learning, briefly reviewed here, is expected to be useful in understanding both the current limitations of machine learning and determining ways to address them.
- Published
- 2021
- Full Text
- View/download PDF