1. A scalable and modular automated pipeline for stitching of large electron microscopy datasets
- Author
-
Sharmishtaa Seshamani, Sam Kinn, Clay Reid, Julie Nyhus, Wenjing Yin, Eric T. Trautman, Stephan Saalfeld, JoAnn Buchanan, Stephen J. Smith, Khaled Khairy, R. P. Gwinn, Russel Torres, Tim P. Fliss, N. Macarico da Costa, Daniel J. Bumbarger, Gayathri Mahalingam, Marc Takeno, Ed S. Lein, Daniel Kapner, R.D. Young, Forrest Collman, and Eric Perlman
- Subjects
Assembly software ,General Immunology and Microbiology ,business.industry ,Computer science ,Pipeline (computing) ,General Neuroscience ,Volume (computing) ,Brain ,General Medicine ,Workflow engine ,General Biochemistry, Genetics and Molecular Biology ,Rendering (computer graphics) ,Image stitching ,Mice ,Microscopy, Electron ,Software ,Drosophila melanogaster ,Image Processing, Computer-Assisted ,Animals ,Humans ,business ,Throughput (business) ,Computer hardware ,Algorithms - Abstract
Serial-section electron microscopy (ssEM) is the method of choice for studying macroscopic biological samples at extremely high resolution in three dimensions. In the nervous system, nanometer-scale images are necessary to reconstruct dense neural wiring diagrams in the brain, so calledconnectomes. In order to use this data, consisting of up to 108individual EM images, it must be assembled into a volume, requiring seamless 2D stitching from each physical section followed by 3D alignment of the stitched sections. The high throughput of ssEM necessitates 2D stitching to be done at the pace of imaging, which currently produces tens of terabytes per day. To achieve this, we present a modular volume assembly software pipelineASAP(Assembly Stitching and Alignment Pipeline) that is scalable to datasets containing petabytes of data and parallelized to work in a distributed computational environment. The pipeline is built on top of theRender[18] services used in the volume assembly of the brain of adultDrosophila melanogaster[2]. It achieves high throughput by operating on the meta-data and transformations of each image stored in a database, thus eliminating the need to render intermediate output. ASAP is modular, allowing for easy incorporation of new algorithms without significant changes in the workflow. The entire software pipeline includes a complete set of tools for stitching, automated quality control, 3D section alignment, and final rendering of the assembled volume to disk. ASAP has been deployed for continuous processing of several large-scale datasets of the mouse visual cortex and human brain samples including one cubic millimeter of mouse visual cortex [1, 25] at speeds that exceed imaging. The pipeline also has multi-channel processing capabilities and can be applied to fluorescence and multi-modal datasets like array tomography.
- Published
- 2022