1. A Real Time Processing system for big data in astronomy: Applications to HERA
- Author
-
La Plante, P., Williams, P.K.G., Kolopanis, M., Dillon, J.S., Beardsley, A.P., Kern, N.S., Wilensky, M., Ali, Z.S., Abdurashidova, Z., Aguirre, J.E., Alexander, P., Balfour, Y., Bernardi, G., Billings, T.S., Bowman, J.D., Bradley, R.F., Bull, P., Burba, J., Carey, S., Carilli, C.L., Cheng, C., DeBoer, D.R., Dexter, M., de Lera Acedo, E., Ely, J., Ewall-Wice, A., Fagnoni, N., Fritz, R., Furlanetto, S.R., Gale-Sides, K., Glendenning, B., Gorthi, D., Greig, B., Grobbelaar, J., Halday, Z., Hazelton, B.J., Hewitt, J.N., Hickish, J., Jacobs, D.C., Julius, A., Kerrigan, J., Kittiwisit, P., Kohn, S.A., Lanman, A., Lekalake, T., Lewis, D., Liu, A., MacMahon, D., Malan, L., Malgas, C., Maree, M., Martinot, Z.E., Matsetela, E., Mesinger, A., Molewa, M., Morales, M.F., Mosiane, T., Murray, S., Neben, A.R., Nikolic, B., Parsons, A.R., Pascua, R., Patra, N., Pieterse, S., Pober, J.C., Razavi-Ghods, N., Ringuette, J., Robnett, J., Rosie, K., Santos, M.G., Sims, P., Smith, C., Syce, A., Thyagarajan, N., and Zheng, H.
- Abstract
As current- and next-generation astronomical instruments come online, they will generate an unprecedented deluge of data. Analyzing these data in real time presents unique conceptual and computational challenges, and their long-term storage and archiving is scientifically essential for generating reliable, reproducible results. We present here the real-time processing (RTP) system for the Hydrogen Epoch of Reionization Array (HERA), a radio interferometer endeavoring to provide the first detection of the highly redshifted 21 cm signal from Cosmic Dawn and the Epoch of Reionization by an interferometer. The RTP system consists of analysis routines run on raw data shortly after they are acquired, such as calibration and detection of radio-frequency interference (RFI) events. RTP works closely with the Librarian, the HERA data storage and transfer manager which automatically ingests data and transfers copies to other clusters for post-processing analysis. Both the RTP system and the Librarian are public and open source software, which allows for them to be modified for use in other scientific collaborations. When fully constructed, HERA is projected to generate over 50 terabytes (TB) of data each night, and the RTP system enables the successful scientific analysis of these data.
- Published
- 2021
- Full Text
- View/download PDF