Back to Search
Start Over
KARAJ: An Efficient Adaptive Multi-Processor Tool to Streamline Genomic and Transcriptomic Sequence Data Acquisition.
- Source :
-
International Journal of Molecular Sciences . Nov2022, Vol. 23 Issue 22, p14418. 9p. - Publication Year :
- 2022
-
Abstract
- Here we developed KARAJ, a fast and flexible Linux command-line tool to automate the end-to-end process of querying and downloading a wide range of genomic and transcriptomic sequence data types. The input to KARAJ is a list of PMCIDs or publication URLs or various types of accession numbers to automate four tasks as follows; firstly, it provides a summary list of accessible datasets generated by or used in these scientific articles, enabling users to select appropriate datasets; secondly, KARAJ calculates the size of files that users want to download and confirms the availability of adequate space on the local disk; thirdly, it generates a metadata table containing sample information and the experimental design of the corresponding study; and lastly, it enables users to download supplementary data tables attached to publications. Further, KARAJ provides a parallel downloading framework powered by Aspera connect which reduces the downloading time significantly. [ABSTRACT FROM AUTHOR]
Details
- Language :
- English
- ISSN :
- 16616596
- Volume :
- 23
- Issue :
- 22
- Database :
- Academic Search Index
- Journal :
- International Journal of Molecular Sciences
- Publication Type :
- Academic Journal
- Accession number :
- 160433041
- Full Text :
- https://doi.org/10.3390/ijms232214418