Loading...
Thumbnail Image
Publication

Bioframe: operations on genomic intervals in Pandas dataframes

Abdennur, Nezar
Fudenberg, Geoffrey
Flyamer, Ilya M
Galitsyna, Aleksandra A
Goloborodko, Anton
Imakaev, Maxim
Venev, Sergey V
Embargo Expiration Date
Abstract

Motivation: Genomic intervals are one of the most prevalent data structures in computational genome biology, and used to represent features ranging from genes, to DNA binding sites, to disease variants. Operations on genomic intervals provide a language for asking questions about relationships between features. While there are excellent interval arithmetic tools for the command line, they are not smoothly integrated into Python, one of the most popular general-purpose computational and visualization environments.

Results: Bioframe is a library to enable flexible and performant operations on genomic interval dataframes in Python. Bioframe extends the Python data science stack to use cases for computational genome biology by building directly on top of two of the most commonly-used Python libraries, NumPy and Pandas. The bioframe API enables flexible name and column orders, and decouples operations from data formats to avoid unnecessary conversions, a common scourge for bioinformaticians. Bioframe achieves these goals while maintaining high performance and a rich set of features.

Availability and implementation: Bioframe is open-source under MIT license, cross-platform, and can be installed from the Python Package Index. The source code is maintained by Open2C on GitHub at https://github.com/open2c/bioframe.

Source

Open2C; Abdennur N, Fudenberg G, Flyamer IM, Galitsyna AA, Goloborodko A, Imakaev M, Venev S. Bioframe: operations on genomic intervals in Pandas dataframes. Bioinformatics. 2024 Feb 1;40(2):btae088. doi: 10.1093/bioinformatics/btae088. PMID: 38402507; PMCID: PMC10903647.

Year of Medical School at Time of Visit
Sponsors
Dates of Travel
DOI
10.1093/bioinformatics/btae088
PubMed ID
38402507
Other Identifiers
Notes
Funding and Acknowledgements
Corresponding Author
Related Resources
Related Resources
Repository Citation
Rights
Copyright The Author(s) 2024. Published by Oxford University Press. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.Attribution 4.0 International