Note
Go to the end to download the full example code or to run this example in your browser via Binder
Local Data Loading#
This example loads data from a local file
Load up a local data file#
from biolearn.data_library import GeoData
from biolearn.util import get_test_data_file
#Files formatted as described in the standard https://bio-learn.github.io/methylation-standard.html will load correctly.
file_path = get_test_data_file("example_data.csv")
data = GeoData.from_methylation_matrix(file_path)
data.dnam
You can now use it like any other Biolearn dataset#
data.quality_report().show()
Quality Report Summary
------------------------------------------------
Sample Count: 4
Methylation Sites: 3
Missing Methylation Data: 1 (8.33%)
Samples With High Deviation: 0 (0.00%)
Methylation Sites With Over 20% of Reads Missing: 1 (33.33%)
Notes:
------------------------------------------------
- Your data set includes methylation sites that have over 20% of reads missing. Default imputation may replace the values for all reads from this site with a gold standard.
Total running time of the script: (0 minutes 0.421 seconds)
Estimated memory usage: 36 MB