DML Analysis Part 19
methylKit gene background
The next big step in the C. virginica project is to conduct a gene enrichment (if necessary…we’ll get to this later). I started this notebook with that intention. Before I could get into gene enrichment or description, I needed to characterize overlaps between the gene background used in
methylKit and various genome feature files. I started this small analysis in December, but didn’t follow through because I was focusing on my MEPS resubmission.
The gene background from
methylKit is formed using
unite. I took this background and saved it as a BEDfile in this R Markdown file. I used
intersectBed to find the overlaps between the gene background, exons, introns, mRNA coding regions, and transposable elements. Finally, I calculated the overlap proportions. My next step is to use these overlap proportions, and overlap proportions from DML and DMR to conduct a proportion test.
- Conduct a proportion test with gene background and DML/DMR overlaps with genome feature files
- See how
min_cov, alignment stringency, or SNPs affect clustering
- Determine if a formal gene enrichment is necessary
- If necessary, select the most appropriate gene enrichment method
- Describe functions of most interesting genes with DML and DMR