Killifish Hypoxia RRBS Part 10
Moving forward with summarizing methylation
After meeting with Neel, we decided that a good way forward would be to run samples through BAT_summarize
since we know that step works. While BAT_overview
would give us interesting results, we could do that manually if needed. The important step would be getting the BAT_summarize
output through BAT_DMRcalling
.
Testing BAT_DMRcalling
Before proceeding, Neel suggested I take the bedGraph from my BAT_summarize
test and put it through BAT_DMRcalling
to ensure I can get DMR from the output. If I can’t, then there’s a bigger problem with the BAT_summarize
output than I originally thought. I ran the following code to try calling DMR:
Singularity> BAT_DMRcalling -q ${ALL_POP}_metilene_N_S.txt -o /scratch/06-DMR/ -a N -b S
[INFO] Fri Apr 29, 11:36:45, 2022 BAT_DMRcalling v0.1 started
[INFO] Fri Apr 29, 11:36:45, 2022 Checking flags
[INFO] Fri Apr 29, 11:36:45, 2022 Calling DMRs.
[INFO] Fri Apr 29, 11:36:45, 2022 Filtering DMRs.
[INFO] Fri Apr 29, 11:36:45, 2022 Wrote 0 DMRs with adj. p-value < 0.05, a minimum absolute difference >= 0.1, a minimum length [CpG] >= 10 and a minimum length [nt] >= 0
[INFO] Fri Apr 29, 11:36:45, 2022 Bedgraph file containing DMR difference: /scratch/06-DMR_qval.0.05.bedgraph
Bed file containing unique DMR identifier and hypo/hypermethylated annotation: /scratch/06-DMR_qval.0.05.bed
[INFO] Fri Apr 29, 11:36:45, 2022 Plotting DMR statistics (/scratch/06-DMR_qval.0.05.pdf).
I checked the output and it looked good, so the weird formatting is only an issue for the R script! This makes me think that the best solution to fix BAT_overview.R
would be to edit the source code in the singularity container somehow. But that’s for another day.
Finishing BAT_summarize
Since the populations are very different genetically, it makes the most sense to investigate oxygen conditions within populations. Comparing populations at the same oxygen conditions would also show how different populations may respond to the same stressor. Neel and I decided on the following contrasts:
- All New Bedford vs. all Scorton Creek samples, irrespective of oxygen condition (i.e. hypoxia, normoxia, outside control)
- NB OC vs. SC OC
- NB normoxia vs. SC normoxia
- NB hypoxia vs. SC hypoxia
- NB normoxia vs. NB OC
- NB normoxia vs. NB hypoxia
- SC normoxia vs. SC OC
- SC normoxia vs. SC hypoxia
I originally tried running my code directly in my SLURM script, but ran into the same issue of not being able to assign variables properly. I created this bash script for BAT_summarize
that I called within this SLURM script with this variable definition file. I ended up with all the output needed, which I saved in this directory.
Going forward
- Figure out
BAT_overview
error (add line of code to make all column data numerical to BAT_overview.R?) - Investigate any issues with
BAT_filtering
output bedGraphs - Write methods and results
- Obtain other important methylation landscape information
- Start RNA-Seq analysis
- Create OSF repository for all intermediate files