Tag Archives: BS-seq

SRA Submission – Olymia oyster Whole Genome BS-seq Data

Submitted our whole genome bisulfite sequencing data to NCBI Sequence Read Archive (SRA).

Relevant SRA info is below.

Have updated nightingales Google Sheet with SRA info.

SAMPLE	SRA (Study)	BioProject	BioSample
1NF11	SRP163248	PRJNA494552	SAMN10172233
1NF15	SRP163248	PRJNA494552	SAMN10172234
1NF16	SRP163248	PRJNA494552	SAMN10172235
1NF17	SRP163248	PRJNA494552	SAMN10172236
2NF5	SRP163248	PRJNA494552	SAMN10172237
2NF6	SRP163248	PRJNA494552	SAMN10172238
2NF7	SRP163248	PRJNA494552	SAMN10172239
2NF8	SRP163248	PRJNA494552	SAMN10172240

DNA Methylation Analysis – Bismark Pipeline on All Olympia oyster BSseq Datasets

0000-0002-2747-368X

Bismark analysis of all of our current Olympia oyster (Ostrea lurida) DNA methylation high-throughput sequencing data.

Analysis was run on Emu (Ubuntu 16.04LTS, Apple Xserve). The primary analysis took ~14 days to complete.

All operations are documented in a Jupyter notebook (GitHub):

20180709_emu_oly_methylation_mapping.ipynb

Genome used:

Olurida_v080.fa ( run was initiated prior to creation of v081; see Genomic Resources wiki for more info )

Input files ( see Olympia oyster Genomic GitHub wiki for more info ):

WG BSseq of Fidalgo Bay offspring grown in Fidalgo Bay & Oyster Bay

1_ATCACG_L001_R1_001.fastq.gz
2_CGATGT_L001_R1_001.fastq.gz
3_TTAGGC_L001_R1_001.fastq.gz
4_TGACCA_L001_R1_001.fastq.gz
5_ACAGTG_L001_R1_001.fastq.gz
6_GCCAAT_L001_R1_001.fastq.gz
7_CAGATC_L001_R1_001.fastq.gz
8_ACTTGA_L001_R1_001.fastq.gz

MBDseq of two populations (Hood Canal & Oyster Bay) grown in Clam Bay

zr1394_10_s456.fastq.gz
zr1394_11_s456.fastq.gz
zr1394_12_s456.fastq.gz
zr1394_13_s456.fastq.gz
zr1394_14_s456.fastq.gz
zr1394_15_s456.fastq.gz
zr1394_16_s456.fastq.gz
zr1394_17_s456.fastq.gz
zr1394_18_s456.fastq.gz
zr1394_1_s456.fastq.gz
zr1394_2_s456.fastq.gz
zr1394_3_s456.fastq.gz
zr1394_4_s456.fastq.gz
zr1394_5_s456.fastq.gz
zr1394_6_s456.fastq.gz
zr1394_7_s456.fastq.gz
zr1394_8_s456.fastq.gz
zr1394_9_s456.fastq.gz

RESULTS:

With Bismark complete, these two sets of analyses can now be looked into further (and separately, as they are separate experiments) using things like MethylKit (R package) and
the Integrative Genomics Viewer (IGV).

Output folder:

owl/Athaliana/20180709_oly_methylseq

Bismark Summary Report:

20180709_oly_methylseq/bismark_summary_report.html

Individual Sample Reports:

BS-seq Mapping – Olympia oyster bisulfite sequencing: Bismark Continued

0000-0002-2747-368X

Previously took the analysis just through the mapping, but didn’t realize Steven wanted me to fully process the data.

So, as en exercise, I followed through with deduplication and sorting of the BAM files.

Then, ran a quick analysis using MethylKit in R. The analysis simply copied what Steven had done with another data set and I haven’t examined it very thoroughly, so am not well-versed on what it’s doing and/or why.

Jupyter Notebook (GitHub):

20180530_emu_oly_methylation_mapping_deduplication.ipynb

R Studio Project (download the folder, load project in R Studio, and then run the script in the scripts subdirectory to run the analysis):

20180531_oly_methylkit/

Will take the full data sets through this whole pipeline.

TrimGalore/FastQC/MultiQC – TrimGalore! RRBS Geoduck BS-seq FASTQ data (directional)

0000-0002-2747-368X

Earlier this week, I ran TrimGalore!, but set the trimming, incorrectly – due to a copy/paste mistake, as --non-directional, so I re-ran with the correct settings.

Steven requested that I trim the Geoduck RRBS libraries that we have, in preparation to run them through Bismark.

These libraries were originally created by Hollie Putnam using the TruSeq DNA Methylation Kit (Illumina):

project_juvenile_geoduck_OA/Sample_Processing (GitHub)

All analysis is documented in a Jupyter Notebook; see link below.

Overview of process:

Run TrimGalore! with --paired and --rrbs settings.
Run FastQC and MultiQC on trimmed files.
Copy all data to owl (see Results below for link).
Confirm data integrity via MD5 checksums.

Jupyter Notebook:

20180516_roadrunner_geoduck_RRBS_trimming.ipynb (GitHub)

Results:

TrimGalore/FastQC/MultiQC – TrimGalore! RRBS Geoduck BS-seq FASTQ data

0000-0002-2747-368X

20180516 – UPDATE!!

THIS WAS RUN WITH THE INCORRECT SETTING IN TRIMGALORE! `--non-directional`

WILL RE-RUN

Steven requested that I trim the Geoduck RRBS libraries that we have, in preparation to run them through Bismark.

These libraries were originally created by Hollie Putnam using the TruSeq DNA Methylation Kit (Illumina):

project_juvenile_geoduck_OA/Sample_Processing (GitHub)

All analysis is documented in a Jupyter Notebook; see link below.

Overview of process:

Copy EPI* FastQ files from owl/P_generosa to roadrunner.
Confirm data integrity via MD5 checksums.
Run TrimGalore! with --paired, --rrbs, and --non-directional settings.
Run FastQC and MultiQC on trimmed files.
Copy all data to owl (see Results below for link).
Confirm data integrity via MD5 checksums.

Jupyter Notebook:

20180514_roadrunner_geoduck_RRBS_trimming.ipynb (GitHub)

Results:

TrimGalore! output folder:

20180514_geoduck_trimgalore_rrbs

FastQC output folder:

20180514_geoduck_trimgalore_rrbs/20180514_geoduck_trimmed_fastqc/

MultiQC output folder:

20180514_geoduck_trimgalore_rrbs/20180514_geoduck_trimmed_fastqc/multiqc_data

MultiQC report (HTML):

multiqc_report.html

BS-seq Mapping – Olympia oyster bisulfite sequencing: TrimGalore > FastQC > Bismark

0000-0002-2747-368X

Steven asked me to evaluate our methylation sequencing data sets for Olympia oyster.

According to our Olympia oyster genome wiki, we have the following two sets of BS-seq data:

All computing was conducted on our Apple Xserve: emu.

All steps were documented in this Jupyter Notebook (GitHub): 20180503_emu_oly_methylation_mapping.ipynb

NOTE: The Jupyter Notebook linked above is very large in size. As such it will not render on GitHub. It will need to be downloaded to a computer that can run Jupyter Notebooks and viewed that way.

Here’s a brief overview of what was done.

Samples were trimmed with TrimGalore and then evaluated with FastQC. MultiQC was used to generate a nice visual summary report of all samples.

The Olympia oyster genome assembly, pbjelly_sjw_01, was used as the reference genome and was prepared for use in Bismark:


/home/shared/Bismark-0.19.1/bismark_genome_preparation \
--path_to_bowtie /home/shared/bowtie2-2.3.4.1-linux-x86_64/ \
--verbose /home/sam/data/oly_methylseq/oly_genome/ \
2> 20180507_bismark_genome_prep.err

Bismark was run on trimmed samples with the following command:


/home/shared/Bismark-0.19.1/bismark \
--path_to_bowtie /home/shared/bowtie2-2.3.4.1-linux-x86_64/ \
--genome /home/sam/data/oly_methylseq/oly_genome/ \
-u 1000000 \
-p 16 \
--non_directional \
/home/sam/analyses/20180503_oly_methylseq_trimgalore/1_ATCACG_L001_R1_001_trimmed.fq.gz \
/home/sam/analyses/20180503_oly_methylseq_trimgalore/2_CGATGT_L001_R1_001_trimmed.fq.gz \
/home/sam/analyses/20180503_oly_methylseq_trimgalore/3_TTAGGC_L001_R1_001_trimmed.fq.gz \
/home/sam/analyses/20180503_oly_methylseq_trimgalore/4_TGACCA_L001_R1_001_trimmed.fq.gz \
/home/sam/analyses/20180503_oly_methylseq_trimgalore/5_ACAGTG_L001_R1_001_trimmed.fq.gz \
/home/sam/analyses/20180503_oly_methylseq_trimgalore/6_GCCAAT_L001_R1_001_trimmed.fq.gz \
/home/sam/analyses/20180503_oly_methylseq_trimgalore/7_CAGATC_L001_R1_001_trimmed.fq.gz \
/home/sam/analyses/20180503_oly_methylseq_trimgalore/8_ACTTGA_L001_R1_001_trimmed.fq.gz \
/home/sam/analyses/20180503_oly_methylseq_trimgalore/zr1394_10_s456_trimmed.fq.gz \
/home/sam/analyses/20180503_oly_methylseq_trimgalore/zr1394_11_s456_trimmed.fq.gz \
/home/sam/analyses/20180503_oly_methylseq_trimgalore/zr1394_12_s456_trimmed.fq.gz \
/home/sam/analyses/20180503_oly_methylseq_trimgalore/zr1394_13_s456_trimmed.fq.gz \
/home/sam/analyses/20180503_oly_methylseq_trimgalore/zr1394_14_s456_trimmed.fq.gz \
/home/sam/analyses/20180503_oly_methylseq_trimgalore/zr1394_15_s456_trimmed.fq.gz \
/home/sam/analyses/20180503_oly_methylseq_trimgalore/zr1394_16_s456_trimmed.fq.gz \
/home/sam/analyses/20180503_oly_methylseq_trimgalore/zr1394_17_s456_trimmed.fq.gz \
/home/sam/analyses/20180503_oly_methylseq_trimgalore/zr1394_18_s456_trimmed.fq.gz \
/home/sam/analyses/20180503_oly_methylseq_trimgalore/zr1394_1_s456_trimmed.fq.gz \
/home/sam/analyses/20180503_oly_methylseq_trimgalore/zr1394_2_s456_trimmed.fq.gz \
/home/sam/analyses/20180503_oly_methylseq_trimgalore/zr1394_3_s456_trimmed.fq.gz \
/home/sam/analyses/20180503_oly_methylseq_trimgalore/zr1394_4_s456_trimmed.fq.gz \
/home/sam/analyses/20180503_oly_methylseq_trimgalore/zr1394_5_s456_trimmed.fq.gz \
/home/sam/analyses/20180503_oly_methylseq_trimgalore/zr1394_6_s456_trimmed.fq.gz \
/home/sam/analyses/20180503_oly_methylseq_trimgalore/zr1394_7_s456_trimmed.fq.gz \
/home/sam/analyses/20180503_oly_methylseq_trimgalore/zr1394_8_s456_trimmed.fq.gz \
/home/sam/analyses/20180503_oly_methylseq_trimgalore/zr1394_9_s456_trimmed.fq.gz \
2> 20180507_bismark_02.err

Results:

TrimGalore output folder:

20180503_oly_methylseq_trimgalore

FastQC output folder:

20180503_oly_methylseq_trimgalore/20180503_trim_fastqc/

MultiQC output folder:

20180503_oly_methylseq_trimgalore/20180503_trim_fastqc/multiqc_data/

MultiQC Report (HTML):

20180503_oly_methylseq_trimgalore/20180503_trim_fastqc/multiqc_data/multiqc_report.html

Bismark genome folder: 20180503_oly_genome_pbjelly_sjw_01_bismark/

Bismark output folder:

20180507_oly_methylseq_bismark

Whole genome BS-seq (2015)

Prep overview

Library prep: Roberts Lab
Sequencing: Genewiz

Bismark Report	Mapping Percentage
1_ATCACG_L001_R1_001_trimmed_bismark_bt2_SE_report.txt	40.3%
2_CGATGT_L001_R1_001_trimmed_bismark_bt2_SE_report.txt	39.9%
3_TTAGGC_L001_R1_001_trimmed_bismark_bt2_SE_report.txt	40.2%
4_TGACCA_L001_R1_001_trimmed_bismark_bt2_SE_report.txt	40.4%
5_ACAGTG_L001_R1_001_trimmed_bismark_bt2_SE_report.txt	39.9%
6_GCCAAT_L001_R1_001_trimmed_bismark_bt2_SE_report.txt	39.6%
7_CAGATC_L001_R1_001_trimmed_bismark_bt2_SE_report.txt	39.9%
8_ACTTGA_L001_R1_001_trimmed_bismark_bt2_SE_report.txt	39.7%

MBD BS-seq (2015)

Prep overview

MBD: Roberts Lab
Library prep: ZymoResearch
Sequencing: ZymoResearch

Bismark Report	Mapping Percentage
zr1394_1_s456_trimmed_bismark_bt2_SE_report.txt	33.0%
zr1394_2_s456_trimmed_bismark_bt2_SE_report.txt	34.1%
zr1394_3_s456_trimmed_bismark_bt2_SE_report.txt	32.5%
zr1394_4_s456_trimmed_bismark_bt2_SE_report.txt	32.8%
zr1394_5_s456_trimmed_bismark_bt2_SE_report.txt	35.2%
zr1394_6_s456_trimmed_bismark_bt2_SE_report.txt	35.5%
zr1394_7_s456_trimmed_bismark_bt2_SE_report.txt	32.8%
zr1394_8_s456_trimmed_bismark_bt2_SE_report.txt	33.0%
zr1394_9_s456_trimmed_bismark_bt2_SE_report.txt	34.7%
zr1394_10_s456_trimmed_bismark_bt2_SE_report.txt	34.9%
zr1394_11_s456_trimmed_bismark_bt2_SE_report.txt	30.5%
zr1394_12_s456_trimmed_bismark_bt2_SE_report.txt	35.8%
zr1394_13_s456_trimmed_bismark_bt2_SE_report.txt	32.5%
zr1394_14_s456_trimmed_bismark_bt2_SE_report.txt	30.8%
zr1394_15_s456_trimmed_bismark_bt2_SE_report.txt	31.3%
zr1394_16_s456_trimmed_bismark_bt2_SE_report.txt	30.7%
zr1394_17_s456_trimmed_bismark_bt2_SE_report.txt	32.4%
zr1394_18_s456_trimmed_bismark_bt2_SE_report.txt	34.9%

FastQC/MultiQC – C. virginica MBD BS-seq Data

0000-0002-2747-368X

Per Steven’s GitHub Issues request, I ran FastQC on the Eastern oyster MBD bisulfite sequencing data we recently got back from ZymoResearch.

Ran FastQC locally with the following script: 20180409_fastqc_Cvirginica_MBD.sh


#!/bin/bash
/home/sam/software/FastQC/fastqc \
--threads 18 \
--outdir /home/sam/20180409_fastqc_Cvirginica_MBD \
/mnt/owl/nightingales/C_virginica/zr2096_10_s1_R1.fastq.gz \
/mnt/owl/nightingales/C_virginica/zr2096_10_s1_R2.fastq.gz \
/mnt/owl/nightingales/C_virginica/zr2096_1_s1_R1.fastq.gz \
/mnt/owl/nightingales/C_virginica/zr2096_1_s1_R2.fastq.gz \
/mnt/owl/nightingales/C_virginica/zr2096_2_s1_R1.fastq.gz \
/mnt/owl/nightingales/C_virginica/zr2096_2_s1_R2.fastq.gz \
/mnt/owl/nightingales/C_virginica/zr2096_3_s1_R1.fastq.gz \
/mnt/owl/nightingales/C_virginica/zr2096_3_s1_R2.fastq.gz \
/mnt/owl/nightingales/C_virginica/zr2096_4_s1_R1.fastq.gz \
/mnt/owl/nightingales/C_virginica/zr2096_4_s1_R2.fastq.gz \
/mnt/owl/nightingales/C_virginica/zr2096_5_s1_R1.fastq.gz \
/mnt/owl/nightingales/C_virginica/zr2096_5_s1_R2.fastq.gz \
/mnt/owl/nightingales/C_virginica/zr2096_6_s1_R1.fastq.gz \
/mnt/owl/nightingales/C_virginica/zr2096_6_s1_R2.fastq.gz \
/mnt/owl/nightingales/C_virginica/zr2096_7_s1_R1.fastq.gz \
/mnt/owl/nightingales/C_virginica/zr2096_7_s1_R2.fastq.gz \
/mnt/owl/nightingales/C_virginica/zr2096_8_s1_R1.fastq.gz \
/mnt/owl/nightingales/C_virginica/zr2096_8_s1_R2.fastq.gz \
/mnt/owl/nightingales/C_virginica/zr2096_9_s1_R1.fastq.gz \
/mnt/owl/nightingales/C_virginica/zr2096_9_s1_R2.fastq.gz

MultiQC was then run on the FastQC output files.

All files were moved to Owl after the jobs completed.

Results:

FastQC Output folder: 20180409_fastqc_Cvirginica_MBD/

MultiQC Output folder: 20180409_fastqc_Cvirginica_MBD/multiqc_data/

MultiQC report (HTML): 20180409_fastqc_Cvirginica_MBD/multiqc_data/multiqc_report.html

Everything looks good to me.

Steven’s interested in seeing what the trimmed output would look like (and, how it would impact mapping efficiencies). Will initiate trimming.

See the GitHub issue linked above for the full discussion.

Data Received – Crassostrea virginica MBD BS-seq from ZymoResearch

0000-0002-2747-368X

Received the sequencing data from ZymoResearch for the <em>Crassostrea virginica</em> gonad MBD DNA that was sent to them on 20180207 for bisulfite conversion, library construction, and sequencing.

Gzipped FASTQ files were:

downloaded to Owl/nightingales/C_virginica
MD5 checksums verified
MD5 checksums appended to the checksums.md5 file
readme.md file updated
Updated nightingales Google Sheet

Here’s the list of files received:

zr2096_10_s1_R1.fastq.gz
zr2096_10_s1_R2.fastq.gz
zr2096_1_s1_R1.fastq.gz
zr2096_1_s1_R2.fastq.gz
zr2096_2_s1_R1.fastq.gz
zr2096_2_s1_R2.fastq.gz
zr2096_3_s1_R1.fastq.gz
zr2096_3_s1_R2.fastq.gz
zr2096_4_s1_R1.fastq.gz
zr2096_4_s1_R2.fastq.gz
zr2096_5_s1_R1.fastq.gz
zr2096_5_s1_R2.fastq.gz
zr2096_6_s1_R1.fastq.gz
zr2096_6_s1_R2.fastq.gz
zr2096_7_s1_R1.fastq.gz
zr2096_7_s1_R2.fastq.gz
zr2096_8_s1_R1.fastq.gz
zr2096_8_s1_R2.fastq.gz
zr2096_9_s1_R1.fastq.gz
zr2096_9_s1_R2.fastq.gz

Here’s the sample processing history:

Data Received – Bisulfite-treated Illumina Sequencing from Genewiz

0000-0002-2747-368X

Received notice the sequencing data was ready from Genewiz for the samples submitted 20151222.

Download the FASTQ files from Genewiz project directory:

wget -r -np -nc -A "*.gz" ftp://username:password@ftp2.genewiz.com/Project_BS1512183

Since two species were sequenced (C.gigas & O.lurida), the corresponding files are in the following locations:

http://owl.fish.washington.edu/nightingales/O_lurida/

http://owl.fish.washington.edu/nightingales/C_gigas/

In order to process the files, I needed to identify just the FASTQ files from this project and save the list of files to a bash variable called ‘bsseq':

bsseq=$(ls | grep '^[0-9]\{1\}_*' | grep -v "2bRAD")

Explanation:

bsseq=

This initializes a variable called “bsseq” to the values contained in the command following the equals sign.

$(ls | grep '^[0-9]\{1\}_*' | grep -v "2bRAD")

This lists (ls) all files, pipes them to the grep command (|), grep finds those files that begin with (^) one or two digits followed by an underscore ([0-9{1}_*), pipes those results (|) to another grep command which excludes (-v) any results containing the text “2bRAD”.

FILENAME	SAMPLE NAME	SPECIES
1_ATCACG_L001_R1_001.fastq.gz	1NF11	O.lurida
2_CGATGT_L001_R1_001.fastq.gz	1NF15	O.lurida
3_TTAGGC_L001_R1_001.fastq.gz	1NF16	O.lurida
4_TGACCA_L001_R1_001.fastq.gz	1NF17	O.lurida
5_ACAGTG_L001_R1_001.fastq.gz	2NF5	O.lurida
6_GCCAAT_L001_R1_001.fastq.gz	2NF6	O.lurida
7_CAGATC_L001_R1_001.fastq.gz	2NF7	O.lurida
8_ACTTGA_L001_R1_001.fastq.gz	2NF8	O.lurida
9_GATCAG_L001_R1_001.fastq.gz	M2	C.gigas
10_TAGCTT_L001_R1_001.fastq.gz	M3	C.gigas
11_GGCTAC_L001_R1_001.fastq.gz	NF2_6	O.lurida
12_CTTGTA_L001_R1_001.fastq.gz	NF_18	O.lurida

I wanted to add some information about the project to the readme file, like total number of sequencing reads generated and the number of reads in each FASTQ file.

Here’s how to count the total of all reads generated in this project

totalreads=0; for i in $bsseq; do linecount=`gunzip -c "$i" | wc -l`; readcount=$((linecount/4)); totalreads=$((readcount+totalreads)); done; echo $totalreads

Total reads = 138,530,448

C.gigas reads: 22,249,631

O.lurida reads: 116,280,817

Code explanation:

totalreads=0;

Creates variable called “totalreads” and initializes value to 0.

for i in $bsseq;

Initiates a for loop to process the list of files stored in $bsseq variable. The FASTQ files have been compressed with gzip and end with the .gz extension.

do linecount=

Creates variable called “linecount” that stores the results of the following command:

`gunzip -c "$i" | wc -l`;

Unzips the files ($i) to stdout (-c) instead of actually uncompressing them. This is piped to the word count command, with the line flag (wc -l) to count the number of lines in the files.

readcount=$((linecount/4));

Divides the value stored in linecount by 4. This is because an entry for a single Illumina read comprises four lines. This value is stored in the “readcount” variable.

totalreads=$((readcount+totalreads));

Adds the readcount for the current file and adds the value to totalreads.

done;

End the for loop.

echo $totalreads

Prints the value of totalreads to the screen.

Next, I wanted to generate list of the FASTQ files and corresponding read counts, and append this information to the readme file.

for i in $bsseq; do linecount=`gunzip -c "$i" | wc -l`; readcount=$(($linecount/4)); printf "%s\t%s\n%s\t\t\n" "$i" "$readcount" >> readme.md; done

Code explanation:

for i in $bsseq; do linecount=`gunzip -c "$i" | wc -l`; readcount=$(($linecount/4));

Same for loop as above that calculates the number of reads in each FASTQ file.

printf "%s\t%s\n\n" "$i" "$readcount" >> readme.md;

This formats the the printed output. The “%s\t%s\n\n” portion prints the value in $i as a string (%s), followed by a tab (\t), followed by the value in $readcount as a string (%s), followed by two consecutive newlines (\n\n) to provide an empty line between the entries. See the readme file linked above to see how the output looks.

>> readme.md; done

This appends the result from each loop to the readme.md file and ends the for loop (done).

Sample Submission – BS-seq Library Pool to Genewiz

0000-0002-2747-368X

Pooled 10ng of each of the libraries prepared yesterday with TruSeq DNA Methylation Library Kit (Illumina) for sequencing at Genewiz.

SAMPLE	VOLUME FOR 10ng (μL)
1NF11	4.13
1NF15	5.32
1NF16	3.65
1NF17	3.94
2NF4	3.68
2NF5	4.10
2NF6	4.20
2NF7	5.32
M2	4.59
M3	3.91
NF2_6	4.00
NF_18	3.76

Samples were sent to Genewiz on dry ice via standard overnight FedEx.

Sam's Notebook

University of Washington – Fishery Sciences – Roberts Lab

Tag Archives: BS-seq

SRA Submission – Olymia oyster Whole Genome BS-seq Data

DNA Methylation Analysis – Bismark Pipeline on All Olympia oyster BSseq Datasets

WG BSseq of Fidalgo Bay offspring grown in Fidalgo Bay & Oyster Bay

MBDseq of two populations (Hood Canal & Oyster Bay) grown in Clam Bay

RESULTS:

BS-seq Mapping – Olympia oyster bisulfite sequencing: Bismark Continued

TrimGalore/FastQC/MultiQC – TrimGalore! RRBS Geoduck BS-seq FASTQ data (directional)

Results:

TrimGalore! output folder:

FastQC output folder:

MultiQC output folder:

MultiQC report (HTML):

TrimGalore/FastQC/MultiQC – TrimGalore! RRBS Geoduck BS-seq FASTQ data

20180516 – UPDATE!!

THIS WAS RUN WITH THE INCORRECT SETTING IN TRIMGALORE! `--non-directional`

WILL RE-RUN

Results:

TrimGalore! output folder:

FastQC output folder:

MultiQC output folder:

MultiQC report (HTML):

BS-seq Mapping – Olympia oyster bisulfite sequencing: TrimGalore > FastQC > Bismark

Results:

Whole genome BS-seq (2015)

Prep overview

MBD BS-seq (2015)

Prep overview

FastQC/MultiQC – C. virginica MBD BS-seq Data

Results:

Data Received – Crassostrea virginica MBD BS-seq from ZymoResearch

Data Received – Bisulfite-treated Illumina Sequencing from Genewiz

Sample Submission – BS-seq Library Pool to Genewiz

WG BSseq of Fidalgo Bay offspring grown in Fidalgo Bay & Oyster Bay

MBDseq of two populations (Hood Canal & Oyster Bay) grown in Clam Bay

RESULTS:

Results:

TrimGalore! output folder:

FastQC output folder:

MultiQC output folder:

MultiQC report (HTML):

20180516 – UPDATE!!

THIS WAS RUN WITH THE INCORRECT SETTING IN TRIMGALORE! --non-directional

WILL RE-RUN

Results:

TrimGalore! output folder:

FastQC output folder:

MultiQC output folder:

MultiQC report (HTML):

Results:

Prep overview

Prep overview

Results:

THIS WAS RUN WITH THE INCORRECT SETTING IN TRIMGALORE! `--non-directional`