Tag Archives: assembly

Transcriptome Assembly – Olympia oyster RNAseq Data with Trinity

Used all of our current oly RNAseq data to assemble a transcriptome using Trinity.

Trinity was run our our Mox HPC node.

Reads were trimmed using the built-in version of Trimmomatic with the default settings.

SBATCH script:

20180827_oly_trinity.sh

Despite the naming conventions, this job was submitted to the Mox scheduler on 201800912 and finished on 20180913.

After job completion, the entire folder was gzipped, using an interactive node (the following method of gzipping is SUPER fast, btw):

tar -c 20180827_trinity_oly_RNAseq | pigz > 20180827_trinity_oly_RNAseq.tar.gz

RESULTS:

Output folder:

20180827_trinity_oly_RNAseq/

Trinity assembly (FastA):

20180827_trinity_oly_RNAseq/Trinity.fasta

Next up, I’ll follow up on this GitHub issue and get some bedgraphs generated.

Assembly Stats – Geoduck Hi-C Final Assembly Comparison

0000-0002-2747-368X

We received the final geoduck genome assembly data from Phase Genomics, in which they updated the assembly by performing some manual curation:

geoduck_manual_scaffolds.fasta

There are additional assembly files that provide some additional assembly data. See the following directory:

20180822_phase_genomics_geoduck_Results/geoduck_manual/

Actual sequencing data and two previous assemblies were previously received on 20180421.

All assembly data (both old and new) from Phase Genomics was downloaded in full from the Google Drive link provided by them and stored here on Owl:

20180822_phase_genomics_geoduck_Results/

Ran Quast to compare all three assemblies provided (command run on Swoose):


/home/sam/software/quast-4.5/quast.py \
-t 24 \
--labels 20180403_pga,20180421_pga,20180810_geo_manual \
/mnt/owl/Athaliana/20180421_geoduck_hi-c/Results/geoduck_roberts results 2018-04-03 11:05:41.596285/PGA_assembly.fasta \ /mnt/owl/Athaliana/20180421_geoduck_hi-c/Results/geoduck_roberts results 2018-04-21 18:09:04.514704/PGA_assembly.fasta \ /mnt/owl/Athaliana/20180822_phase_genomics_geoduck_Results/geoduck_manual/geoduck_manual_scaffolds.fasta

Results:

Quast output folder: results_2018_08_23_07_38_28/

Quast report (HTML): results_2018_08_23_07_38_28/report.html

Read Mapping – Mapping Illumina Data to Geoduck Genome Assemblies with Bowtie2

0000-0002-2747-368X

We have an upcoming meeting with Illumina to discuss how the geoduck genome project is coming along and to decide how we want to proceed.

So, we wanted to get a quick idea of how well our geoduck assemblies are by performing some quick alignments using Bowtie2.

Used the following assemblies as references:

sn_ph_01 : SuperNova assembly of 10x Genomics data
sparse_03 : SparseAssembler assembly of BGI and Illumina project data
pga_02 : Hi-C assembly of Phase Genomics data

The analysis is documented in a Jupyter Notebook.

Jupyter Notebook (GitHub):

20180508_roadrunner_geoduck_bowtie2_genome_mapping.ipynb

NOTE: Due to large amount of stdout from first genome index command, the notebook does not render well on GitHub. I recommend downloading and opening notebook on a locally install version of Jupyter.

Here’s a brief overview of the process:

Generate Bowtie2 indexes for each of the genome assemblies.

Map 1,000,000 reads from the following Illumina NovaSeq FastQ files:

NR013_AD013_S2_L001_R1_001_val_1_val_1.fq.gz

NR013_AD013_S2_L001_R2_001_val_2_val_2.fq.gz

Results:

Bowtie2 Genome Indexes:

20180508_geoduck_assemblies_bowtie2_indexes/

Bowtie2 sn_ph_01 alignment folder:

20180508_geoduck_mapping_nova_to_10x/

Bowtie2 sparse_03 alignment folder:

20180508_geoduck_mapping_nova_to_sparse/

Bowtie2 pga_02 alignment folder:

20180508_geoduck_mapping_nova_to_Hi-C/

MAPPING SUMMARY TABLE

All mapping data was pulled from the respective *.err file in the Bowtie2 alignment folders.

sequence_ID Assembler Alignment Rate (%)

sn_ph_01 SuperNova (10x) 79.89

sparse_03 SparseAssembler 85.83

pga_02 Hi-C (Phase Genomics) 79.90|

Mapping efficiency is similar for all assemblies. After speaking with Steven, we’ve decided we’ll begin exploring genome annotation pipelines.

sequence_ID	Assembler	Alignment Rate (%)
sn_ph_01	SuperNova (10x)	79.89
sparse_03	SparseAssembler	85.83
pga_02	Hi-C (Phase Genomics)	79.90\|

Posted in Geoduck Genome Sequencing and tagged assembly, bowtie2, geoduck, jupyter notebook, Panopea generosa on May 9, 2018 by kubu4. Leave a comment

Assembly Stats – Geoduck Genome Assembly Comparisons w/Quast – SparseAssembler, SuperNova, Hi-C

0000-0002-2747-368X
Steven requested a comparison of geoduck genome assemblies.

Ran the following Quast command:

/home/sam/software/quast-4.5/quast.py \ -t 24 \ --labels 20180405_sparse_kmer101,supernova_pseudohap_duck4-p,20180421_Hi-C \ /mnt/owl/Athaliana/20180405_sparseassembler_kmer101_geoduck/Contigs.txt \ /mnt/owl//halfshell/bu-mox/analyses/0305b/duck4-p.fasta.gz \ /mnt/owl/Athaliana/20180419_geoduck_hi-c/Results/geoduck_roberts\ results\ 2018-04-21\ 18\:09\:04.514704/PGA_assembly.fasta

Results:

Quast output folder: results_2018_04_30_08_00_42/

Quast report (HTML): results_2018_04_30_08_00_42/report.html

The data’s pretty interesting and cool!

SparseAssembler has over 2x the amount of data (in bas pairs), yet produces the worst assembly.

SuperNova and Hi-C assemblies are very close in nearly all categories. This isn’t surprising, as the SuperNova assembly was used as a reference assembly for the Hi-C assembly.

However, the Hi-C assembly is insanely better than the SuperNova assembly! For example:

Largest contig is ~7x larger than the SuperNova assembly.

The N50 size is ~243x larger than the SuperNova assembly!!

L50 is only 18, 46x smaller than the SuperNova assembly!

This is pretty amazing, honestly. Even more amazing is that this data was sent over to us as some “preliminary” data for us to take a peak at!

Posted in Geoduck Genome Sequencing and tagged assembly, geoduck, Panopea generosa, QUAST on April 30, 2018 by kubu4. Leave a comment

Assembly Stats – Geoduck Hi-C Assembly Comparison

0000-0002-2747-368X
Ran the following Quast command to compare the two geoduck assemblies provided to us by Phase Genomics:

/home/sam/software/quast-4.5/quast.py \ -t 24 \ --labels 20180403_pga,20180421_pga \ /mnt/owl/Athaliana/20180421_geoduck_hi-c/Results/geoduck_roberts\ results\ 2018-04-03\ 11\:05\:41.596285/PGA_assembly.fasta \ /mnt/owl/Athaliana/20180421_geoduck_hi-c/Results/geoduck_roberts\ results\ 2018-04-21\ 18\:09\:04.514704/PGA_assembly.fasta

Results:

Quast Output folder: results_2018_04_30_11_16_04/

Quast report (HTML): results_2018_04_30_11_16_04/report.html

The two assemblies are nearly identical. Interesting…

Posted in Geoduck Genome Sequencing and tagged assembly, geoduck, Panopea generosa, QUAST on April 30, 2018 by kubu4. 1 Comment

Data Management – Geoduck Phase Genomics Hi-C Data

0000-0002-2747-368X
We received sequencing/assembly data from Phase Genomics.

The data contains two assemblies, produced on two different dates.

All data is here: 20180421_geoduck_hi-c

All FASTQ files (four files; Geoduck_HiC*.gz) were copied to Nightingales:

http://owl.fish.washington.edu/nightingales/P_generosa/

MD5 checksums were verified and appended to the Nightingales checksum file:

http://owl.fish.washington.edu/nightingales/P_generosa/checksums.md5

Nightingales sequencing inventory was updated (Google Sheet):

Nightingales inventory

The two assemblies (and assembly stats) they provided are here:

20180403 assembly (FASTA)

20180403 assembly stats (TXT)

20180421 assembly (FASTA)

20180421 assembly stats (TXT)

I’ve updated the project-geoduck-genome GitHub wiki with this info.

Posted in Geoduck Genome Sequencing, Samples Received and tagged assembly, geoduck, Hi-C, Panopea generosa, phase genomics on April 21, 2018 by kubu4. 2 Comments

Assembly – Geoduck NovaSeq using SparseAssembler (failed)

0000-0002-2747-368X
Steven came across a 2012 paper in BMC Bioinformatics (“Exploiting sparseness in de novo genome assembly”) that utilized an assembly program we hadn’t previously encountered: SparseAssembler

This software is intended to greatly reduce the required amount of RAM necessary to process very large assembly data sets. As I previously learned, RAM is a limiting factor for assembly programs, and the install (if you can even call it that) was simply upacking a zip file (program installations on Mox are not trivialso this seems like it has promise!

The job was run on our Mox node.

Here’s the batch script to initiate the job:

20180308_soap_novaseq_geoduck_slurm.sh

#!/bin/bash ## Job Name #SBATCH --job-name=20180308_sparse_assembler_geo_novaseq ## Allocation Definition #SBATCH --account=srlab #SBATCH --partition=srlab ## Resources ## Nodes (We only get 1, so this is fixed) #SBATCH --nodes=1 ## Walltime (days-hours:minutes:seconds format) #SBATCH --time=30-00:00:00 ## Memory per node #SBATCH --mem=500G ##turn on e-mail notification #SBATCH --mail-type=ALL #SBATCH --mail-user=samwhite@uw.edu ## Specify the working directory for this job #SBATCH --workdir=/gscratch/scrubbed/samwhite/20180308_SparseAssembler_novaseq_geoduck /gscratch/srlab/programs/SparseAssembler/SparseAssembler LD 0 NodeCovTh 1 EdgeCovTh 0 k 117 g 15 PathCovTh 100 GS 2200000000 i1 /gscratch/scrubbed/samwhite/20180129_trimmed_again/AD002_S9_L001_R1_001_val_1_val_1.fastq i2 /gscratch/scrubbed/samwhite/20180129_trimmed_again/AD002_S9_L001_R2_001_val_2_val_2.fastq i1 /gscratch/scrubbed/samwhite/20180129_trimmed_again/AD002_S9_L002_R1_001_val_1_val_1.fastq i2 /gscratch/scrubbed/samwhite/20180129_trimmed_again/AD002_S9_L002_R2_001_val_2_val_2.fastq i1 /gscratch/scrubbed/samwhite/20180129_trimmed_again/NR005_S4_L001_R1_001_val_1_val_1.fastq i2 /gscratch/scrubbed/samwhite/20180129_trimmed_again/NR005_S4_L001_R2_001_val_2_val_2.fastq i1 /gscratch/scrubbed/samwhite/20180129_trimmed_again/NR005_S4_L002_R1_001_val_1_val_1.fastq i2 /gscratch/scrubbed/samwhite/20180129_trimmed_again/NR005_S4_L002_R2_001_val_2_val_2.fastq i1 /gscratch/scrubbed/samwhite/20180129_trimmed_again/NR006_S3_L001_R1_001_val_1_val_1.fastq i2 /gscratch/scrubbed/samwhite/20180129_trimmed_again/NR006_S3_L001_R2_001_val_2_val_2.fastq i1 /gscratch/scrubbed/samwhite/20180129_trimmed_again/NR006_S3_L002_R1_001_val_1_val_1.fastq i2 /gscratch/scrubbed/samwhite/20180129_trimmed_again/NR006_S3_L002_R2_001_val_2_val_2.fastq i1 /gscratch/scrubbed/samwhite/20180129_trimmed_again/NR012_S1_L001_R1_001_val_1_val_1.fastq i2 /gscratch/scrubbed/samwhite/20180129_trimmed_again/NR012_S1_L001_R2_001_val_2_val_2.fastq i1 /gscratch/scrubbed/samwhite/20180129_trimmed_again/NR012_S1_L002_R1_001_val_1_val_1.fastq i2 /gscratch/scrubbed/samwhite/20180129_trimmed_again/NR012_S1_L002_R2_001_val_2_val_2.fastq i1 /gscratch/scrubbed/samwhite/20180129_trimmed_again/NR013_AD013_S2_L001_R1_001_val_1_val_1.fastq i2 /gscratch/scrubbed/samwhite/20180129_trimmed_again/NR013_AD013_S2_L001_R2_001_val_2_val_2.fastq i1 /gscratch/scrubbed/samwhite/20180129_trimmed_again/NR013_AD013_S2_L002_R1_001_val_1_val_1.fastq i2 /gscratch/scrubbed/samwhite/20180129_trimmed_again/NR013_AD013_S2_L002_R2_001_val_2_val_2.fastq i1 /gscratch/scrubbed/samwhite/20180129_trimmed_again/NR014_AD014_S5_L001_R1_001_val_1_val_1.fastq i2 /gscratch/scrubbed/samwhite/20180129_trimmed_again/NR014_AD014_S5_L001_R2_001_val_2_val_2.fastq i1 /gscratch/scrubbed/samwhite/20180129_trimmed_again/NR014_AD014_S5_L002_R1_001_val_1_val_1.fastq i2 /gscratch/scrubbed/samwhite/20180129_trimmed_again/NR014_AD014_S5_L002_R2_001_val_2_val_2.fastq i1 /gscratch/scrubbed/samwhite/20180129_trimmed_again/NR015_AD015_S6_L001_R1_001_val_1_val_1.fastq i2 /gscratch/scrubbed/samwhite/20180129_trimmed_again/NR015_AD015_S6_L001_R2_001_val_2_val_2.fastq i1 /gscratch/scrubbed/samwhite/20180129_trimmed_again/NR015_AD015_S6_L002_R1_001_val_1_val_1.fastq i2 /gscratch/scrubbed/samwhite/20180129_trimmed_again/NR015_AD015_S6_L002_R2_001_val_2_val_2.fastq i1 /gscratch/scrubbed/samwhite/20180129_trimmed_again/NR019_S7_L001_R1_001_val_1_val_1.fastq i2 /gscratch/scrubbed/samwhite/20180129_trimmed_again/NR019_S7_L001_R2_001_val_2_val_2.fastq i1 /gscratch/scrubbed/samwhite/20180129_trimmed_again/NR019_S7_L002_R1_001_val_1_val_1.fastq i2 /gscratch/scrubbed/samwhite/20180129_trimmed_again/NR019_S7_L002_R2_001_val_2_val_2.fastq i1 /gscratch/scrubbed/samwhite/20180129_trimmed_again/NR021_S8_L001_R1_001_val_1_val_1.fastq i2 /gscratch/scrubbed/samwhite/20180129_trimmed_again/NR021_S8_L001_R2_001_val_2_val_2.fastq i1 /gscratch/scrubbed/samwhite/20180129_trimmed_again/NR021_S8_L002_R1_001_val_1_val_1.fastq i2 /gscratch/scrubbed/samwhite/20180129_trimmed_again/NR021_S8_L002_R2_001_val_2_val_2.fastq

Results

Output folder: 20180308_SparseAssembler_novaseq_geoduck/

Well, this failed, but not because of memory issues (which is a good start)!

Instead, it failed because the kmer size was too large??!!

See the slurm output log file:

slurm-142382.out

Kmergenie had indicated a kmer size of 117bp.

Will reduce kmer size and try again. Fingers crossed…

Posted in Geoduck Genome Sequencing and tagged assembly, geoduck, NovaSeq, Panopea generosa, SparseAssembler on March 8, 2018 by kubu4. 2 Comments

NovaSeq Assembly – The Struggle is Real – Real Annoying!

0000-0002-2747-368X
Well, I continue to struggle to makek progress on assembling the geoduck Illumina NovaSeq data. Granted, there is a ton of data (374GB!!!!), but it’s still frustrating that we can’t get an assembly anywhere…

Here are some of the struggles so far:

Meraculous:

Can’t run locally because:

Ran out of hard drive space – due to hardware limitations of our Apple Xserve

Fixed HDD space issue, but Roadrunner locks up and has to be restarted; no error message(s) in log files to help troubleshoot

Can’t run on Mox because:

Can’t figure out how to install needed dependencies that don’t already exist on Mox. More specifically, friggin’ Boost libraries! Trying to install these properly has been an issue in the past for non-Mox computers, too. I remember a few times discussing the pain of installing Boost with Sean Bennett.

SOAPdenovo2

Our Mox node can’t handle the memory requirements needed for assembly.

JR-Assembler

Can’t install one of the dependencies (SOAP error correction)

Actually, I need to try the binary version of this, instead of the source version (the source version fails at the make step)

So, next up will trying the following two assemblers:

JR-Assembler: Will see if SOAPec binary will work, and then run an assembly.

AllPaths-LG: I was able to install this successfully on Mox.

Additionally, we’ve ordered some additional hard drives and will be converting the old head/master node on the Apple Xserve cluster to Linux. The old master node is a little better equipped than the other Apple Xserve “birds”, so will try to re-run Meraculous on it once we get it converted.

Posted in Geoduck Genome Sequencing and tagged AllPaths-LG, assembly, geoduck, jr-assembler, meraculous, mox, NovaSeq, Panopea generosa, SOAPdenovo2 on February 21, 2018 by kubu4. 1 Comment

Assembly – Geoduck Illumina NovaSeq SOAPdenovo2 on Mox (FAIL)

0000-0002-2747-368X
Trying to get the NovaSeq data assembled using SOAPdenovo2 on the Mox HPC node we have and it will not work.

Tried a couple of times and it hasn’t run successfully. Here are links to the files used on Mox (including the batch script and slurm output files). I made slight changes to the formatting of the batch script because I thought there was something wrong. Specifically, the slurm output file in the 20180215 runs does not accurately reflect the command I issued (i.e. 1> ass.log is command, but slurm shows > ass.log).

20180215_soapdenovo2_novaseq_geoduck: This failed for no obvious reason. The slurm output file simply indicates the job was killed and the SOAPdenov2 error log doesn’t contain any info regarding why the job failed.

20180218_soapdenovo2_novaseq_geoduck/: This failed because there wasn’t enough memory(!!??) – see below for more info.

NOTE: In the 20180218 run, I have excluded transferring the core dump file due to its crazy size:

Here’s the error log generated by SOAPdenovo2 in the 20180218 run (the last line is all you really need to see, though):

Version 2.04: released on July 13th, 2012 Compile May 10 2017 12:50:52 ******************** Pregraph ******************** Parameters: pregraph -s /gscratch/scrubbed/samwhite/20180218_soapdenovo2_novaseq_geoduck/soap_config -K 117 -p 24 -o /gscratch/scrubbed/samwhite/20180218_soapdenovo2_novaseq_geoduck/ In /gscratch/scrubbed/samwhite/20180218_soapdenovo2_novaseq_geoduck/soap_config, 1 lib(s), maximum read length 150, maximum name length 256. 24 thread(s) initialized. Import reads from file: /gscratch/scrubbed/samwhite/20180129_trimmed_again/AD002_S9_L001_R1_001_val_1_val_1.fq.gz Import reads from file: /gscratch/scrubbed/samwhite/20180129_trimmed_again/AD002_S9_L001_R2_001_val_2_val_2.fq.gz --- 100000000th reads. --- 200000000th reads. --- 300000000th reads. Import reads from file: /gscratch/scrubbed/samwhite/20180129_trimmed_again/AD002_S9_L002_R1_001_val_1_val_1.fq.gz Import reads from file: /gscratch/scrubbed/samwhite/20180129_trimmed_again/AD002_S9_L002_R2_001_val_2_val_2.fq.gz --- 400000000th reads. --- 500000000th reads. --- 600000000th reads. --- 700000000th reads. Import reads from file: /gscratch/scrubbed/samwhite/20180129_trimmed_again/NR005_S4_L001_R1_001_val_1_val_1.fq.gz Import reads from file: /gscratch/scrubbed/samwhite/20180129_trimmed_again/NR005_S4_L001_R2_001_val_2_val_2.fq.gz Import reads from file: /gscratch/scrubbed/samwhite/20180129_trimmed_again/NR005_S4_L002_R1_001_val_1_val_1.fq.gz Import reads from file: /gscratch/scrubbed/samwhite/20180129_trimmed_again/NR005_S4_L002_R2_001_val_2_val_2.fq.gz Import reads from file: /gscratch/scrubbed/samwhite/20180129_trimmed_again/NR006_S3_L001_R1_001_val_1_val_1.fq.gz Import reads from file: /gscratch/scrubbed/samwhite/20180129_trimmed_again/NR006_S3_L001_R2_001_val_2_val_2.fq.gz Import reads from file: /gscratch/scrubbed/samwhite/20180129_trimmed_again/NR006_S3_L002_R1_001_val_1_val_1.fq.gz Import reads from file: /gscratch/scrubbed/samwhite/20180129_trimmed_again/NR006_S3_L002_R2_001_val_2_val_2.fq.gz Import reads from file: /gscratch/scrubbed/samwhite/20180129_trimmed_again/NR012_S1_L001_R1_001_val_1_val_1.fq.gz Import reads from file: /gscratch/scrubbed/samwhite/20180129_trimmed_again/NR012_S1_L001_R2_001_val_2_val_2.fq.gz Import reads from file: /gscratch/scrubbed/samwhite/20180129_trimmed_again/NR012_S1_L002_R1_001_val_1_val_1.fq.gz Import reads from file: /gscratch/scrubbed/samwhite/20180129_trimmed_again/NR012_S1_L002_R2_001_val_2_val_2.fq.gz Import reads from file: /gscratch/scrubbed/samwhite/20180129_trimmed_again/NR013_AD013_S2_L001_R1_001_val_1_val_1.fq.gz Import reads from file: /gscratch/scrubbed/samwhite/20180129_trimmed_again/NR013_AD013_S2_L001_R2_001_val_2_val_2.fq.gz --- 800000000th reads. --- 900000000th reads. -- Out of memory --

I guess I’ll explore some other options for assembling these? I’m having a difficult time accepting that 500GB of RAM is insufficient, but that seems to be the case. Ouch.

Posted in Geoduck Genome Sequencing and tagged assembly, geoduck, mox, NovaSeq, Panopea generosa, SOAPdenovo2 on February 19, 2018 by kubu4. 1 Comment

NovaSeq Assembly – Trimmed Geoduck NovaSeq with Meraculous

0000-0002-2747-368X
Attempted to use Meraculous to assemble the trimmed geoduck NovaSeq data.

Here’s the Meraculous manual (PDF).

After a bunch of various issues (running out of hard drive space – multiple times, config file issues, typos), I’ve finally given up on running meraculous. It failed, again, saying it couldn’t find a file in a directory that meraculous created! I’ve emailed the authors and if they have an easy fix, I’ll implement it and see what happens.

Anyway, it’s all documented in the Jupyter Notebook below.

One good thing came out of all of it is that I had to run kmergenie to identify an appopriate kmer size to use for assembly, as well as estimated genome size (this info is needed for both meraculous and SOAPdeNovo (which I’ll be trying next)):

kmergenie output folder: http://owl.fish.washington.edu/Athaliana/20180125_geoduck_novaseq/20180206_kmergenie/
kmergenie HTML report (doesn’t display histograms for some reason): 20180206_kmergenie/histograms_report.html
kmer size: 117
Est. genome size: 2.17Gbp

Jupyter Notebook (GitHub): 20180205_roadrunner_meraculous_geoduck_novaseq.ipynb

Posted in Geoduck Genome Sequencing and tagged assembly, geoduck, Illumina, kmer, kmer genie, meraculous, NovaSeq, Panopea generosa, roadrunner on February 5, 2018 by kubu4. 1 Comment

← Older

Loading

Projects

Links
GitHub – Roberts Lab Repository

GitHub – Roberts Lab Wiki

GitHub – Roberts Lab Notebooks

GitHub – My Repository

I Try Linux

Recent Posts

New Notebook November 8, 2018

qPCRs – Ronit’s C.gigas ploidy/dessication/heat stress cDNA (1:5 dilution) October 18, 2018

Samples Received – Crassostrea virginica (Eastern oyster) tissue from Lotterhos Lab (Northeastern University) October 17, 2018

Reverse Transcription – Ronit’s C.gigas DNased ctenidia RNA October 17, 2018

qPCR – Ronit’s DNAsed C.gigas Ploidy/Dessication RNA with elongation factor primers October 17, 2018

Tag Cloud
BB black abalone BS-seq cDNA COX Crassostrea gigas Crassostrea virginica cyclooxygenase DNA Isolation DNA Quantification DNase DNased RNA Eastern oyster EtOH precipitation gDNA gel geoduck gill gonad Haliotis cracherodii Hard clam hemocyte Immomix jupyter notebook library prep Mercenaria mercenaria NanoDrop1000 olympia oyster Opticon2 Ostrea lurida Pacific herring Pacific oyster Panopea generosa PCR PGS prostaglandin synthase qPCR RNA RNA isolation RNA quantification SOLiD SOLiD libraries SYTO13 Turbo DNA-free Vibrio tubiashii
Archives

Meta

Register

Log in

Entries RSS

Comments RSS

Open Notebook Science Network

Sam's Notebook

University of Washington – Fishery Sciences – Roberts Lab

Tag Archives: assembly

Transcriptome Assembly – Olympia oyster RNAseq Data with Trinity

RESULTS:

Assembly Stats – Geoduck Hi-C Final Assembly Comparison

Results:

Read Mapping – Mapping Illumina Data to Geoduck Genome Assemblies with Bowtie2

Results:

MAPPING SUMMARY TABLE

Assembly Stats – Geoduck Genome Assembly Comparisons w/Quast – SparseAssembler, SuperNova, Hi-C

Results:

Assembly Stats – Geoduck Hi-C Assembly Comparison

Results:

Data Management – Geoduck Phase Genomics Hi-C Data

Assembly – Geoduck NovaSeq using SparseAssembler (failed)

Results

NovaSeq Assembly – The Struggle is Real – Real Annoying!

Assembly – Geoduck Illumina NovaSeq SOAPdenovo2 on Mox (FAIL)

NovaSeq Assembly – Trimmed Geoduck NovaSeq with Meraculous