Data Management – Illumina Geoduck HiSeq & MiSeq Data

The HDD we received from Illumina last week only had data (i.e. fastq files) from the NovaSeq runs they performed – nothing from either the MiSeq, nor the HiSeq runs.

We contacted them about the missing data, they confirmed it was missing, and uploaded the remaining data to BaseSpace.

Began downloading the data – will take awhile…

 

Files will be temporarily stored in these locations:

/volume1/web/nightingales/Geoduck_MiSeq/170317_M03814_0172_000000000-B2K79/Data/GeoDuckRNAMiSeq-35978947

/volume1/web/nightingales/Geoduck_HiSeq/170228_ST-K00104_0382_BHHGTLBBXX/Data/Ironman-35682656

/volume1/web/nightingales/Geoduck_HiSeq/170228_ST-K00104_0381_AHHHWNBBXX/Data/Ironman-35682656

Goals – August 2017

Data, data, data! That is the theme for this month.

Olympia oyster data – Going to assess and continue working on assembly the Olympia oyster genome we have from our BGI sequencing project, along with the PacBio data we have. To-date, it’s been difficult to get these two datasets to play nicely with one another. I’ll take a look to see what’s worked, and what hasn’t, as well as try out some other means by which to get a decent assembly.

Geoduck dataWe just got back an insane amount of RNAseq and genome sequencing data from an Illumina pilot project. This data needs to be properly documented and catalogued.

Hackweek 2017 – Although this is only a brief period during this month, it’s designed to tackle some lower priority tasks that will help streamline lab operations. Check out the GtiHub repo issues for Hackweek 2017 to get an idea of some of the projects.

Data Received – Geoduck Genome Sequencing by Illumina

We previously sent some geoduck samples to Illumina, as part of a pilot project for them to test out a new sequencing platform. The data has finally arrived!

It was sent on a 4TB Seagate external hard drive.

Due to weird connection issues we’ve recently encountered with our server, Owl (Synology DS1812+), I connected the HDD directly to Owl via USB (instead of connecting to a computer and transferring). I transferred the data using the Synology web interface to avoid any computer/NAS connection issues that might interrupt the transfer.

We have a meeting with the Illumina people tomorrow afternoon to review the data they’ve provided (looks like it’s going to take awhile, though). Once that meeting takes place, we’ll figure out how to document this project in our data management plan.

 

 

 

RNA Isolation – Olympia oyster gonad tissue in paraffin histology blocks

My previous go at this was a little premature – I didn’t wait for Laura to fully annotate her slides/blocks. Little did I know, the tissue was mostly visceral mass and, as such, I didn’t hit much in the way of actual gonad tissue. So, I’m redoing this, now that Grace has gone through and annotated the blocks to point out gonad tissue. SN-10-16 was sent to Katherine Silliman on 20170720.

Isolated RNA from Olympia oyster gonad previously preserved with the PAXgene Tissue Fixative and Stabilizer and then embedded in paraffin blocks. See Laura’s notebook for full details on samples and preservation.

 

RNA was isolated from the following samples using the PAXgene Tissue RNA Kit (Qiagen). Gouged samples from the blocks weighing ~10mg from each of the tissues and processed according the protocol for isolating RNA from blocks of paraffin-embedded tissues.

Background on all of this is in this GitHub Issue

NF-10-22
NF-10-23
NF-10-24
NF-10-26
NF-10-28
NF-10-30
SN-10-16
SN-10-17
SN-10-20
SN-10-25
SN-10-26
SN-10-31

IMPORTANT:

  • Prior to beginning, I prepared an aliquot of Buffer TR1 by adding 40μL of β-mercaptoethanol (β-ME) to 4000μL of Buffer TR1)

Isolated RNA according to the PAXgene Tissue RNA Kit protocol with the following alterations:

  • “Max speed” spins were performed at 20,000g.
  • Tissue disruption was performed by adding ~25-50 glass beads (425 – 600μm diameter) with the Disruptor Genie @ 45C for 15mins (in the Friedman Lab).
  • Shaking incubation step was performed with Disruptor Genie
  • Samples were eluted with 27μL of Buffer TR4 x 2, incubated @ 65C for 5mins, immediately placed on ice.

 

Results:

Samples were not quantified due to lack of proper RNA Qubit assay AND the computer that our NanoDrop1000 is hooked up to is dead. Will have Katherine Silliman perform quantification.

Samples were stored at -80C temporarily.

Samples will be sent to Katherine Silliman for high-throughput library construction and sequencing once I hear back from her regarding her availability to receive the samples.

RNA Isolation – Olympia oyster gonad tissue in paraffin histology blocks

UPDATE 20170712: The RNA I isolated below is from incorrect regions of tissue. I misunderstood exactly what this tissue was, and admittedly, jumped the gun. The tissue is actually collected from the visceral mass – which contains gonad (a small amount) and digestive gland (a large amount). The RNA isolated below will be stored in one of the Shellfish RNA boxes and I will isolate RNA from the correct regions indicated by Grace

Isolated RNA from Olympia oyster gonad previously preserved with the PAXgene Tissue Fixative and Stabilizer and then embedded in paraffin blocks. See Laura’s notebook for full details on samples and preservation.

 

RNA was isolated from the following samples using the PAXgene Tissue RNA Kit (Qiagen). Gouged samples from the blocks weighing ~10mg from each of the tissues and processed according the protocol for isolating RNA from blocks of paraffin-embedded tissues.

Tissue identification is available in this GitHub Issue

NF-10-22
NF-10-23
NF-10-24
NF-10-26
NF-10-28
NF-10-30
SN-10-16
SN-10-17
SN-10-20
SN-10-25
SN-10-26
SN-10-31

IMPORTANT:

  • Prior to beginning, I prepared an aliquot of Buffer TR1 by adding 40μL of β-mercaptoethanol (β-ME) to 4000μL of Buffer TR1).
  • Reconstituted DNase I with 550μL of RNase-free H2O. Aliquoted in 100μL volumes and stored @ -20C in the “-20C Kit Components” box.

Isolated RNA according to the PAXgene Tissue RNA Kit protocol with the following alterations:

  • “Max speed” spins were performed at 20,000g.
  • Tissue disruption was performed by adding ~25-50 glass beads (425 – 600μm diameter) with the Disruptor Genie @ 45C for 15mins (in the Friedman Lab).
  • Shaking incubation step was performed with Disruptor Genie
  • Samples were eluted with 27μL of Buffer TR4 x 2, incubated @ 65C for 5mins, immediately placed on ice and quantified on the Roberts Lab Qubit 3.0 with the RNA High Sensitivity Assay (ThermoFisher Scientific) using 5μL of each sample.

Results:

Concentrations (Google Sheet): 20170710_RNA_qubit_oly_histo_blocks

Well, the good news is that there’s RNA from all the samples and it seems to be in relatively high concentrations!

The bad news is that the concentrations for 10 of the 12 samples were too high and outside the range of the Qubit RNA HS Assay! Since we don’t have the broad range RNA assay, I can’t properly quantify the remaining samples. However, these samples are being sent to Katherine Silliman at some point, so I’ll leave it up to her to quantify the samples. I’m also guessing that she’ll run them on a Bioanalyzer to assess their integrity prior to beginning library construction, so that will also yield concentrations for the samples.

Samples were stored at -80C temporarily.

Samples will be sent to Katherine Silliman for high-throughput library construction and sequencing once I hear back from her regarding her availability to receive the samples.

Data Management – SRA Submission Olympia Oyster UW PacBio Data from 20170323

Submitted the FASTQ files from the UW PacBio Data from 20170323 to the NCBI sequence read archive (SRA).

FTP’d the data to NCBI’s servers, following their instructions. Briefly,

Change to the directory where the FASTQ files are (Owl/web/nightingales/O_lurida) and then initiate an FTP session:

ftp -i ftp-private.ncbi.nlm.nih.gov

Enter provided username/password, change to my designated uploads directory, create new folder dedicate to this particular upload. Then, upload all the files using the mput command:

mput *filtered_subreads*

SRA deets are below (assigned FASTQ files to existing BioProject and created a new BioSample). Will update post with SRA number when processing is complete on the NCBI end.

SRA: SRS2339870
Study: SRR5809355
BioProject: PRJNA316624
BioSample: SAMN07326085

Data Management – Tarball of Olympia oyster UW PacBio Data from 20170323

I’d previously attempted to archive this data set on multiple occasions, across multiple days, but network dropouts kept killing my connection to the server (Owl) and, in turn, interrupting the tarball operation.

Today, I came in to a successful creation of the tarball of this PacBio data set (it only took 10hrs)! And, it’s a big file: 162GB!! Remember, that’s the compressed size!

Now, we’ll have to decide where we want to keep the tarball. I guess this’ll be part of our next data management plan discussions.

 

Sample Annotation – Olympia oyster histology blocks (from Laura Spencer)

I’ve been asked to isolate RNA from some paraffin-embedded Olympia oyster gonad tissue.

Despite some excellent documentation by Laura Spencer (images of tissue layouts in histology cassettes and a corresponding cassette mapping key file), the histology facility seems to have flipped some things around and/or repositioned/split the contents of each cassette. This makes ID-ing the proper tissues tedious and, at times, difficult.

The list of tissues that needs to be processed is listed in this GitHub Issue #648. I’ve also added the list below:

NF-10 22
NF-10-23
NF-10-24
NF-10-26
NF-10-28
NF-10-30
SN-10-16
SN-10-17
SN-10-20
SN-10-25
SN-10-26
SN-10-31

Prior to beginning RNA isolations, I have annotated images of the histology blocks and will be waiting for Laura to confirm that my annotations are correct. I will be posting a link to this notebook entry in the GitHub issue listed above for her to view and wait for her confirmation.

UPDATE 201700707 – Laura has indicated that many of my annotations are incorrect. Katie has gone through and made proper identification: https://github.com/sr320/LabDocs/issues/648#issuecomment-313792588

 

Additionally, as indicated in the GitHub Issue above, histology block “Oly 14″ does not have a corresponding tissue cassette photo (containing sample NF-10 26). Without the original image, I don’t think I can make an accurate guess on how the tissues are oriented in the resulting two histo blocks (see below).

 

BLOCKS 5

 

BLOCK 6

 

 

BLOCK 9

 

 

BLOCK 10

 

 

BLOCKS 14 (unable to annotate at time of posting)

 

 

 

BLOCK 15

 

 

 

BLOCK 21

 

 

 

 

BLOCK 22