{ "metadata": { "name": "TJGR_Mgo_Expression" }, "nbformat": 3, "nbformat_minor": 0, "worksheets": [ { "cells": [ { "cell_type": "markdown", "metadata": {}, "source": "#Developing Oyster Male Gonad Gene Expression Tracks\n\nAttempting to develop IGV genome browser tracks from Zhang et al data." }, { "cell_type": "markdown", "metadata": {}, "source": "Overview\n\n1.\n2.\n3.\n" }, { "cell_type": "markdown", "metadata": {}, "source": "\n\n##1. Track based on transcript specific expression\n\nTable is available in SQLShare \n\n\n_Description:_\n```\nPaired end Male Gonad RNA-Seq data from Zhang et al 2012 Exported file from CLCL v6\nData provided at exon level\nDerived using Dataset:\nZhang, G; Fang, X; Guo, X; Li, L; Luo, R; Xu, F; Yang, P; Zhang, L; Wang, X; Qi, H; Zhu, Y; Yang, L; Huang, Z (2012): Genomic data from the Pacific oyster (Crassostrea gigas). GigaScience. http://dx.doi.org/10.5524/100030 \n Maximum paired distance = 250\n Unspecific match limit = 10\n Minimum exon coverage fraction = 0.2\n Count paired reads as two = No\n Additional downstream bases = 0\n Minimum paired distance = 180\n Minimum similarity fraction = 0.8\n Additional upstream bases = 0\n Strand = Forward\n Use annotations for gene and transcript identification = Yes\n Organism type = EUKARYOTE\n Minimum length fraction (long reads) = 0.9\n Minimum read count fusion gene table = 5\n Exon discovery = Yes\n Expression level = Transcripts\n Create report = Yes\n Use colorspace encoding = No\n Expression value = RPKM\n Create list of unmapped reads = No\n Create fusion gene table = No\n Minimum number of reads = 10\n Maximum number of mismatches allowed (applies to short reads) = 2\n References = oyster.v9_90-7\n Use strand specific assembly = No\n Minimum length of putative exons = 25\n Expression value = Read Per Kilobase of exon Model value\nFound: 25123 genes.\nTotal number of reads : 54739722 ( single reads: 0, paired reads: 54739722 )\nTotal number of mapped reads : 25685813 ( single reads: 0, paired reads: 25685813 )\nTotal number of unmapped reads : 29053909 ( single reads: 0, paired reads: 29053909 )\n ```\n \n_Screenshot_\n \n\"Screenshot%205/29/13%202:28%20PM\"\n " }, { "cell_type": "markdown", "metadata": {}, "source": "Query\n" }, { "cell_type": "raw", "metadata": {}, "source": "SELECT\n Chromosome,\n \"Chromosome region start\" - 1 as start,\n \"Chromosome region end\",\n 'exon' as Feature,\n \"Expression Values\"\n \n FROM [sr320@washington.edu].[Mgo_RNAseq_transcript]\u200b\u200b\u200b\u200b\u200b\u200b\u200b\u200b\u200b\u200b\u200b\u200b\u200b\u200b\u200b\u200b\u200b\u200b\u200b\u200b\u200b\u200b\u200b\u200b\u200b\u200b\u200b\u200b\u200b\u200b\n \n\u200bOrder by \"Expression Values\"\u200b\u200b\u200b\u200b\u200b\u200b\u200b\u200b\u200b\u200b\u200b\u200b\u200b\u200b\u200b\u200b\u200b Desc\u200b\u200b" }, { "cell_type": "markdown", "metadata": {}, "source": "_Screenshot_\n\n\"Screenshot%205/29/13%202:39%20PM\"" }, { "cell_type": "markdown", "metadata": {}, "source": "_Downloading via python client_" }, { "cell_type": "markdown", "metadata": {}, "source": "\n\n`python fetchdata.py -d \"[sr320@washington.edu].[Mgo_RNAseq_transcript_IGV]\" -f tsv -o /Volumes/web/cnidarian/Mgo_RNAseq_transcript.igv`" }, { "cell_type": "markdown", "metadata": {}, "source": "\n \nSorted within IGV\n \n\"Screenshot%205/29/13%202:53%20PM\"\n\nThis creates a new file\n " }, { "cell_type": "markdown", "metadata": {}, "source": "Seems to have worked but not certain on coordinates...\n\n\"Screenshot%205/29/13%203:00%20PM\"" }, { "cell_type": "markdown", "metadata": {}, "source": "
\n##2. Track based on gene specific expression\n" }, { "cell_type": "markdown", "metadata": {}, "source": "
\n##3. Expression data as SAM file\n" }, { "cell_type": "code", "collapsed": false, "input": "", "language": "python", "metadata": {}, "outputs": [] }, { "cell_type": "code", "collapsed": false, "input": "", "language": "python", "metadata": {}, "outputs": [] } ], "metadata": {} } ] }