There are 28,027 genes

http://aquacul4.fish.washington.edu/~steven/armina/oyster_v9_gene.fasta

also in SQLShare

 




Can get lengths 
SELECT CGI_ID,len(sequenceas CDS_length FROM [sr320@washington.edu].[qDOD_Cgigas_gene_fasta]






--------------------

Now want to get genomic structure of gene..

http://aquacul4.fish.washington.edu/~steven/armina/oyster.v9.glean.final.rename.mRNA.gff





GFF has Start on Stop and presumably includes introns….

Reconfigured to get ID out


http://eagle.fish.washington.edu/cnidarian/TJGR_Gene_28027_column_mod.gff





Now lets get the corresponding fasta (again) 

to avoid http://genetwit.tumblr.com/image/51023089882

 

missing ID is in GFF.

---
no idea what is going on 


CGI_10006842 
is in fasta
and gff 







Missing in SQLShare




Downloading to desktop and look at in TextWrangler.




In Short
https://sqlshare.escience.washington.edu/sqlshare#s=query/sr320%40washington.edu/TJGR_genomic_gene.txt

CGI_10006842

---