Tribolium
iBeetle Bioinformatics Resources
This page is summarizing some web ressources for the iBeetle project.It is mostly for internal use.
For all current data please go to the iBeetle browser hosted in Göttingen.
The data below is based on different assemblies available from the BeetleBase ftp server.
genome annotation on Tcas 5.2
The Annotation is based on Assembly 5.2Batch 6 files
- batch6.fa : The FASTA formatted sequences.
- batch6.html : A link list.
- batch6.s.tbl : A map of the fragments and their parental transcripts.
NCBI genes
Link list of pubished NCBI genes to view in gbrowse with publication information: ncbi.genes.htmlBatch 5 files
- batch5.fa : The FASTA formatted sequences.
- batch5.html : A link list together with a summary of applied filter criteria.
- batch5.s.tbl : A map of the fragments and their parental transcripts.
Batch 4 files
Note that files changed, removed fragments that have been filtered out by Mirko (leaving 942 fragments), additional 442 fragments (generated in January 2014)- batch4.fa : The FASTA formatted sequences.
- batch4.html : A link list together with a summary of applied filter criteria.
- batch4.s.tbl : A map of the fragments and their parental transcripts.
- batch4.category.combination.txt : The proportion of transcripts in 10 categories crucial for the fragment selection.
au3 reannotation
The prediction was done on Assembly 4.0.au3 is a combination of 11729 genes predicted by AUGUSTUS and 2563 genes from the OGS.
- au3.gff: gff file
- au3.aa.fa: protein fasta file
- au3.mrna.fa: mRNA fasta file
Big correspondence table
up to batch 3 (plates 1-62). corrtab.xlsx, corrtab.tab.gzThe columns are
- iBeetle number, e.g. iB_00002
- dsRNA sequence, e.g. CACCACAGCACGACAAA...
- batch number ("safe fragment"), e.g. b1.ds2
- gene id from official gene set OGS), e.g. TC000021
- coding sequence (CDS) from OGS gene, e.g. ATGCGGTCCCATAAAAAAA...
- Drosophila ortholog gene name, e.g. JIL
- Drosophila ortholog protein isoform id, e.g. JIL-1-PA
- CDS of protein Drosophila isoform, e.g. ATGAGTCGCTTGCAAAA
au2 OGS mapping
Tab-separated list of OGS transcripts (TC number) and their corresponding au2 transcripts in one-to-one relation: au2-ogs-mappingsample of au2 genes for testing
mRNA sequences of au2 genes- supported by neither RNA-Seq nor protein homology
- supported by protein homology but not by RNA-Seq
- supported by RNA-Seq but not by protein homology
A Gene is called supported by RNA-Seq if at least one of its mRNAs is supported by RNA-Seq. The same applies to protein homology.
A mRNA sequence is called supported by RNA-Seq if it is at least half covered by an interval of reads where each covered position is covered by at least 2 reads and gaps of at most 20 bp are allowed.
If for one gene several mRNAs satisfy the above condition the most probable one was taken.
- au2.no.support.50.fa , au2.only.hom.support.48.fa, au2.only.cov.support.50.fa: The FASTA formatted sequences.
- au2.no.support.html , au2.only.hom.support.html, au2.only.cov.support.html: A link list.
We provide a list with the number of genes for each category.
Batch 3 files
- batch3.fa : The FASTA formatted sequences.
- batch3.html : A link list together with a summary of applied filter criteria.
- batch3.s.tbl : A map of the fragments and their parental transcripts.
Batch 2 files
- batch2.fa : The FASTA formatted sequences.
- batch2.html : A link list together with a summary of applied filter criteria.
- batch2.s.tbl : A map of the fragments and their parental transcripts.
Batch 1 files
- batch1.fa : The FASTA formatted sequences.
- batch1.html : A link list leading to the GBrowse2 sites.
- batch1.s.tbl : A map of the fragments and their parental transcripts.
GBrowse: developmental genome browser
This browser holds tracks that may be useful for iBeetle. Some tracks are experimental. Currently, there are
- OGS: The official gene set (OGS 2) from BeetleBase ftp
- "safe" mRNA fragments, first, second and third batch: mRNA fragments for the above batches of dsRNA construction
- polyA hints: 3' termini identified by nontemplated polyA stretches in the raw ESTs
- AUGUSTUS ab initio with Tribolium parameters (CDS only): first AUGUSTUS prediction for T.cas.
- AUGUSTUS (UTR and hints from cDNA, revised): latest AUGUSTUS prediction, including UTR prediction and incorporating evidence from RNA-Seq data
Mario Stanke, Universität Greifswald
Last modified: Tue Oct 11 15:35:42 CEST 2011