Difference between revisions of "ABBA"

From AMOS WIKI
Jump to: navigation, search
(Overview)
 
(One intermediate revision by the same user not shown)
Line 4: Line 4:
  
 
Assembly Boosted By Amino acid sequence is a comparative gene assembler, which uses amino acid sequences from predicted proteins to help build a better assembly.  see the journal paper.
 
Assembly Boosted By Amino acid sequence is a comparative gene assembler, which uses amino acid sequences from predicted proteins to help build a better assembly.  see the journal paper.
 
 
 
 
For additional information on short read assembly check the following University of Maryland CBCB web sites:
 
 
* [http://www.cbcb.umd.edu/research/SR-assembly.shtml Genome Assembly with Short Reads]
 
* [http://www.cbcb.umd.edu/research/SR-assembly-tutorial.shtml Genome Assembly with Short Reads Tutorial]
 
 
  
 
== Download ==
 
== Download ==
 
NOTE : ABBA does protein assembly but doesn't find the reference proteins to assemble.  You will need to find the proteins running off the ends of contigs separately and then pass the proteins to ABBA to fill in the gaps.
 
NOTE : ABBA does protein assembly but doesn't find the reference proteins to assemble.  You will need to find the proteins running off the ends of contigs separately and then pass the proteins to ABBA to fill in the gaps.
 
* Two ways to find the reference proteins:
 
* Two ways to find the reference proteins:
** Do a draft annotation of the genome
+
** Do a draft annotation of the genome using a annotation pipline. ABBA will not annotate your assembly.
 
** Align the draft assembly contigs to a close relative and find where the contig ends intersect protein coding regions.
 
** Align the draft assembly contigs to a close relative and find where the contig ends intersect protein coding regions.
  

Latest revision as of 20:42, 17 February 2011

ABBA: Assembly Boosted By Amino acid sequences

Overview

Assembly Boosted By Amino acid sequence is a comparative gene assembler, which uses amino acid sequences from predicted proteins to help build a better assembly. see the journal paper.

Download

NOTE : ABBA does protein assembly but doesn't find the reference proteins to assemble. You will need to find the proteins running off the ends of contigs separately and then pass the proteins to ABBA to fill in the gaps.

  • Two ways to find the reference proteins:
    • Do a draft annotation of the genome using a annotation pipline. ABBA will not annotate your assembly.
    • Align the draft assembly contigs to a close relative and find where the contig ends intersect protein coding regions.

ABBA is built on top of the AMOS framework but has it's own distribution. The AMOS framework is included in the ABBA tarball and will install with AMOS if you don't already have AMOS installed. The tarball here: ftp://ftp.cbcb.umd.edu/pub/data/dsommer/abba.tgz

References

Gene-Boosted Assembly of a Novel Bacterial Genome from Very Short Reads.

Salzberg SL, Sommer DD, Puiu D, Lee VT 2008 PLoS Computational Biology 4(9): e1000186 doi:10.1371/journal.pcbi.1000186


Acknowledgements

The development of ABBA was supported by the National Institutes of Health under grants R01-LM06845 and R01-LM007938 to SLS.