<?xml version="1.0"?>
<?xml-stylesheet type="text/css" href="http://amos.sourceforge.net/wiki/skins/common/feed.css?303"?>
<feed xmlns="http://www.w3.org/2005/Atom" xml:lang="en">
		<id>http://amos.sourceforge.net/wiki/api.php?action=feedcontributions&amp;feedformat=atom&amp;user=Ddsommer</id>
		<title>AMOS WIKI - User contributions [en]</title>
		<link rel="self" type="application/atom+xml" href="http://amos.sourceforge.net/wiki/api.php?action=feedcontributions&amp;feedformat=atom&amp;user=Ddsommer"/>
		<link rel="alternate" type="text/html" href="http://amos.sourceforge.net/wiki/index.php/Special:Contributions/Ddsommer"/>
		<updated>2026-04-30T05:54:02Z</updated>
		<subtitle>User contributions</subtitle>
		<generator>MediaWiki 1.23.5</generator>

	<entry>
		<id>http://amos.sourceforge.net/wiki/index.php/AMOS</id>
		<title>AMOS</title>
		<link rel="alternate" type="text/html" href="http://amos.sourceforge.net/wiki/index.php/AMOS"/>
				<updated>2014-10-16T01:52:19Z</updated>
		
		<summary type="html">&lt;p&gt;Ddsommer: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;{| align=&amp;quot;right&amp;quot;&lt;br /&gt;
 | __TOC__&lt;br /&gt;
 |}&lt;br /&gt;
&lt;br /&gt;
The AMOS consortium is committed to the development of open-source whole genome assembly software. The project acronym (AMOS) represents our primary goal -- to produce A Modular, Open-Source whole genome assembler. Open-source so that everyone is welcome to contribute and help build outstanding assembly tools, and modular in nature so that new contributions can be easily inserted into an existing assembly pipeline. This modular design will foster the development of new assembly algorithms and allow the AMOS project to continually grow and improve in hopes of eventually becoming a widely accepted and deployed assembly infrastructure. In this sense, AMOS is both a design philosophy and a software system.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
Quick links:&lt;br /&gt;
* [[AMOS Getting Started]]&lt;br /&gt;
* [http://sourceforge.net/project/showfiles.php?group_id=134326 Download]&lt;br /&gt;
* [http://sourceforge.net/projects/amos SourceForge project page]&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
== Announcements ==&lt;br /&gt;
&lt;br /&gt;
* August 5, 2011 - [http://sourceforge.net/projects/amos/files/amos/3.1.0/ Version 3.1.0] of AMOS released!&lt;br /&gt;
* August 2, 2011 - [http://sourceforge.net/projects/amos/files/sample_data/ AMOS Sample Data] posted&lt;br /&gt;
* December 7, 2010 - [http://sourceforge.net/projects/amos/files/amos/3.0.0/ Version 3.0.0] of AMOS released!&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
== Documentation ==&lt;br /&gt;
&lt;br /&gt;
=== Assemblers ===&lt;br /&gt;
* [[AMOScmp]] - comparative assembler&lt;br /&gt;
* [[AMOScmp-shortReads]] - comparative assembler for short reads (Solexa,454)&lt;br /&gt;
* [[AMOScmp-shortReads-alignmentTrimmed]] - comparative assembler for short reads that uses alignment based trimming&lt;br /&gt;
* [[minimus]] - basic genome assembler for small datasets&lt;br /&gt;
* [[Minimo]] - the minimus assembler with many more options: short read support, variable stringency, strand-specificity, various outputs formats&lt;br /&gt;
* [[minimus2]] - basic genome assembler for two datasets; can also be used as an assembly merge pipeline&lt;br /&gt;
* [[minimus2-blat]] - Same as minimus2 but uses BLAT instead of Nucmer for added speed&lt;br /&gt;
&lt;br /&gt;
=== Validation and Visualization ===&lt;br /&gt;
* [[Hawkeye]] - assembly viewer&lt;br /&gt;
* [[amosvalidate]] - assembly forensics&lt;br /&gt;
* [[FRCurve]] - Feature-Response Curve&lt;br /&gt;
* [[Benchmark]] - assembly benchmark data&lt;br /&gt;
&lt;br /&gt;
=== Scaffolding ===&lt;br /&gt;
* [[Bambus]] - Open source standalone hierarchical scaffolding&lt;br /&gt;
* [[Bambus2]] - Scaffolding Polymorphic Genomes and Metagenomes&lt;br /&gt;
&lt;br /&gt;
=== Trimming, Overlapping, &amp;amp; Error Correction ===&lt;br /&gt;
* [[Figaro]] - statistical vector trimmer&lt;br /&gt;
* [[UMD Overlapper]] - High quality overlap computations&lt;br /&gt;
* [[KI Overlapper]] - Repeat aware overlapper&lt;br /&gt;
* [[AutoEditor]] - Automatic correction of genome sequencing errors&lt;br /&gt;
* [[FastqQC]] - Read composition and quality&lt;br /&gt;
&lt;br /&gt;
=== Utilities ===&lt;br /&gt;
* [[File conversion utilities]] - converting data to and from AMOS&lt;br /&gt;
* [[AMOS Utilities | AMOS Utilities]] - general utilities&lt;br /&gt;
* [[runAmos]] - Pipeline executor&lt;br /&gt;
&lt;br /&gt;
=== AMOS Development ===&lt;br /&gt;
* [[Programmer&amp;#039;s guide]] - Getting started with the Source code&lt;br /&gt;
* [[Infrastructure]] - Developer level details&lt;br /&gt;
* [[Wiki guide]] - Guide for editing the wiki&lt;br /&gt;
&lt;br /&gt;
=== Assembly Tutorials ===&lt;br /&gt;
* [http://www.cbcb.umd.edu/research/assembly_primer.shtml Assembly primer] - overview of genome assembly.&lt;br /&gt;
* [http://www.cbcb.umd.edu/research/contig_representation.shtml Representing assemblies (not just in AMOS)]&lt;br /&gt;
* [http://wgs-assembler.sourceforge.net Running Celera Assembler]&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
Additional documentation in development through the [[AMOS Documentation Project]]&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
== Download == &lt;br /&gt;
The AMOS source if freely available for download from the File Release Section of our SourceForge project page. Please refer to the COPYING license included in the package for a description of the Artistic License, the same OSI certified open source license used by Perl and countless other packages. Not all of the above packages are included with the standard AMOS distribution, please see the homepage for the software you wish to download to verify that it is included with the AMOS source distribution.&lt;br /&gt;
&lt;br /&gt;
[http://sourceforge.net/project/showfiles.php?group_id=134326 Download from SourceForge]&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
== Consortium members ==&lt;br /&gt;
&lt;br /&gt;
There have been numerous positive responses regarding the AMOS initiative, and we expect the list of involved organizations to grow significantly as the project matures. Please contact us if you want to join. The groups currently involved with the development of AMOS are listed below, along with their responsibilities and areas of expertise.&lt;br /&gt;
&lt;br /&gt;
* University of Maryland, Center for Bioinformatics and Computational Biology&lt;br /&gt;
** project organization and direction&lt;br /&gt;
** infrastructure&lt;br /&gt;
** consensus&lt;br /&gt;
** automated sequence editing&lt;br /&gt;
** scaffolding&lt;br /&gt;
** overlap detection&lt;br /&gt;
** contig construction&lt;br /&gt;
&lt;br /&gt;
* The Institute for Genomic Research&lt;br /&gt;
** production pipelines&lt;br /&gt;
** automated finishing tools&lt;br /&gt;
** error correction&lt;br /&gt;
&lt;br /&gt;
* Karolinska Institutet&lt;br /&gt;
** overlap detection&lt;br /&gt;
** error correction&lt;br /&gt;
&lt;br /&gt;
* Marine Biological Laboratory - Woods Hole&lt;br /&gt;
** graphical interface&lt;br /&gt;
** integration of assembly data with analysis (gene, polymorphism, etc.) information&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
== Join the consortium ==&lt;br /&gt;
&lt;br /&gt;
All interested parties are welcome to join or aid the AMOS consortium. Please address all correspondence via Email to:&lt;br /&gt;
&lt;br /&gt;
 amos-help (at) lists (dot) sourceforge (dot) net&lt;br /&gt;
&lt;br /&gt;
To receive information regarding new releases and developments, please subscribe to our moderated, low-traffic users&amp;#039; mailing list:&lt;br /&gt;
&lt;br /&gt;
 amos-users (at) lists (dot) sourceforge (dot) net&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
== Bug reports and support ==&lt;br /&gt;
&lt;br /&gt;
For AMOS bug reports or support requests, please browse our SourceForge project page or Email us at:&lt;br /&gt;
&lt;br /&gt;
 amos-help (at) lists (dot) sourceforge (dot) net&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
== Acknowledgements ==&lt;br /&gt;
&lt;br /&gt;
The AMOS consortium would like to thank the following organizations for their funding and/or support:&lt;br /&gt;
* The National Institutes of Health - grants R01-LM06845, N01-AI-15447&lt;br /&gt;
* The National Science Foundation - grants IIS-9902923, IIS-9820497&lt;br /&gt;
* Department of Homeland Security - cooperative agreement W81XWH-05-2-0051&lt;br /&gt;
* SourceForge.net&lt;/div&gt;</summary>
		<author><name>Ddsommer</name></author>	</entry>

	<entry>
		<id>http://amos.sourceforge.net/wiki/index.php/AMOS_Getting_Started</id>
		<title>AMOS Getting Started</title>
		<link rel="alternate" type="text/html" href="http://amos.sourceforge.net/wiki/index.php/AMOS_Getting_Started"/>
				<updated>2011-07-22T17:58:20Z</updated>
		
		<summary type="html">&lt;p&gt;Ddsommer: /* Downloading the development version */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;{{TOC}}&lt;br /&gt;
&lt;br /&gt;
Is AMOS an assembler?  is one of the first questions we are asked.  The short answer is no.  AMOS is not an assembler, rather a software infrastructure for developing assembly tools.  If you are only interested in running an off-the-shelf assembler on your shotgun data, do not despair, AMOS provides two such assemblers: AMOScmp - a comparative assembler; and Minimus - a basic assembler for small datasets.  However it is important to realize that, with a little bit of programming, you can use AMOS to put together your own shotgun assembler customized for the specific characteristics of your data.  &lt;br /&gt;
&lt;br /&gt;
This page will provide you with the basic information needed to get started using AMOS.  Advanced AMOS users can go directly to in-depth resources from the main page [[AMOS]].&lt;br /&gt;
&lt;br /&gt;
== Downloading AMOS ==&lt;br /&gt;
AMOS can be downloaded from Sourceforge using the following link: [http://sourceforge.net/project/showfiles.php?group_id=134326 http://sourceforge.net/project/showfiles.php?group_id=134326]&lt;br /&gt;
&lt;br /&gt;
No need to remember this URL as you can easily reach it from the [AMOS main page].&lt;br /&gt;
&lt;br /&gt;
This link will bring you to the Sourceforge download page for our project.  While older versions of our code are also available for download from this page we recommend you download the latest version to take advantage of the full functionality of the code.&lt;br /&gt;
&lt;br /&gt;
AMOS is released as a source-code package, with the exception of the OSX version of the assembly viewer Hawkeye, that can be downloaded as a binary from the File Release section of the download page.  Instructions for compiling and installing AMOS are provided below.  &lt;br /&gt;
&lt;br /&gt;
=== Downloading the development version ===&lt;br /&gt;
&lt;br /&gt;
If you want the bleeding-edge of AMOS, e.g. to edit the source code, you should download the development version of AMOS using CVS following the directions here: [http://sourceforge.net/scm/?type=git&amp;amp;group_id=134326 http://sourceforge.net/scm/?type=git&amp;amp;group_id=134326]&lt;br /&gt;
&lt;br /&gt;
Or in short:&lt;br /&gt;
   git clone git://amos.git.sourceforge.net/gitroot/amos/amos&lt;br /&gt;
&lt;br /&gt;
== Installing AMOS ==&lt;br /&gt;
After reading this section make sure you also read the INSTALL file distributed with AMOS.  This file may contain information pertaining to the latest version of AMOS that is not included here.&lt;br /&gt;
&lt;br /&gt;
=== Installing the development version ===&lt;br /&gt;
&lt;br /&gt;
The first step to install the CVS version of AMOS is to type:&lt;br /&gt;
   ./bootstrap&lt;br /&gt;
&lt;br /&gt;
Then proceed with the instructions for the normal installation below.&lt;br /&gt;
&lt;br /&gt;
=== Normal installation ===&lt;br /&gt;
The AMOS source package has a name like: amos-1.4.5.tar.gz where 1.4.5 is the version of the code.  Once you untar this file (using &amp;quot;tar -xzf amos-1.4.5.tar.gz&amp;quot; in Linux, or &amp;quot;gunzip -d amos-1.4.5.tar.gz | tar xf -&amp;quot; in other flavors of Unix) you will find the current AMOS distribution in a directory named amos-1.4.5.  The next steps assume you have cd&amp;#039;d into this directory.&lt;br /&gt;
&lt;br /&gt;
AMOS uses the [http://www.gnu.org/software/autoconf GNU autoconf] package to reduce cross-platform compatibility issues.  Before compiling the code you will need to run the configure script that will probe your system for the locations of all software packages required by AMOS.&lt;br /&gt;
&lt;br /&gt;
By simply running:&lt;br /&gt;
&lt;br /&gt;
 ./configure&lt;br /&gt;
&lt;br /&gt;
you will prepare AMOS to be installed in the directory hosting the source package.  This is OK if you are just testing AMOS.  We recommend, however, that you provide the configure script with a more permanent home for AMOS, e.g.:&lt;br /&gt;
&lt;br /&gt;
 ./configure --prefix=/usr/local/AMOS&lt;br /&gt;
&lt;br /&gt;
will ultimately lead the AMOS directory hierarchy to be installed underneath /usr/local/.  &lt;br /&gt;
&lt;br /&gt;
After running configure, make sure you check the messages left on your screen to make sure no errors occured.  Errors during the configure step can lead to an incomplete build.&lt;br /&gt;
&lt;br /&gt;
To compile the code you need to simply run:&lt;br /&gt;
&lt;br /&gt;
 make&lt;br /&gt;
&lt;br /&gt;
followed by&lt;br /&gt;
&lt;br /&gt;
 make install&lt;br /&gt;
&lt;br /&gt;
to install AMOS into the directory selected with the --prefix option to configure.  &lt;br /&gt;
&lt;br /&gt;
Normally, these steps are sufficient to install AMOS on most UNIX systems.  If you encounter errors during configuration or compilation, or if you are trying to install AMOS on an OSX or Cygwin system, please read the following sub-sections.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
=== Specifying the location of dependencies ===&lt;br /&gt;
If the configure script gives you a message like:&lt;br /&gt;
&lt;br /&gt;
 WARNING! nucmer was not found but is required to run AMOScmp&lt;br /&gt;
    install nucmer if planning on using AMOScmp&lt;br /&gt;
&lt;br /&gt;
you either have not installed the [http://mummer.sourceforge.net/ MUMmer] package, or you have installed it in a location where the configure script cannot find it.  MUMmer (the nucmer program in particular) is required by the comparative assembler [[AMOScmp]].&lt;br /&gt;
&lt;br /&gt;
To remedy this situation, please install MUMmer following instructions found at [http://mummer.sourceforge.net http://mummer.sourceforge.net].  &lt;br /&gt;
&lt;br /&gt;
If MUMmer is already installed, but configure cannot find it, you can specify the location of the nucmer program by setting the environment variably NUCMER, e.g.:&lt;br /&gt;
&lt;br /&gt;
 NUCMER=/usr/local/bin/mummer/nucmer&lt;br /&gt;
 export NUCMER&lt;br /&gt;
&lt;br /&gt;
in a &amp;quot;traditional&amp;quot; shell (sh, bash, ksh, etc.), or&lt;br /&gt;
&lt;br /&gt;
 setenv NUCMER /usr/local/bin/mummer/nucmer&lt;br /&gt;
&lt;br /&gt;
in csh or tcsh.  Of course you&amp;#039;ll need to replace /usr/local/bin/mummer/nucmer with the actual location of this program on your system.&lt;br /&gt;
Specifying the location of the QT library&lt;br /&gt;
On most Unix installations (see below for OSX and Cygwin), the QT library should be properly installed and AMOS will make without any problems. If, however, you notice a message like:&lt;br /&gt;
&lt;br /&gt;
 WARNING! Qt3 toolkit was not found but is required to run AMOS GUIs&lt;br /&gt;
&lt;br /&gt;
the configure process was not able to find the QT library on your system.  Check with your system administrator to have this toolkit installed on your system.  If, however, you are certain the toolkit is installed, but AMOS still didn&amp;#039;t find it, you can directly specify the location of the toolkit directory, or specifically the include, bin, and lib directories, where QT is installed, and the name of the library file, using the following options to the configure script:&lt;br /&gt;
&lt;br /&gt;
 --with-Qt-dir&lt;br /&gt;
 --with-Qt-include_dir&lt;br /&gt;
 --with-Qt-lib_dir&lt;br /&gt;
 --with-Qt-bin_dir&lt;br /&gt;
 --with-Qt-lib&lt;br /&gt;
&lt;br /&gt;
Similarly, if you get the message:&lt;br /&gt;
&lt;br /&gt;
 WARNING! Boost graph toolkit was not found but is required to run parts of the AMOS Scaffolder (Bambus 2)&lt;br /&gt;
&lt;br /&gt;
try specifying the location of Boost with the option:&lt;br /&gt;
&lt;br /&gt;
 --with-Boost-dir&lt;br /&gt;
&lt;br /&gt;
=== Debian and Ubuntu installation ===&lt;br /&gt;
[[Debian installation]]&lt;br /&gt;
&lt;br /&gt;
=== Fedora, RedHat, CentOS installation ===&lt;br /&gt;
[[Fedora installation]]&lt;br /&gt;
&lt;br /&gt;
=== Mac OS X installation ===&lt;br /&gt;
&lt;br /&gt;
[[OSX installation]]&lt;br /&gt;
&lt;br /&gt;
=== Cygwin installation ===&lt;br /&gt;
[[Cygwin installation]]&lt;br /&gt;
&lt;br /&gt;
== Running AMOS ==&lt;br /&gt;
&lt;br /&gt;
=== Basic AMOS concepts ===&lt;br /&gt;
AMOS consists of a collection of modules that operate on a central data-structure called a bank.  A bank is really just a directory that contains a database (organized as a collection of indexed files) comprising assembly related objects such as reads, contigs, scaffolds, etc.  The modules thus communicate with each other by making changes to the bank.  For example, an assembler might consist of three modules: an overlapper, a contigger, and a multi-aligner.  The overlapper will first read the shotgun reads from the bank, compare them to each other and write back to the bank a list of overlaps, i.e. pairs of reads that match each other.  The contigger then reads the collection of overlaps and makes sense out of it, by producing a layout of the reads that is consistent with most of the observed overlaps.  The contigger then writes these contigs (contiguous chunks of the genome) to the bank.  Finally, the multi-aligner reads from the bank both the reads and the contigs, builds a multiple alignment of the reads, using as a guide the layout of the reads produced by the contigger, then updates the contigs with the detailed alignment information.  Thus, the three programs were able to communicate with each other using the bank as an intermediate storage space.  If this litle description didn&amp;#039;t make much sense to you, check out our [http://www.cbcb.umd.edu/research/assembly_primer.shtml Genome Assembly Primer].  It also has pointers to future reading.&lt;br /&gt;
&lt;br /&gt;
Objects in the bank may be identified by one, or both of the following identifiers:  IID (internal identifier) - an integer identifier, internal to AMOS; and EID (external identifier) - a string representing some external identifier of the record, e.g. the original name of a sequencing read.  Both identifiers must be unique for a specific object type, but may be shared by multiple objects.  For example, there can only be one contig with an IID equal to 1, however there can be both a contig, and a read, and an overlap, all with the IID = 1.&lt;br /&gt;
Message files&lt;br /&gt;
The AMOS banks are not the only mechanism for AMOS modules to communicate with each other, and to the &amp;quot;outside world&amp;quot;.  AMOS also uses a flat-file format (AMOS message files) inspired  by the format used in Celera Assembler.  This format is generally used as an intermediate format for converting to and from external file formats.  The AMOS message files are then used to populate the data-structures present in a bank.  &lt;br /&gt;
&lt;br /&gt;
For more details on the AMOS message file format check out the [[Infrastructure]] pages.  The use of message files will be described in more detail in the remainder of this tutorial.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
==== Reading and writing banks ====&lt;br /&gt;
To learn how to generate AMOS message files check out the section called Creating inputs for AMOS.  Assuming you already have an AMOS message file, most of the modules will require that the information from this file be loaded into a bank.  This section describes the commands used to transfer information between a bank and the message file.&lt;br /&gt;
&lt;br /&gt;
The command bank-transact can be used to load a message file into a bank.  In its simplest invocation:&lt;br /&gt;
&lt;br /&gt;
 bank-transact -b mybank -m mymessagefile&lt;br /&gt;
&lt;br /&gt;
bank-transact loads the messages in mymessagefile into the bank mybank.  Note that this invocation assume the bank already exists, and bank-transact will fail otherwise.  When creating a new bank you can run:&lt;br /&gt;
&lt;br /&gt;
 bank-transact -c -b mybank -m mymessagefile&lt;br /&gt;
&lt;br /&gt;
The option -c stands for &amp;quot;create&amp;quot;.  By also providing the option -f  (force), the bank will be overwritten if it already exists.&lt;br /&gt;
&lt;br /&gt;
The contents of a bank can be output into a flat-file format with the command:&lt;br /&gt;
&lt;br /&gt;
 bank-report -b mybank&lt;br /&gt;
&lt;br /&gt;
By default bank-report outputs all the data in the bank.  The output can be restricted to certain message types by providing the 3 letter codes of the messages to be output, e.g:&lt;br /&gt;
&lt;br /&gt;
 bank-report -b mybank CTG RED&lt;br /&gt;
&lt;br /&gt;
will output all the contigs (CTG) and read (RED) records.  In addition bank-report allows the user to specify a list of EIDs (option -E) or a list of IIDs (option -I) that will be reported.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
==== Bank locking ====&lt;br /&gt;
To allow concurrent access to the bank, AMOS programs lock the bank while the operate on it.  There are two types of locks: for reading, and writing.  If a bank is locked for reading, other read accesses are allowed but no writes.  If a bank is locked for writing, no concurrent accesses are allowed.  Some of the AMOS tools (such as the viewer Hawkeye), have an option to load a bank in &amp;quot;inspect&amp;quot; mode, i.e. the code ignores any locks placed on the bank.&lt;br /&gt;
&lt;br /&gt;
In certain situations, if a program accessing the bank crashes, the bank may remain locked, prohibiting further access.  All existing locks can be removed with the command (make sure that another user is not accessing the same bank):&lt;br /&gt;
&lt;br /&gt;
 bank-unlock mybank&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
==== Bank versions ====&lt;br /&gt;
The specific format of the AMOS bank is closely related to the current version of the AMOS software.  The banks are not backward compatible, i.e., a bank produced by AMOS 1.0 will not be readable by AMOS 1.5.  A simple solution for reading a bank created by an older version of AMOS is to output the contents of the bank using bank-report (the AMOS distribution contains old versions of the bank-report code, e.g. bank-report-1.1) , then reload the bank with the most recent bank-transact command.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
==== Pipelines ====&lt;br /&gt;
As it has hopefully become clear from the introduction to AMOS above, most genome assembly tasks involve the sequential execution of several modules, in an assembly line (or pipeline) fashion.   AMOS provides a mechanism for quickly putting together simple pipelines.   By &amp;quot;simple&amp;quot; we mean situations where the specific assembly task involves running several programs in order, without the need for more complex control structures such as &amp;quot;if&amp;quot; statements or loops.  To implement complex pipelines you will have to rely on Perl or another complex programming language.  &lt;br /&gt;
&lt;br /&gt;
An AMOS pipelines are described in a simple interpreted language, and consist of a series of steps that are executed in order.  The steps are meant to provide a logical breakdown of the individual assembly tasks, representing the execution of one or more programs.  Each step in a pipeline is identified by a step number (a throw-back to the days of the Basic language) providing the user with a mechanism to execute only some of the steps of a pipeline.  &lt;br /&gt;
&lt;br /&gt;
To learn more about AMOS pipelines and how to write them, check out the documentation for [[runAmos]] (the pipeline executor), or check out one of the pipelines distributed with AMOS (AMOScmp and minimus are good starting points).&lt;br /&gt;
&lt;br /&gt;
=== Creating inputs for AMOS ===&lt;br /&gt;
The inputs to most AMOS programs must be provided in the AMOS message format.  For help converting non-AMOS file formats into message files see the [[File conversion utilities]].&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
=== Running AMOScmp ===&lt;br /&gt;
AMOScmp is a comparative assembler that can be used to assemble reads from one genome (called the target) using as a template the sequence of a related genome (called the reference).  Read the AMOScmp documentation for a  detailed description of this program.&lt;br /&gt;
&lt;br /&gt;
By default, running AMOScmp as follows:&lt;br /&gt;
&lt;br /&gt;
 AMOScmp prefix&lt;br /&gt;
&lt;br /&gt;
assumes that the target is provided in the AMOS message file prefix.afg, and the reference in the file prefix.1con.  To use different file locations, you can set the variables TGT and REF, either directly within the AMOScmp script, or on the command line:&lt;br /&gt;
&lt;br /&gt;
 AMOScmp -D &amp;quot;TGT=mytarget.afg&amp;quot; -D &amp;quot;REF=myreference.1con&amp;quot;  prefix&lt;br /&gt;
&lt;br /&gt;
The prefix must still be provided as it is used to generate the name of the output files.&lt;br /&gt;
&lt;br /&gt;
AMOScmp will populate a bank named prefix.bnk, and will load into it a set of contigs, as well as a scaffold, linking together contigs that are adjacent along the reference.  In addition, AMOScmp outputs the set of contigs as both a multi-FASTA file prefix.fasta, and a TIGR .contig file prefix.contig.  Note that the consensus of the contigs (reported in the FASTA file) is generated from the target genome, and may differ from the reference genome (after all, the goal of the assembler is to assemble the target).  In fact, AMOScmp uses sophisticated algorithms for detecting differences between the target and reference in order to prevent misassemblies.  For more information refer to:&lt;br /&gt;
&lt;br /&gt;
M. Pop, A. Phillippy, A.L. Delcher and S.L. Salzberg. [http://www.cbcb.umd.edu/papers/Pop%20et%20al%20Comparative.pdf Comparative genome assembly]. Briefings in Bioinformatics. 5(3), pp. 237-248, 2004.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
=== Running minimus ===&lt;br /&gt;
Minimus is a basic genome assembler that can be used for small assembly jobs (e.g. a single gene, or a viral genome).  Minimus is currently used as a central component of the Influenza A sequencing pipeline at The Institute for Genomic Research. Read the [[minimus]] documentation for more information.&lt;br /&gt;
&lt;br /&gt;
To run minimus you must provide a set of shotgun reads in an AMOS message file.  Running:&lt;br /&gt;
&lt;br /&gt;
 minimus prefix&lt;br /&gt;
&lt;br /&gt;
assumes the input is in file prefix.afg.  After running, minimus populates the bank prefix.bnk with a set of contigs, furthermore it reports the contigs in both a FASTA file (prefix.fasta) and a TIGR .contig file (prefix.contig).   Note that minimus does not use mate-pairs.  In essence it is, in Celera Assembler terminology, a unitigger.  Any mate-pair information provided in the .afg will be silently ignored.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
=== Viewing the result of an assembly ===&lt;br /&gt;
The content of a  bank can be viewed with a program called Hawkeye:&lt;br /&gt;
&lt;br /&gt;
 hawkeye mybank&lt;br /&gt;
&lt;br /&gt;
For detailed information on how to use Hawkeye, refer to the [[Hawkeye]] documentation.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
=== Validating assemblies ===&lt;br /&gt;
Even the best genome assemblers sometimes make mistakes.  AMOS provides a mechanism to run several checks on the output of an assembler (assuming the data are already stored in a bank), through a script called amosvalidate.  Amosvalidate runs through the assembly and identifies several types of inconsistencies, such as clusters of SNPs in the assembled reads, clusters of mate-pairs that are too close or too far from each other (with respect to the estimated library sizes), and unassembled reads that do not properly match the assembly.  A full description of these measures is beyond the scope of this document.  We are currently submitting a manuscript describing the tools included in amosvalidate and will update this page when it gets published.&lt;br /&gt;
&lt;br /&gt;
All the potential assembly problems identified by amosvalidate are written back into the bank as features, i.e ranges along the assembly.  Each feature is tagged with the problem that was identified in that region.  Typically, users then load the assembly in the Hawkeye viewer and examine the assembly in the tagged regions.  Alternatively, the features may be extracted from the bank and processed automatically by specialized software (e.g. several assemblies of a same genome can be compared by the number of features identified in the assembly - the assembly with fewer features is likely &amp;quot;better&amp;quot;).  &lt;br /&gt;
&lt;br /&gt;
Running amosvalidate is as simple as:&lt;br /&gt;
&lt;br /&gt;
 amosvalidate prefix&lt;br /&gt;
&lt;br /&gt;
where prefix.bnk is the location of the bank.&lt;br /&gt;
&lt;br /&gt;
== Getting help ==&lt;br /&gt;
To report bugs in AMOS, or to get help, email us at:&lt;br /&gt;
&lt;br /&gt;
 amos-help (at) lists (dot) sourceforget (dot) net&lt;br /&gt;
&lt;br /&gt;
To receive information regarding new releases and developments, please [http://lists.sourceforge.net/lists/listinfo/amos-users subscribe] to our moderated, low-traffic users&amp;#039; mailing list:&lt;br /&gt;
&lt;br /&gt;
 amos-users (at) lists (dot) sourceforget (dot) net&lt;/div&gt;</summary>
		<author><name>Ddsommer</name></author>	</entry>

	<entry>
		<id>http://amos.sourceforge.net/wiki/index.php/AMOS_Getting_Started</id>
		<title>AMOS Getting Started</title>
		<link rel="alternate" type="text/html" href="http://amos.sourceforge.net/wiki/index.php/AMOS_Getting_Started"/>
				<updated>2011-07-22T17:56:28Z</updated>
		
		<summary type="html">&lt;p&gt;Ddsommer: /* Downloading the development version */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;{{TOC}}&lt;br /&gt;
&lt;br /&gt;
Is AMOS an assembler?  is one of the first questions we are asked.  The short answer is no.  AMOS is not an assembler, rather a software infrastructure for developing assembly tools.  If you are only interested in running an off-the-shelf assembler on your shotgun data, do not despair, AMOS provides two such assemblers: AMOScmp - a comparative assembler; and Minimus - a basic assembler for small datasets.  However it is important to realize that, with a little bit of programming, you can use AMOS to put together your own shotgun assembler customized for the specific characteristics of your data.  &lt;br /&gt;
&lt;br /&gt;
This page will provide you with the basic information needed to get started using AMOS.  Advanced AMOS users can go directly to in-depth resources from the main page [[AMOS]].&lt;br /&gt;
&lt;br /&gt;
== Downloading AMOS ==&lt;br /&gt;
AMOS can be downloaded from Sourceforge using the following link: [http://sourceforge.net/project/showfiles.php?group_id=134326 http://sourceforge.net/project/showfiles.php?group_id=134326]&lt;br /&gt;
&lt;br /&gt;
No need to remember this URL as you can easily reach it from the [AMOS main page].&lt;br /&gt;
&lt;br /&gt;
This link will bring you to the Sourceforge download page for our project.  While older versions of our code are also available for download from this page we recommend you download the latest version to take advantage of the full functionality of the code.&lt;br /&gt;
&lt;br /&gt;
AMOS is released as a source-code package, with the exception of the OSX version of the assembly viewer Hawkeye, that can be downloaded as a binary from the File Release section of the download page.  Instructions for compiling and installing AMOS are provided below.  &lt;br /&gt;
&lt;br /&gt;
=== Downloading the development version ===&lt;br /&gt;
&lt;br /&gt;
If you want the bleeding-edge of AMOS, e.g. to edit the source code, you should download the development version of AMOS using CVS following the directions here: [http://sourceforge.net/scm/?type=cvs&amp;amp;group_id=134326 http://sourceforge.net/scm/?type=cvs&amp;amp;group_id=134326]&lt;br /&gt;
&lt;br /&gt;
Or in short:&lt;br /&gt;
   git clone git://amos.git.sourceforge.net/gitroot/amos/amos&lt;br /&gt;
&lt;br /&gt;
== Installing AMOS ==&lt;br /&gt;
After reading this section make sure you also read the INSTALL file distributed with AMOS.  This file may contain information pertaining to the latest version of AMOS that is not included here.&lt;br /&gt;
&lt;br /&gt;
=== Installing the development version ===&lt;br /&gt;
&lt;br /&gt;
The first step to install the CVS version of AMOS is to type:&lt;br /&gt;
   ./bootstrap&lt;br /&gt;
&lt;br /&gt;
Then proceed with the instructions for the normal installation below.&lt;br /&gt;
&lt;br /&gt;
=== Normal installation ===&lt;br /&gt;
The AMOS source package has a name like: amos-1.4.5.tar.gz where 1.4.5 is the version of the code.  Once you untar this file (using &amp;quot;tar -xzf amos-1.4.5.tar.gz&amp;quot; in Linux, or &amp;quot;gunzip -d amos-1.4.5.tar.gz | tar xf -&amp;quot; in other flavors of Unix) you will find the current AMOS distribution in a directory named amos-1.4.5.  The next steps assume you have cd&amp;#039;d into this directory.&lt;br /&gt;
&lt;br /&gt;
AMOS uses the [http://www.gnu.org/software/autoconf GNU autoconf] package to reduce cross-platform compatibility issues.  Before compiling the code you will need to run the configure script that will probe your system for the locations of all software packages required by AMOS.&lt;br /&gt;
&lt;br /&gt;
By simply running:&lt;br /&gt;
&lt;br /&gt;
 ./configure&lt;br /&gt;
&lt;br /&gt;
you will prepare AMOS to be installed in the directory hosting the source package.  This is OK if you are just testing AMOS.  We recommend, however, that you provide the configure script with a more permanent home for AMOS, e.g.:&lt;br /&gt;
&lt;br /&gt;
 ./configure --prefix=/usr/local/AMOS&lt;br /&gt;
&lt;br /&gt;
will ultimately lead the AMOS directory hierarchy to be installed underneath /usr/local/.  &lt;br /&gt;
&lt;br /&gt;
After running configure, make sure you check the messages left on your screen to make sure no errors occured.  Errors during the configure step can lead to an incomplete build.&lt;br /&gt;
&lt;br /&gt;
To compile the code you need to simply run:&lt;br /&gt;
&lt;br /&gt;
 make&lt;br /&gt;
&lt;br /&gt;
followed by&lt;br /&gt;
&lt;br /&gt;
 make install&lt;br /&gt;
&lt;br /&gt;
to install AMOS into the directory selected with the --prefix option to configure.  &lt;br /&gt;
&lt;br /&gt;
Normally, these steps are sufficient to install AMOS on most UNIX systems.  If you encounter errors during configuration or compilation, or if you are trying to install AMOS on an OSX or Cygwin system, please read the following sub-sections.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
=== Specifying the location of dependencies ===&lt;br /&gt;
If the configure script gives you a message like:&lt;br /&gt;
&lt;br /&gt;
 WARNING! nucmer was not found but is required to run AMOScmp&lt;br /&gt;
    install nucmer if planning on using AMOScmp&lt;br /&gt;
&lt;br /&gt;
you either have not installed the [http://mummer.sourceforge.net/ MUMmer] package, or you have installed it in a location where the configure script cannot find it.  MUMmer (the nucmer program in particular) is required by the comparative assembler [[AMOScmp]].&lt;br /&gt;
&lt;br /&gt;
To remedy this situation, please install MUMmer following instructions found at [http://mummer.sourceforge.net http://mummer.sourceforge.net].  &lt;br /&gt;
&lt;br /&gt;
If MUMmer is already installed, but configure cannot find it, you can specify the location of the nucmer program by setting the environment variably NUCMER, e.g.:&lt;br /&gt;
&lt;br /&gt;
 NUCMER=/usr/local/bin/mummer/nucmer&lt;br /&gt;
 export NUCMER&lt;br /&gt;
&lt;br /&gt;
in a &amp;quot;traditional&amp;quot; shell (sh, bash, ksh, etc.), or&lt;br /&gt;
&lt;br /&gt;
 setenv NUCMER /usr/local/bin/mummer/nucmer&lt;br /&gt;
&lt;br /&gt;
in csh or tcsh.  Of course you&amp;#039;ll need to replace /usr/local/bin/mummer/nucmer with the actual location of this program on your system.&lt;br /&gt;
Specifying the location of the QT library&lt;br /&gt;
On most Unix installations (see below for OSX and Cygwin), the QT library should be properly installed and AMOS will make without any problems. If, however, you notice a message like:&lt;br /&gt;
&lt;br /&gt;
 WARNING! Qt3 toolkit was not found but is required to run AMOS GUIs&lt;br /&gt;
&lt;br /&gt;
the configure process was not able to find the QT library on your system.  Check with your system administrator to have this toolkit installed on your system.  If, however, you are certain the toolkit is installed, but AMOS still didn&amp;#039;t find it, you can directly specify the location of the toolkit directory, or specifically the include, bin, and lib directories, where QT is installed, and the name of the library file, using the following options to the configure script:&lt;br /&gt;
&lt;br /&gt;
 --with-Qt-dir&lt;br /&gt;
 --with-Qt-include_dir&lt;br /&gt;
 --with-Qt-lib_dir&lt;br /&gt;
 --with-Qt-bin_dir&lt;br /&gt;
 --with-Qt-lib&lt;br /&gt;
&lt;br /&gt;
Similarly, if you get the message:&lt;br /&gt;
&lt;br /&gt;
 WARNING! Boost graph toolkit was not found but is required to run parts of the AMOS Scaffolder (Bambus 2)&lt;br /&gt;
&lt;br /&gt;
try specifying the location of Boost with the option:&lt;br /&gt;
&lt;br /&gt;
 --with-Boost-dir&lt;br /&gt;
&lt;br /&gt;
=== Debian and Ubuntu installation ===&lt;br /&gt;
[[Debian installation]]&lt;br /&gt;
&lt;br /&gt;
=== Fedora, RedHat, CentOS installation ===&lt;br /&gt;
[[Fedora installation]]&lt;br /&gt;
&lt;br /&gt;
=== Mac OS X installation ===&lt;br /&gt;
&lt;br /&gt;
[[OSX installation]]&lt;br /&gt;
&lt;br /&gt;
=== Cygwin installation ===&lt;br /&gt;
[[Cygwin installation]]&lt;br /&gt;
&lt;br /&gt;
== Running AMOS ==&lt;br /&gt;
&lt;br /&gt;
=== Basic AMOS concepts ===&lt;br /&gt;
AMOS consists of a collection of modules that operate on a central data-structure called a bank.  A bank is really just a directory that contains a database (organized as a collection of indexed files) comprising assembly related objects such as reads, contigs, scaffolds, etc.  The modules thus communicate with each other by making changes to the bank.  For example, an assembler might consist of three modules: an overlapper, a contigger, and a multi-aligner.  The overlapper will first read the shotgun reads from the bank, compare them to each other and write back to the bank a list of overlaps, i.e. pairs of reads that match each other.  The contigger then reads the collection of overlaps and makes sense out of it, by producing a layout of the reads that is consistent with most of the observed overlaps.  The contigger then writes these contigs (contiguous chunks of the genome) to the bank.  Finally, the multi-aligner reads from the bank both the reads and the contigs, builds a multiple alignment of the reads, using as a guide the layout of the reads produced by the contigger, then updates the contigs with the detailed alignment information.  Thus, the three programs were able to communicate with each other using the bank as an intermediate storage space.  If this litle description didn&amp;#039;t make much sense to you, check out our [http://www.cbcb.umd.edu/research/assembly_primer.shtml Genome Assembly Primer].  It also has pointers to future reading.&lt;br /&gt;
&lt;br /&gt;
Objects in the bank may be identified by one, or both of the following identifiers:  IID (internal identifier) - an integer identifier, internal to AMOS; and EID (external identifier) - a string representing some external identifier of the record, e.g. the original name of a sequencing read.  Both identifiers must be unique for a specific object type, but may be shared by multiple objects.  For example, there can only be one contig with an IID equal to 1, however there can be both a contig, and a read, and an overlap, all with the IID = 1.&lt;br /&gt;
Message files&lt;br /&gt;
The AMOS banks are not the only mechanism for AMOS modules to communicate with each other, and to the &amp;quot;outside world&amp;quot;.  AMOS also uses a flat-file format (AMOS message files) inspired  by the format used in Celera Assembler.  This format is generally used as an intermediate format for converting to and from external file formats.  The AMOS message files are then used to populate the data-structures present in a bank.  &lt;br /&gt;
&lt;br /&gt;
For more details on the AMOS message file format check out the [[Infrastructure]] pages.  The use of message files will be described in more detail in the remainder of this tutorial.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
==== Reading and writing banks ====&lt;br /&gt;
To learn how to generate AMOS message files check out the section called Creating inputs for AMOS.  Assuming you already have an AMOS message file, most of the modules will require that the information from this file be loaded into a bank.  This section describes the commands used to transfer information between a bank and the message file.&lt;br /&gt;
&lt;br /&gt;
The command bank-transact can be used to load a message file into a bank.  In its simplest invocation:&lt;br /&gt;
&lt;br /&gt;
 bank-transact -b mybank -m mymessagefile&lt;br /&gt;
&lt;br /&gt;
bank-transact loads the messages in mymessagefile into the bank mybank.  Note that this invocation assume the bank already exists, and bank-transact will fail otherwise.  When creating a new bank you can run:&lt;br /&gt;
&lt;br /&gt;
 bank-transact -c -b mybank -m mymessagefile&lt;br /&gt;
&lt;br /&gt;
The option -c stands for &amp;quot;create&amp;quot;.  By also providing the option -f  (force), the bank will be overwritten if it already exists.&lt;br /&gt;
&lt;br /&gt;
The contents of a bank can be output into a flat-file format with the command:&lt;br /&gt;
&lt;br /&gt;
 bank-report -b mybank&lt;br /&gt;
&lt;br /&gt;
By default bank-report outputs all the data in the bank.  The output can be restricted to certain message types by providing the 3 letter codes of the messages to be output, e.g:&lt;br /&gt;
&lt;br /&gt;
 bank-report -b mybank CTG RED&lt;br /&gt;
&lt;br /&gt;
will output all the contigs (CTG) and read (RED) records.  In addition bank-report allows the user to specify a list of EIDs (option -E) or a list of IIDs (option -I) that will be reported.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
==== Bank locking ====&lt;br /&gt;
To allow concurrent access to the bank, AMOS programs lock the bank while the operate on it.  There are two types of locks: for reading, and writing.  If a bank is locked for reading, other read accesses are allowed but no writes.  If a bank is locked for writing, no concurrent accesses are allowed.  Some of the AMOS tools (such as the viewer Hawkeye), have an option to load a bank in &amp;quot;inspect&amp;quot; mode, i.e. the code ignores any locks placed on the bank.&lt;br /&gt;
&lt;br /&gt;
In certain situations, if a program accessing the bank crashes, the bank may remain locked, prohibiting further access.  All existing locks can be removed with the command (make sure that another user is not accessing the same bank):&lt;br /&gt;
&lt;br /&gt;
 bank-unlock mybank&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
==== Bank versions ====&lt;br /&gt;
The specific format of the AMOS bank is closely related to the current version of the AMOS software.  The banks are not backward compatible, i.e., a bank produced by AMOS 1.0 will not be readable by AMOS 1.5.  A simple solution for reading a bank created by an older version of AMOS is to output the contents of the bank using bank-report (the AMOS distribution contains old versions of the bank-report code, e.g. bank-report-1.1) , then reload the bank with the most recent bank-transact command.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
==== Pipelines ====&lt;br /&gt;
As it has hopefully become clear from the introduction to AMOS above, most genome assembly tasks involve the sequential execution of several modules, in an assembly line (or pipeline) fashion.   AMOS provides a mechanism for quickly putting together simple pipelines.   By &amp;quot;simple&amp;quot; we mean situations where the specific assembly task involves running several programs in order, without the need for more complex control structures such as &amp;quot;if&amp;quot; statements or loops.  To implement complex pipelines you will have to rely on Perl or another complex programming language.  &lt;br /&gt;
&lt;br /&gt;
An AMOS pipelines are described in a simple interpreted language, and consist of a series of steps that are executed in order.  The steps are meant to provide a logical breakdown of the individual assembly tasks, representing the execution of one or more programs.  Each step in a pipeline is identified by a step number (a throw-back to the days of the Basic language) providing the user with a mechanism to execute only some of the steps of a pipeline.  &lt;br /&gt;
&lt;br /&gt;
To learn more about AMOS pipelines and how to write them, check out the documentation for [[runAmos]] (the pipeline executor), or check out one of the pipelines distributed with AMOS (AMOScmp and minimus are good starting points).&lt;br /&gt;
&lt;br /&gt;
=== Creating inputs for AMOS ===&lt;br /&gt;
The inputs to most AMOS programs must be provided in the AMOS message format.  For help converting non-AMOS file formats into message files see the [[File conversion utilities]].&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
=== Running AMOScmp ===&lt;br /&gt;
AMOScmp is a comparative assembler that can be used to assemble reads from one genome (called the target) using as a template the sequence of a related genome (called the reference).  Read the AMOScmp documentation for a  detailed description of this program.&lt;br /&gt;
&lt;br /&gt;
By default, running AMOScmp as follows:&lt;br /&gt;
&lt;br /&gt;
 AMOScmp prefix&lt;br /&gt;
&lt;br /&gt;
assumes that the target is provided in the AMOS message file prefix.afg, and the reference in the file prefix.1con.  To use different file locations, you can set the variables TGT and REF, either directly within the AMOScmp script, or on the command line:&lt;br /&gt;
&lt;br /&gt;
 AMOScmp -D &amp;quot;TGT=mytarget.afg&amp;quot; -D &amp;quot;REF=myreference.1con&amp;quot;  prefix&lt;br /&gt;
&lt;br /&gt;
The prefix must still be provided as it is used to generate the name of the output files.&lt;br /&gt;
&lt;br /&gt;
AMOScmp will populate a bank named prefix.bnk, and will load into it a set of contigs, as well as a scaffold, linking together contigs that are adjacent along the reference.  In addition, AMOScmp outputs the set of contigs as both a multi-FASTA file prefix.fasta, and a TIGR .contig file prefix.contig.  Note that the consensus of the contigs (reported in the FASTA file) is generated from the target genome, and may differ from the reference genome (after all, the goal of the assembler is to assemble the target).  In fact, AMOScmp uses sophisticated algorithms for detecting differences between the target and reference in order to prevent misassemblies.  For more information refer to:&lt;br /&gt;
&lt;br /&gt;
M. Pop, A. Phillippy, A.L. Delcher and S.L. Salzberg. [http://www.cbcb.umd.edu/papers/Pop%20et%20al%20Comparative.pdf Comparative genome assembly]. Briefings in Bioinformatics. 5(3), pp. 237-248, 2004.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
=== Running minimus ===&lt;br /&gt;
Minimus is a basic genome assembler that can be used for small assembly jobs (e.g. a single gene, or a viral genome).  Minimus is currently used as a central component of the Influenza A sequencing pipeline at The Institute for Genomic Research. Read the [[minimus]] documentation for more information.&lt;br /&gt;
&lt;br /&gt;
To run minimus you must provide a set of shotgun reads in an AMOS message file.  Running:&lt;br /&gt;
&lt;br /&gt;
 minimus prefix&lt;br /&gt;
&lt;br /&gt;
assumes the input is in file prefix.afg.  After running, minimus populates the bank prefix.bnk with a set of contigs, furthermore it reports the contigs in both a FASTA file (prefix.fasta) and a TIGR .contig file (prefix.contig).   Note that minimus does not use mate-pairs.  In essence it is, in Celera Assembler terminology, a unitigger.  Any mate-pair information provided in the .afg will be silently ignored.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
=== Viewing the result of an assembly ===&lt;br /&gt;
The content of a  bank can be viewed with a program called Hawkeye:&lt;br /&gt;
&lt;br /&gt;
 hawkeye mybank&lt;br /&gt;
&lt;br /&gt;
For detailed information on how to use Hawkeye, refer to the [[Hawkeye]] documentation.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
=== Validating assemblies ===&lt;br /&gt;
Even the best genome assemblers sometimes make mistakes.  AMOS provides a mechanism to run several checks on the output of an assembler (assuming the data are already stored in a bank), through a script called amosvalidate.  Amosvalidate runs through the assembly and identifies several types of inconsistencies, such as clusters of SNPs in the assembled reads, clusters of mate-pairs that are too close or too far from each other (with respect to the estimated library sizes), and unassembled reads that do not properly match the assembly.  A full description of these measures is beyond the scope of this document.  We are currently submitting a manuscript describing the tools included in amosvalidate and will update this page when it gets published.&lt;br /&gt;
&lt;br /&gt;
All the potential assembly problems identified by amosvalidate are written back into the bank as features, i.e ranges along the assembly.  Each feature is tagged with the problem that was identified in that region.  Typically, users then load the assembly in the Hawkeye viewer and examine the assembly in the tagged regions.  Alternatively, the features may be extracted from the bank and processed automatically by specialized software (e.g. several assemblies of a same genome can be compared by the number of features identified in the assembly - the assembly with fewer features is likely &amp;quot;better&amp;quot;).  &lt;br /&gt;
&lt;br /&gt;
Running amosvalidate is as simple as:&lt;br /&gt;
&lt;br /&gt;
 amosvalidate prefix&lt;br /&gt;
&lt;br /&gt;
where prefix.bnk is the location of the bank.&lt;br /&gt;
&lt;br /&gt;
== Getting help ==&lt;br /&gt;
To report bugs in AMOS, or to get help, email us at:&lt;br /&gt;
&lt;br /&gt;
 amos-help (at) lists (dot) sourceforget (dot) net&lt;br /&gt;
&lt;br /&gt;
To receive information regarding new releases and developments, please [http://lists.sourceforge.net/lists/listinfo/amos-users subscribe] to our moderated, low-traffic users&amp;#039; mailing list:&lt;br /&gt;
&lt;br /&gt;
 amos-users (at) lists (dot) sourceforget (dot) net&lt;/div&gt;</summary>
		<author><name>Ddsommer</name></author>	</entry>

	<entry>
		<id>http://amos.sourceforge.net/wiki/index.php/ABBA</id>
		<title>ABBA</title>
		<link rel="alternate" type="text/html" href="http://amos.sourceforge.net/wiki/index.php/ABBA"/>
				<updated>2011-02-17T20:42:50Z</updated>
		
		<summary type="html">&lt;p&gt;Ddsommer: /* Overview */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;ABBA: Assembly Boosted By Amino acid sequences&lt;br /&gt;
&lt;br /&gt;
== Overview ==&lt;br /&gt;
&lt;br /&gt;
Assembly Boosted By Amino acid sequence is a comparative gene assembler, which uses amino acid sequences from predicted proteins to help build a better assembly.  see the journal paper.&lt;br /&gt;
&lt;br /&gt;
== Download ==&lt;br /&gt;
NOTE : ABBA does protein assembly but doesn&amp;#039;t find the reference proteins to assemble.  You will need to find the proteins running off the ends of contigs separately and then pass the proteins to ABBA to fill in the gaps.&lt;br /&gt;
* Two ways to find the reference proteins:&lt;br /&gt;
** Do a draft annotation of the genome using a annotation pipline. ABBA will not annotate your assembly.&lt;br /&gt;
** Align the draft assembly contigs to a close relative and find where the contig ends intersect protein coding regions.&lt;br /&gt;
&lt;br /&gt;
ABBA is built on top of the AMOS framework but has it&amp;#039;s own distribution.   The AMOS framework is included in the ABBA tarball and will install with AMOS if you don&amp;#039;t already have AMOS installed. The tarball here: ftp://ftp.cbcb.umd.edu/pub/data/dsommer/abba.tgz&lt;br /&gt;
&lt;br /&gt;
== References ==&lt;br /&gt;
&lt;br /&gt;
[http://www.ploscompbiol.org/article/info:doi/10.1371/journal.pcbi.1000186 Gene-Boosted Assembly of a Novel Bacterial Genome from Very Short Reads].&lt;br /&gt;
&lt;br /&gt;
Salzberg SL, Sommer DD, Puiu D, Lee VT 2008 PLoS Computational Biology 4(9): e1000186 doi:10.1371/journal.pcbi.1000186&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
== Acknowledgements ==&lt;br /&gt;
The development of ABBA was supported by the National Institutes of Health under grants [http://www.google.com/search?q=R01-LM06845 R01-LM06845] and [http://www.google.com/search?q=R01-LM007938 R01-LM007938] to SLS.&lt;/div&gt;</summary>
		<author><name>Ddsommer</name></author>	</entry>

	<entry>
		<id>http://amos.sourceforge.net/wiki/index.php/ABBA</id>
		<title>ABBA</title>
		<link rel="alternate" type="text/html" href="http://amos.sourceforge.net/wiki/index.php/ABBA"/>
				<updated>2011-02-17T20:39:34Z</updated>
		
		<summary type="html">&lt;p&gt;Ddsommer: /* Download */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;ABBA: Assembly Boosted By Amino acid sequences&lt;br /&gt;
&lt;br /&gt;
== Overview ==&lt;br /&gt;
&lt;br /&gt;
Assembly Boosted By Amino acid sequence is a comparative gene assembler, which uses amino acid sequences from predicted proteins to help build a better assembly.  see the journal paper.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
For additional information on short read assembly check the following University of Maryland CBCB web sites:&lt;br /&gt;
&lt;br /&gt;
* [http://www.cbcb.umd.edu/research/SR-assembly.shtml Genome Assembly with Short Reads]&lt;br /&gt;
* [http://www.cbcb.umd.edu/research/SR-assembly-tutorial.shtml Genome Assembly with Short Reads Tutorial]&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
== Download ==&lt;br /&gt;
NOTE : ABBA does protein assembly but doesn&amp;#039;t find the reference proteins to assemble.  You will need to find the proteins running off the ends of contigs separately and then pass the proteins to ABBA to fill in the gaps.&lt;br /&gt;
* Two ways to find the reference proteins:&lt;br /&gt;
** Do a draft annotation of the genome using a annotation pipline. ABBA will not annotate your assembly.&lt;br /&gt;
** Align the draft assembly contigs to a close relative and find where the contig ends intersect protein coding regions.&lt;br /&gt;
&lt;br /&gt;
ABBA is built on top of the AMOS framework but has it&amp;#039;s own distribution.   The AMOS framework is included in the ABBA tarball and will install with AMOS if you don&amp;#039;t already have AMOS installed. The tarball here: ftp://ftp.cbcb.umd.edu/pub/data/dsommer/abba.tgz&lt;br /&gt;
&lt;br /&gt;
== References ==&lt;br /&gt;
&lt;br /&gt;
[http://www.ploscompbiol.org/article/info:doi/10.1371/journal.pcbi.1000186 Gene-Boosted Assembly of a Novel Bacterial Genome from Very Short Reads].&lt;br /&gt;
&lt;br /&gt;
Salzberg SL, Sommer DD, Puiu D, Lee VT 2008 PLoS Computational Biology 4(9): e1000186 doi:10.1371/journal.pcbi.1000186&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
== Acknowledgements ==&lt;br /&gt;
The development of ABBA was supported by the National Institutes of Health under grants [http://www.google.com/search?q=R01-LM06845 R01-LM06845] and [http://www.google.com/search?q=R01-LM007938 R01-LM007938] to SLS.&lt;/div&gt;</summary>
		<author><name>Ddsommer</name></author>	</entry>

	<entry>
		<id>http://amos.sourceforge.net/wiki/index.php/ABBA</id>
		<title>ABBA</title>
		<link rel="alternate" type="text/html" href="http://amos.sourceforge.net/wiki/index.php/ABBA"/>
				<updated>2011-02-17T20:38:38Z</updated>
		
		<summary type="html">&lt;p&gt;Ddsommer: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;ABBA: Assembly Boosted By Amino acid sequences&lt;br /&gt;
&lt;br /&gt;
== Overview ==&lt;br /&gt;
&lt;br /&gt;
Assembly Boosted By Amino acid sequence is a comparative gene assembler, which uses amino acid sequences from predicted proteins to help build a better assembly.  see the journal paper.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
For additional information on short read assembly check the following University of Maryland CBCB web sites:&lt;br /&gt;
&lt;br /&gt;
* [http://www.cbcb.umd.edu/research/SR-assembly.shtml Genome Assembly with Short Reads]&lt;br /&gt;
* [http://www.cbcb.umd.edu/research/SR-assembly-tutorial.shtml Genome Assembly with Short Reads Tutorial]&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
== Download ==&lt;br /&gt;
NOTE : ABBA does protein assembly but doesn&amp;#039;t find the reference proteins to assemble.  You will need to find the proteins running off the ends of contigs separately and then pass the proteins to ABBA to fill in the gaps.&lt;br /&gt;
* Two ways to find the reference proteins:&lt;br /&gt;
** Do a draft annotation of the genome&lt;br /&gt;
** Align the draft assembly contigs to a close relative and find where the contig ends intersect protein coding regions.&lt;br /&gt;
&lt;br /&gt;
ABBA is built on top of the AMOS framework but has it&amp;#039;s own distribution.   The AMOS framework is included in the ABBA tarball and will install with AMOS if you don&amp;#039;t already have AMOS installed. The tarball here: ftp://ftp.cbcb.umd.edu/pub/data/dsommer/abba.tgz&lt;br /&gt;
&lt;br /&gt;
== References ==&lt;br /&gt;
&lt;br /&gt;
[http://www.ploscompbiol.org/article/info:doi/10.1371/journal.pcbi.1000186 Gene-Boosted Assembly of a Novel Bacterial Genome from Very Short Reads].&lt;br /&gt;
&lt;br /&gt;
Salzberg SL, Sommer DD, Puiu D, Lee VT 2008 PLoS Computational Biology 4(9): e1000186 doi:10.1371/journal.pcbi.1000186&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
== Acknowledgements ==&lt;br /&gt;
The development of ABBA was supported by the National Institutes of Health under grants [http://www.google.com/search?q=R01-LM06845 R01-LM06845] and [http://www.google.com/search?q=R01-LM007938 R01-LM007938] to SLS.&lt;/div&gt;</summary>
		<author><name>Ddsommer</name></author>	</entry>

	<entry>
		<id>http://amos.sourceforge.net/wiki/index.php/ABBA</id>
		<title>ABBA</title>
		<link rel="alternate" type="text/html" href="http://amos.sourceforge.net/wiki/index.php/ABBA"/>
				<updated>2011-02-17T20:37:47Z</updated>
		
		<summary type="html">&lt;p&gt;Ddsommer: /* Download */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;ABBA: Assembly Boosted By Amino acid sequences&lt;br /&gt;
&lt;br /&gt;
== Overview ==&lt;br /&gt;
&lt;br /&gt;
Assembly Boosted By Amino acid sequence is a comparative gene assembler, which uses amino acid sequences from predicted proteins to help build a better assembly.  see the journal paper.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
For additional information on short read assembly check the following University of Maryland CBCB web sites:&lt;br /&gt;
&lt;br /&gt;
* [http://www.cbcb.umd.edu/research/SR-assembly.shtml Genome Assembly with Short Reads]&lt;br /&gt;
* [http://www.cbcb.umd.edu/research/SR-assembly-tutorial.shtml Genome Assembly with Short Reads Tutorial]&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
== Download ==&lt;br /&gt;
IMPORTANT TO NOTE : ABBA does protein assembly but doesn&amp;#039;t find the reference proteins to assemble.  You will need to find the proteins running off the ends of contigs separately and then pass the proteins to ABBA to fill in the gaps.&lt;br /&gt;
Two ways to find the reference proteins:&lt;br /&gt;
** Do a draft annotation of the genome&lt;br /&gt;
** Align the draft assembly contigs to a close relative and find where the contig ends intersect protein coding regions.&lt;br /&gt;
&lt;br /&gt;
ABBA is built on top of the AMOS framework but has it&amp;#039;s own distribution.   The AMOS framework is included in the ABBA tarball and will install with AMOS if you don&amp;#039;t already have AMOS installed. The tarball here: ftp://ftp.cbcb.umd.edu/pub/data/dsommer/abba.tgz&lt;br /&gt;
&lt;br /&gt;
== References ==&lt;br /&gt;
&lt;br /&gt;
[http://www.ploscompbiol.org/article/info:doi/10.1371/journal.pcbi.1000186 Gene-Boosted Assembly of a Novel Bacterial Genome from Very Short Reads].&lt;br /&gt;
&lt;br /&gt;
Salzberg SL, Sommer DD, Puiu D, Lee VT 2008 PLoS Computational Biology 4(9): e1000186 doi:10.1371/journal.pcbi.1000186&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
== Acknowledgements ==&lt;br /&gt;
The development of ABBA was supported by the National Institutes of Health under grants [http://www.google.com/search?q=R01-LM06845 R01-LM06845] and [http://www.google.com/search?q=R01-LM007938 R01-LM007938] to SLS.&lt;/div&gt;</summary>
		<author><name>Ddsommer</name></author>	</entry>

	<entry>
		<id>http://amos.sourceforge.net/wiki/index.php/ABBA</id>
		<title>ABBA</title>
		<link rel="alternate" type="text/html" href="http://amos.sourceforge.net/wiki/index.php/ABBA"/>
				<updated>2011-02-17T20:37:03Z</updated>
		
		<summary type="html">&lt;p&gt;Ddsommer: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;ABBA: Assembly Boosted By Amino acid sequences&lt;br /&gt;
&lt;br /&gt;
== Overview ==&lt;br /&gt;
&lt;br /&gt;
Assembly Boosted By Amino acid sequence is a comparative gene assembler, which uses amino acid sequences from predicted proteins to help build a better assembly.  see the journal paper.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
For additional information on short read assembly check the following University of Maryland CBCB web sites:&lt;br /&gt;
&lt;br /&gt;
* [http://www.cbcb.umd.edu/research/SR-assembly.shtml Genome Assembly with Short Reads]&lt;br /&gt;
* [http://www.cbcb.umd.edu/research/SR-assembly-tutorial.shtml Genome Assembly with Short Reads Tutorial]&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
== Download ==&lt;br /&gt;
IMPORTANT TO NOTE : ABBA does protein assembly but doesn&amp;#039;t find the reference proteins to assemble.  You will need to find the proteins running off the ends of contigs separately and then pass the proteins to ABBA to fill in the gaps.&lt;br /&gt;
Two ways to find the reference proteins:&lt;br /&gt;
1. Do a draft annotation of the genome&lt;br /&gt;
2. Align the draft assembly contigs to a close relative and find where the contig ends intersect protein coding regions.&lt;br /&gt;
&lt;br /&gt;
ABBA is built on top of the AMOS framework but has it&amp;#039;s own distribution.   The AMOS framework is included in the ABBA tarball and will install with AMOS if you don&amp;#039;t already have AMOS installed. The tarball here: ftp://ftp.cbcb.umd.edu/pub/data/dsommer/abba.tgz&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
== References ==&lt;br /&gt;
&lt;br /&gt;
[http://www.ploscompbiol.org/article/info:doi/10.1371/journal.pcbi.1000186 Gene-Boosted Assembly of a Novel Bacterial Genome from Very Short Reads].&lt;br /&gt;
&lt;br /&gt;
Salzberg SL, Sommer DD, Puiu D, Lee VT 2008 PLoS Computational Biology 4(9): e1000186 doi:10.1371/journal.pcbi.1000186&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
== Acknowledgements ==&lt;br /&gt;
The development of ABBA was supported by the National Institutes of Health under grants [http://www.google.com/search?q=R01-LM06845 R01-LM06845] and [http://www.google.com/search?q=R01-LM007938 R01-LM007938] to SLS.&lt;/div&gt;</summary>
		<author><name>Ddsommer</name></author>	</entry>

	<entry>
		<id>http://amos.sourceforge.net/wiki/index.php/ABBA</id>
		<title>ABBA</title>
		<link rel="alternate" type="text/html" href="http://amos.sourceforge.net/wiki/index.php/ABBA"/>
				<updated>2011-02-17T20:35:10Z</updated>
		
		<summary type="html">&lt;p&gt;Ddsommer: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;ABBA: Assembly Boosted By Amino acid sequences&lt;br /&gt;
&lt;br /&gt;
== Overview ==&lt;br /&gt;
&lt;br /&gt;
Assembly Boosted By Amino acid sequence is a comparative gene assembler, which uses amino acid sequences from predicted proteins to help build a better assembly.  see the journal paper.&lt;br /&gt;
&lt;br /&gt;
Two ways to find the reference proteins:&lt;br /&gt;
1. Do a draft annotation of the genome&lt;br /&gt;
2. Align the draft assembly contigs to a close relative and find where the contig ends intersect protein coding regions.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
For additional information on short read assembly check the following University of Maryland CBCB web sites:&lt;br /&gt;
&lt;br /&gt;
* [http://www.cbcb.umd.edu/research/SR-assembly.shtml Genome Assembly with Short Reads]&lt;br /&gt;
* [http://www.cbcb.umd.edu/research/SR-assembly-tutorial.shtml Genome Assembly with Short Reads Tutorial]&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
== Download ==&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
ABBA is built on top of the AMOS framework but has it&amp;#039;s own distribution.   The AMOS framework is included in the ABBA tarball and will install with AMOS if you don&amp;#039;t already have AMOS installed. The tarball here: ftp://ftp.cbcb.umd.edu/pub/data/dsommer/abba.tgz&lt;br /&gt;
&lt;br /&gt;
IMPORTANT TO NOTE : ABBA does protein assembly but doesn&amp;#039;t find the reference proteins to assemble.  You will need to find the proteins running off the ends of contigs separately and then pass the proteins to ABBA to fill in the gaps.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
== References ==&lt;br /&gt;
&lt;br /&gt;
[http://www.ploscompbiol.org/article/info:doi/10.1371/journal.pcbi.1000186 Gene-Boosted Assembly of a Novel Bacterial Genome from Very Short Reads].&lt;br /&gt;
&lt;br /&gt;
Salzberg SL, Sommer DD, Puiu D, Lee VT 2008 PLoS Computational Biology 4(9): e1000186 doi:10.1371/journal.pcbi.1000186&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
== Acknowledgements ==&lt;br /&gt;
The development of ABBA was supported by the National Institutes of Health under grants [http://www.google.com/search?q=R01-LM06845 R01-LM06845] and [http://www.google.com/search?q=R01-LM007938 R01-LM007938] to SLS.&lt;/div&gt;</summary>
		<author><name>Ddsommer</name></author>	</entry>

	<entry>
		<id>http://amos.sourceforge.net/wiki/index.php/ABBA</id>
		<title>ABBA</title>
		<link rel="alternate" type="text/html" href="http://amos.sourceforge.net/wiki/index.php/ABBA"/>
				<updated>2011-02-17T20:34:26Z</updated>
		
		<summary type="html">&lt;p&gt;Ddsommer: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;ABBA: Assembly Boosted By Amino acid sequences&lt;br /&gt;
&lt;br /&gt;
== Overview ==&lt;br /&gt;
&lt;br /&gt;
Assembly Boosted By Amino acid sequence is a comparative gene assembler, which uses amino acid sequences from predicted proteins to help build a better assembly.  see the journal paper.&lt;br /&gt;
&lt;br /&gt;
Two ways to find the reference proteins:&lt;br /&gt;
1. Do a draft annotation of the genome&lt;br /&gt;
2. Align the draft assembly contigs to a close relative and find where the contig ends intersect protein coding regions.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
For additional information on short read assembly check the following University of Maryland CBCB web sites:&lt;br /&gt;
&lt;br /&gt;
* [http://www.cbcb.umd.edu/research/SR-assembly.shtml Genome Assembly with Short Reads]&lt;br /&gt;
* [http://www.cbcb.umd.edu/research/SR-assembly-tutorial.shtml Genome Assembly with Short Reads Tutorial]&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
== Download ==&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
ABBA is built on top of the AMOS framework but has it&amp;#039;s own distribution.   The AMOS framework is included in the ABBA tarball and will install with AMOS if you don&amp;#039;t already have AMOS installed. The tarball here: ftp://ftp.cbcb.umd.edu/pub/data/dsommer/abba.tgz&lt;br /&gt;
&lt;br /&gt;
 IT IS IMPORTANT TO NOTE: ABBA does protein assembly but doesn&amp;#039;t find the reference proteins to assemble.  You will need to find the proteins running off the ends of contigs separately and then pass the proteins to ABBA to fill in the gaps.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
== References ==&lt;br /&gt;
&lt;br /&gt;
[http://www.ploscompbiol.org/article/info:doi/10.1371/journal.pcbi.1000186 Gene-Boosted Assembly of a Novel Bacterial Genome from Very Short Reads].&lt;br /&gt;
&lt;br /&gt;
Salzberg SL, Sommer DD, Puiu D, Lee VT 2008 PLoS Computational Biology 4(9): e1000186 doi:10.1371/journal.pcbi.1000186&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
== Acknowledgements ==&lt;br /&gt;
The development of ABBA was supported by the National Institutes of Health under grants [http://www.google.com/search?q=R01-LM06845 R01-LM06845] and [http://www.google.com/search?q=R01-LM007938 R01-LM007938] to SLS.&lt;/div&gt;</summary>
		<author><name>Ddsommer</name></author>	</entry>

	<entry>
		<id>http://amos.sourceforge.net/wiki/index.php/ABBA</id>
		<title>ABBA</title>
		<link rel="alternate" type="text/html" href="http://amos.sourceforge.net/wiki/index.php/ABBA"/>
				<updated>2011-02-17T19:55:28Z</updated>
		
		<summary type="html">&lt;p&gt;Ddsommer: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;ABBA: Assembly Boosted By Amino acid sequences&lt;br /&gt;
&lt;br /&gt;
== Overview ==&lt;br /&gt;
&lt;br /&gt;
Assembly Boosted By Amino acid sequence is a comparative gene assembler, which uses amino acid sequences from predicted proteins to help build a better assembly. See the journal paper.&lt;br /&gt;
&lt;br /&gt;
For additional information on short read assembly check the following University of Maryland CBCB web sites:&lt;br /&gt;
&lt;br /&gt;
* [http://www.cbcb.umd.edu/research/SR-assembly.shtml Genome Assembly with Short Reads]&lt;br /&gt;
* [http://www.cbcb.umd.edu/research/SR-assembly-tutorial.shtml Genome Assembly with Short Reads Tutorial]&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
== Documentation ==&lt;br /&gt;
&lt;br /&gt;
Documentation on running ABBA is included in the tarball here: ftp://ftp.cbcb.umd.edu/pub/data/dsommer/abba.tgz&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
== References ==&lt;br /&gt;
&lt;br /&gt;
[http://www.ploscompbiol.org/article/info:doi/10.1371/journal.pcbi.1000186 Gene-Boosted Assembly of a Novel Bacterial Genome from Very Short Reads].&lt;br /&gt;
&lt;br /&gt;
Salzberg SL, Sommer DD, Puiu D, Lee VT 2008 PLoS Computational Biology 4(9): e1000186 doi:10.1371/journal.pcbi.1000186&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
== Acknowledgements ==&lt;br /&gt;
The development of ABBA was supported by the National Institutes of Health under grants [http://www.google.com/search?q=R01-LM06845 R01-LM06845] and [http://www.google.com/search?q=R01-LM007938 R01-LM007938] to SLS.&lt;/div&gt;</summary>
		<author><name>Ddsommer</name></author>	</entry>

	<entry>
		<id>http://amos.sourceforge.net/wiki/index.php/ABBA</id>
		<title>ABBA</title>
		<link rel="alternate" type="text/html" href="http://amos.sourceforge.net/wiki/index.php/ABBA"/>
				<updated>2011-02-17T19:21:19Z</updated>
		
		<summary type="html">&lt;p&gt;Ddsommer: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;ABBA: Assembly Boosted By Amino acid sequences&lt;br /&gt;
&lt;br /&gt;
== Overview ==&lt;br /&gt;
&lt;br /&gt;
Assembly Boosted By Amino acid sequence is a comparative gene assembler, which uses amino acid sequences from predicted proteins to help build a better assembly. See the journal paper.&lt;br /&gt;
&lt;br /&gt;
For additional information on short read assembly check the following University of Maryland CBCB web sites:&lt;br /&gt;
&lt;br /&gt;
* [http://www.cbcb.umd.edu/research/SR-assembly.shtml Genome Assembly with Short Reads]&lt;br /&gt;
* [http://www.cbcb.umd.edu/research/SR-assembly-tutorial.shtml Genome Assembly with Short Reads Tutorial]&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
== Documentation ==&lt;br /&gt;
&lt;br /&gt;
Documentation on running ABBA is included in the tarball here: ftp://ftp.cbcb.umd.edu/pub/data/dsommer&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
&amp;quot;NOTE: ABBA relies on other tools in the AMOS package, so you need&lt;br /&gt;
to download and install AMOS as well.&amp;quot;&lt;br /&gt;
&lt;br /&gt;
== References ==&lt;br /&gt;
&lt;br /&gt;
[http://www.ploscompbiol.org/article/info:doi/10.1371/journal.pcbi.1000186 Gene-Boosted Assembly of a Novel Bacterial Genome from Very Short Reads].&lt;br /&gt;
&lt;br /&gt;
Salzberg SL, Sommer DD, Puiu D, Lee VT 2008 PLoS Computational Biology 4(9): e1000186 doi:10.1371/journal.pcbi.1000186&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
== Acknowledgements ==&lt;br /&gt;
The development of ABBA was supported by the National Institutes of Health under grants [http://www.google.com/search?q=R01-LM06845 R01-LM06845] and [http://www.google.com/search?q=R01-LM007938 R01-LM007938] to SLS.&lt;/div&gt;</summary>
		<author><name>Ddsommer</name></author>	</entry>

	<entry>
		<id>http://amos.sourceforge.net/wiki/index.php/ABBA</id>
		<title>ABBA</title>
		<link rel="alternate" type="text/html" href="http://amos.sourceforge.net/wiki/index.php/ABBA"/>
				<updated>2011-02-17T19:13:39Z</updated>
		
		<summary type="html">&lt;p&gt;Ddsommer: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;ABBA: Assembly Boosted By Amino acid sequences&lt;br /&gt;
&lt;br /&gt;
== Overview ==&lt;br /&gt;
&lt;br /&gt;
Assembly Boosted By Amino acid sequence is a comparative gene assembler, which uses amino acid sequences from predicted proteins to help build a better assembly. See the journal paper.&lt;br /&gt;
&lt;br /&gt;
For additional information on short read assembly check the following University of Maryland CBCB web sites:&lt;br /&gt;
&lt;br /&gt;
* [http://www.cbcb.umd.edu/research/SR-assembly.shtml Genome Assembly with Short Reads]&lt;br /&gt;
* [http://www.cbcb.umd.edu/research/SR-assembly-tutorial.shtml Genome Assembly with Short Reads Tutorial]&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
== Documentation ==&lt;br /&gt;
&lt;br /&gt;
Documentation on running ABBA is included in the tarball&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
&amp;quot;NOTE: ABBA relies on other tools in the AMOS package, so you need&lt;br /&gt;
to download and install AMOS as well.&amp;quot;&lt;br /&gt;
&lt;br /&gt;
== References ==&lt;br /&gt;
&lt;br /&gt;
[http://www.ploscompbiol.org/article/info:doi/10.1371/journal.pcbi.1000186 Gene-Boosted Assembly of a Novel Bacterial Genome from Very Short Reads].&lt;br /&gt;
&lt;br /&gt;
Salzberg SL, Sommer DD, Puiu D, Lee VT 2008 PLoS Computational Biology 4(9): e1000186 doi:10.1371/journal.pcbi.1000186&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
== Acknowledgements ==&lt;br /&gt;
The development of ABBA was supported by the National Institutes of Health under grants [http://www.google.com/search?q=R01-LM06845 R01-LM06845] and [http://www.google.com/search?q=R01-LM007938 R01-LM007938] to SLS.&lt;/div&gt;</summary>
		<author><name>Ddsommer</name></author>	</entry>

	</feed>