<?xml version="1.0"?>
<?xml-stylesheet type="text/css" href="http://amos.sourceforge.net/wiki/skins/common/feed.css?303"?>
<feed xmlns="http://www.w3.org/2005/Atom" xml:lang="en">
		<id>http://amos.sourceforge.net/wiki/api.php?action=feedcontributions&amp;feedformat=atom&amp;user=Trgibbons</id>
		<title>AMOS WIKI - User contributions [en]</title>
		<link rel="self" type="application/atom+xml" href="http://amos.sourceforge.net/wiki/api.php?action=feedcontributions&amp;feedformat=atom&amp;user=Trgibbons"/>
		<link rel="alternate" type="text/html" href="http://amos.sourceforge.net/wiki/index.php/Special:Contributions/Trgibbons"/>
		<updated>2026-04-30T08:35:43Z</updated>
		<subtitle>User contributions</subtitle>
		<generator>MediaWiki 1.23.5</generator>

	<entry>
		<id>http://amos.sourceforge.net/wiki/index.php/Minimus</id>
		<title>Minimus</title>
		<link rel="alternate" type="text/html" href="http://amos.sourceforge.net/wiki/index.php/Minimus"/>
				<updated>2010-01-29T16:25:49Z</updated>
		
		<summary type="html">&lt;p&gt;Trgibbons: /* Overview */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;== Overview ==&lt;br /&gt;
&lt;br /&gt;
Minimus is one of several assembly pipelines included in the AMOS software package. It is designed specifically for small data-sets, such as the set of reads covering a specific gene. Note that the code will work for larger assemblies (we have used it to assemble bacterial genomes), however, due to its stringency, the resulting assembly will be highly fragmented. For large and/or complex assemblies the execution of Minimus should be followed by additional processing steps, such as scaffolding.&lt;br /&gt;
&lt;br /&gt;
Minimus follows the Overlap-Layout-Consensus paradigm and consists of three main modules which share information through a central file bank:&lt;br /&gt;
&lt;br /&gt;
* [[hash-overlap]] - Computes the overlaps between the reads using a modified version of the Smith-Waterman local alignment algorithm&lt;br /&gt;
* [[tigger]] - Uses the read overlaps to generate the layouts of reads representing individual contigs&lt;br /&gt;
* [[make-consensus]] - Refines the layouts produced by the tigger to generate accurate multiple alignments within the reads&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
Minimus uses AMOS message files as both the inputs and the outputs. Please see the [[File conversion utilities]] documentation for more information.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
[[minimus2 | Minimus2]] is a modified version of the minimus pipeline designed for merging two sequence sets. Instead of hash-overlap it uses a nucmer based overlap detector which is much faster.&lt;br /&gt;
&lt;br /&gt;
== Documentation ==&lt;br /&gt;
&lt;br /&gt;
Documentation on running minimus is included with the distribution in the /docs subdirectory.&lt;br /&gt;
&lt;br /&gt;
See [[Minimus/README]].&lt;br /&gt;
&lt;br /&gt;
== Examples ==&lt;br /&gt;
&lt;br /&gt;
Examples of a flu assembly and a Zebrafish gene can be found in the test/minimus directory created when the AMOS distribution is untarred. Documentation on the examples is included with the distribution in /docs/minimus.README.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
== Basic usage  ==&lt;br /&gt;
&lt;br /&gt;
To run minimus will you need a set of sequence files. Assuming you have a set of reads in fasta format called &amp;#039;&amp;#039;&amp;#039;my_reads.seq&amp;#039;&amp;#039;&amp;#039;, you can run minimus with the following two commands:&lt;br /&gt;
&lt;br /&gt;
 toAmos -s my_reads.seq -o my_reads.afg&lt;br /&gt;
 &lt;br /&gt;
 minimus my_reads&lt;br /&gt;
&lt;br /&gt;
The output will be a fasta formatted file called &amp;#039;&amp;quot;my_reads.fasta&amp;quot;&amp;#039;, a contig file with details about the assembly of each contig called &amp;#039;&amp;quot;my_reads.contig&amp;quot;&amp;#039;, and an AMOS bank folder with various files used internally by minimus.&lt;br /&gt;
The toAmos file conversion utility is the most general and probably the most useful of the file conversion utilities included with minimus. More information about toAmos and the [[File_conversion_utilities | other file conversion utilities]] can be found in the [[AMOS | AMOS documentation wiki]]. For example, you can include quality data from a Phred style quality score file by running [[ToAmos | toAmos]] with the -q option as follows:&lt;br /&gt;
&lt;br /&gt;
 toAmos -s my_reads.fasta -q my_reads.qual -o my_reads.afg&lt;br /&gt;
&lt;br /&gt;
Minimus can also be called with the following equivalent command:&lt;br /&gt;
&lt;br /&gt;
 runAmos -C $AMOSBASE/src/Pipeline/minimus.acf my_reads&lt;br /&gt;
&lt;br /&gt;
The AMOS package also includes other helpful tools such as [[Hawkeye]], which is useful for evaluating your assembly with respect to paired-end reads. It can be run on the minimus bank with the following command:&lt;br /&gt;
&lt;br /&gt;
 hawkeye my_reads.bnk/&lt;br /&gt;
&lt;br /&gt;
== Publication ==&lt;br /&gt;
&lt;br /&gt;
[http://www.biomedcentral.com/1471-2105/8/64 Minimus: a fast, lightweight genome assembler]&lt;br /&gt;
&lt;br /&gt;
Sommer, DD, Delcher, AL, Salzberg, SL, and Pop, M. (2007) BMC Bioinformatics, 8:64doi:10.1186/1471-2105-8-64.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
== Acknowledgements ==&lt;br /&gt;
The development of minimus was supported by the National Institutes of Health under grants R01-LM06845 and R01-LM007938 to SLS and by Department of Homeland Security cooperative agreement W81XWH-05-2-0051.&lt;/div&gt;</summary>
		<author><name>Trgibbons</name></author>	</entry>

	<entry>
		<id>http://amos.sourceforge.net/wiki/index.php/Tarchive2amos</id>
		<title>Tarchive2amos</title>
		<link rel="alternate" type="text/html" href="http://amos.sourceforge.net/wiki/index.php/Tarchive2amos"/>
				<updated>2010-01-20T19:25:11Z</updated>
		
		<summary type="html">&lt;p&gt;Trgibbons: /* Required inputs */ Renamed section, since qual and xml files are not actually required inputs&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;tarchive2amos: utility for generating AMOS message files&lt;br /&gt;
&lt;br /&gt;
== Overview ==&lt;br /&gt;
&lt;br /&gt;
The AMOS package uses a compact representation for the information exchange to and from the assembler. This representation, the AMOS message format, is described in detail here, and was inspired by the interchange format developed at Celera Genomics for use in Celera Assembler.&lt;br /&gt;
&lt;br /&gt;
Tarchive2amos is a utility that allows users to convert files from the NCBI Trace Archive format into the AMOS message format.&lt;br /&gt;
 &lt;br /&gt;
&lt;br /&gt;
== Input Files ==&lt;br /&gt;
&lt;br /&gt;
tarchive2amos can use data specified in the following three formats:&lt;br /&gt;
&lt;br /&gt;
* sequence data in one or more multi-fasta formatted files. These files must be named fasta.* (Trace Archive standard) or *.seq.&lt;br /&gt;
* quality data in zero or more multi-fasta formatted files. These files must be named qual.* (Trace Archive standard) or *.qual and must match the names of the sequence files. Note that the quality files are not necessary: if they are absent all bases will be assigned quality value of 20 (1 error in 100 bp).&lt;br /&gt;
* anciliary data in xml format. These files must be named xml.* (Trace Archive standard) or *.xml and must match the names of the sequence files. The information specified in these files includes (but is not limited to) clipping information, library size information, etc. For more information please refer to the Trace Archive documentation. Like the quality information, the XML files are not required.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
In addition to these files, the user can provide a list of clear ranges (clipping coordinates) in a separate file. This information will override any set by the xml files. Furthermore, reads not present in the clear range file will be excluded from the conversion.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
Note that if a clear range file is not specified, reads with no clear range set in the XML or the sequence file (see below) will be assigned a clear range that spans the entire extent of the read.&lt;br /&gt;
&lt;br /&gt;
== Sequence file formats ==&lt;br /&gt;
&lt;br /&gt;
tarchive2amos accepts four different formats for the header lines in the sequence file:&lt;br /&gt;
&lt;br /&gt;
* Trace Archive format generated by a query (either through website or query_tracedb script)&lt;br /&gt;
&lt;br /&gt;
 &amp;gt;gnl|ti|145655111 name:38245161 ...&lt;br /&gt;
&lt;br /&gt;
The first identifier is the TRACE_ID in the XML file and the second one is the name assigned to the trace (TRACE_NAME) in the xml file.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
* Trace Archive format:&lt;br /&gt;
&lt;br /&gt;
 &amp;gt;gnl|ti|145655111 38245161&lt;br /&gt;
&lt;br /&gt;
The first identifier is the trace identifier (TRACE_ID in the XML file) while the second one is the assigned name for the trace (TRACE_NAME in the XML file). The output message file will only contain the trace name (in the eid: field of each read record).&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
* TIGR sequence format (also produced by the trimming package lucy) :&lt;br /&gt;
&lt;br /&gt;
 &amp;gt;GBRAA01TF 1000 2000 1500 17 823&lt;br /&gt;
&lt;br /&gt;
The first identifier is the trace name, followed by three numbers representing the library size estimates (ignored by tarchive2amos), then followed by the clear range.&lt;br /&gt;
&lt;br /&gt;
* Generic multi-fasta&lt;br /&gt;
&lt;br /&gt;
 &amp;gt;GBRAA01TF&lt;br /&gt;
&lt;br /&gt;
Note that the sequence and quality files are linked through the first identifier on the multi-fasta header line. The XML and the sequence files are linked through the TRACE_NAME field in the XML (it has to match the trace name portion of the header in the Trace Archive format, or the trace identifier in the other two formats).&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
== Synopsis ==&lt;br /&gt;
&lt;br /&gt;
tarchive2amos assumes that for each file called &amp;lt;file&amp;gt;.seq there is a &amp;lt;file&amp;gt;.qual and a &amp;lt;file&amp;gt;.xml. (alternatively the files may be called fasta.&amp;lt;file&amp;gt;, qual.&amp;lt;file&amp;gt; and xml.&amp;lt;file&amp;gt;). If no .xml file is present the program will only produce a set of RED (read) records.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
 tarchive2amos -o &amp;lt;prefix&amp;gt; [-c &amp;lt;clear_ranges&amp;gt;] [-l &amp;lt;libs&amp;gt;] &lt;br /&gt;
              [-m &amp;lt;mates&amp;gt;] &amp;lt;seq_file1&amp;gt; &amp;lt;seq_file2&amp;gt; ...&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
tarchive2amos will read one or more sequence files (as described above) and place the ouptut in a file called &amp;lt;prefix&amp;gt;.afg. Note that the -o option is required. Use the -h option for a complete list of options.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
A set of clear ranges may be specified in an addional file (with option -c) in the format:&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
 &amp;lt;read id&amp;gt; &amp;lt;clip_left&amp;gt; &amp;lt;clip_right&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
These values will overwrite any value specified in the XML or sequence files.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
In addition to Trace Archive XMLs, tarchive2amos also accepts library and read mate information in a Bambus-style .mates file.   Furthermore, library information can also be provided with the -l option in a file formatted as follows:&lt;br /&gt;
&lt;br /&gt;
 &amp;lt;lib_id&amp;gt;  &amp;lt;mean_size&amp;gt;   &amp;lt;size_stdev&amp;gt;&lt;br /&gt;
&lt;br /&gt;
== Additional options ==&lt;br /&gt;
&lt;br /&gt;
* -i &amp;lt;id&amp;gt; - specifies the starting identifier for the messages generated. This option is useful when appending to an already existing AMOS bank.&lt;br /&gt;
* -min &amp;lt;len&amp;gt; - minimum length of reads accepted (default 100 bp)&lt;br /&gt;
* -max &amp;lt;len&amp;gt; - maximum length of reads accepted (default 2048 bp)&lt;br /&gt;
* -qual &amp;lt;qval&amp;gt; - quality value to be assigned to qualityless reads (default 20)&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
== Notes ==&lt;br /&gt;
&lt;br /&gt;
The program produces rather verbose output when inconsistencies are found in the data.&lt;/div&gt;</summary>
		<author><name>Trgibbons</name></author>	</entry>

	<entry>
		<id>http://amos.sourceforge.net/wiki/index.php/Tarchive2amos</id>
		<title>Tarchive2amos</title>
		<link rel="alternate" type="text/html" href="http://amos.sourceforge.net/wiki/index.php/Tarchive2amos"/>
				<updated>2010-01-20T19:24:16Z</updated>
		
		<summary type="html">&lt;p&gt;Trgibbons: /* Synopsis */ Added the description of the implicit use of qual and xml files, as well as a mention to the -h option&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;tarchive2amos: utility for generating AMOS message files&lt;br /&gt;
&lt;br /&gt;
== Overview ==&lt;br /&gt;
&lt;br /&gt;
The AMOS package uses a compact representation for the information exchange to and from the assembler. This representation, the AMOS message format, is described in detail here, and was inspired by the interchange format developed at Celera Genomics for use in Celera Assembler.&lt;br /&gt;
&lt;br /&gt;
Tarchive2amos is a utility that allows users to convert files from the NCBI Trace Archive format into the AMOS message format.&lt;br /&gt;
 &lt;br /&gt;
&lt;br /&gt;
== Required inputs ==&lt;br /&gt;
&lt;br /&gt;
tarchive2amos can use data specified in the following three formats:&lt;br /&gt;
&lt;br /&gt;
* sequence data in one or more multi-fasta formatted files. These files must be named fasta.* (Trace Archive standard) or *.seq.&lt;br /&gt;
* quality data in zero or more multi-fasta formatted files. These files must be named qual.* (Trace Archive standard) or *.qual and must match the names of the sequence files. Note that the quality files are not necessary: if they are absent all bases will be assigned quality value of 20 (1 error in 100 bp).&lt;br /&gt;
* anciliary data in xml format. These files must be named xml.* (Trace Archive standard) or *.xml and must match the names of the sequence files. The information specified in these files includes (but is not limited to) clipping information, library size information, etc. For more information please refer to the Trace Archive documentation. Like the quality information, the XML files are not required.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
In addition to these files, the user can provide a list of clear ranges (clipping coordinates) in a separate file. This information will override any set by the xml files. Furthermore, reads not present in the clear range file will be excluded from the conversion.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
Note that if a clear range file is not specified, reads with no clear range set in the XML or the sequence file (see below) will be assigned a clear range that spans the entire extent of the read.&lt;br /&gt;
&lt;br /&gt;
== Sequence file formats ==&lt;br /&gt;
&lt;br /&gt;
tarchive2amos accepts four different formats for the header lines in the sequence file:&lt;br /&gt;
&lt;br /&gt;
* Trace Archive format generated by a query (either through website or query_tracedb script)&lt;br /&gt;
&lt;br /&gt;
 &amp;gt;gnl|ti|145655111 name:38245161 ...&lt;br /&gt;
&lt;br /&gt;
The first identifier is the TRACE_ID in the XML file and the second one is the name assigned to the trace (TRACE_NAME) in the xml file.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
* Trace Archive format:&lt;br /&gt;
&lt;br /&gt;
 &amp;gt;gnl|ti|145655111 38245161&lt;br /&gt;
&lt;br /&gt;
The first identifier is the trace identifier (TRACE_ID in the XML file) while the second one is the assigned name for the trace (TRACE_NAME in the XML file). The output message file will only contain the trace name (in the eid: field of each read record).&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
* TIGR sequence format (also produced by the trimming package lucy) :&lt;br /&gt;
&lt;br /&gt;
 &amp;gt;GBRAA01TF 1000 2000 1500 17 823&lt;br /&gt;
&lt;br /&gt;
The first identifier is the trace name, followed by three numbers representing the library size estimates (ignored by tarchive2amos), then followed by the clear range.&lt;br /&gt;
&lt;br /&gt;
* Generic multi-fasta&lt;br /&gt;
&lt;br /&gt;
 &amp;gt;GBRAA01TF&lt;br /&gt;
&lt;br /&gt;
Note that the sequence and quality files are linked through the first identifier on the multi-fasta header line. The XML and the sequence files are linked through the TRACE_NAME field in the XML (it has to match the trace name portion of the header in the Trace Archive format, or the trace identifier in the other two formats).&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
== Synopsis ==&lt;br /&gt;
&lt;br /&gt;
tarchive2amos assumes that for each file called &amp;lt;file&amp;gt;.seq there is a &amp;lt;file&amp;gt;.qual and a &amp;lt;file&amp;gt;.xml. (alternatively the files may be called fasta.&amp;lt;file&amp;gt;, qual.&amp;lt;file&amp;gt; and xml.&amp;lt;file&amp;gt;). If no .xml file is present the program will only produce a set of RED (read) records.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
 tarchive2amos -o &amp;lt;prefix&amp;gt; [-c &amp;lt;clear_ranges&amp;gt;] [-l &amp;lt;libs&amp;gt;] &lt;br /&gt;
              [-m &amp;lt;mates&amp;gt;] &amp;lt;seq_file1&amp;gt; &amp;lt;seq_file2&amp;gt; ...&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
tarchive2amos will read one or more sequence files (as described above) and place the ouptut in a file called &amp;lt;prefix&amp;gt;.afg. Note that the -o option is required. Use the -h option for a complete list of options.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
A set of clear ranges may be specified in an addional file (with option -c) in the format:&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
 &amp;lt;read id&amp;gt; &amp;lt;clip_left&amp;gt; &amp;lt;clip_right&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
These values will overwrite any value specified in the XML or sequence files.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
In addition to Trace Archive XMLs, tarchive2amos also accepts library and read mate information in a Bambus-style .mates file.   Furthermore, library information can also be provided with the -l option in a file formatted as follows:&lt;br /&gt;
&lt;br /&gt;
 &amp;lt;lib_id&amp;gt;  &amp;lt;mean_size&amp;gt;   &amp;lt;size_stdev&amp;gt;&lt;br /&gt;
&lt;br /&gt;
== Additional options ==&lt;br /&gt;
&lt;br /&gt;
* -i &amp;lt;id&amp;gt; - specifies the starting identifier for the messages generated. This option is useful when appending to an already existing AMOS bank.&lt;br /&gt;
* -min &amp;lt;len&amp;gt; - minimum length of reads accepted (default 100 bp)&lt;br /&gt;
* -max &amp;lt;len&amp;gt; - maximum length of reads accepted (default 2048 bp)&lt;br /&gt;
* -qual &amp;lt;qval&amp;gt; - quality value to be assigned to qualityless reads (default 20)&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
== Notes ==&lt;br /&gt;
&lt;br /&gt;
The program produces rather verbose output when inconsistencies are found in the data.&lt;/div&gt;</summary>
		<author><name>Trgibbons</name></author>	</entry>

	<entry>
		<id>http://amos.sourceforge.net/wiki/index.php/Tarchive2amos</id>
		<title>Tarchive2amos</title>
		<link rel="alternate" type="text/html" href="http://amos.sourceforge.net/wiki/index.php/Tarchive2amos"/>
				<updated>2010-01-20T19:20:29Z</updated>
		
		<summary type="html">&lt;p&gt;Trgibbons: /* Required inputs */ Beginning to update this page&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;tarchive2amos: utility for generating AMOS message files&lt;br /&gt;
&lt;br /&gt;
== Overview ==&lt;br /&gt;
&lt;br /&gt;
The AMOS package uses a compact representation for the information exchange to and from the assembler. This representation, the AMOS message format, is described in detail here, and was inspired by the interchange format developed at Celera Genomics for use in Celera Assembler.&lt;br /&gt;
&lt;br /&gt;
Tarchive2amos is a utility that allows users to convert files from the NCBI Trace Archive format into the AMOS message format.&lt;br /&gt;
 &lt;br /&gt;
&lt;br /&gt;
== Required inputs ==&lt;br /&gt;
&lt;br /&gt;
tarchive2amos can use data specified in the following three formats:&lt;br /&gt;
&lt;br /&gt;
* sequence data in one or more multi-fasta formatted files. These files must be named fasta.* (Trace Archive standard) or *.seq.&lt;br /&gt;
* quality data in zero or more multi-fasta formatted files. These files must be named qual.* (Trace Archive standard) or *.qual and must match the names of the sequence files. Note that the quality files are not necessary: if they are absent all bases will be assigned quality value of 20 (1 error in 100 bp).&lt;br /&gt;
* anciliary data in xml format. These files must be named xml.* (Trace Archive standard) or *.xml and must match the names of the sequence files. The information specified in these files includes (but is not limited to) clipping information, library size information, etc. For more information please refer to the Trace Archive documentation. Like the quality information, the XML files are not required.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
In addition to these files, the user can provide a list of clear ranges (clipping coordinates) in a separate file. This information will override any set by the xml files. Furthermore, reads not present in the clear range file will be excluded from the conversion.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
Note that if a clear range file is not specified, reads with no clear range set in the XML or the sequence file (see below) will be assigned a clear range that spans the entire extent of the read.&lt;br /&gt;
&lt;br /&gt;
== Sequence file formats ==&lt;br /&gt;
&lt;br /&gt;
tarchive2amos accepts four different formats for the header lines in the sequence file:&lt;br /&gt;
&lt;br /&gt;
* Trace Archive format generated by a query (either through website or query_tracedb script)&lt;br /&gt;
&lt;br /&gt;
 &amp;gt;gnl|ti|145655111 name:38245161 ...&lt;br /&gt;
&lt;br /&gt;
The first identifier is the TRACE_ID in the XML file and the second one is the name assigned to the trace (TRACE_NAME) in the xml file.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
* Trace Archive format:&lt;br /&gt;
&lt;br /&gt;
 &amp;gt;gnl|ti|145655111 38245161&lt;br /&gt;
&lt;br /&gt;
The first identifier is the trace identifier (TRACE_ID in the XML file) while the second one is the assigned name for the trace (TRACE_NAME in the XML file). The output message file will only contain the trace name (in the eid: field of each read record).&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
* TIGR sequence format (also produced by the trimming package lucy) :&lt;br /&gt;
&lt;br /&gt;
 &amp;gt;GBRAA01TF 1000 2000 1500 17 823&lt;br /&gt;
&lt;br /&gt;
The first identifier is the trace name, followed by three numbers representing the library size estimates (ignored by tarchive2amos), then followed by the clear range.&lt;br /&gt;
&lt;br /&gt;
* Generic multi-fasta&lt;br /&gt;
&lt;br /&gt;
 &amp;gt;GBRAA01TF&lt;br /&gt;
&lt;br /&gt;
Note that the sequence and quality files are linked through the first identifier on the multi-fasta header line. The XML and the sequence files are linked through the TRACE_NAME field in the XML (it has to match the trace name portion of the header in the Trace Archive format, or the trace identifier in the other two formats).&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
== Synopsis ==&lt;br /&gt;
&lt;br /&gt;
 tarchive2amos -o &amp;lt;prefix&amp;gt; [-c &amp;lt;clear_ranges&amp;gt;] [-l &amp;lt;libs&amp;gt;] &lt;br /&gt;
              [-m &amp;lt;mates&amp;gt;] &amp;lt;seq_file1&amp;gt; &amp;lt;seq_file2&amp;gt; ...&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
tarchive2amos will read one or more sequence files (as described above) and place the ouptut in a file called &amp;lt;prefix&amp;gt;.afg. Note that the -o option is required.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
A set of clear ranges may be specified in an addional file (with option -c) in the format:&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
 &amp;lt;read id&amp;gt; &amp;lt;clip_left&amp;gt; &amp;lt;clip_right&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
These values will overwrite any value specified in the XML or sequence files.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
In addition to Trace Archive XMLs, tarchive2amos also accepts library and read mate information in a Bambus-style .mates file.   Furthermore, library information can also be provided with the -l option in a file formatted as follows:&lt;br /&gt;
&lt;br /&gt;
 &amp;lt;lib_id&amp;gt;  &amp;lt;mean_size&amp;gt;   &amp;lt;size_stdev&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
== Additional options ==&lt;br /&gt;
&lt;br /&gt;
* -i &amp;lt;id&amp;gt; - specifies the starting identifier for the messages generated. This option is useful when appending to an already existing AMOS bank.&lt;br /&gt;
* -min &amp;lt;len&amp;gt; - minimum length of reads accepted (default 100 bp)&lt;br /&gt;
* -max &amp;lt;len&amp;gt; - maximum length of reads accepted (default 2048 bp)&lt;br /&gt;
* -qual &amp;lt;qval&amp;gt; - quality value to be assigned to qualityless reads (default 20)&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
== Notes ==&lt;br /&gt;
&lt;br /&gt;
The program produces rather verbose output when inconsistencies are found in the data.&lt;/div&gt;</summary>
		<author><name>Trgibbons</name></author>	</entry>

	<entry>
		<id>http://amos.sourceforge.net/wiki/index.php/Minimus/README</id>
		<title>Minimus/README</title>
		<link rel="alternate" type="text/html" href="http://amos.sourceforge.net/wiki/index.php/Minimus/README"/>
				<updated>2010-01-14T22:31:44Z</updated>
		
		<summary type="html">&lt;p&gt;Trgibbons: /* Brief Summary */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;minimus - The AMOS Lightweight Assembler&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
== Brief Summary ==&lt;br /&gt;
Minimus is an assembly pipeline designed specifically for small&lt;br /&gt;
data-sets, such as the set of reads covering a specific gene. Note that&lt;br /&gt;
the code will work for larger assemblies (we have used it to assemble&lt;br /&gt;
bacterial genomes), however, due to its stringency, the resulting assembly&lt;br /&gt;
will be highly fragmented.  For large and/or complex assemblies the execution&lt;br /&gt;
of Minimus should be followed by additional processing steps, such as&lt;br /&gt;
scaffolding.&lt;br /&gt;
&lt;br /&gt;
Minimus follows the Overlap-Layout-Consensus paradigm and consists of three main modules which share information through a central file bank:&lt;br /&gt;
&lt;br /&gt;
* [[hash-overlap]] - Computes the overlaps between the reads using a modified version of the Smith-Waterman local alignment algorithm&lt;br /&gt;
* [[tigger]] - Uses the read overlaps to generate the layouts of reads representing individual contigs&lt;br /&gt;
* [[make-consensus]] - Refines the layouts produced by the tigger to generate accurate multiple alignments within the reads&lt;br /&gt;
&lt;br /&gt;
==Dependencies==&lt;br /&gt;
None.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
==Running==&lt;br /&gt;
Either execute the minimus configuration script directly from&lt;br /&gt;
$bindir OR copy it to your local directory, edit it, and run it with&lt;br /&gt;
the `runAmos&amp;#039; command interpreter. The following variables must be set&lt;br /&gt;
on the command line or added to the script for the pipeline to operate&lt;br /&gt;
properly:&lt;br /&gt;
&lt;br /&gt;
TGT - The target genome sequences in AMOS message format (.afg)&lt;br /&gt;
 minimus -D TGT=&amp;lt;target&amp;gt; &amp;lt;prefix&amp;gt;&lt;br /&gt;
OR&lt;br /&gt;
 runAmos -C minimus -D TGT=&amp;lt;target&amp;gt; &amp;lt;prefix&amp;gt;&lt;br /&gt;
&lt;br /&gt;
Where &amp;lt;prefix&amp;gt; will be the output file prefix, and &amp;lt;target&amp;gt; is the&lt;br /&gt;
input AMOS message file. Check the `runAmos&amp;#039; documentation or type&lt;br /&gt;
`runAmos --help&amp;#039; for details on operating an AMOS pipeline. The&lt;br /&gt;
minimus pipeline config file can be easily modified by the user to add&lt;br /&gt;
additional processing steps.&lt;br /&gt;
&lt;br /&gt;
In order to run minimus you need to provide an AMOS formatted file&lt;br /&gt;
of the reads. Such a file (commonly with extension .afg) can be&lt;br /&gt;
generated from a combination of sequence (.seq), quality (.qual), and&lt;br /&gt;
Trace Archive XML (.xml) files using the [[ToAmos | toAmos]] or &lt;br /&gt;
[[Tarchive2amos | tarchive2amos]] programs which will appear in the &lt;br /&gt;
$bindir directory upon installation.&lt;br /&gt;
&lt;br /&gt;
The default TGT file is &amp;lt;prefix&amp;gt;.afg, thus if our input file is&lt;br /&gt;
&amp;lt;prefix&amp;gt;.afg we can run minimus simply by typing:&lt;br /&gt;
&lt;br /&gt;
 minimus &amp;lt;prefix&amp;gt;&lt;br /&gt;
&lt;br /&gt;
== Output ==&lt;br /&gt;
Output will be a TIGR .contig file and a FastA .fasta file. The&lt;br /&gt;
TIGR contig file contains the gapped consensus and multi-alignment&lt;br /&gt;
information for the assembly. Each contig sequence is preceded by a&lt;br /&gt;
header line which starts with &amp;#039;##&amp;#039;, followed by the gapped consensus&lt;br /&gt;
sequence with gaps represented as a &amp;#039;-&amp;#039; character. Following the&lt;br /&gt;
consensus is the gapped read sequence preceded by a header line&lt;br /&gt;
beginning with &amp;#039;#&amp;#039;. The .fasta file contains all the contigs produced&lt;br /&gt;
by AMOScmp in a multi-FastA formatted file. These sequences will match&lt;br /&gt;
the sequences in the .contig file, but without the gaps.&lt;br /&gt;
&lt;br /&gt;
To obtain an ACE format representation of the assembly, we can run&lt;br /&gt;
the following to obtain a &amp;lt;prefix&amp;gt;.ace file:&lt;br /&gt;
&lt;br /&gt;
 bank-report -b &amp;lt;prefix&amp;gt;.bnk CTG &amp;gt; &amp;lt;prefix&amp;gt;.ctg&lt;br /&gt;
 amos2ace &amp;lt;prefix&amp;gt;.afg &amp;lt;prefix&amp;gt;.ctg&lt;br /&gt;
&lt;br /&gt;
Where &amp;lt;prefix&amp;gt; is the same as was used in the above section and&lt;br /&gt;
&amp;lt;prefix&amp;gt;.afg is the original input to the assembly pipeline. We can&lt;br /&gt;
simply add these commands to the runAmos config file to produce an ACE&lt;br /&gt;
file every time we run minimus.&lt;br /&gt;
&lt;br /&gt;
==Example==&lt;br /&gt;
Assume we have a set of Trace Archive data with the names&lt;br /&gt;
`target.seq&amp;#039;, `target.qual&amp;#039; and `target.xml&amp;#039; which contain the&lt;br /&gt;
sequence information for a small assembly task. To run the minimus&lt;br /&gt;
pipeline and generate the default output, we would type the following:&lt;br /&gt;
&lt;br /&gt;
 tarchive2amos -o  target.seq&lt;br /&gt;
 minimus -D TGT=target.afg target&lt;br /&gt;
&lt;br /&gt;
This will generate the default output named `target.contig&amp;#039; and&lt;br /&gt;
`target.fasta&amp;#039;. We could then generate an ACE assembly format file by&lt;br /&gt;
following the instructions in the above section, substituting &amp;quot;target&amp;quot;&lt;br /&gt;
for &amp;quot;&amp;lt;prefix&amp;gt;&amp;quot;.&lt;br /&gt;
&lt;br /&gt;
Minimus is now packaged with two example assemblies. The two examples&lt;br /&gt;
are an Influenza A assembly and a Zebra Fish Gene assembly under the &amp;#039;test&amp;#039;&lt;br /&gt;
directory. The &amp;#039;test&amp;#039; directory in located in the main AMOS directory after you untar &lt;br /&gt;
the AMOS tarball.&lt;/div&gt;</summary>
		<author><name>Trgibbons</name></author>	</entry>

	<entry>
		<id>http://amos.sourceforge.net/wiki/index.php/Minimus/README</id>
		<title>Minimus/README</title>
		<link rel="alternate" type="text/html" href="http://amos.sourceforge.net/wiki/index.php/Minimus/README"/>
				<updated>2010-01-14T22:29:29Z</updated>
		
		<summary type="html">&lt;p&gt;Trgibbons: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;minimus - The AMOS Lightweight Assembler&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
== Brief Summary ==&lt;br /&gt;
minimus is an assembly pipeline designed specifically for small&lt;br /&gt;
data-sets, such as the set of reads covering a specific gene. Note that&lt;br /&gt;
the code will work for larger assemblies (we have used it to assemble&lt;br /&gt;
bacterial genomes), however, due to its stringency, the resulting assembly&lt;br /&gt;
will be highly fragmented.  For large and/or complex assemblies the execution&lt;br /&gt;
of Minimus should be followed by additional processing steps, such as&lt;br /&gt;
scaffolding.&lt;br /&gt;
&lt;br /&gt;
Minimus follows the Overlap-Layout-Consensus paradigm and consists of&lt;br /&gt;
three main modules:&lt;br /&gt;
&lt;br /&gt;
* overlapper - computes the overlaps between the reads using a modified version of the Smith-Waterman local alignment algorithm&lt;br /&gt;
&lt;br /&gt;
* tigger - uses the read overlaps to generate the layouts of reads representing individual contigs&lt;br /&gt;
&lt;br /&gt;
* make-consensus - refines the layouts produced by the tigger to generate accurate multiple alignments within the reads&lt;br /&gt;
&lt;br /&gt;
==Dependencies==&lt;br /&gt;
None.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
==Running==&lt;br /&gt;
Either execute the minimus configuration script directly from&lt;br /&gt;
$bindir OR copy it to your local directory, edit it, and run it with&lt;br /&gt;
the `runAmos&amp;#039; command interpreter. The following variables must be set&lt;br /&gt;
on the command line or added to the script for the pipeline to operate&lt;br /&gt;
properly:&lt;br /&gt;
&lt;br /&gt;
TGT - The target genome sequences in AMOS message format (.afg)&lt;br /&gt;
 minimus -D TGT=&amp;lt;target&amp;gt; &amp;lt;prefix&amp;gt;&lt;br /&gt;
OR&lt;br /&gt;
 runAmos -C minimus -D TGT=&amp;lt;target&amp;gt; &amp;lt;prefix&amp;gt;&lt;br /&gt;
&lt;br /&gt;
Where &amp;lt;prefix&amp;gt; will be the output file prefix, and &amp;lt;target&amp;gt; is the&lt;br /&gt;
input AMOS message file. Check the `runAmos&amp;#039; documentation or type&lt;br /&gt;
`runAmos --help&amp;#039; for details on operating an AMOS pipeline. The&lt;br /&gt;
minimus pipeline config file can be easily modified by the user to add&lt;br /&gt;
additional processing steps.&lt;br /&gt;
&lt;br /&gt;
In order to run minimus you need to provide an AMOS formatted file&lt;br /&gt;
of the reads. Such a file (commonly with extension .afg) can be&lt;br /&gt;
generated from a combination of sequence (.seq), quality (.qual), and&lt;br /&gt;
Trace Archive XML (.xml) files using the [[ToAmos | toAmos]] or &lt;br /&gt;
[[Tarchive2amos | tarchive2amos]] programs which will appear in the &lt;br /&gt;
$bindir directory upon installation.&lt;br /&gt;
&lt;br /&gt;
The default TGT file is &amp;lt;prefix&amp;gt;.afg, thus if our input file is&lt;br /&gt;
&amp;lt;prefix&amp;gt;.afg we can run minimus simply by typing:&lt;br /&gt;
&lt;br /&gt;
 minimus &amp;lt;prefix&amp;gt;&lt;br /&gt;
&lt;br /&gt;
== Output ==&lt;br /&gt;
Output will be a TIGR .contig file and a FastA .fasta file. The&lt;br /&gt;
TIGR contig file contains the gapped consensus and multi-alignment&lt;br /&gt;
information for the assembly. Each contig sequence is preceded by a&lt;br /&gt;
header line which starts with &amp;#039;##&amp;#039;, followed by the gapped consensus&lt;br /&gt;
sequence with gaps represented as a &amp;#039;-&amp;#039; character. Following the&lt;br /&gt;
consensus is the gapped read sequence preceded by a header line&lt;br /&gt;
beginning with &amp;#039;#&amp;#039;. The .fasta file contains all the contigs produced&lt;br /&gt;
by AMOScmp in a multi-FastA formatted file. These sequences will match&lt;br /&gt;
the sequences in the .contig file, but without the gaps.&lt;br /&gt;
&lt;br /&gt;
To obtain an ACE format representation of the assembly, we can run&lt;br /&gt;
the following to obtain a &amp;lt;prefix&amp;gt;.ace file:&lt;br /&gt;
&lt;br /&gt;
 bank-report -b &amp;lt;prefix&amp;gt;.bnk CTG &amp;gt; &amp;lt;prefix&amp;gt;.ctg&lt;br /&gt;
 amos2ace &amp;lt;prefix&amp;gt;.afg &amp;lt;prefix&amp;gt;.ctg&lt;br /&gt;
&lt;br /&gt;
Where &amp;lt;prefix&amp;gt; is the same as was used in the above section and&lt;br /&gt;
&amp;lt;prefix&amp;gt;.afg is the original input to the assembly pipeline. We can&lt;br /&gt;
simply add these commands to the runAmos config file to produce an ACE&lt;br /&gt;
file every time we run minimus.&lt;br /&gt;
&lt;br /&gt;
==Example==&lt;br /&gt;
Assume we have a set of Trace Archive data with the names&lt;br /&gt;
`target.seq&amp;#039;, `target.qual&amp;#039; and `target.xml&amp;#039; which contain the&lt;br /&gt;
sequence information for a small assembly task. To run the minimus&lt;br /&gt;
pipeline and generate the default output, we would type the following:&lt;br /&gt;
&lt;br /&gt;
 tarchive2amos -o  target.seq&lt;br /&gt;
 minimus -D TGT=target.afg target&lt;br /&gt;
&lt;br /&gt;
This will generate the default output named `target.contig&amp;#039; and&lt;br /&gt;
`target.fasta&amp;#039;. We could then generate an ACE assembly format file by&lt;br /&gt;
following the instructions in the above section, substituting &amp;quot;target&amp;quot;&lt;br /&gt;
for &amp;quot;&amp;lt;prefix&amp;gt;&amp;quot;.&lt;br /&gt;
&lt;br /&gt;
Minimus is now packaged with two example assemblies. The two examples&lt;br /&gt;
are an Influenza A assembly and a Zebra Fish Gene assembly under the &amp;#039;test&amp;#039;&lt;br /&gt;
directory. The &amp;#039;test&amp;#039; directory in located in the main AMOS directory after you untar &lt;br /&gt;
the AMOS tarball.&lt;/div&gt;</summary>
		<author><name>Trgibbons</name></author>	</entry>

	<entry>
		<id>http://amos.sourceforge.net/wiki/index.php/Minimus</id>
		<title>Minimus</title>
		<link rel="alternate" type="text/html" href="http://amos.sourceforge.net/wiki/index.php/Minimus"/>
				<updated>2010-01-14T22:28:16Z</updated>
		
		<summary type="html">&lt;p&gt;Trgibbons: /* Basic usage */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;== Overview ==&lt;br /&gt;
&lt;br /&gt;
Minimus is an assembly pipeline designed specifically for small data-sets, such as the set of reads covering a specific gene. Note that the code will work for larger assemblies (we have used it to assemble bacterial genomes), however, due to its stringency, the resulting assembly will be highly fragmented. For large and/or complex assemblies the execution of Minimus should be followed by additional processing steps, such as scaffolding.&lt;br /&gt;
&lt;br /&gt;
Minimus follows the Overlap-Layout-Consensus paradigm and consists of three main modules which share information through a central file bank:&lt;br /&gt;
&lt;br /&gt;
* [[hash-overlap]] - Computes the overlaps between the reads using a modified version of the Smith-Waterman local alignment algorithm&lt;br /&gt;
* [[tigger]] - Uses the read overlaps to generate the layouts of reads representing individual contigs&lt;br /&gt;
* [[make-consensus]] - Refines the layouts produced by the tigger to generate accurate multiple alignments within the reads&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
Minimus uses AMOS message files as both the inputs and the outputs. Please see the [[File conversion utilities]] documentation for more information.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
[[minimus2 | Minimus2]] is a modified version of the minimus pipeline designed for merging two sequence sets. Instead of hash-overlap it uses a nucmer based overlap detector which is much faster.&lt;br /&gt;
&lt;br /&gt;
== Documentation ==&lt;br /&gt;
&lt;br /&gt;
Documentation on running minimus is included with the distribution in the /docs subdirectory.&lt;br /&gt;
&lt;br /&gt;
See [[Minimus/README]].&lt;br /&gt;
&lt;br /&gt;
== Examples ==&lt;br /&gt;
&lt;br /&gt;
Examples of a flu assembly and a Zebrafish gene can be found in the test/minimus directory created when the AMOS distribution is untarred. Documentation on the examples is included with the distribution in /docs/minimus.README.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
== Basic usage  ==&lt;br /&gt;
&lt;br /&gt;
To run minimus will you need a set of sequence files. Assuming you have a set of reads in fasta format called &amp;#039;&amp;#039;&amp;#039;my_reads.seq&amp;#039;&amp;#039;&amp;#039;, you can run minimus with the following two commands:&lt;br /&gt;
&lt;br /&gt;
 toAmos -s my_reads.seq -o my_reads.afg&lt;br /&gt;
 &lt;br /&gt;
 minimus my_reads&lt;br /&gt;
&lt;br /&gt;
The output will be a fasta formatted file called &amp;#039;&amp;quot;my_reads.fasta&amp;quot;&amp;#039;, a contig file with details about the assembly of each contig called &amp;#039;&amp;quot;my_reads.contig&amp;quot;&amp;#039;, and an AMOS bank folder with various files used internally by minimus.&lt;br /&gt;
The toAmos file conversion utility is the most general and probably the most useful of the file conversion utilities included with minimus. More information about toAmos and the [[File_conversion_utilities | other file conversion utilities]] can be found in the [[AMOS | AMOS documentation wiki]]. For example, you can include quality data from a Phred style quality score file by running [[ToAmos | toAmos]] with the -q option as follows:&lt;br /&gt;
&lt;br /&gt;
 toAmos -s my_reads.fasta -q my_reads.qual -o my_reads.afg&lt;br /&gt;
&lt;br /&gt;
Minimus can also be called with the following equivalent command:&lt;br /&gt;
&lt;br /&gt;
 runAmos -C $AMOSBASE/src/Pipeline/minimus.acf my_reads&lt;br /&gt;
&lt;br /&gt;
The AMOS package also includes other helpful tools such as [[Hawkeye]], which is useful for evaluating your assembly with respect to paired-end reads. It can be run on the minimus bank with the following command:&lt;br /&gt;
&lt;br /&gt;
 hawkeye my_reads.bnk/&lt;br /&gt;
&lt;br /&gt;
== Publication ==&lt;br /&gt;
&lt;br /&gt;
[http://www.biomedcentral.com/1471-2105/8/64 Minimus: a fast, lightweight genome assembler]&lt;br /&gt;
&lt;br /&gt;
Sommer, DD, Delcher, AL, Salzberg, SL, and Pop, M. (2007) BMC Bioinformatics, 8:64doi:10.1186/1471-2105-8-64.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
== Acknowledgements ==&lt;br /&gt;
The development of minimus was supported by the National Institutes of Health under grants R01-LM06845 and R01-LM007938 to SLS and by Department of Homeland Security cooperative agreement W81XWH-05-2-0051.&lt;/div&gt;</summary>
		<author><name>Trgibbons</name></author>	</entry>

	<entry>
		<id>http://amos.sourceforge.net/wiki/index.php/Minimus</id>
		<title>Minimus</title>
		<link rel="alternate" type="text/html" href="http://amos.sourceforge.net/wiki/index.php/Minimus"/>
				<updated>2010-01-14T22:26:56Z</updated>
		
		<summary type="html">&lt;p&gt;Trgibbons: /* Basic usage */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;== Overview ==&lt;br /&gt;
&lt;br /&gt;
Minimus is an assembly pipeline designed specifically for small data-sets, such as the set of reads covering a specific gene. Note that the code will work for larger assemblies (we have used it to assemble bacterial genomes), however, due to its stringency, the resulting assembly will be highly fragmented. For large and/or complex assemblies the execution of Minimus should be followed by additional processing steps, such as scaffolding.&lt;br /&gt;
&lt;br /&gt;
Minimus follows the Overlap-Layout-Consensus paradigm and consists of three main modules which share information through a central file bank:&lt;br /&gt;
&lt;br /&gt;
* [[hash-overlap]] - Computes the overlaps between the reads using a modified version of the Smith-Waterman local alignment algorithm&lt;br /&gt;
* [[tigger]] - Uses the read overlaps to generate the layouts of reads representing individual contigs&lt;br /&gt;
* [[make-consensus]] - Refines the layouts produced by the tigger to generate accurate multiple alignments within the reads&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
Minimus uses AMOS message files as both the inputs and the outputs. Please see the [[File conversion utilities]] documentation for more information.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
[[minimus2 | Minimus2]] is a modified version of the minimus pipeline designed for merging two sequence sets. Instead of hash-overlap it uses a nucmer based overlap detector which is much faster.&lt;br /&gt;
&lt;br /&gt;
== Documentation ==&lt;br /&gt;
&lt;br /&gt;
Documentation on running minimus is included with the distribution in the /docs subdirectory.&lt;br /&gt;
&lt;br /&gt;
See [[Minimus/README]].&lt;br /&gt;
&lt;br /&gt;
== Examples ==&lt;br /&gt;
&lt;br /&gt;
Examples of a flu assembly and a Zebrafish gene can be found in the test/minimus directory created when the AMOS distribution is untarred. Documentation on the examples is included with the distribution in /docs/minimus.README.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
== Basic usage  ==&lt;br /&gt;
&lt;br /&gt;
To run minimus will you need a set of sequence files. Assuming you have a set of reads in fasta format called &amp;#039;&amp;#039;&amp;#039;my_reads.seq&amp;#039;&amp;#039;&amp;#039;, you can run minimus with the following two commands:&lt;br /&gt;
&lt;br /&gt;
 `toAmos -s my_reads.seq -o my_reads.afg&amp;#039;&lt;br /&gt;
 &lt;br /&gt;
 `minimus my_reads&amp;#039;&lt;br /&gt;
&lt;br /&gt;
The output will be a fasta formatted file called &amp;#039;&amp;quot;my_reads.fasta&amp;quot;&amp;#039;, a contig file with details about the assembly of each contig called &amp;#039;&amp;quot;my_reads.contig&amp;quot;&amp;#039;, and an AMOS bank folder with various files used internally by minimus.&lt;br /&gt;
The toAmos file conversion utility is the most general and probably the most useful of the file conversion utilities included with minimus. More information about toAmos and the [[File_conversion_utilities | other file conversion utilities]] can be found in the [[AMOS | AMOS documentation wiki]]. For example, you can include quality data from a Phred style quality score file by running [[ToAmos | toAmos]] with the -q option as follows:&lt;br /&gt;
&lt;br /&gt;
 `toAmos -s my_reads.fasta -q my_reads.qual -o my_reads.afg&amp;#039;&lt;br /&gt;
&lt;br /&gt;
Minimus can also be called with the following equivalent command:&lt;br /&gt;
&lt;br /&gt;
 `runAmos -C $AMOSBASE/src/Pipeline/minimus.acf my_reads&amp;#039;&lt;br /&gt;
&lt;br /&gt;
The AMOS package also includes other helpful tools such as [[Hawkeye]], which is useful for evaluating your assembly with respect to paired-end reads. It can be run on the minimus bank with the following command:&lt;br /&gt;
&lt;br /&gt;
 `hawkeye my_reads.bnk/&amp;#039;&lt;br /&gt;
&lt;br /&gt;
== Publication ==&lt;br /&gt;
&lt;br /&gt;
[http://www.biomedcentral.com/1471-2105/8/64 Minimus: a fast, lightweight genome assembler]&lt;br /&gt;
&lt;br /&gt;
Sommer, DD, Delcher, AL, Salzberg, SL, and Pop, M. (2007) BMC Bioinformatics, 8:64doi:10.1186/1471-2105-8-64.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
== Acknowledgements ==&lt;br /&gt;
The development of minimus was supported by the National Institutes of Health under grants R01-LM06845 and R01-LM007938 to SLS and by Department of Homeland Security cooperative agreement W81XWH-05-2-0051.&lt;/div&gt;</summary>
		<author><name>Trgibbons</name></author>	</entry>

	<entry>
		<id>http://amos.sourceforge.net/wiki/index.php/Minimus/README</id>
		<title>Minimus/README</title>
		<link rel="alternate" type="text/html" href="http://amos.sourceforge.net/wiki/index.php/Minimus/README"/>
				<updated>2010-01-14T22:24:50Z</updated>
		
		<summary type="html">&lt;p&gt;Trgibbons: /* Example */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;minimus - The AMOS Lightweight Assembler&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
== Brief Summary ==&lt;br /&gt;
minimus is an assembly pipeline designed specifically for small&lt;br /&gt;
data-sets, such as the set of reads covering a specific gene. Note that&lt;br /&gt;
the code will work for larger assemblies (we have used it to assemble&lt;br /&gt;
bacterial genomes), however, due to its stringency, the resulting assembly&lt;br /&gt;
will be highly fragmented.  For large and/or complex assemblies the execution&lt;br /&gt;
of Minimus should be followed by additional processing steps, such as&lt;br /&gt;
scaffolding.&lt;br /&gt;
&lt;br /&gt;
Minimus follows the Overlap-Layout-Consensus paradigm and consists of&lt;br /&gt;
three main modules:&lt;br /&gt;
&lt;br /&gt;
* overlapper - computes the overlaps between the reads using a modified version of the Smith-Waterman local alignment algorithm&lt;br /&gt;
&lt;br /&gt;
* tigger - uses the read overlaps to generate the layouts of reads representing individual contigs&lt;br /&gt;
&lt;br /&gt;
* make-consensus - refines the layouts produced by the tigger to generate accurate multiple alignments within the reads&lt;br /&gt;
&lt;br /&gt;
==Dependencies==&lt;br /&gt;
None.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
==Running==&lt;br /&gt;
Either execute the minimus configuration script directly from&lt;br /&gt;
$bindir OR copy it to your local directory, edit it, and run it with&lt;br /&gt;
the `runAmos&amp;#039; command interpreter. The following variables must be set&lt;br /&gt;
on the command line or added to the script for the pipeline to operate&lt;br /&gt;
properly:&lt;br /&gt;
&lt;br /&gt;
TGT - The target genome sequences in AMOS message format (.afg)&lt;br /&gt;
 `minimus -D TGT=&amp;lt;target&amp;gt; &amp;lt;prefix&amp;gt;&amp;#039;&lt;br /&gt;
OR&lt;br /&gt;
 `runAmos -C minimus -D TGT=&amp;lt;target&amp;gt; &amp;lt;prefix&amp;gt;&amp;#039;&lt;br /&gt;
&lt;br /&gt;
Where &amp;lt;prefix&amp;gt; will be the output file prefix, and &amp;lt;target&amp;gt; is the&lt;br /&gt;
input AMOS message file. Check the `runAmos&amp;#039; documentation or type&lt;br /&gt;
`runAmos --help&amp;#039; for details on operating an AMOS pipeline. The&lt;br /&gt;
minimus pipeline config file can be easily modified by the user to add&lt;br /&gt;
additional processing steps.&lt;br /&gt;
&lt;br /&gt;
In order to run minimus you need to provide an AMOS formatted file&lt;br /&gt;
of the reads. Such a file (commonly with extension .afg) can be&lt;br /&gt;
generated from a combination of sequence (.seq), quality (.qual), and&lt;br /&gt;
Trace Archive XML (.xml) files using the [[ToAmos | toAmos]] or &lt;br /&gt;
[[Tarchive2amos | tarchive2amos]] programs which will appear in the &lt;br /&gt;
$bindir directory upon installation.&lt;br /&gt;
&lt;br /&gt;
The default TGT file is &amp;lt;prefix&amp;gt;.afg, thus if our input file is&lt;br /&gt;
&amp;lt;prefix&amp;gt;.afg we can run minimus simply by typing:&lt;br /&gt;
&lt;br /&gt;
 `minimus &amp;lt;prefix&amp;gt;&amp;#039;&lt;br /&gt;
&lt;br /&gt;
== Output ==&lt;br /&gt;
Output will be a TIGR .contig file and a FastA .fasta file. The&lt;br /&gt;
TIGR contig file contains the gapped consensus and multi-alignment&lt;br /&gt;
information for the assembly. Each contig sequence is preceded by a&lt;br /&gt;
header line which starts with &amp;#039;##&amp;#039;, followed by the gapped consensus&lt;br /&gt;
sequence with gaps represented as a &amp;#039;-&amp;#039; character. Following the&lt;br /&gt;
consensus is the gapped read sequence preceded by a header line&lt;br /&gt;
beginning with &amp;#039;#&amp;#039;. The .fasta file contains all the contigs produced&lt;br /&gt;
by AMOScmp in a multi-FastA formatted file. These sequences will match&lt;br /&gt;
the sequences in the .contig file, but without the gaps.&lt;br /&gt;
&lt;br /&gt;
To obtain an ACE format representation of the assembly, we can run&lt;br /&gt;
the following to obtain a &amp;lt;prefix&amp;gt;.ace file:&lt;br /&gt;
&lt;br /&gt;
 `bank-report -b &amp;lt;prefix&amp;gt;.bnk CTG &amp;gt; &amp;lt;prefix&amp;gt;.ctg&amp;#039;&lt;br /&gt;
 `amos2ace &amp;lt;prefix&amp;gt;.afg &amp;lt;prefix&amp;gt;.ctg&amp;#039;&lt;br /&gt;
&lt;br /&gt;
Where &amp;lt;prefix&amp;gt; is the same as was used in the above section and&lt;br /&gt;
&amp;lt;prefix&amp;gt;.afg is the original input to the assembly pipeline. We can&lt;br /&gt;
simply add these commands to the runAmos config file to produce an ACE&lt;br /&gt;
file every time we run minimus.&lt;br /&gt;
&lt;br /&gt;
==Example==&lt;br /&gt;
Assume we have a set of Trace Archive data with the names&lt;br /&gt;
`target.seq&amp;#039;, `target.qual&amp;#039; and `target.xml&amp;#039; which contain the&lt;br /&gt;
sequence information for a small assembly task. To run the minimus&lt;br /&gt;
pipeline and generate the default output, we would type the following:&lt;br /&gt;
&lt;br /&gt;
 `tarchive2amos -o  target.seq&amp;#039;&lt;br /&gt;
 `minimus -D TGT=target.afg target&amp;#039;&lt;br /&gt;
&lt;br /&gt;
This will generate the default output named `target.contig&amp;#039; and&lt;br /&gt;
`target.fasta&amp;#039;. We could then generate an ACE assembly format file by&lt;br /&gt;
following the instructions in the above section, substituting &amp;quot;target&amp;quot;&lt;br /&gt;
for &amp;quot;&amp;lt;prefix&amp;gt;&amp;quot;.&lt;br /&gt;
&lt;br /&gt;
Minimus is now packaged with two example assemblies. The two examples&lt;br /&gt;
are an Influenza A assembly and a Zebra Fish Gene assembly under the &amp;#039;test&amp;#039;&lt;br /&gt;
directory. The &amp;#039;test&amp;#039; directory in located in the main AMOS directory after you untar &lt;br /&gt;
the AMOS tarball.&lt;/div&gt;</summary>
		<author><name>Trgibbons</name></author>	</entry>

	<entry>
		<id>http://amos.sourceforge.net/wiki/index.php/Minimus/README</id>
		<title>Minimus/README</title>
		<link rel="alternate" type="text/html" href="http://amos.sourceforge.net/wiki/index.php/Minimus/README"/>
				<updated>2010-01-14T22:24:09Z</updated>
		
		<summary type="html">&lt;p&gt;Trgibbons: /* Output */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;minimus - The AMOS Lightweight Assembler&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
== Brief Summary ==&lt;br /&gt;
minimus is an assembly pipeline designed specifically for small&lt;br /&gt;
data-sets, such as the set of reads covering a specific gene. Note that&lt;br /&gt;
the code will work for larger assemblies (we have used it to assemble&lt;br /&gt;
bacterial genomes), however, due to its stringency, the resulting assembly&lt;br /&gt;
will be highly fragmented.  For large and/or complex assemblies the execution&lt;br /&gt;
of Minimus should be followed by additional processing steps, such as&lt;br /&gt;
scaffolding.&lt;br /&gt;
&lt;br /&gt;
Minimus follows the Overlap-Layout-Consensus paradigm and consists of&lt;br /&gt;
three main modules:&lt;br /&gt;
&lt;br /&gt;
* overlapper - computes the overlaps between the reads using a modified version of the Smith-Waterman local alignment algorithm&lt;br /&gt;
&lt;br /&gt;
* tigger - uses the read overlaps to generate the layouts of reads representing individual contigs&lt;br /&gt;
&lt;br /&gt;
* make-consensus - refines the layouts produced by the tigger to generate accurate multiple alignments within the reads&lt;br /&gt;
&lt;br /&gt;
==Dependencies==&lt;br /&gt;
None.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
==Running==&lt;br /&gt;
Either execute the minimus configuration script directly from&lt;br /&gt;
$bindir OR copy it to your local directory, edit it, and run it with&lt;br /&gt;
the `runAmos&amp;#039; command interpreter. The following variables must be set&lt;br /&gt;
on the command line or added to the script for the pipeline to operate&lt;br /&gt;
properly:&lt;br /&gt;
&lt;br /&gt;
TGT - The target genome sequences in AMOS message format (.afg)&lt;br /&gt;
 `minimus -D TGT=&amp;lt;target&amp;gt; &amp;lt;prefix&amp;gt;&amp;#039;&lt;br /&gt;
OR&lt;br /&gt;
 `runAmos -C minimus -D TGT=&amp;lt;target&amp;gt; &amp;lt;prefix&amp;gt;&amp;#039;&lt;br /&gt;
&lt;br /&gt;
Where &amp;lt;prefix&amp;gt; will be the output file prefix, and &amp;lt;target&amp;gt; is the&lt;br /&gt;
input AMOS message file. Check the `runAmos&amp;#039; documentation or type&lt;br /&gt;
`runAmos --help&amp;#039; for details on operating an AMOS pipeline. The&lt;br /&gt;
minimus pipeline config file can be easily modified by the user to add&lt;br /&gt;
additional processing steps.&lt;br /&gt;
&lt;br /&gt;
In order to run minimus you need to provide an AMOS formatted file&lt;br /&gt;
of the reads. Such a file (commonly with extension .afg) can be&lt;br /&gt;
generated from a combination of sequence (.seq), quality (.qual), and&lt;br /&gt;
Trace Archive XML (.xml) files using the [[ToAmos | toAmos]] or &lt;br /&gt;
[[Tarchive2amos | tarchive2amos]] programs which will appear in the &lt;br /&gt;
$bindir directory upon installation.&lt;br /&gt;
&lt;br /&gt;
The default TGT file is &amp;lt;prefix&amp;gt;.afg, thus if our input file is&lt;br /&gt;
&amp;lt;prefix&amp;gt;.afg we can run minimus simply by typing:&lt;br /&gt;
&lt;br /&gt;
 `minimus &amp;lt;prefix&amp;gt;&amp;#039;&lt;br /&gt;
&lt;br /&gt;
== Output ==&lt;br /&gt;
Output will be a TIGR .contig file and a FastA .fasta file. The&lt;br /&gt;
TIGR contig file contains the gapped consensus and multi-alignment&lt;br /&gt;
information for the assembly. Each contig sequence is preceded by a&lt;br /&gt;
header line which starts with &amp;#039;##&amp;#039;, followed by the gapped consensus&lt;br /&gt;
sequence with gaps represented as a &amp;#039;-&amp;#039; character. Following the&lt;br /&gt;
consensus is the gapped read sequence preceded by a header line&lt;br /&gt;
beginning with &amp;#039;#&amp;#039;. The .fasta file contains all the contigs produced&lt;br /&gt;
by AMOScmp in a multi-FastA formatted file. These sequences will match&lt;br /&gt;
the sequences in the .contig file, but without the gaps.&lt;br /&gt;
&lt;br /&gt;
To obtain an ACE format representation of the assembly, we can run&lt;br /&gt;
the following to obtain a &amp;lt;prefix&amp;gt;.ace file:&lt;br /&gt;
&lt;br /&gt;
 `bank-report -b &amp;lt;prefix&amp;gt;.bnk CTG &amp;gt; &amp;lt;prefix&amp;gt;.ctg&amp;#039;&lt;br /&gt;
 `amos2ace &amp;lt;prefix&amp;gt;.afg &amp;lt;prefix&amp;gt;.ctg&amp;#039;&lt;br /&gt;
&lt;br /&gt;
Where &amp;lt;prefix&amp;gt; is the same as was used in the above section and&lt;br /&gt;
&amp;lt;prefix&amp;gt;.afg is the original input to the assembly pipeline. We can&lt;br /&gt;
simply add these commands to the runAmos config file to produce an ACE&lt;br /&gt;
file every time we run minimus.&lt;br /&gt;
&lt;br /&gt;
==Example==&lt;br /&gt;
Assume we have a set of Trace Archive data with the names&lt;br /&gt;
`target.seq&amp;#039;, `target.qual&amp;#039; and `target.xml&amp;#039; which contain the&lt;br /&gt;
sequence information for a small assembly task. To run the minimus&lt;br /&gt;
pipeline and generate the default output, we would type the following:&lt;br /&gt;
&lt;br /&gt;
`tarchive2amos -o  target.seq&amp;#039;&lt;br /&gt;
`minimus -D TGT=target.afg target&amp;#039;&lt;br /&gt;
&lt;br /&gt;
This will generate the default output named `target.contig&amp;#039; and&lt;br /&gt;
`target.fasta&amp;#039;. We could then generate an ACE assembly format file by&lt;br /&gt;
following the instructions in the above section, substituting &amp;quot;target&amp;quot;&lt;br /&gt;
for &amp;quot;&amp;lt;prefix&amp;gt;&amp;quot;.&lt;br /&gt;
&lt;br /&gt;
Minimus is now packaged with two example assemblies. The two examples&lt;br /&gt;
are an Influenza A assembly and a Zebra Fish Gene assembly under the &amp;#039;test&amp;#039;&lt;br /&gt;
directory. The &amp;#039;test&amp;#039; directory in located in the main AMOS directory after you untar &lt;br /&gt;
the AMOS tarball.&lt;/div&gt;</summary>
		<author><name>Trgibbons</name></author>	</entry>

	<entry>
		<id>http://amos.sourceforge.net/wiki/index.php/Minimus/README</id>
		<title>Minimus/README</title>
		<link rel="alternate" type="text/html" href="http://amos.sourceforge.net/wiki/index.php/Minimus/README"/>
				<updated>2010-01-14T22:23:23Z</updated>
		
		<summary type="html">&lt;p&gt;Trgibbons: /* Running */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;minimus - The AMOS Lightweight Assembler&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
== Brief Summary ==&lt;br /&gt;
minimus is an assembly pipeline designed specifically for small&lt;br /&gt;
data-sets, such as the set of reads covering a specific gene. Note that&lt;br /&gt;
the code will work for larger assemblies (we have used it to assemble&lt;br /&gt;
bacterial genomes), however, due to its stringency, the resulting assembly&lt;br /&gt;
will be highly fragmented.  For large and/or complex assemblies the execution&lt;br /&gt;
of Minimus should be followed by additional processing steps, such as&lt;br /&gt;
scaffolding.&lt;br /&gt;
&lt;br /&gt;
Minimus follows the Overlap-Layout-Consensus paradigm and consists of&lt;br /&gt;
three main modules:&lt;br /&gt;
&lt;br /&gt;
* overlapper - computes the overlaps between the reads using a modified version of the Smith-Waterman local alignment algorithm&lt;br /&gt;
&lt;br /&gt;
* tigger - uses the read overlaps to generate the layouts of reads representing individual contigs&lt;br /&gt;
&lt;br /&gt;
* make-consensus - refines the layouts produced by the tigger to generate accurate multiple alignments within the reads&lt;br /&gt;
&lt;br /&gt;
==Dependencies==&lt;br /&gt;
None.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
==Running==&lt;br /&gt;
Either execute the minimus configuration script directly from&lt;br /&gt;
$bindir OR copy it to your local directory, edit it, and run it with&lt;br /&gt;
the `runAmos&amp;#039; command interpreter. The following variables must be set&lt;br /&gt;
on the command line or added to the script for the pipeline to operate&lt;br /&gt;
properly:&lt;br /&gt;
&lt;br /&gt;
TGT - The target genome sequences in AMOS message format (.afg)&lt;br /&gt;
 `minimus -D TGT=&amp;lt;target&amp;gt; &amp;lt;prefix&amp;gt;&amp;#039;&lt;br /&gt;
OR&lt;br /&gt;
 `runAmos -C minimus -D TGT=&amp;lt;target&amp;gt; &amp;lt;prefix&amp;gt;&amp;#039;&lt;br /&gt;
&lt;br /&gt;
Where &amp;lt;prefix&amp;gt; will be the output file prefix, and &amp;lt;target&amp;gt; is the&lt;br /&gt;
input AMOS message file. Check the `runAmos&amp;#039; documentation or type&lt;br /&gt;
`runAmos --help&amp;#039; for details on operating an AMOS pipeline. The&lt;br /&gt;
minimus pipeline config file can be easily modified by the user to add&lt;br /&gt;
additional processing steps.&lt;br /&gt;
&lt;br /&gt;
In order to run minimus you need to provide an AMOS formatted file&lt;br /&gt;
of the reads. Such a file (commonly with extension .afg) can be&lt;br /&gt;
generated from a combination of sequence (.seq), quality (.qual), and&lt;br /&gt;
Trace Archive XML (.xml) files using the [[ToAmos | toAmos]] or &lt;br /&gt;
[[Tarchive2amos | tarchive2amos]] programs which will appear in the &lt;br /&gt;
$bindir directory upon installation.&lt;br /&gt;
&lt;br /&gt;
The default TGT file is &amp;lt;prefix&amp;gt;.afg, thus if our input file is&lt;br /&gt;
&amp;lt;prefix&amp;gt;.afg we can run minimus simply by typing:&lt;br /&gt;
&lt;br /&gt;
 `minimus &amp;lt;prefix&amp;gt;&amp;#039;&lt;br /&gt;
&lt;br /&gt;
== Output ==&lt;br /&gt;
Output will be a TIGR .contig file and a FastA .fasta file. The&lt;br /&gt;
TIGR contig file contains the gapped consensus and multi-alignment&lt;br /&gt;
information for the assembly. Each contig sequence is preceded by a&lt;br /&gt;
header line which starts with &amp;#039;##&amp;#039;, followed by the gapped consensus&lt;br /&gt;
sequence with gaps represented as a &amp;#039;-&amp;#039; character. Following the&lt;br /&gt;
consensus is the gapped read sequence preceded by a header line&lt;br /&gt;
beginning with &amp;#039;#&amp;#039;. The .fasta file contains all the contigs produced&lt;br /&gt;
by AMOScmp in a multi-FastA formatted file. These sequences will match&lt;br /&gt;
the sequences in the .contig file, but without the gaps.&lt;br /&gt;
&lt;br /&gt;
To obtain an ACE format representation of the assembly, we can run&lt;br /&gt;
the following to obtain a &amp;lt;prefix&amp;gt;.ace file:&lt;br /&gt;
&lt;br /&gt;
`bank-report -b &amp;lt;prefix&amp;gt;.bnk CTG &amp;gt; &amp;lt;prefix&amp;gt;.ctg&amp;#039;&lt;br /&gt;
`amos2ace &amp;lt;prefix&amp;gt;.afg &amp;lt;prefix&amp;gt;.ctg&amp;#039;&lt;br /&gt;
&lt;br /&gt;
Where &amp;lt;prefix&amp;gt; is the same as was used in the above section and&lt;br /&gt;
&amp;lt;prefix&amp;gt;.afg is the original input to the assembly pipeline. We can&lt;br /&gt;
simply add these commands to the runAmos config file to produce an ACE&lt;br /&gt;
file every time we run minimus.&lt;br /&gt;
&lt;br /&gt;
==Example==&lt;br /&gt;
Assume we have a set of Trace Archive data with the names&lt;br /&gt;
`target.seq&amp;#039;, `target.qual&amp;#039; and `target.xml&amp;#039; which contain the&lt;br /&gt;
sequence information for a small assembly task. To run the minimus&lt;br /&gt;
pipeline and generate the default output, we would type the following:&lt;br /&gt;
&lt;br /&gt;
`tarchive2amos -o  target.seq&amp;#039;&lt;br /&gt;
`minimus -D TGT=target.afg target&amp;#039;&lt;br /&gt;
&lt;br /&gt;
This will generate the default output named `target.contig&amp;#039; and&lt;br /&gt;
`target.fasta&amp;#039;. We could then generate an ACE assembly format file by&lt;br /&gt;
following the instructions in the above section, substituting &amp;quot;target&amp;quot;&lt;br /&gt;
for &amp;quot;&amp;lt;prefix&amp;gt;&amp;quot;.&lt;br /&gt;
&lt;br /&gt;
Minimus is now packaged with two example assemblies. The two examples&lt;br /&gt;
are an Influenza A assembly and a Zebra Fish Gene assembly under the &amp;#039;test&amp;#039;&lt;br /&gt;
directory. The &amp;#039;test&amp;#039; directory in located in the main AMOS directory after you untar &lt;br /&gt;
the AMOS tarball.&lt;/div&gt;</summary>
		<author><name>Trgibbons</name></author>	</entry>

	<entry>
		<id>http://amos.sourceforge.net/wiki/index.php/Minimus</id>
		<title>Minimus</title>
		<link rel="alternate" type="text/html" href="http://amos.sourceforge.net/wiki/index.php/Minimus"/>
				<updated>2010-01-14T22:17:46Z</updated>
		
		<summary type="html">&lt;p&gt;Trgibbons: /* Overview */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;== Overview ==&lt;br /&gt;
&lt;br /&gt;
Minimus is an assembly pipeline designed specifically for small data-sets, such as the set of reads covering a specific gene. Note that the code will work for larger assemblies (we have used it to assemble bacterial genomes), however, due to its stringency, the resulting assembly will be highly fragmented. For large and/or complex assemblies the execution of Minimus should be followed by additional processing steps, such as scaffolding.&lt;br /&gt;
&lt;br /&gt;
Minimus follows the Overlap-Layout-Consensus paradigm and consists of three main modules which share information through a central file bank:&lt;br /&gt;
&lt;br /&gt;
* [[hash-overlap]] - Computes the overlaps between the reads using a modified version of the Smith-Waterman local alignment algorithm&lt;br /&gt;
* [[tigger]] - Uses the read overlaps to generate the layouts of reads representing individual contigs&lt;br /&gt;
* [[make-consensus]] - Refines the layouts produced by the tigger to generate accurate multiple alignments within the reads&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
Minimus uses AMOS message files as both the inputs and the outputs. Please see the [[File conversion utilities]] documentation for more information.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
[[minimus2 | Minimus2]] is a modified version of the minimus pipeline designed for merging two sequence sets. Instead of hash-overlap it uses a nucmer based overlap detector which is much faster.&lt;br /&gt;
&lt;br /&gt;
== Documentation ==&lt;br /&gt;
&lt;br /&gt;
Documentation on running minimus is included with the distribution in the /docs subdirectory.&lt;br /&gt;
&lt;br /&gt;
See [[Minimus/README]].&lt;br /&gt;
&lt;br /&gt;
== Examples ==&lt;br /&gt;
&lt;br /&gt;
Examples of a flu assembly and a Zebrafish gene can be found in the test/minimus directory created when the AMOS distribution is untarred. Documentation on the examples is included with the distribution in /docs/minimus.README.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
== Basic usage  ==&lt;br /&gt;
&lt;br /&gt;
To run minimus will you need a set of sequence files. Assuming you have a set of reads in fasta format called &amp;#039;&amp;#039;&amp;#039;my_reads.seq&amp;#039;&amp;#039;&amp;#039;, you can run minimus with the following two commands:&lt;br /&gt;
&lt;br /&gt;
 toAmos -s my_reads.seq -o my_reads.afg&lt;br /&gt;
 &lt;br /&gt;
 minimus my_reads&lt;br /&gt;
&lt;br /&gt;
The output will be a fasta formatted file called &amp;#039;&amp;quot;my_reads.fasta&amp;quot;&amp;#039;, a contig file with details about the assembly of each contig called &amp;#039;&amp;quot;my_reads.contig&amp;quot;&amp;#039;, and an AMOS bank folder with various files used internally by minimus.&lt;br /&gt;
The toAmos file conversion utility is the most general and probably the most useful of the file conversion utilities included with minimus. More information about toAmos and the [[File_conversion_utilities | other file conversion utilities]] can be found in the [[AMOS | AMOS documentation wiki]]. For example, you can include quality data from a Phred style quality score file by running [[ToAmos | toAmos]] with the -q option as follows:&lt;br /&gt;
&lt;br /&gt;
 toAmos -s my_reads.fasta -q my_reads.qual -o my_reads.afg&lt;br /&gt;
&lt;br /&gt;
Minimus can also be called with the following equivalent command:&lt;br /&gt;
&lt;br /&gt;
 runAmos -C $AMOSBASE/src/Pipeline/minimus.acf my_reads&lt;br /&gt;
&lt;br /&gt;
The AMOS package also includes other helpful tools such as [[Hawkeye]], which is useful for evaluating your assembly with respect to paired-end reads. It can be run on the minimus bank with the following command:&lt;br /&gt;
&lt;br /&gt;
 hawkeye my_reads.bnk/&lt;br /&gt;
&lt;br /&gt;
== Publication ==&lt;br /&gt;
&lt;br /&gt;
[http://www.biomedcentral.com/1471-2105/8/64 Minimus: a fast, lightweight genome assembler]&lt;br /&gt;
&lt;br /&gt;
Sommer, DD, Delcher, AL, Salzberg, SL, and Pop, M. (2007) BMC Bioinformatics, 8:64doi:10.1186/1471-2105-8-64.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
== Acknowledgements ==&lt;br /&gt;
The development of minimus was supported by the National Institutes of Health under grants R01-LM06845 and R01-LM007938 to SLS and by Department of Homeland Security cooperative agreement W81XWH-05-2-0051.&lt;/div&gt;</summary>
		<author><name>Trgibbons</name></author>	</entry>

	<entry>
		<id>http://amos.sourceforge.net/wiki/index.php/Minimus</id>
		<title>Minimus</title>
		<link rel="alternate" type="text/html" href="http://amos.sourceforge.net/wiki/index.php/Minimus"/>
				<updated>2010-01-14T20:11:12Z</updated>
		
		<summary type="html">&lt;p&gt;Trgibbons: /* Basic usage example */ I updated the minimus home page in an attempt to make it more approachable. I will not be offended if this is rolled back or radically modified.&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;== Overview ==&lt;br /&gt;
&lt;br /&gt;
minimus is an assembly pipeline designed specifically for small data-sets, such as the set of reads covering a specific gene. Note that the code will work for larger assemblies (we have used it to assemble bacterial genomes), however, due to its stringency, the resulting assembly will be highly fragmented. For large and/or complex assemblies the execution of Minimus should be followed by additional processing steps, such as scaffolding.&lt;br /&gt;
&lt;br /&gt;
minimus follows the Overlap-Layout-Consensus paradigm and consists of three main modules:&lt;br /&gt;
&lt;br /&gt;
* [[hash-overlap]] - computes the overlaps between the reads using a modified version of the Smith-Waterman local alignment algorithm&lt;br /&gt;
* [[tigger]] - uses the read overlaps to generate the layouts of reads representing individual contigs&lt;br /&gt;
* [[make-consensus]] - refines the layouts produced by the tigger to generate accurate multiple alignments within the reads&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
minimus uses as AMOS messages as both the inputs and the outputs. Please see the [[File conversion utilities]] documentation for more information.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
[[minimus2]] is a modified version of the minimus pipeline designed for merging two sequence sets. Instead of hash-overlap it uses a nucmer based overlap detector which is much faster.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
== Documentation ==&lt;br /&gt;
&lt;br /&gt;
Documentation on running minimus is included with the distribution in the /docs subdirectory.&lt;br /&gt;
&lt;br /&gt;
See [[Minimus/README]].&lt;br /&gt;
&lt;br /&gt;
== Examples ==&lt;br /&gt;
&lt;br /&gt;
Examples of a flu assembly and a Zebrafish gene can be found in the test/minimus directory created when the AMOS distribution is untarred. Documentation on the examples is included with the distribution in /docs/minimus.README.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
== Basic usage  ==&lt;br /&gt;
&lt;br /&gt;
To run minimus will you need a set of sequence files. Assuming you have a set of reads in fasta format called &amp;#039;&amp;#039;&amp;#039;my_reads.seq&amp;#039;&amp;#039;&amp;#039;, you can run minimus with the following two commands:&lt;br /&gt;
&lt;br /&gt;
 toAmos -s my_reads.seq -o my_reads.afg&lt;br /&gt;
 &lt;br /&gt;
 minimus my_reads&lt;br /&gt;
&lt;br /&gt;
The output will be a fasta formatted file called &amp;#039;&amp;quot;my_reads.fasta&amp;quot;&amp;#039;, a contig file with details about the assembly of each contig called &amp;#039;&amp;quot;my_reads.contig&amp;quot;&amp;#039;, and an AMOS bank folder with various files used internally by minimus.&lt;br /&gt;
The toAmos file conversion utility is the most general and probably the most useful of the file conversion utilities included with minimus. More information about toAmos and the [[File_conversion_utilities | other file conversion utilities]] can be found in the [[AMOS | AMOS documentation wiki]]. For example, you can include quality data from a Phred style quality score file by running [[ToAmos | toAmos]] with the -q option as follows:&lt;br /&gt;
&lt;br /&gt;
 toAmos -s my_reads.fasta -q my_reads.qual -o my_reads.afg&lt;br /&gt;
&lt;br /&gt;
Minimus can also be called with the following equivalent command:&lt;br /&gt;
&lt;br /&gt;
 runAmos -C $AMOSBASE/src/Pipeline/minimus.acf my_reads&lt;br /&gt;
&lt;br /&gt;
The AMOS package also includes other helpful tools such as [[Hawkeye]], which is useful for evaluating your assembly with respect to paired-end reads. It can be run on the minimus bank with the following command:&lt;br /&gt;
&lt;br /&gt;
 hawkeye my_reads.bnk/&lt;br /&gt;
&lt;br /&gt;
== Publication ==&lt;br /&gt;
&lt;br /&gt;
[http://www.biomedcentral.com/1471-2105/8/64 Minimus: a fast, lightweight genome assembler]&lt;br /&gt;
&lt;br /&gt;
Sommer, DD, Delcher, AL, Salzberg, SL, and Pop, M. (2007) BMC Bioinformatics, 8:64doi:10.1186/1471-2105-8-64.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
== Acknowledgements ==&lt;br /&gt;
The development of minimus was supported by the National Institutes of Health under grants R01-LM06845 and R01-LM007938 to SLS and by Department of Homeland Security cooperative agreement W81XWH-05-2-0051.&lt;/div&gt;</summary>
		<author><name>Trgibbons</name></author>	</entry>

	</feed>