|
|
||||
|
|
|||||
|
An integrated genome and ortholog viewer (inGeno) is designed for genome sequence comparisons, which has been proven to be powerful, in particular to prokaryotic genomes of close phylogenetic distances. The original purpose of this software is to user-friendly visualize the corresponding relationships between orthologous genes. Step by step, a series of algorithms are implemented and integrated together, thus enable a noise-reducing process, a locus collinear block recognition and a text-mining step, which are helpful for users to extract biological information precisely. Features
This part is specially prepared for our students as a practice to acquire a quick impression of pairwise genome comparisons, through the sample files, people could notice easily the strain-specific genes in red color, the potential operons present in the genome, foreign insertion caused by phage and many rearrangement events. Package The practice package can be fetched Steps 1. unpack the downloaded package, run it by simply clicking the executable file "ingeno.bat "(WIN) or typing "./ingeno" (LIN). 2. within inGeno, the menu "File" / "Restore from a snapshot" is designed for importing these snapshot files. Afterwards, users please sequentially click the buttons "Dotplot" and "Compare", the comparison figures will be generated in the screen. -Recommended Demo file is currently located at ./SampleData/Listeria/lmolwe.snapshot and ./SampleData/Listeria/lmolin.snapshot 3. in the panel, there are operation controls for helping users to optimize the view, e.g., zoom, distance, color clusters, thresholds. Moreover, mouse operations are allowed to be applied directly on the gene position and observe the corresponding information. 4. comparison from original files. Download the genome genbank files (lmo.genbank, lin.genbank) from the "SAMPLE FILES" below. Blast them with appropriate parameters and collect the reports. Or for convenience, download two reports (lmolin.report, linlmo.report), which are pairwise comparison for eliminating the false-negative noises. Load them (4 FILES) into inGeno software, note you have to upload all these files together in the "Import data from disk" window. Then you couldperform the comparison function.
Program
(JarRAR file is a package file suitable for most users, users could unpack it, then execute it by ingeno.bat or ./ingeno)
Sourcecode The Java sourcecode of inGeno is freely available here.
Tutorial The latest version of supplementary material can be downloaded here, including a PDF document. Users can get instructions how to initiate the program, download and import a sequence file or data mirror, further analyze it using the interactive interface. A couple of sample files are also downloadable, which are helpful for new users to get fit of the software.
AvailabilityPentium CPU compatible or higher, 256M RAM is recommended. When the genome size is large than 5,000,000 BP, 512M or higher is optimal for obtaining a satisfied speed. A
Because the L. welshimeri genome has still not been published, so that here we are not allowed to provide it free at this moment. However, these nice examples will be released in one or two months, once the genome paper is accepted. Alternatively, L. inncua is also a good control example suitable for genome comparisons between listeria spp., although it is extremely simlar to L. monocytogenes, even we can not observe large islands of strain-specific genes. However, using inGeno, the characteristic genes related to pathogenicity can be distinctly noticed, such as factors (PrfA etc.), regulators, function proteins (Internalins). The detail description is written in the supplementary material. 1. Sequence input files (Genbank sample, GenBank, EMBL, GenbankXML), Blast alignment Report (Blast, Smith-Waterman as well as Genematcher version supported). These four files can be used to be imported into program, (Menu->File->Import), thus the program can analyze them. The alternative rapid review way is to restore the data mirror directly.
2. Data Snapshot File (recommended for visitors)
Here are some screenshots of the inGeno, which illustrate some features and applications useful for studing genome information, such as linear conservation, rearrangement events, phylogenetic evolution, specific functions which are necessary for reconstructing the metabolism. All the figures are taken from inGeno Ver 0.5a.
Figure 1. Typically investigation grahpic user interface (GUI). Detail description please read supplemenary materials.
Analysis Example Screenshots I. Visualization a strain-specific gene islands, which enable a novel metabolism pathway. These genes are continuous in genome, which might to form an operon (Potential operon detection sample (a strain-specific gene island), click to view large figure
II. Genome rearrangment events investigation samples (green lines in figure), LCB defined by color clusters.
Acknowledgements We thank DFG (Da208/7-1; Bo-1099/5-3) and BMBF for support (PathoGenoMik 0313134). We thank all biologists, bioinformaticians and computer scientists who contribute to the open-source Biojava effort.
Copyright by Dandekar Group, Bioinformatik, Uni Wuerzburg, 2006
|
|||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
|
updated:
2012.02.15 |
||||||
|
||||||