BZ Logo



 

An integrated genome and ortholog viewer (inGeno) is designed for genome sequence comparisons, which has been proven to be powerful, in particular to prokaryotic genomes of close phylogenetic distances. The original purpose of this software is to user-friendly visualize the corresponding relationships between orthologous genes. Step by step, a series of algorithms are implemented and integrated together, thus enable a noise-reducing process, a locus collinear block recognition and a text-mining step, which are helpful for users to extract biological information precisely.

Features

  • Dot-analysis and plot the results
  • User-friendly comparison interface
  • Gene information retrieval.
  • Linear regression and heuristic locus collinear blocks (LCB) recognition process
  • Pseudo-positive orthologous genes reducing procedure (low-noise control)
  • Color clusters re-definition using easy-used control panel.
  • Text-mining operations helpful for precisely extracting the gene information.
  • A sequence conversion tool which can transform rich-format genome sequence files into proteome multi-FASTA format files, which is suitable for making a BLAST or Smith-Waterman analysis.
  • Paracel GeneMatcher alignment report is supported, which make sense to obtain ideal sensitive results rapidly.

The detail operation method is written in detail in supplemental materials. The corresponding interface is recorded as screenshots.

Practice

This part is specially prepared for our students as a practice to acquire a quick impression of pairwise genome comparisons, through the sample files, people could notice easily the strain-specific genes in red color, the potential operons present in the genome, foreign insertion caused by phage and many rearrangement events.

Package

The practice package can be fetched HERE , which includes a lot of demo snapshot files, i.e., strain comparison results between Listeria, Staphylococcus, Blochmannia and E. coli species.

Steps

1. unpack the downloaded package, run it by simply clicking the executable file "ingeno.bat "(WIN) or typing "./ingeno" (LIN).

2. within inGeno, the menu "File" / "Restore from a snapshot" is designed for importing these snapshot files. Afterwards, users please sequentially click the buttons "Dotplot" and "Compare", the comparison figures will be generated in the screen. -Recommended Demo file is currently located at ./SampleData/Listeria/lmolwe.snapshot and ./SampleData/Listeria/lmolin.snapshot

3. in the panel, there are operation controls for helping users to optimize the view, e.g., zoom, distance, color clusters, thresholds. Moreover, mouse operations are allowed to be applied directly on the gene position and observe the corresponding information.

4. comparison from original files. Download the genome genbank files (lmo.genbank, lin.genbank) from the "SAMPLE FILES" below. Blast them with appropriate parameters and collect the reports. Or for convenience, download two reports (lmolin.report, linlmo.report), which are pairwise comparison for eliminating the false-negative noises. Load them (4 FILES) into inGeno software, note you have to upload all these files together in the "Import data from disk" window. Then you couldperform the comparison function.

 


News

 2006.12 ingeno 0.6 available (stable)

 2006.3 ingeno 0.5 available
 2006.2 ingeno 0.4 available
 2006.1 ingeno 0.3 available
 2005.12 ingeno 0.2 available
 2005.10 ingeno 0.1 available

Download

Program

Windows 

Exe:Download

 JarRAR :Download

 Linux

Bin: Download

 JarRAR: Download

 MaxOS

Zip: Download

 Solaris

bin: Download

(JarRAR file is a package file suitable for most users, users could unpack it, then execute it by ingeno.bat or ./ingeno)

 

Sourcecode

The Java sourcecode of inGeno is freely available here.

Development version Download for Linux , (Java JRE 6 required)

Supplementary material

Tutorial

The latest version of supplementary material can be downloaded here, including a PDF document. Users can get instructions how to initiate the program, download and import a sequence file or data mirror, further analyze it using the interactive interface. A couple of sample files are also downloadable, which are helpful for new users to get fit of the software.

       Supplementary material updated in July, 2006

 

Availability

Pentium CPU compatible or higher, 256M RAM is recommended. When the genome size is large than 5,000,000 BP, 512M or higher is optimal for obtaining a satisfied speed. A  JRE environment 1.5 or later version is demanded, since it is completely implemented in Java, the program can run properly on a various platforms (Linux/Windows-stable-,  MAC platforms-unaccomplished).

 


Sample Files

 

Because the L. welshimeri genome has still not been published, so that here we are not allowed to provide it free at this moment. However, these nice examples will be released in one or two months, once the genome paper is accepted. 

Alternatively, L. inncua is also a good control example suitable for genome comparisons between listeria spp., although it is extremely simlar to L. monocytogenes, even we can not observe large islands of strain-specific genes. However, using inGeno, the characteristic genes related to pathogenicity can be distinctly noticed, such as factors (PrfA etc.), regulators, function proteins (Internalins).  The detail description is written in the supplementary material.

1. Sequence input files (Genbank sample, GenBank, EMBL, GenbankXML), Blast alignment Report (Blast, Smith-Waterman as well as Genematcher version supported). These four files can be used to be imported into program, (Menu->File->Import), thus the program can analyze them. The alternative rapid review way is to restore the data mirror directly.

 Genome Files 

 L. monocytogenes 

 L. innocua 

 Blast reports

 lmoVSlin

 linVSlmo 

Please download these FOUR files into one directory, e.g. sampledata folder. then load two genome sequence files using in the first/second rows (File1/File2), load two report files in the third/fourth rows (BlastResult1 / BlastResult2). Then specify the desired threshold with the sliderbars, eventually simply click the <Let's GO> button. 

 

2. Data Snapshot File (recommended for visitors)

 Snapshot file  

 lmoVSlin

By this way, new users can restore the data mirror easily (operation: Menu->File-> Restore a snapshot), so as to review the comparison result and have a fast impression of the software. Hereby, we suggest new-users start using this file.

Screenshots

Here are some screenshots of the inGeno, which illustrate some features and applications useful for studing genome information, such as linear conservation, rearrangement events, phylogenetic evolution, specific functions which are necessary for reconstructing the metabolism. All the figures are taken from inGeno Ver 0.5a.

Figure 1. Typically investigation grahpic user interface (GUI). Detail description please read supplemenary materials.

 

Analysis Example Screenshots

I. Visualization a strain-specific gene islands, which enable a novel metabolism pathway. These genes are continuous in genome, which might to form an operon (Potential operon detection sample (a strain-specific gene island), click to view large figure

     

  

II. Genome rearrangment events investigation samples (green lines in figure), LCB defined by color clusters.

   

Impressum

Acknowledgements

We thank DFG (Da208/7-1; Bo-1099/5-3) and BMBF for support (PathoGenoMik 0313134). We thank all biologists, bioinformaticians and computer scientists who contribute to the open-source Biojava effort.

Contact information

Prof. Dr. Dandekar

 Dandekar@biozentrum.uni-wuerzburg.de

Chunguang Liang

 liang@biozentrum.uni-wuerzburg.de

 

 

Tel.: +49 (0)931 888 4551

Tel.: +49 (0)931 888 4561 (tech.)

Fax: +49 (0)931 888 4552

Post: Bioinformatics Dept., Biocenter Am Hubland, University of Wuerzburg, Germany

PLZ: D-97074


Copyright by Dandekar Group, Bioinformatik, Uni Wuerzburg, 2006
All rights reservered.



Zukunft
updated:
2012.02.15

Link zur neuen Homepage    Impress (german only)