Helix bioinformatics


	Context and situation

	Research activities

	Partnerships

	Teaching activities

	Members

	Former members


	Evolution of species and gene families

	Spatial organization of genomic information

	Syntaxic and functionnal genome annotation

	Proteomics

	Modeling and simulation of genetic regulatory networks

	Information extraction from texts


	Evolution of gene and gene families

	Spatial organization of genomic information

	Syntaxic and functionnal genome annotation

	Proteomics

	Modeling and simulation of genetic regulatory networks

	Information extraction from text


	Publications by year

	Publications by author

	Export


	The GenoStar integrated bioinformatics platform for exploratory genomics

	GEB: GenoExpertBacteria

	GNA: Genetic Network Analyzer

	PepLine: high throughput proteomics

	Herbs: checking the consistency of proteome annotations

	ISee: In Silico biology e-learning environment

	BOX: XML specifications of genomic data

	AROM: entity-relationship knowledge modeling


	Software and database releases

	Talks, seminars, poster presentations,...

	PhD and Master thesis defenses

	Training and job opportunities

Software and databases

PepLine: high throughput proteomics


		Software pipeline for protein identification and gene localization from tandem mass spectrometry data

Tandem mass spectrometry (MS/MS) presently represents the far most effective tool for high throughput proteomic analysis. MS/MS spectra provide informations that allow the interpretation of the sequence part of the proteins under study, leading to protein identifications.

In this context, Helix and the "Laboratoire Chimie des Protéines", with GENOME express for the PepMap module, designed PepLine: a software pipeline dedicated to high throughput proteomics. If most of the proteomic softwares achieve protein identification through requests on protein databases, PepLine offers moreover to localize genomic regions that putatively code for observed proteins that have not been discovered yet.

At the present time, PepLine is made up of two components: Taggor and PepMap.

The Taggor component allows the partial sequence interpretation of the results produced by MS/MS experiments (MS/MS spectra peak lists) without any reconstruction phase against protein databases. Taggor output is a set of Peptide Sequence Tags.

A set of PSTs represents the input of the PepMap software. PepMap aims at identifying proteins through protein database screening or at localizing genes on a complete unannotated genome of the organism under study (FastA files).

These two software modules have been written in C language. Their validation on experimental data is in progress.

A first prototype of a graphic user interface dedicated to visualization and management of PepMap results has been implemented in Java.

PepMap GUI - 74.9 ko
PepMap GUI
Gene localization on the genome of Arabidopsis thaliana. The left panel of the PepMap GUI main window presents the PST best locations. The right panel displays the genomic location selected in the left panel (7 PSTs found). The optional window shows the nucleic sequence corresponding to the selected location (the nucleotides colored in red/green point out PST sequence/mass correspondances).

	The GenoStar integrated bioinformatics platform for exploratory genomics
	GEB: GenoExpertBacteria
	GNA: Genetic Network Analyzer
	PepLine: high throughput proteomics
	Herbs: checking the consistency of proteome annotations
	ISee: In Silico biology e-learning environment
	BOX: XML specifications of genomic data
	AROM: entity-relationship knowledge modeling

	CEA/LCP, Grenoble
	From MS/MS spectra to PSTs
	From PSTs to protein identification
	PepLine: high throughput proteomics
	From PSTs to gene localization