Helix bioinformatics


	Context and situation

	Research activities

	Partnerships

	Teaching activities

	Members

	Former members


	Evolution of species and gene families

	Spatial organization of genomic information

	Syntaxic and functionnal genome annotation

	Proteomics

	Modeling and simulation of genetic regulatory networks

	Information extraction from texts


	Evolution of gene and gene families

	Spatial organization of genomic information

	Syntaxic and functionnal genome annotation

	Proteomics

	Modeling and simulation of genetic regulatory networks

	Information extraction from text


	Publications by year

	Publications by author

	Export


	The GenoStar integrated bioinformatics platform for exploratory genomics

	GEB: GenoExpertBacteria

	GNA: Genetic Network Analyzer

	PepLine: high throughput proteomics

	Herbs: checking the consistency of proteome annotations

	ISee: In Silico biology e-learning environment

	BOX: XML specifications of genomic data

	AROM: entity-relationship knowledge modeling


	Software and database releases

	Talks, seminars, poster presentations,...

	PhD and Master thesis defenses

	Training and job opportunities

Work in progress and results > Proteomics

From PSTs to gene localization

PepMap software allows gene localization through mapping of PSTs on raw genomic data (i.e. complete unannotated chromosomes).

As for protein identification, PepMap algorithm involves two steps:

A mapping phase of PSTs on the six translation frames of genomic sequences allows putative localization of PSTs coding regions. By taking into account partial matches (i.e. one of the flanking masses of a PST is not recognized while the other is), PepMap may additionally provide important information about intron/exon boundaries.

PST matching types - 5.2 ko
PST matching types

Then a clustering phase aims at grouping the PSTs matches belonging to the same protein in order to help identifying the corresponding gene. We devised several algorithms to this purpose but good results are obtained by simple single linkage clustering procedure: a match is clustered with other surrounding matches, if they are closer than a given maximum distance (typically 5000bp for Arabidopsis thaliana genome and 15000bp for human genome).

PepMap Gene localization - 9.5 ko
PepMap Gene localization

	From MS/MS spectra to PSTs
	From PSTs to protein identification
	From PSTs to gene localization
	Validation of PepLine

	CEA/LCP, Grenoble
	From MS/MS spectra to PSTs
	From PSTs to protein identification
	PepLine: high throughput proteomics
	From PSTs to gene localization