Rangannan, Vetriselvi and Bansal, Manju (2010) High-quality annotation of promoter regions for 913 bacterial genomes. In: Bioinformatics, 26 (24). pp. 3043-3050.
High-quality.pdf - Published Version
Restricted to Registered users only
Download (957Kb) | Request a copy
Motivation: The number of bacterial genomes being sequenced is increasing very rapidly and hence, it is crucial to have procedures for rapid and reliable annotation of their functional elements such as promoter regions, which control the expression of each gene or each transcription unit of the genome. The present work addresses this requirement and presents a generic method applicable across organisms. Results: Relative stability of the DNA double helical sequences has been used to discriminate promoter regions from non-promoter regions. Based on the difference in stability between neighboring regions, an algorithm has been implemented to predict promoter regions on a large scale over 913 microbial genome sequences. The average free energy values for the promoter regions as well as their downstream regions are found to differ, depending on their GC content. Threshold values to identify promoter regions have been derived using sequences flanking a subset of translation start sites from all microbial genomes and then used to predict promoters over the complete genome sequences. An average recall value of 72% (which indicates the percentage of protein and RNA coding genes with predicted promoter regions assigned to them) and precision of 56% is achieved over the 913 microbial genome dataset.
|Item Type:||Journal Article|
|Additional Information:||Copyright of this article belongs to Oxford University Press.|
|Department/Centre:||Division of Biological Sciences > Molecular Biophysics Unit|
|Date Deposited:||27 Dec 2010 07:41|
|Last Modified:||27 Dec 2010 07:41|
Actions (login required)