Joshi, Y and Vadhiyar, S (2009) Analysis of DNA sequence transformations on grids. In: Journal of Parallel and Distributed Computing, 69 (1). pp. 80-90.
er.pdf - Published Version
Restricted to Registered users only
Download (2399Kb) | Request a copy
Study of the evolution of species or organisms is essential for various biological applications. Evolution is typically studied at the molecular level by analyzing the mutations of DNA sequences of organisms. Techniques have been developed for building phylogenetic or evolutionary trees for a set of sequences. Though phylogenetic trees capture the overall evolutionary relationships among the sequences, they do not reveal fine-level details of the evolution. In this work, we attempt to resolve various fine-level sequence transformation details associated with a phylogenetic tree using cellular automata. In particular, our work tries to determine the cellular automata rules for neighbor-dependent mutations of segments of DNA sequences. We also determine the number of time steps needed for evolution of a progeny from an ancestor and the unknown segments of the intermediate sequences in the phylogenetic tree. Due to the existence of vast number of cellular automata rules, we have developed a grid system that performs parallel guided explorations of the rules on grid resources. We demonstrate our techniques by conducting experiments on a grid comprising machines in three countries and obtaining potentially useful statistics regarding evolutions in three HIV sequences. In particular, our work is able to verify the phenomenon of neighbor-dependent mutations and find that certain combinations of neighbor-dependent mutations, defined by a cellular automata rule, occur with greater than 90% probability. We also find the average number of time steps for mutations for some branches of phylogenetic tree over a large number of possible transformations with standard deviations less than 2.
|Item Type:||Journal Article|
|Additional Information:||Copyright of this Article belongs to Elsevier Science.|
|Keywords:||Bioinformatics; Biology and genetics; Grid computing; Grids; Grid applications; Distributed systems; Client/Server; Distributed applications|
|Department/Centre:||Division of Information Sciences > Supercomputer Education & Research Centre|
|Date Deposited:||22 Feb 2010 09:14|
|Last Modified:||19 Sep 2010 05:55|
Actions (login required)