Tikar, Sandip and Vadhiyar, Sathish (2008) Efficient reuse of replicated parallel data segments in computational grids. In: Future Generation Computer Systems, 24 (7). pp. 644-657.
Restricted to Registered users only
Download (740Kb) | Request a copy
Grids are being used for executing parallel applications over remote resources. For executing a parallel application on a set of grid resources chosen by a user or a grid scheduler, the input data needed by the application is segmented according to the data distribution followed in the application and the data segments are distributed to the grid resources. The same input data may be used subsequently by different applications leading to multiple copies (replicas) of parallel data segments in various grid resources. The data needed for a parallel application can be gathered from the existing replicas onto the computational resources chosen by the grid scheduler for application execution. In this work, we have devised novel algorithms for determining “nearest” replica sites containing data segments needed by a parallel application executing on a set of resources with the objective of minimizing the time needed for transferring the data segments from the replica sites to the resources. We have tested our algorithms on different kinds of experimental setups. We find that the best algorithm varies according to the configuration of data servers and clients. In all cases, our algorithms performed better than the existing algorithms by at least 15%.
|Item Type:||Journal Article|
|Additional Information:||Copyright of this article belongs to Elsevier.|
|Keywords:||Replica selection;Scheduling;Data movement;Grids;Parallel applications;Parallel Algorithms.|
|Department/Centre:||Division of Information Sciences > Supercomputer Education & Research Centre|
|Date Deposited:||04 Aug 2008|
|Last Modified:||19 Sep 2010 04:48|
Actions (login required)