In this paper we study the problem of redistributing in parallel data between clusters interconnected by a backbone. This problem is a generalization of the well-known redistribution problem that appears in parallelism [9]. We suppose that at most k communications can be performed at the same time (the value of k depending on the characteristics of the platform). We use the knowledge of the application in order to schedule the messages and perform a control of the congestion by ourselves. Previous results [7, 6] show that this problem is NP-Complete. We propose and study two fast and ef.cient algorithms for this problem. We prove that these algorithms are 2-approximation algorithms. Simulation results show that both algorithms perform very well compared to the optimal solution. These algorithms have been implemented using MPI. Experimental results show that both algorithms outperform a brute-force TCP based solution, where no scheduling of the messages is performed.
Citation:
Emmanuel Jeannot, Fr?d?ric Wagner, "Two Fast and Efficient Message Scheduling Algorithms for Data Redistribution through a Backbone," ipdps, vol. 1, pp.3b, 18th International Parallel and Distributed Processing Symposium (IPDPS'04) - Papers, 2004