NCSA Home
Contact Us | Intranet | Search

NCSA NEWS

News Home
Calendar
Images
Subscribe to Our Newsletter

Nug30—Unsolved for More than Three Decades—Cracked on the Grid

released July 18, 2000

 

Contact
Karen Green
Public Information Officer
kareng@ncsa.uiuc.edu
217.265.0748 phone
217.244.7396 fax

CHAMPAIGN, IL — In what one researcher called one of "the largest Grid computing successes for the Alliance" to date, researchers at National Computational Science Alliance partners University of Iowa and Argonne National Laboratory announced in June that they used the Partnerships for Advanced Computational Infrastructure (PACI) Grid to solve the nug30 quadratic assignment problem (QAP). Nug30 had gone unsolved since 1968 when it was first proposed as a test problem in the area of applied mathematics known as location theory.

Kurt Anstreicher, a professor of management sciences at the University of Iowa, and his student Nathan Brixius designed the algorithm that solved the nug30 problem. They worked in collaboration with Jeff Linderoth and Jean-Pierre Goux of Argonne to parallelize their QAP algorithm using a variety of Alliance Grid resources and software tools.

Processors from machines at eight locations around the world using five different computing platforms were harnessed in the seven-day nug30 run-effectively creating a single, integrated parallel computing platform. The QAP solution consumed more than 96,000 CPU-hours. An average of about 650 processors were used to solve the problem at any given time, with a peak of 1,009.

Processors at Argonne, the University of Wisconsin, Georgia Tech's Interactive High Performance Computing Laboratory, the Italian Istituto Nazionale di Fisica Nucleare, the National Center for Supercomputing Applications (NCSA), the Albuquerque High Performance Computing Center, Columbia University, and Northwestern University participated in the nug30 run. SGI Origin2000 supercomputers at Argonne and NCSA also contributed.

"Computing like this, using different systems in different locations to create a single [problem-solving] environment, has opened the door to addressing new, important scientific applications," says Jeff Linderoth, a research scientist in the MetaNEOS project at Argonne. "However, as this sort of heterogenous, distributed computing becomes more prevalent, it has to become easier."

The tools deployed by Alliance partners made the massive nug30 computation easier, even for intrepid early users like the team from Argonne and Iowa. The Condor high-throughput computing system was used to harness the power of desktop computers and commodity clusters by monitoring their status and running jobs on them when they are available. The MetaNEOS project's Master-Worker library was used to implement the algorithm on Condor, coordinating the activities of the large number of processors in solving the problem. Finally, the Globus toolkit was used to setup access to some of the computational resources. Condor was developed at Wisconsin; MetaNEOS's Master-Worker was developed by Argonne and Wisconsin; and Globus development is centered at Argonne and the University of Southern California, with NCSA and others also contributing.

"The most important thing to remember here is that this wasn't a demo," says Miron Livny, a computer science professor at Wisconsin who heads the Condor project. "The nug30 team wasn't working in any sort of bubble. They were using generic grid middle-ware, on production grid resources, in a very dynamic multi-user environment."

"This tremendous achievement demonstrates that the Grid being established within the Alliance can have a real impact on the practice of science," says Ian Foster, a senior scientist at Argonne and computer science professor at the University of Chicago who co-leads the Globus project. "As grid services provided by Globus and Condor enter the infrastructure, we can expect that computations such as these will become easier and breakthrough results commonplace."

In QAPs like nug30, there are a set of n locations and a set of n facilities, and each facility must be assigned a location. To measure the cost of each possible assignment, the flow between each pair of facilities is multiplied by the distance between the pair's assigned locations, and then a sum is taken over all of the pairs. In nug30, for example, 30 facilities must be assigned to 30 fixed locations, and the goal is to find the assignment that minimizes total cost of transferring material between the facilities.

QAP problems like these present themselves in a variety of fields, from designing the layout of hospitals or factories to designing computer chips. Despite the simplicity of the problem statement, QAPs are incredibly difficult to solve to optimality.

"The complexity of a QAP like nug30 is really hard to imagine," says Anstreicher. "Without the use of metacomputing, the solution process would have required about 7 years on a fast workstation, even with our very sophisticated algorithm."

 

Headlines Archive