This article is composed of the following sections:
There are two types of machines in the Valkyrie cluster:
Front end nodes are used for compiling and launching jobs. There is one front end node named: valkyrie.ucsd.edu.
Compute nodes are the machines on which jobs run. There are sixteen (16) compute nodes, compute-0-0 through compute-0-15.
All of the Valkyrie nodes are linked by a special highspeed Myrinet network. They also can communicate via a standard (100mbit) Ethernet which is slower.
MPI is a library specification for message-passing, proposed as a standard by a broadly based committee of vendors, implementors, and users.
MPICH is a freely available, portable implementation of MPI, the Standard for message-passing libraries. MPICH is maintained by Argonne National Laboratory (ANL) and Mississippi State University (MSU).
GM is a message-passing system for Myrinet networks. The GM system includes a driver, Myrinet-interface control program, a network mapping program, and the GM API, library, and header files.
GM features include:
Myricom has implemented MPICH over the MyriNet GM message passing library. This version of MPI is installed under "/opt/mpich".
Set the following environment variable:
export PATH=/opt/mpich/myrinet/gnu/bin:$PATH
set path = ( /opt/mpich/myrinet/gnu/bin $path )
The compiler executable is named mpicc.
For example, compiling the sample cpi.c program which may be copied from /opt/mpich/myrinet/gnu/examples/cpi.c would look like:
valkyrie.ucsd.edu% mpicc cpi.c -o cpi
If for some reason you wish to use the standard (100mbit) Ethernet instead of the highspeed MyriNet for interprocess communication, compile your programs using the /opt/mpich tree.
Set the following environment variable.
export PATH=/opt/mpich/gnu/bin:$PATH
set path = ( /opt/mpich/gnu/bin $path )
Compile your source files using mpicc as described above.
Mpirun on Rocks clusters is used to launch jobs that are linked with the Ethernet device for MPICH.
In this section we will assume that you have compiled the sample "cpi" program.
For example, to interactively launch the "cpi" program on two processors:
compute-0-0
compute-0-1
export PATH=/opt/mpich/gnu/bin:$PATH
set path = ( /opt/mpich/gnu/bin $path )
mpirun -nolocal -np 2 -machinefile machines
For example for user cs260c user:
mpirun -nolocal -np 2 -machinefile machines /home/cs260c/cpi
Mpirun.ch_gm on Rocks clusters is used to launch jobs that are linked with the Myrinet device for MPICH.
For example, to interactively launch the "cpi" program on two processes:
compute-0-0
compute-0-1
Set the PATH:
export PATH=/opt/mpich/myrinet/gnu/bin:$PATH
set path = ( /opt/mpich/myrinet/gnu/bin $path )
mpirun.ch_gm -np 2 -machinefile machines
mpirun.ch_gm -np 2 -machinefile machines /home/cs260c/cpi
cluster-ps and cluster-kill commands can be used on Valkyrie to check the running processes and the owner of the process/es can kill his/her job.
[valkyrie ~]$ cluster-ps
[valkyrie ~]$ cluster-kill (username)