HeaderHeader

Welcome to the Octo Beowulf cluster

last update: June 25, 2003: Michele

Introduction

The PMA Divisional Beowulf (Octo, begun March 2003 as a follow-up to Gordon, begun February 2001) is meant to allow access to a research grade cluster to students as well as research groups. The system is currently being used by many different groups within Physics, and by the students in the ph20/21/22 class. A very special thank you goes to Intel Corporation for supplying most of the computing and networking equipment. Without their generous donation this project never would have been possible.

The Octo Beowulf team is Chris Mach and Michele Vallisneri.

Hardware

Software

Accounts and support

Compiling and running parallel jobs on Octo

Compiling MPI jobs

Running MPI jobs interactively

We discourage running parallel jobs interactively on the cluster. The golden rule of Beowulf systems is one CPU, one process; degraded performance, general user frustration and administrator wrath are the results of violating this rule. So please run your jobs using the PBS queues (see below).

This said, we reserved the server, octo (a dual processor machine) to run MPI test jobs in interactive mode. Maybe you do not want to wait in the queue because you know that your undebugged code is going to crash immediately anyway. Just remember that octo is used by everybody to compile programs and to serve files: do not to hog it.

Running a batch MPI job

This is how a queuing system works: you submit a job to a queue, requesting a certain resources (typically, a certain number of processors for a certain maximum time). When the requested number of processors becomes available (and when fair-share policies are satisfied), your job is started. This queueing system is designed to avoid oversubscribing (so that each available processor will run only one process at a time, maximising performance) and to share the computing power of the cluster equitably between all users.

Feedback and documentation

Please provide your feedback (beowulf@alice.caltech.edu). As for documentation, good places to start are the LAM/MPI and OpenPBS websites.

Valid XHTML 1.0!