<?xml version="1.0" standalone="no"?>

<!DOCTYPE article PUBLIC "-//OASIS//DTD DocBook XML V4.2//EN"
          "http://www.oasis-open.org/docbook/xml/4.2/docbookx.dtd">
<article>
<articleinfo>
  <title>NPACI Rocks at NCSU: Installing and Realmification</title>
  <author><firstname>Jack</firstname><surname>Neely</surname>
    <email>jjneely@pams.ncsu.edu</email>
  </author>
</articleinfo>

<section><title>New Beowulf Software from PAMS Computer Operations</title>

<para>The next generation of software solutions for high performance computing offered by PAMS Computer Operations is a Red Hat Linux based Beowulf distribution called <ulink url="http://www.rocksclusters.org">NPACI Rocks</ulink>.  We are phasing out the very old NCSU Scyld distribution in favor of this newer and more powerful software.  The goal of this short document is to provide some instructions about installing and realmifying the Rocks distribution.</para>

<para>The new software contains many features.  Many of which are much more advanced and mature when compared to the NCSU Scyld Beowulf distribution.  The software contains a web based monitoring tool, MPICH, PVM, PVFS, OpenPBS, the Sun Grid Engine, etc.  One of the major goals of this project is easy maintenance, and it is pretty easy to install and maintain.</para>

</section>

<section><title>Installing NPACI Rocks</title>

<para>Here I will point you to the <ulink url="http://www.rocksclusters.org">NPACI Rocks Documentation</ulink>.  It contains all the instructions for getting the software, installing it, and start computing with it. </para>

<para>Note that it is easier to complete the rest of this document after you have installed the front end machine and before you have installed any of the slave nodes.</para>

</section>

<section><title>Realmification</title>

<para>Realmification of the Rocks distribution is pretty simple.  All you need to do is to install the <emphasis>ncsu-beowulf</emphasis> package on the cluster.  This package will configure kerberos to interface with the kerberos authentication system here at NCSU.  It modifies one of the PAM configuration files.  Finally, it installs the <command>allowuser</command> and <command>rmuser</command> commands.  Which will allow you to let NCSU realm IDs log into the cluster.  Both command accept the <parameter>--help</parameter> argument to display how they are to be used.</para>

<para>The latest version of the <emphasis>ncsu-beowulf</emphasis> package is availible on <systemitem>kickstart.linux.ncsu.edu</systemitem> in the <filename>/pub/contrib/jjneely/NPACI_Rocks</filename> directory.</para>

</section>

<section><title>Notes about Installation</title>

<para>By now you should be able to install your slave nodes and have a working system.  However, you may want to do some other configuration before hand.  The configuration you do is all dependent on how you would like to use your cluster.  The following are some pointers to set up a cluster similar to how PAMS plans to use this software.</para>

<section><title>Sun Grid Engine</title>

<para>You will need to create the file <filename>/etc/USESGE</filename> to switch to the Sun Grid Engine from the default batch system (OpenPBS).  You should do this before you install your slave nodes.  This way the <emphasis>insert-ethers</emphasis> program will be able to do some of the configuration for SGE for you.  Rocks version 2.3 had some bugs in the SGE packages so you will want to at least use Rocks 2.3.1.</para>

</section>

<!--
<section><title>ATLAS and the Linpack Benchmark</title>

<para>Another issue you may run into is running the Linpack benchmark.  The ATLAS libraries that Linpack needs are optimized for the Pentium 4 and sometimes don't work right on other Pentiums.  You may want to recompile ATLAS and then relink the HPL packages so that ATLAS will be optimized for the processors in your system.  I've put up some packages of a recompiled ATLAS for the Pentium 3 and rebuild HPL packages on <systemitem>kickstart.linux.ncsu.edu</systemitem> in the <filename>/pub/contrib/jjneely/NPACI_Rocks</filename> directory.  You will also want to drop these packages in <filename>/export/home/install/contrib/public/i386/RPMS</filename>, then change directory to <filename>/export/home/install</filename>, and finally run the command <command>rocks-dist dist</command>.  That will update the install tree that your slave nodes install from so that they will get the updated packages.</para>

</section>
-->

</section>

</article>
