This wiki is obsolete, see the NorduGrid web pages for up to date information.

Debian

From NorduGrid
Jump to navigationJump to search

Debian is a fine distribution of Linux that is provided by a community of users rather than a commercial institution. More rencently, OpenSuSE (Novell) and Fedora (RedHat) have adopted that form of organisation for themselves.

Quick start

Repositories

Official channels for Debian

All ARC dependencies (Globus, LFC, VOMS, etc) have been part of the official Debian stable release since "squeeze" (Debian 6.0). Backports to the old stable ("lenny") have not yet been addressed via the official backports.org initiative. Instead, it is suggested to refer to NorduGrid's repository as shown below. ARC itself is in the testing and unstable distributions. To include these packages to your current Debian installation, please add

deb http://ftp.[yourcountrycode].debian.org/debian/ testing main contrib non-free

to your /etc/apt/sources.list file. If you don't have a Debian mirror with your countrycode, please cross-check with the Debian Mirror List.

NorduGrid's repository

NorduGrid also maintains its own repository which can be used without enabling testing or unstable repositories, or for older versions of Debian.

GPG Key

The archives are signed by the key available at:

 http://download.nordugrid.org/DEB-GPG-KEY-nordugrid.asc. 

Download the key and then install it in your apt configuration using the command:

sudo apt-key add DEB-GPG-KEY-nordugrid.asc

Client installation

Globus certificate utilities are needed if you need to request a new NorduGrid Grid certificate. Other Certificate Authorities (CAs) may provide different means of issuing Grid certificates.

  sudo apt-get install globus-gsi-cert-utils-progs

Keys of potentially relevant CAs are needed to verify the authenticity of Grid services; a simple approach is to install all of them by using

  sudo apt-get install ca-*

This is however not recommended from the security perspective, and security-conscious people must install necessary certificates one by one.

One can also install CA keys from the original IGTF repository, see section #Installing CA keys from IGTF repository


Then install ARC client itself, which will pull in other necessary packages:

  sudo apt-get install nordugrid-arc-client

In all likelihood, you will need to install Globus plugins as well (as long as GridFTP is around):

  sudo apt-get install nordugrid-arc-plugins-globus


Installing CA keys from EGI repository

EGI has experimental support for Debian when it comes to CA keys:add this repository

  #### EGI Trust Anchor Distribution ####
  deb http://repository.egi.eu/sw/production/cas/1/current egi-igtf core

Get the GPG key

  wget -q -O - \
    https://dist.eugridpma.info/distribution/igtf/current/GPG-KEY-EUGridPMA-RPM-3 \
    | apt-key add -

And install the packages:

  apt-get update
  apt-get install ca-policy-egi-core


Installing CA keys from IGTF repository

When nothing works or you need an urgent update before the packages are in repositories, get the keys directly from the upstream provider:

  wget http://dist.eugridpma.info/distribution/current/igtf-policy-installation-bundle-1.40.tar.gz
  tar xvzf igtf-policy-installation-bundle-1.40.tar.gz
  cd igtf-policy-installation-bundle-1.40
  ./configure --with-profile=classic
  sudo make install

You may need to browse to that repository first and find out what is the most recent version number, instead of 1.40

Status

  • Globus packages are accepted by Debian in June 2009
  • Production ARC packages are in Debian unstable as of June 2011

Motivation to use Debian with Computational Grids

Debian is special in its support of many platforms, across all of which the same set of files is made available at the same locations. Hence, if a software runs on one target, then it should also on all the other Debian systems, if memory and other hardware requirements are met. This is exactly what one needs for Grid Computing when the hardware underneath (PowerPC, Sun, Intel, AMD) may differ.

Communities

  • Bioinformatics is mostly provided via the Debian-Med project, a task of which is the provisioning of a Bioinformatics infrastructure. The KnowARC project has produced an interface to the production ARC middleware from the workflow management tool Taverna, which is very popular among bioinformaticians. See grid.inb.uni-luebeck.de.
  • Physics is partially addressed by the Debian-Science initiative. This group has already provided the following packages that are available in the regular Debian distribution
  • CERN is buffering packages prior to an official upload in their own Debian repository since there are still license issues with the remote distribution for CERN.

With Mattias Wadenstein and Steffen Möller, there are at least two individuals that are very close to the Debian Society and the NorduGrid. Mattias Ellert does some fantastic bits to Globus, albeit more aiming at Fedora than at Debian, possibly :) Anders Wäänänen laid out the packaging of ARC and maintains all the nice regular builds.

Accessories

Debian packages as Runtime Environments

The script debian2runtime was developed within the KnowARC project to convert the description of a Debian package into a script that may serve as a runtime environment with production ARC.

Download and post-processing of external data

Together with the Debian-Med community the script getData was developed that downloads, updates and indexes common public databases. It comes with an extensive description that is available on the Debian Wiki pages.

Detailed notes

Configuration

For the client, the command line tools such as arcsub, arcstat and arcget (counterparts to the old ngsub, ngstat, ngget etc) can be used directly and no specific configuration is necessary.

For services see the official NorduGrid documentation for details. In most cases there is nothing Debian-specific that needs to be done.

Installation of packages

Client installtion

For a client installation, perform as follows

 # execute as root
 apt-get install nordugrid-arc-client 

Presuming that you have certificates that grant Grid access in place already, test it with

 $ arctest -J 2
 $ arcstat <jobid>  <- repeat until job reports "Finished"
 $ arccat "that job id" 

If you can read the "Hello, grid!", then you were successful.

Server installation

For a server installation, execute as root

 apt-get install nordugrid-arc-client nordugrid-arc-server

The client package is only needed to test this installation. If the -libs package is not coming with it, then please install it manually.

The configuration file is expected at

 /etc/arc.conf

as usual for ARC but not placed there by the install script. Upon installation of the host certificates, preparation of the directories, one can start the services

 /etc/init.d/a-rex start
 /etc/init.d/gridftpd start
 /etc/init.d/grid-infosys start

Now start a test job

 f=`hostname --long`
 arctest -J 2 -d INFO -c $f

Error messages are in /var/log/ and in /var/spool/nordugrid/jobstatus.

Installation of Globus (obsolete now)

You are now ready to perform the downloads:

 # execute as root
 apt-get install gsoap voms globus globus-dev globus-doc

should do. You may be asked if these packages should be trusted if some are not signed. Also, you may be informed that some packages are needed but not installed. It is likely that those needed packages are automatically retrieved by the apt-get tool, aptitude does such, just add those missing packages to the command line.

Shared libraries cache refresh

 # echo "/opt/globus/lib" > /etc/ld.so.conf.d/globus.conf
 # ls -l /etc/ld.so.conf.d/globus.conf
 -rw-r--r-- 1 root root 16 2008-07-25 16:44 /etc/ld.so.conf.d/globus.conf
 # ldconfig 
 ldconfig: /opt/globus/lib/libltdl_gcc32dbg.so.3 is not a symbolic link
 ldconfig: /opt/globus/lib/libxmlsec1_openssl_gcc32dbg.so.1 is not a symbolic link

To build Globus packages for Debian or Ubuntu yourself, this Wiki has very simple to follow instructions.

See also