LHCb Software Training: Basics
This hands on tutorial should be followed by all newcomers to the LHCb software environment. It covers the following topics:
Prerequisites
The instructions for these exercises assume that they are being executed on the CERN Linux interactive cluster
lxplus
. However, except for some details of the interactive environment, the exercises should work on any machine where the LHCb software is installed.
To get an account on lxplus, please contact the
LHCb secretariat
. If you already have an lxplus account, but were previously working in another experiment, you should make sure that your account is registered as an LHCb account, in afs group
z5 (see
Exercise 1).
These instructions have last been checked against the
DaVinci v19r3
(and
Gaudi v19r4
) environment. Please use this version of
DaVinci or a more recent version.
Logging in to lxplus
Remote access to
lxplus
is via the
ssh
protocol. Open a connection using the command:
ssh -X <username>@lxplus.cern.ch |
The
-X
option is needed to enable
X11
connections when logging in to lxplus as a remote machine, to be able to open graphics windows and windows for the
emacs
editor. Alternatively, omit the
-X
option and include the following line in the ssh configuration file,
${HOME}/.ssh/config
:
Choice of shell
By default, accounts for LHCb users on lxplus are configured to use the
tcsh
shell, an enhanced version of
csh
, but LHCb setup scripts are also available for the Bourne family of shells:
sh
,
bash
,
ksh
,
zsh
. There are subtle and not-so-subtle differences in the functionality of the different shells, and for scripting there are major differences between (t)csh and Bourne-type shells. A useful short summary can be found in:
A user may redefine the default shell on lxplus by going to
https://cra.cern.ch
and changing the appropriate field. In general, unless you have a strong preference, stick to tcsh as this is better tested.
In what follows,
<script>.[c]sh
is used as shorthand for:
- csh,tcsh:
<script>.csh
- sh,bash,ksh,zsh:
<script>.sh
LHCb software environment
For users logging on to lxplus with an account in the LHCb group (
z5), two scripts are sourced automatically at startup:
-
/afs/cern.ch/group/z5/group_env.[c]sh
-
/afs/cern.ch/group/z5/group_login.[c]sh
The only action of the first group script is to source the script:
-
/afs.cern.ch/lhcb/scripts/lhcbsetup.[c]sh
This sets a number of environment variables, mainly identifying directories relevant for working with LHCb software, and creates some aliases. The main action of the second script is to source the script:
-
/afs.cern.ch/lhcb/scripts/CMT.[c]sh
This defines some additional environment variables and aliases, and performs the setup for the Configuration Management Tool (
CMT
), used in LHCb for building software and for environment setup.
When not working on lxplus, but on a Linux machine where
AFS
is available, the standard LHCb environment may be set using:
source /afs.cern.ch/lhcb/scripts/lhcbsetup.[c]sh source /afs.cern.ch/lhcb/scripts/CMT.[c]sh |
LHCb flavour of emacs editor
Customisations to emacs designed to help LHCb users have been written, and are described in the LHCb
emacs user guide
. Even for users who prefer a different editor (
vim
,
nedit
,
Joe's own editor
,
pico
,
nano
, or
other
), the LHCb flavour of emacs is useful for creating skeleton algorithms and similar.
IMPORTANT : If you are working over a slow network connection, it is sometimes useful to work in a 'terminal mode', where a new X window is not opened. This can be done by running
In this mode, the menu bars are not useful, so it is good to know a few key-strokes :
- Open a file with control-X control-F
- Save a file with control-X control-S
- Exit with control-X control-C
Exercise 1
This is a simple exercise just to check your environment setup.
- Login to lxplus.
- Check that X11 connections are enabled, for example by displaying a clock:
- Check the group associated with your account by giving the command: For an account in the LHCb group the result should be z5.
- Check which shell you are using, for example using: or and optionally change your default shell by going to https://cra.cern.ch
(you will need to login using your CERN NICE
username and password)
- Check that the standard LHCb environment has been set. If it has, then you will have seen a message at login time about your CMT
settings, and the result of the command: should contain a number of variables with prefix LHCB
(LHCBHOME
, LHCBRELEASES
and so on), and with prefix CMT
(CMTROOT
, CMTCONFIG
and so on). If this isn't the case then you need to setup the environment yourself.
- Setup the LHCb flavour of emacs and (optionally) the EDT keyboard emulation by adding the following lines to the
~/.emacs
file: (load (expand-file-name "$EMACSDIR/lhcb")) (load (expand-file-name "$EMACSDIR/edt")) |
or LHCb specific emacs commands are documented in the LHCb emacs user guide
.
Organisation of LHCb software
LHCb software is based on a framework called
Gaudi
. This framework provides functionality useful in a wide range of contexts, such as file access, histogramming, message printing, and run-time configuration. The run-time configuration is based on job options.
Software in LHCb is developed in the context of
CMT packages and
CMT projects.
- A package is a set of files, organised according to some directory structure, which provides some well-defined, circumscribed functionality. It is the basic unit of software development, meaning that all the files in a given package are tagged with the same version number and are released together. Changing just one file in the package implies releasing a new version of the whole package. Most LHCb packages contain code that can be compiled into one or more libraries.
- Packages with related functionality are collected in groups (or are said to be placed under a hat). The purpose of package groups is simply to group together, in the directory tree, all packages that are related in some way (for example all event model packages under the
/Event
hat, or all tracking packages under the /Tr
hat).
- A project is a set of packages that are grouped together according to some functionality, for example all Gaudi packages (
Gaudi
project), most public LHCb header files and base class libraries (LHCb
project), all reconstruction packages (Rec
project), all packages specific to the simulation application (Gauss
project). All packages within a given project are released as a single unit: changing just one package in a project implies a releasing a new version of the whole project. Choosing a given version of a project automatically selects a single version of each of the packages in the project.
Projects and packages have an associated version number, of the form
vXrY
, with
X
and
Y
integers, and this is incremented when changes are made in package contents. The hierarchy of the software organisation is then:
PROJECT/PROJECT_<version>/<package group>/<package>/<package version>
. For example
ls $LHCBRELEASES/DAVINCI/DAVINCI_v19r3/Phys/DaVinci/v19r3/ |
Sub-directories typically found in an LHCb package include:
-
src
: source code;
-
<package>
(Sub-directory with same name as package): header files;
-
options
: job options associated with the package;
-
cmt
: information for CMT
processing;
-
doc
: package documentation.
Package development in LHCb is carried out using the Concurrent Versions System (
CVS
). This defines a code repository, provides dedicated commands for moving code in and out of the repository, and keeps a record of all changes made. With the standard LHCb environment, the repository is identified by the variable
CVSROOT
, and a user can obtain a copy of any package in the repository using the command:
getpack <package group>/<package> <version> |
If no arguments are specified, a list of all available packages is printed, and if the version is omitted a list of all available versions for the specified package is given. The latest version of a package, (not necessarily a working version!) is always available as the package
head. Access to the CVS repository requires
kerberos
authentication. This is is automatic on lxplus, but for access over AFS from another machine a kerberos ticket must be obtained. This can be done using the command:
kinit <username>@cern.ch -tmp |
or on some systems
kinit.krb <username>@cern.ch |
where <username> is the username on lxplus, and the corresponding password will be requested.
Check what tokens you have and when they will expire with the command
or on some systems
(These are usually actually the same command).
New versions of packages, and new packages, are placed in the release area after testing, or may be available in the development area as works in progress. The standard LHCb environment includes the variables:
-
LHCBRELEASES
: release area;
-
LHCBDEV
: development area.
Instructions for making package modifications and new packages available from the release area can be found in the documentation for
How to use Gaudi under CMT
.
Web access is available to both the
CVS repository
and the
release area
.
LHCb applications
The LHCb applications are projects that use the
Gaudi
framework to perform specific tasks. There is a project specific to, and with the same name as, each application. The main applications are:
- Gauss
: event generation, using Pythia
and EvtGen
, and detector simulation based on Geant4
;
- Boole
: simulation of detector response (digitisation)
- Brunel
: event reconstruction
- DaVinci
: event selection and data analysis
- Panoramix
: graphical display of detector and event information
Additional projects contain software that is shared by several applications. The main ones are:
- LHCb
: base classes and public include files
- Lbcom
: components common to all applications
- Hlt
: components for online (HLT) reconstruction
- Rec
: components for online (HLT) and offline reconstruction
- Phys
: components for online (HLT) and offline analysis
- Analysis
: components for offline analysis
Exercise 2
This exercise is designed to give you some familiarity with the LHCb software organisation. It's fairly open-ended, in that you could spend a very long time exploring all of the LHCb packages. Until you're trying to find something - which is when knowing the organisation is a big help - it's probably not useful to spend more than 5-10 minutes looking around.
- Move to the LHCb release area: either or use the web interface to the release area: http://cern.ch/LHCb-release-area/
- Note the different project directories, and enter the project directory for any one of the projects.
- Note the different project versions available, and move to the directory for the latest.
- Note the different package groups, and have a look in two or more of these at what packages are available.
- Choose any two packages, and for each in turn: move to the directory for the latest version, look at which sub-directories are defined, and have a look at the files that are in each.
- Look also at the development area:
Configuration Management Tool
CMT
is used to specify the compilation and execution environment for packages within a project, the dependencies between projects and between packages, and how each package is built (for example, how to produce executables or libraries from source code). To allow this, a CMT package must have a
cmt
sub-directory containing a
requirements
file, where the relevant instructions are given. All LHCb packages developed in the Gaudi framework are CMT packages.
For a package with name
myPackage
, CMT itself defines an environment variable
MYPACKAGEROOT
that contains the package location.
The full set of commands that can be used in a
requirements
file is described in the
CMT manual
. Some of the most useful commands are shown below.
# Lines beginning with the symbol # are treated as comments.
# Blank lines are ignored.
#==================================================================
# The following commands describe a package.
# Package name.
package myPackage
# Package version, should correspond to CVS tag.
version v1r0
# Structure, i.e. directories to process.
# A package must always have:
# a cmt directory (for the requirements file)
# a doc directory (for the release.notes file)
# It may have:
# a src directory for source files (.h, .cpp)
# a options directory for job options (.opts)
branches cmt doc src options
#==================================================================
# The following command specifies packages which the current package
# needs for compilation and linking. Put as many lines as necessary.
# The keyword is followed by package name and version and,
# where applicable, the package group in which the package is placed.
# The version can usually be v* (i.e. any version) because the exact version
# is usually determined by setting the environment in which a package is built
use DaVinciTools v* Phys
#==================================================================
# The following commands tell CMT what to build and how
# Build application with the specified file
application myApp ../src/myApp.cpp
# Build a library with the specified files
library myLib ../src/*.cpp
# Define link options and environment variables for loading component library
apply_pattern component_library library=myLib
# Define link options and environment variables for loading linker library
apply_pattern linker_library library=myLib
#==================================================================
# The following commands define symbols (variables and aliases).
# They consist of a keyword followed by a symbol name, and possibly by a value.
# Previously defined symbols, including environment variables, can be used in
# value assignments by placing brackets - () or {} - around them and a $ in front.
# Treat symbol as a Make macro definition.
# This can be used to define a symbol as a local variable
# (i.e. one that won't be visible from the shell).
macro MYCMTDIR $(HOME)/cmtuser
# Treat symbol as an environment variable.
# The symbol will be visible from the shell after executing SetupProject
set LCGDIR /afs/cern.ch/sw/lcg
# Treat symbol as a path variable.
# The symbol will be visible from the shell after executing SetupProject
# When appending or prepending an item to a path variable not initialised in the
# requirements file, it's a good idea to first remove the item, to ensure it occurs only once.
# Set path.
path PYTHONPATH $(HOME)/python
# Append item to path.
path_append PYTHONPATH $(HOME)/myPythonPackage
# Remove item from path.
path_remove LD_LIBRARY_PATH $(HOME)/lib
# Prepend item to path.
path_prepend LD_LIBRARY_PATH $(HOME)/lib
# Create alias.
# The alias will be visible from the shell after executing SetupProject
alias myApp $(MYPACKAGEROOT)/$(CMTCONFIG)/myApp.exe
#==================================================================
|
Most CMT commands, but not all, operate on
requirements
files, and must be given from a package's
cmt
directory. If a
requirements
file includes lines indicating that other packages are used, then these packages will be searched for in locations specified by the project environment, and the
requirements
files of each used package is processed in turn.
The
CMT manual
should be consulted for the full set of CMT commands, but a useful subset is given below.
- Perform package configuration - that is, create setup files and Make files: Note that when a copy of a package is obtained using
getpack
the package is configured automatically.
- After configuration, Make commands can be used from the
cmt
directory:
- Build the package binaries: The output of the build operation is placed in a package sub-directory. The name of this sub-directory is given by the value of the environment variable
CMTCONFIG
.
- Delete the package binaries:
- Delete all generated files, including copies made to the project
InstallArea
- Show all packages used by the current package:
- Execute
<shell command(s)>
in cmt
directory of current package and all used packages within the same project: cmt broadcast <shell command(s)> |
Multiple commands need to be separated by semicolons and enclosed in inverted commas. This can be useful, for example, to force rebuilding of all used packages in the user's area: cmt broadcast "cmt config ; gmake binclean ; gmake" |
- Execute an executable within the environment of the current project
To release a new package in the LHCb framework it must be imported into CVS and registered so that
getpack
can find it. Follow the instructions
here
.
When executing an application, the following two environment variables are very important:
-
PATH
: search path for executables;
-
LD_LIBRARY_PATH
: search path for shared libraries.
CMT (and the scripts associated to it, as will be seen in the next exercises) remove the need to set these variables explicitly. The environment in which you build and run an application is fully determined by choosing which
CMT project you wish to work with, and the value of the
CMTPROJECTPATH
environment variable.
Exercise 3
This exercise introduces basic CMT functionality and demonstrates the use of the LHCb flavour of emacs to create different kinds of files.
- Create a CMT project directory in which to work. This command creates a directory
~cmtuser/DaVinci_v19r3
, containing a file cmt/project.cmt
. Any directory tree containing this file is called a CMT project.
- Look at the value of
CMTPROJECTPATH
The setenvDaVinci
command also set up the search path for CMT projects
- Look at the
cmt/project.cmt
file that was created The setenvDaVinci v19r3
command placed you inside your new CMT project, and created a file that tells CMT that any CMT command issued from inside this directory tree should use the environment defined by the DAVINCI_v19r3 project, which is found somewhere on the CMTPROJECTPATH
(look for it!). If you want to work with a different application, or with a different version of DaVinci, simply re-issue the setenv<Project>
command, e.g. setenvBrunel v31r7
. Information about CMT projects, including tips and tricks for working with them, can be found here
- Create a new package in your
DaVinci_v19r3
project: cd ~cmtuser/DaVinci_v19r3 mkdir -p MyGroup/MyPackage/v1r0/cmt cd MyGroup/MyPackage/v1r0/cmt |
- Create a cmt requirements file Look at the generated file and add the line Save the file. The
use
statement that you added tells CMT that the files in this package will require to compile and link against files in the GaudiAlg
package.
- Try some basic CMT commands:
cmt show uses ls ls .. cmt config ls ls .. |
- Try creating some different types of files with emacs. Each time, make a small modification to the file and save it
emacs ../doc/release.notes & emacs ../src/MyAlg.h & emacs ../src/MyAlg.cpp & emacs ../src/MyPackage_dll.cpp & |
- Use CMT to compile the files you just created, and make a library out of them
gmake ls .. ls ../slc4_ia32_gcc34 ls ../$CMTCONFIG |
- Similarly, make the debug version of the library (i.e. containing debug symbols for use with a debugger)
setenv CMTCONFIG $CMTDEB gmake ls .. ls ../slc4_ia32_gcc34_dbg ls ../$CMTCONFIG |
Exercise 4
Running LHCb Applications
Exercise 5
Exercise 6
Exercise 7
Exercise 8
--
MarcoCattaneo - 18 Jul 2007