Running the online alignment in run 2

How to run the Alignment from the PVSS panel

Open the panel from an ui or plus machine:

ssh -Y lbgw

ssh -Y ui

/group/online/ecs/Shortcuts315/LHCb/ECS/ECS_UI_FSM.sh

Open the LHCb_Align tree (right click).

The firs thing to do is to write your name in the appropriate field and click on Reserve Alignment so that people know who to contact in case of need.

ALERT! Usually the farm should be taken by the run control in shared mode. if this is not the case you may have to:

  1. take the project: click on the grey open lock and choice "take";
  2. Allocate: click on the State menu and select Allocate.

panel.png

Then select the activity from the menu on the right of the Run Info panel. Normally, alignment activities are performed in this order:

  1. VELO
  2. Tracker
  3. RICH
  4. Muon

Now you should select the run range to use in the alignment.

To select the run range, click on the Choose Runs for Alignmnet button. Select the runs from the list and click Ok. The runs corresponding to the last fill are in the bottom of the tyable . It is possible to select runs from any other type of data for a given alignment activity (by default it proposes the data corresponding to the activity).

runsSelector.png

Now open the error log to monitor what is going on. On a shell, type:

errorLog LHCbA

Useful commands: Ctrl-S to block the screen, Ctrl-Q to resume, Ctrl-C to close.

Tips:

  • this is the error log for the whole LHCb-A online partition, so if errors not containing "Alig" appear, do not panic.
  • the color code is determined according to the content of the message: for example, messages containing contribution to hit error do not represent errors, but are still colored in red.

errorLog_yellow.png

You are now ready to run the alignment: turn on the Auto Pilot and wait that it does the Configure and Start run. This will take several minutes; when it will be done everything should be in ready and the autopilot should be off.

(to check- and modify in case- the function and the conditions that are used see sections "Function to be used" and "Alignment constants")

Running without the autopilot

From the LHCb_Align PVSS panel, click on the menu at the top and choose Configure. This can take a few minutes.

If the HLT subtree goes in Error state, you can open it and exclude the nodes or farms in error using the arrows in the Quick actions panel. To locate the troubling nodes, have a look at the PARTAlign tree.

HLT_running.png PARTAlign_running.png

Once the top tree is in state ready, you can click on Start run. Enjoy!

See and select runs available on the HLT farm to run the alignment

There is a buttom on the main panel "Run for alignment" Clicking on it a new window is open. It shows for the selected activity fill number, run number, number of events collected for that activity in that run ( This number is not precise to 1 event but the order is correct, and it should be read as at least. This list is available only for all the fill starting from 3962)

See which runs are used for the alignment

From the top panel click on RunInfo, then in the panel that appears click on Trigger. In the panel that appears the information can be found under Trigger.DeferredRuns

Software development in the satellite area (online)

AlignmentOnline is installed in the satellite area and a specific branch is devoted to development in there. The suggested workflow is the following

<make changes until sensible workpoint>
git add <files>
git commit -m '<a meaningful message>'
<make changes until sensible workpoint>
git add <files>
git commit -m '<a meaningful message>'
<ready to go to the main repository without breaking the nightlies>
git pull origin master
<fix any conflict (hopefully none) and check that things work as expected>
git push origin satellite
<open merge request from given link or gitlab webpage>

If some modification in Alignment is needed, the corresponding package can be developed with the workflow described in Git4LHCb:

git lb-use Alignment
git lb-checkout Alignment/master Some/Package
<make changes and test>
git add Some/Package/src/MyStuff.cpp
git commit -m 'fixing feature abc (JIRATICKET-123)?
git lb-push Alignment JIRATICKET-123

Donít forget to open a merge request in Alignment after pushing.

Compile the alignment framework online

The code used online for the alignment can be found in /group/online/dataflow/cmtuser/AlignmentRelease/ (Link to the appropriate AlignmentOnlineDev _vXYrZ, e.g. AlignmentOnlineDev _v11r0 for 2016 or AlignmentOnlineDev _v11r3 in 2017,). To recompile the code after having made some changes, login to plus as online (ui does not work) and follow the prescription and troubleshooting help here:

You are now ready to compile (as online)

Function to be used

First, follow the instructions for the installation explained at the step before. The functions that can be used to run the tracker alignment are defined under

/group/online/dataflow/cmtuser/AlignmentRelease/Alignment/TAlignment/python/TAlignment/AlignmentScenarios.py

E.g. configureEarlyDataAlignment (for early 2012 alignment)

configureTrackerAlignment (default 2016/2017 alignment)

configureFirstTrackerAlignment (early runs 2016 alignment)

configureTrackerAlignmentITInternal (additional dof for internal layers of the IT)

The different elements that can be aligned are defined in the script Alignables.py that is present in the same directory. Here, TxTyTz = Translation around the pivot point in X, Y and Z axis (positive or negative), RxRyRz = Rotation around X, Y and Z axis.

The function that one want to run can be found in

/group/online/dataflow/cmtuser/AlignmentRelease/AlignmentOnline/AlignOnline/python/AlignmentConfigurations/TrackerAlignment.py

Just uncomment the related lines.

After modifying any of these scripts, do again do_install.

Alignment constants in LHCbCond.db and ONLINE.db

In addition to the [det]Global and [det]Modules xml files, the IT and TT also support an Elements.xml that contains values taken as constant over time, and thus are not rewritten by the automatic procedure. These parameters were used to be stored in LHCbCond.

Since run 1, the database logic has changed. The alignment constants are no longer stored in LHCbCond but in the Online database that does not have tags and only supports intervals of validity. On top of this, a database named CalibOff.db may be used to hold different tags: its behavior is the same of LHCbCond, but the structure is that of the Online database.

The /group/online/alignment folder holds the latest alignment results used by the HLT.

For the tracker, the important info are contained into

/group/online/alignment/IT/ITModules

/group/online/alignment/IT/ITGlobal

and similarly for TT and OT. If non stated explicitly, the latest version is used. To select a different version it is useful to look into this twiki page, where a (non-up-to-date) list can be found. Important: we do not align for ITModules thus we never update these xml.

To change the xml used, use the script newAlignmentStart.py like

python newAlignmentStart.py --IT versionModules versionGlobal

and similarly for IT and OT. With the option --dry-run, the script only print the version used without changing it.

Check and change the CONDDB and DDDB tags used by the Alignment

The CONDDB (LHCBCOND) and DDDB tags used by the LHCb_Align project are currently (April-May 2016) not automatically propagated from the tags used in the HLT. Therefore when there is a change in the tags used by the HLT, we need to update the tags used by the online Alignment as well.

Check the CONDDB and DDDB tags currently used by Alignment

On the LHCb_Align panel click on the "View.." button next to the Trigger Config

LHCbA_TrigConfView.png

The window that you now visualize, shows you the database tags currently used, but you cannot change them from this panel (see all the fields are grey and no "Save" button at the bottom of the window). To change the tags, see next point.

TrigConfig_view.png

Change the CONDDB and DDDB tags to be used by the Alignment

On the LHCb_Align panel click on "RunInfo"

LHCbA_RunInfo.png

and in the "RunInfo" panel, select Trigger Configuration, Create/Edit

RunInfo_CreateEdit.png

From this panel you can select the cond db and dddb tags needed, and Save the update with the buttom at the bottom of the window. (you will be asked to confirm your choice before the change is saved).

ALERT! NOTE: It is possible that you also have to change also the version on Moore to the latest version.

TrigConfig_edit.png

ALERT! NOTE: Once you have saved the new tags, for the give trigger configuration (PassThrough in this case), you need to select again the Trigger configuration (PassThrough in this case) in the LHCb_Align TOP level panel.

ALERT! NOTE: You can only choose the database tags which appear on the menu when you click on the arrow next to "Tags CONDDB (DDDB)". This is because you can only choose the tags which are already available for the HLT. (The HLT in order to make the tags available on the panels needs to put a snapshot into a specific location). For the Alignment we will always have to select a tag which is already available for the HLT, because we have to be consistent in the tags we use, we will never need a tag not already available for the HLT.

Monitoring the automatic procedure

The results of each iterations are saved, this includes xml files and histograms (more details below). For the moment the monitoring should be performed manually running few scripts.

Xml files location

  • The xml files used in the job while it is running are in /group/online/AlignWork/running
  • At the end of each iteration the results are copied in Iter1 (Iter2....), while the results of last iteration will remain in xml
  • All results present in the directory running are copied into a new directory located under /group/online/AlignWork/ACTIVITY/RunXXXX (ACTIVITY=Velo, Tracker, Muon XXX=run number) when the alignment converged or after the maximum number of iteration (currently 10). If you run the job twice on the same run and the same activity the results are overwritten * In this directory it is possible to find also the logfile of the alignment.
  • For each iteration, various subfolders are created, one per aligned subdetector. For each subdetector two sets of alignment results are created: [det]Global and [det]Modules.

  • If the job converged and the new alignment constants significantly differ by the others, they are copied in /group/online/alignment/XX (XX: Velo, TT, IT, OT) as vN.xml (N=previous version+1)
  • The alignment job for your activity uses the latest version in /group/online/alignment/XX for the subdetectors that you are aligning for.
  • In some panel of PVVS one can see (and set) the version that should be used in hlt. Currently it is done manually, in future will be automatic.
  • For each run all the information about calibration and alignment version used in hlt can be found in /group/online/hlt/conditions/LHCb/2015/XXXX (XXXX=run number)
    • xx.xml (XX: Velo, TT, IT, OT) contains the link to the version used
    • When a run is ready for hlt2, the alignment/calibration constants has to be copied to be used offline. An hlt task is run and a temporary directory /group/online/hlt/conditions/LHCb/2015/XXXX/OfflineConds is created. It contains the xml with the full list of calibration and alignment constants (with the "offline" header/footer). An additional script copy these xml files to the online db.
    • The constants are available in the online.db for all the runs declared ready for hlt2. Time validity time of the start of the run to the next run with valid constants (if no further run, infinite).

    • The location of the Survey info can be found in: /group/online/dataflow/cmtuser/AlignmentRelease/Alignment/SurveyConstraints.py and it is called by AlignmentScenarios.py (in the same directory)

Histograms location

The histograms are produced automatically when running the alignment. We have one root file for each iteration, they are at: the histograms are at /hist/Savesets/"YEAR"/LHCbA/ Nomenclature convention: ex. /hist/Savesets/2015/LHCbA/AligWrk_Muon/07/10/AligWrk_Muon-15691601-20150710T100630-EOR.root

  • Muon is the name of the activity (Muon, Tracker or Velo)
  • 156916 is the first run number in the run list
  • 01 is the number of iteration
  • 20150710T100630 is the time when the file has been generated.

To run root on plus:

/group/online/dataflow/cmtuser/AlignmentRelease/build.x86_64-centos7-gcc62-opt/run bash

or (not working in 2017):

lb-run root root.exe FILENAME.root

Location of Automatically produced plots

A pdf containing monitoring and convergency plots is produced after the alignment (only a pdf is produced per run analysed). If the alignment needs updating or if the variation of the alignment constants is too big the pdf is uploaded to the logbook. If no alignment update is needed the pdf is not attach to the logbook but only a message appears. All the pdfs can be found here. The naming convention is FIRST_RUN_NUMBER.pdf if the update was not triggered or vVERSION_FIRST_RUN_NUMBER.pdf

Make the Automatically produced plots yourself

To make the automatically plroduced plots you can use the scripts in Alignment/AlignmentMonitoring/scripts/.

  • moniPlots.py to produce the plots for the Velo
  • moniPlots_Tracker.py to produce the plots for the Tracker
  • moniPlots_Muon.py to produce the plots for the Muon

To see how to use them call them with the -h option.

E.g. for the Tracker use: /group/online/dataflow/cmtuser/AlignmentRelease/build.x86_64-centos7-gcc62-opt/run bash

/group/online/dataflow/cmtuser/AlignmentDev_v11r4p1/Alignment/AlignmentMonitoring/scripts/moniPlots_Tracker.py -r 194913 --alignlog /group/online/AligWork/Tracker/run194913_v3/alignlog.txt --histoFiles /hist/Savesets/2017/LHCbA/AligWrk_Tracker/07/14/AligWrk_Tracker-19491418-20170714T101919-EOR.root(OLDFILE) /hist/Savesets/2017/LHCbA/AligWrk_Tracker/07/14/AligWrk_Tracker-19491425-20170714T183340-EOR.root(NEWFILE) -o test.pdf

Make trend plots

To make the trend plots you can use the scripts Alignment/AlignmentMonitoring/scripts/trendPlots.py call it with the option -h to see the options available. If you want to produce the "standard plots " for Velo, Tracker and Muon you can use the script Alignment/AlignmentMonitoring/examples/makePublicityPlots.sh, it just calls the previous one with the appropriate options. You may want to open it and change some options like the min and max run number or the date in the label.

lb-dev Alignment v11r0
cd AlignmentDev_v11r0
git lb-use Alignment
git lb-checkout Alignment/master Alignment/AlignmentMonitoring
make configure
make -j8
./run Alignment/AlignmentMonitoring/examples/makePublicityPlots.sh

Update reference histograms in the presenter

Instructions to change the reference histograms in the presenter can be found in this page.

Using the hlt1 conditions instead of the ones in the online.db

If you want to run on plus using the same hlt1 (and hlt2) conditions and not the one in the online.db you should use add the following options (example for a Brunel job). Main reasons: runs not yet processed by hlt2 or check the constants in the online.db are the same ones used in hlt.

#Need to check that the CondDBtag and DDDBtag were the same used during the data taking.
Brunel().DDDBtag = "dddb-20150526"
Brunel().CondDBtag = "cond-20150617"

from Configurables import CondDB
conddb = CondDB()
conddb.IgnoreHeartBeat = True
conddb.EnableRunChangeHandler = True

#The following 2 lines should be uncommented only for data not processed yet by hlt2
#conddb.UseDBSnapshot = True
#conddb.Tags['ONLINE'] = 'fake'

from Configurables import MagneticFieldSvc
MagneticFieldSvc().UseSetCurrent = True

try:
    import AllHlt1
except ImportError:
    import sys
    rd = '/group/online/hlt/conditions/RunChangeHandler'
    sys.path.insert(0, rd)
    print sys.path
    import AllHtl1

conddb.RunChangeHandlerConditions = AllHlt1.ConditionMap

Known issues and workaronds

  • The folder running should not be present in the /group/online/AligWork directory before the start of the alignment. If it remained there from previous unsuccessful tests it should be removed. To remove it one must log in as online user. Changing the name of the directory also works

  • It is possible that the job become stuck with the PARTAlign_Master in running and the various nodes (e.g. HLTA05_A) in ready. In this case from the top tree select again start run

  • If only few nodes are in the "Included Nodes and Removed Nodes" in the panel Quick Actions of the window LHCbA _HLT: TOP, try to DEALLOCATE and then ALLOCATE again from the Top panel

  • The histograms created before 21st July have a bug for all iterations except the first one (the histograms were not reset before filling in the second (or more) iterations, this results in histograms including entry of first iteration and second iteration (and all the previous for the other iterations). This problem was fixed on 21st July, thus re-running would produce the correct histograms.

Create a new release area

A new release area can be created with the following steps.

First a clone of the project is needed:

user@plus $ cd /group/online/dataflow/cmtuser
user@plus $ git clone ssh://git@gitlab.cern.ch:7999/lhcb/AlignmentOnline.git
user@plus $ mv AlignmentOnline AlignmentOnlineDev_vXrY
user@plus $ cd AlignmentOnlineDev_vXrY
user@plus $ git checkout satellite
user@plus $ lb-project-init
user@plus $ git update-index --assume-unchanged CMakeLists.txt
user@plus $ chmod g+rw ../AlignmentOnlineDev_vXrY # to allow colleagues to make changes in the directory

Then modify the CMakeLists.txt by adding the following lines at its top

set(OnlineVersion vUrV) # <-- Change this to the latest version of Online (or the one suggested by the Online team)
set(OnlineDev_DIR /group/online/dataflow/cmtuser/OnlineDev_${OnlineVersion}/InstallArea/$ENV{CMTCONFIG}) 

and the dependency on the OnlineDev release

gaudi_project(AlignmentOnlineDev vXXrY
              USE OnlineDev ${OnlineVersion}
                  AlignmentOnline vXXrY)

You are now ready to compile (as online):

online@plus $ dataflowcmt
online@plus $ cd $User_release_area
online@plus $ export CMTPROJECTPATH=/group/online/dataflow/cmtuser:$CMTPROJECTPATH
online@plus $ cd AlignmentOnlineDev_vXXrY
online@plus $ rm -rf build.x*                                #(carefully)
online@plus $ do_configure
online@plus $ do_install
online@plus $ cmsetup                                        #(one should call this after every installation) 

Beware that this procedure could fail with CMake failing to find OnlineDev project. This may happen if /group/online/dataflow/cmtuser is not in the CMTPROJECTPATH. To fix this

online@plus $ export CMTPROJECTPATH=$CMTPROJECTPATH:/group/online/dataflow/cmtuser

Once finished modify the link to the current and old version of AlignmentOnline:

user@plus $ rm AlignmentRelease_old
user@plus $ mv AlignmentRelease AlignmentRelease_old
user@plus $ ln -s AlignmentOnlineDev_vXrY AlignmentRelease

Troubleshooting

A typical error is due to CMake missing boost dependencies. This happens as an error at configuring time. To solve it, simply add the following lines to searchPath.cmake:

set(CMAKE_PREFIX_PATH "/group/online/dataflow/cmtuser" "/group/online/dataflow/SwData" "/cvmfs/lhcb.cern.ch/lib/lhcb/LBSCRIPTS/LBSCRIPTS_v8r3p1/LbRelease/data/DataPkgEnvs" "/cvmfs/lhcb.cern.ch/lib/lhcb/LBSCRIPTS/LBSCRIPTS_v8r3p1/LbUtils/cmake" "/cvmfs/lhcb.cern.ch/lib/lhcb" "/cvmfs/lhcb.cern.ch/lib/lcg/releases" "/cvmfs/lhcb.cern.ch/lib/lcg/app/releases" "/cvmfs/lhcb.cern.ch/lib/lcg/external" "/cvmfs/lhcb.cern.ch/lib/lhcb" "/cvmfs/lhcb.cern.ch/lib/lcg/releases" "/cvmfs/lhcb.cern.ch/lib/lcg/app/releases" "/cvmfs/lhcb.cern.ch/lib/lcg/external" ${CMAKE_PREFIX_PATH})
list(INSERT CMAKE_PREFIX_PATH 0 "$ENV{User_release_area}")

It may sometimes happen that the above recipe won't work. In that case manual compilation of the opt and dbg executables is possible:

online@plus $ make purge
online@plus $ export OnlineDev_DIR=/group/online/dataflow/cmtuser/OnlineDev_v5r24/InstallArea/x86_64-slc6-gcc48-opt # <-- this should point to the online version in CMakeLists.txt 
online@plus $ export LD_LIBRARY_PATH=$LD_LIBRARY_PATH:${SW_LCG}/releases/LCG_79/Boost/1.55.0_python2.7/x86_64-slc6-gcc48-opt/lib # <-- this solves the boost compilation error
online@plus $ make -j 5 configure
online@plus $ make -j 5 install

n.b.: if SW_LCG is not defined, look for lcg in the configuration output (usually it is /cvmfs/lhcb.cern.ch/lib/lcg)

The same procedure needs to be repeated for the dbg executables by repeating it after logging in with

online@plus $ LbLogin -c $CMTDEB

beware to setup properly the environment variables to point to the dbg install area.

Finally, call

online@plus $ cmsetup

To setup the system.

See the logfiles

The logfile of the day is /clusterlogs/partitions/LHCbA/daq/LHCbA.log. To see all the messages for a particular node do something like: grep hlt02 /clusterlogs/partitions/LHCbA/daq/LHCbA.log Old logs are zipped and moved to the folder old; there you can do something like less /clusterlogs/partitions/LHCbA/daq/old/LHCbA.log.2016-05-11:03h.bz2 | grep hlt02 | less

Warnings and errors to shifters

In AlignmentOnline/AlignOnline/src/AlignOnlineIterator.cpp it's define a DIM service to communicate the outcome of the alignment procedure and raise warning and alarms.

Where in the code

The DIM service publish the content of the string m_align_message that has the following structure: "SeverityLevel: Subdetector message | further informations" where SeverityLevel can be INFO, WARNING or ERROR and the part = | further informations= is optional. In case of WARNING or ERROR an email will be sent to lhcb-onlinealignmentcalibration@cernNOSPAMPLEASE.ch; in case of ERROR a message will also appear in the alarm screen and require immediate action. In the alarm message only the Subdetector message will appear while the email will contain also the further informations. The error message will disappear at the next run of the alignment on the same subdetector when an INFO message is found.

How to see the dim service

From a machine of in the online network do:

did -dns=hlt01

A window will appear, there in the View menu select Servers by Node and select hlt02. If you are running the alignment, among the services you should see LHCbA_HLT02_AligDrv_0, click on it and select Services. From the list select /Publisher/AlignmentMessenger and click on View/Send. A new window will open where you can see the string writen from AlignOnlineIterator.cpp. Click on Subscribe (On Change).

Use the debugger

as online user ssh into hlt02

ssh -Y online@hlt02

Follow the instructions above for running the alignment online until the job is configured. Just after having started the analysis

ps x | grep AligDrv

to find the pid of the iterator. Then

/home/beat/scripts/debug --pid [pid]

When the debugger finished configured press c to make the process continue. In case of crash one can have the information on what caused the crash by typing where.

Avoid to insert the online password many times

To avoid inserting the online password many times it is possible to create a private/public key pair following the instructions in http://cvs.web.cern.ch/cvs/howto.php#accessing-sshlinux

Just once one has to do:

ssh-keygen --help
cp .ssh/id_rsa.pub .ssh/authorized_keys
ssh-agent > ~/.agentinfo
. ~/.agentinfo
ssh-add

and then each time:

eval `ssh-agent`
ssh-add

-- ElenaGraverini - 2015-06-12

  • runs4Align.py.txt: script for knowing which runs for each fill are available on the HLT farm</verbatim>
Topic attachments
I Attachment History Action Size Date Who Comment
PNGpng HLT_running.png r1 manage 133.3 K 2015-06-12 - 15:33 ElenaGraverini Cropped screenshots
PNGpng LHCbA_RunInfo.png r1 manage 101.0 K 2016-05-24 - 15:23 LuciaGrillo LHCbA TOP panel with the "RunInfo" button highlighted
PNGpng LHCbA_TrigConfView.png r1 manage 102.9 K 2016-05-24 - 15:11 LuciaGrillo LHCbA TOP panel with the button to view the Trigger Config highlighted
PNGpng LHCbAlign_active.png r2 r1 manage 156.9 K 2015-07-16 - 16:01 SilviaBorghi Cropped screenshots
PNGpng PARTAlign_running.png r1 manage 55.1 K 2015-06-12 - 15:33 ElenaGraverini Cropped screenshots
PNGpng RunInfo_CreateEdit.png r1 manage 70.8 K 2016-05-24 - 15:31 LuciaGrillo LHCbA RunInfo panel with Trigger Config create/edit button highlighted
PNGpng TrigConfig_edit.png r1 manage 127.0 K 2016-05-24 - 15:32 LuciaGrillo LHCbA Trigger Config Create/Edit panel
PNGpng TrigConfig_view.png r1 manage 113.8 K 2016-05-24 - 15:22 LuciaGrillo LHCbA Trigger Config View pannel
PNGpng errorLog_red.png r1 manage 40.6 K 2015-06-12 - 15:33 ElenaGraverini Cropped screenshots
PNGpng errorLog_yellow.png r1 manage 112.0 K 2015-06-12 - 15:33 ElenaGraverini Cropped screenshots
PNGpng panel.png r3 r2 r1 manage 77.1 K 2015-07-29 - 11:28 GiulioDujany first panel
Texttxt runs4Align.py.txt r1 manage 2.4 K 2015-06-18 - 16:22 GiulioDujany script for knowing which runs for each fill are available on the HLT farm
PNGpng runsSelector.png r2 r1 manage 38.0 K 2015-09-11 - 11:05 SilviaBorghi panel to select runs
Edit | Attach | Watch | Print version | History: r72 < r71 < r70 < r69 < r68 | Backlinks | Raw View | Raw edit | More topic actions...
Topic revision: r69 - 2017-07-14 - FrancescaDordei
 
    • Cern Search Icon Cern Search
    • TWiki Search Icon TWiki Search
    • Google Search Icon Google Search

    LHCb All webs login

This site is powered by the TWiki collaboration platform Powered by PerlCopyright &© 2008-2019 by the contributing authors. All material on this collaboration platform is the property of the contributing authors.
Ideas, requests, problems regarding TWiki? Send feedback