Sites | Explanation |
---|---|
IN2P3-CC |
CRITICAL: CRITICAL METRIC FAILED [org.sam.WN-RepFree-ops]: CRITICAL: bad GlueSAFreeOnlineSize\n----------\nSA ops :\n GlueSAFreeOnlineSize = GB\n GlueSAStateAvailableSpace = 1048576000 KByte\n ACBRs for VO ops = VO:ops,ops\nERROR: bad GlueSAFreeOnlineSize: \n\n |
INFN-T1 |
CRITICAL: CRITICAL METRIC FAILED [org.sam.WN-RepFree-ops]: CRITICAL: bad GlueSAFreeOnlineSize\n----------\nSA ops :\n GlueSAFreeOnlineSize = GB\n GlueSAStateAvailableSpace = 116455098023 KByte\n ACBRs for VO ops = VO:ops,ops\nERROR: bad GlueSAFreeOnlineSize: \n----------\nSA ops:replica:online :\n GlueSAFreeOnlineSize = 116455 GB\n GlueSAStateAvailableSpace = 116455098023 KByte\n ACBRs for VO ops = VO:ops,ops\n\n |
SARA-MATRIX |
am91-46.gina.sara.nl: CRITICAL: CRITICAL METRIC FAILED [org.sam.WN-RepFree-ops]: CRITICAL: bad GlueSAFreeOnlineSize |
Taiwan-LCG2 |
w-wn0989: CRITICAL: CRITICAL METRIC FAILED [org.sam.WN-RepFree-ops]: CRITICAL: bad GlueSAFreeOnlineSize |
TRIUMF-LCG2 |
wn205.triumf.lcg: CRITICAL: CRITICAL METRIC FAILED [org.sam.WN-RepFree-ops]: CRITICAL: bad GlueSAFreeOnlineSize |
Sites | Explanation |
---|---|
FZK-LCG2 & pic |
The sBDII sanity check used in Nagios is more strict compared to the one in SAM. On Wednesday midday we asked Laurence F. to reduce the level of this test to raise error status and make it more similar to the one in SAM. He did it, and since then, it shows green in Nagios. |
NDGF-T1 |
There were two bugs in NCG: 1) certificate & key had to be provided to the check (missing in original version of Hash.pm) 2) LDAP discovery part of NCG was incorrectly extracting ports for SRMv2 endpoints (thus, wrong port was contacted and check was issuing CRITICAL). Fixed both issues in Nagios. |
INFN-T1 |
Nagios had a huge number of services configured (>3000) and was skipping some of them. We changed the profile to contain only the critical metrics and the problem was solved |