Gridpack files and the Frontier infrastructure

You can leave a comment at the bottom.


On the weekend of September 7th-8th 2013, a workflow requested a large file (692 MB in size) many times, often aborting & restarting the requests. This resulted in heavy loading of the CMS Launchpads, causing them to collapse. Here, a study of the events is follows.

Max Threads at Transaction Start
Ratio of Max size to Max Threads
Data Transferred [MiB]
Response Time
Transfer Rates

Since every Launchpad is installed on a 1 Gbit link, each one could serve up to 119 MiB/s theoretical bandwidth. In practice, the protocols reduce the effective bandwidth, but from the plots it can be seen that none of them reached such a transfer rate during the weekend.

Reset events

The timeline of machine restarts, as recorded in the Tomcat and Squid logs, is: To be included

Messages in the logs include

INFO: The APR based Apache Tomcat Native library which allows optimal performance in production environments was not found on the java.library.path: /u
Sep 7, 2013 3:31:16 PM org.apache.coyote.http11.Http11Protocol init
INFO: Initializing Coyote HTTP/1.1 on http-8080


This is a persistent comments section
Edit | Attach | Watch | Print version | History: r7 < r6 < r5 < r4 < r3 | Backlinks | Raw View | WYSIWYG | More topic actions
Topic revision: r7 - 2020-08-20 - TWikiAdminUser
    • Cern Search Icon Cern Search
    • TWiki Search Icon TWiki Search
    • Google Search Icon Google Search

    Sandbox/SandboxArchive All webs login

This site is powered by the TWiki collaboration platform Powered by PerlCopyright & 2008-2023 by the contributing authors. All material on this collaboration platform is the property of the contributing authors.
or Ideas, requests, problems regarding TWiki? use Discourse or Send feedback