Project Road Map

A summary of a rough roadmap for the inspire project, as envisioned by Tim Smith, Annette Holtkamp, Salvatore Mele, and Travis Brooks at SLAC on 5/12/2007 (As interpreted and added to by TB, for the historical records the original notes are attached) TB has added some further out phases, with less specific time.goals.

1: May-August ComparisonSlacCernRecordMarkup Compare internal record structure of SPIRES and Invenio records for bibliographic items.

  • Leads on this are almost certainly Tibor and Mike.
  • Consider "external" files like Jobs and Inst which are needed for auth. control, but might not migrate...
  • Work starts now, possible face to face in early July.
    • Back and forth twisting Invenio to store our datastructures
  • Note major decision point early on: If invenio cannot store SPIRES records, we must consider other platforms (9/2007) a transformation has been defined and works for most data/records, but not all. However, invenio appears to be flexible enough to store SPIRES records see: RecordComparison

2 August-October DevelopmentRecordLoading Test loading of large chunk (.5MRecs) to test scalability. Not to be mistaken for any sort of test/demonstration system. Requires existence of at least 1 new Invenio instance ready to store data. One instance already available at SLAC, and this might be most convenient...

  • Consider at this point the multiple mirror/master architecture (possibly just 2 test sites..)
  • Purpose for this is in case Invenio modifications to accomodate SPIRES records do not scale appropriately. Again this is a major decision point regarding the appropriateness of Invenio as the underlying system.

(9/2007) 2Krecs have been loaded and function in Invenio installations at SLAC and CERN. More might be possible, but more work is needed on the transformations in some special cases see: RecordLoading

3 October ProjectWorkingMeetings to agree on appropriateness of records. (Oct 8-12)

4 Sept-Jan Define desired inputs from repositories (journals/arXiv..) and determine influence on data structures Leads (Annette/Travis? + Invenio person?)

5. Jan08 -Mar SearchSyntax Can the SPIRES (fussy) search syntax be reproduced by Invenio.

  • Certainly not completely, however, the most used parts might be acceptably translated to invenio search.
  • This is neccessary for branding purposes to make users confortable with new system (choice will be offered)
    • Also note that survey results show that comments about SPIRES syntax were 70% negative and 40% positive (yes, some were both)

6. Jan 08-> late 2008 ComparisonSlacFermilabDesyCernEnrichmentScripts. SPIRES COllaboration currently has many (separate) scripts for enriching data, both fully automatic, and manually assisted. Both the functionality and the ease of user interface (in manual cases) should be reproduced in Invenio.

  • Manual (manual assisted) scripts must function well or data quality will suffer immediately!
  • Catalog of existing scripts (perl and SPIRES protocol) and functionality
    • May be useful for SPIRES groups to consolidate scripts and especially SPIRES interfaces
    • Should consider whether SPIRES->Perl is worthwhile intermediate step ,since SPIRES API exists in perl.
  • Perl->Python translation (SPIRES->Invenio API) (or consider Perl Invenio API?) (Possible MC work)
  • SPIRES Protocol -> Python is only possible by a few cognoscenti...
  • CDS scripts must go through similar process

7. Jan 08-July 08 DevelopmentRecordDisplayFormats Catalog and compare output formats from SPIRES and CDS. Must make invenio reproduce SPIRES output formats. Use weblogs to determine heavily used formats.

  • Here no translation is possible, must go SPIRES->functionality-> Invenio
Some initial work has been already done here and Invenio can reproduce a few of the heavily used SPIRES formats

8: mid late 08: Create multiple mirror site architecture, distributed among 4 labs, redundantly, so that all can contribute/access data.

  • Load records from SPIRES, SPIRES formats

9: late 08/early 09 Real test system. Begin to run concurrently with SPIRES, rading only from SPIRES.

  • Slowly moving maintenance tasks over to Inspire (rather than feeding from SPIRES)
  • Compare side-by-side, real world functionality

10: 2009 release new system

  • Many bug fixes
  • Begin improvements to new system

-- TravisBrooks - 13 Jun 2007

