UC-VO-ILDG-search

From EGI Knowledge Base

Jump to: navigation, search

Use Case title: Search for scientific data

Short description: An LQCD researcher is looking for an ensemble of gauge configurations that exhibit specific scientific properties.

Actors involved: An LQCD ‘researcher’


Prerequisites: Researcher is registered member of ILDG VO.

Steps:

  1. Researcher opens the ILDG Browser.
  2. Researcher creates an XPath query that identifies the scientific properties they are looking for (following the QCDML schema for an ensemble). ILDG Browser includes a query constructor module for users who are not familiar with XPath.
  3. Researcher submits query to regional grid Metadata Catalogues and waits for results.
  4. Researcher browses results from query and identifies an interesting ensembles.
  5. Researcher uses ILDG Browser to retrieve a list of the (LFNs for) configurations within the particular ensemble.
  6. Researcher generates a proxy certificate and then uses the ILDG ‘getURL’ client to contact regional grid file catalogues and establish SURLs for each of the configurations.
  7. Researcher uses SURLs to download configuration data to local computer, possibly using srmcp, globus-url-copy, or wget, for SRM, GSIFTP, and HTTP protocols, respectively. Note that, due to the size/number of the datasets and potential bandwidth constraints, this download step may take some time.
  8. Researcher performs analysis on retrieved datasets.

Middleware/applications involved:

  • Web service containers (hosting regional grid MDCs and FCs).
  • Regional grid file catalogues (e.g. gLite File Catalogue or Globus Replica Location Service).
  • File transfer services/clients – SRM-compliant, GridFTP, and so on.
  • Bespoke ILDG clients:

Notable items

  • Different regional grids provide different data transfer protocols, as dictated by resource providers and historic decisions.
  • At the time of writing, no common standards exist for interfacing with file catalogues from different middleware stacks (most notably, gLite File Catalogue and Globus RLS).
  • At the time of writing, users have experienced poor bandwidth between Europe and Australasia. The level of network performance has, in some instances, made downloading ensembles of data impossible.
Personal tools
hidden pages