User Tools

Site Tools


igsn:namespaces

This is an old revision of the document!


Namespaces

Proposal for a hierarchical namespace governance model

Proposal

  1. Top-level namespaces shall be governed and assigned by IGSN e.V.
  2. Assigning Agents may apply for the allocation of three-character namespaces by IGSN e.V.
  3. Assigning Agents may extend their allocated namespaces into sub-namespaces by adding characters or numbers. The allocated namespaces and corresponding sub-namespaces shall be governed by the respective Assigning Agents.

Explanation

A governance model on the level of IGSN must meet three conditions:

  1. Flexibility to accommodate the needs of diverse scientific communities
  2. Assure uniqueness of assigned IGSN names
  3. Minimal human and technical communication overhead

These requirements are best met by hierarchical namespace governance 1)

Flexibility

A hierachical namespace governance model, as is now used by http://www.crossref.org/ and http://www.datacite.org, would satisfy all tree requirements. In fact, DataCite is already a sub-namespace of CrossRef, which in turn is a sub-namespace of http://www.handle.net. At present, IGSN has been assigned to the namespace 10273 of the Handle namespace. CNRI, the operator of handle.net, has delegated the governance of the sub-namespace 10273 to IGSN e.V.

Uniqueness and Namespace Exhaustion

In the context of IGSN namespaces it is required to support the 9-character syntax of SESAR and alternative naming conventions used by the major core repositories. At the current state of the discussion the standard prefix for a collection is a string of three characters. This rule allows for potentially 17576 (26^3) namespaces. Using the SESAR format of 3+6 characters, the following string of six characters (a-z, 0-9, excluding i and o) allows 1.55 x 10^9 samples per namespace, which should be sufficient for most purposes.

To assure unique names for long term operation of the system and to accommodate the requests from the core repositories a few points should be considered:

  1. Many collections are much smaller than 1,500 million samples. In most cases four characters would be sufficient, even if the name suffixes are only numbers.
  2. Using longer collection prefixes as sub-namespaces would multiply the number of collection namespaces by orders of magnitude while still providing a sufficient number of names for small collections of about 10,000 samples.

The proposed hierarchical namespace governance model would be similar in structure to the IP address space (IPv4) or to telephone numbers in Europe.

The image below shows the geographical distribution of the first digit of telephone area codes in Germany. Large cities with many subscriber lines have short area codes (e.g. Berlin = 030, Munich = 089, Hamburg = 040), while smaller cities have longer area codes (Potsdam = 0331, Freiburg = 0761), whereas small towns have long area codes (e.g. Groß Pankow = 033983). In turn, the number of digits of a local subscriber line is long (up to eight digits) in large cities, but may be much shorter in small towns.

Geographical distribution of telephone area codes in Germany

Source http://de.wikipedia.org/wiki/Telefonvorwahl

Communication Overhead

In a hierarchical namespace governance model the Allocating Agent does not need to negociate the allocation of sub-namespaces with IGSN e.V. but may solely negociate with its clients. This is analogous to the current practice in assigning DOI names, or handle names. By delegating parts of the namespace governance to Allocating Agents the communication overhead between Allocating Agents and IGSN e.V. will be minimised.

Mnemonic Names

Principal investigators applying to an Assigning Agent for a namespace might prefer mnemonic names. However, it is foreseeable that many mnemonic namespaces will soon be allocated. When this point is reached only non-mnemonic namespaces can be allocated. This situation is similar to “vanity plates” in car licence plates. For example, in Berlin all car licences in the namespace B-MW nnnn have already been allocated, this particular namespace has been exhausted. An Assigning Agent should take care to reserve “valuable” namespaces for their major clients (e.g. ODP, IODP, ICDP, …).

Legacy Namespaces

The current number of namespaces already assigned by SESAR is small enough to leave already assigned namespaces (legacy namespaces) in place.

Back to overview

1)
Bechtold, S. (2003), Governance in Namespaces, Loy. L.A. L. Rev., 36(3), 1239–1320, http://dx.doi.org/10.2139/ssrn.413681
igsn/namespaces.1350296243.txt.gz · Last modified: 2012/10/15 10:17 by jklump