IGSN ID Metadata Recommendations
IGSN IDs are registered with metadata encoded in the DataCite Metadata Schema. The following guidance for IGSN ID metadata in the DataCite Metadata Schema was developed by the IGSN–DataCite Crosswalk and Metadata Management Working Group and sourced from the IGSN–DataCite Crosswalk Recommendation. DataCite, the IGSN e.V., and the Working Group will continue to expand and refine metadata best practices for IGSN IDs over time.
IGSN ID recommendations for mandatory properties in DataCite Metadata Schema
IGSN IDs must be registered with at least the six mandatory properties in the DataCite Metadata Schema.
1 Identifier
The identifier
property will be automatically populated with a DOI upon the creation of an IGSN ID. The system will assign a random DOI suffix unless a specific suffix is supplied using Fabrica or DataCite APIs.
2 Creator
The creator
property contains a list of “the main researcher(s) involved...in priority order.” For IGSN IDs, this could be the sample collector/creator, chief scientist, curator, or even the person who deposited the sample into a repository.
As a norm, IGSN ID registrants are expected to collect information about the sample owner, Principle Investigator, and/or otherwise. However, if no appropriate name is available, the property will be populated with the name of the IGSN ID Repository registrant or an appropriate standard value for unknown information from Table 11 of the DataCite Metadata Schema.
3 Title
The title
property should include appropriate elements that would help find and distinguish a sample. The exact syntax is at the discretion of the IGSN ID registrant. Appropriate elements might include:
- The basic form of the object that is registered. For example, polished section, core, pulp, solution, dredge haul in a box, lot, or piece of material.
- The material or materials that compose the sample. For example, water, granite, or tissue.
- Local sample identifiers.
We strongly recommend populating the title
property with an appropriate value to enhance discoverability. If no title information is available, there is the option to fill this property with an appropriate standard value for unknown information from Table 11 of the DataCite Metadata Schema.
4 Publisher
The publisher
property contains “the name of the entity that holds, archives, publishes prints, distributes, releases, issues, or produces the resource.” For IGSN IDs, this should be the organization registering the IGSN ID for the physical sample.
5 PublicationYear
The publicationYear
property should contain the year when the sample was first made available to the research community. This is likely to be the year at the time the physical sample is registered unless the sample was released before registration of its metadata record.
The DataCite Metadata Schema provides the following additional guidance for the publicationYear
property that is relevant to material samples:
- If the date of public availability cannot be determined, use the date of registration.
- If an embargo period has been in effect, use the date when the embargo period ends.
- If there is no standard publication year value, use the date that would be preferred from a citation perspective.
10 ResourceType
The resourceType
property may be populated with resource types from external ontologies or shared vocabularies. In the absence of an agreed vocabulary, the use of the terms “material sample” or “‘feature-of-interest” are strongly recommended to at least distinguish between these sampling concepts. A material sample is a specialization of a larger feature-of-interest, which is typically the collection site. For example, in the Geosciences, a feature-of-interest might be a lake, tree, cross-section, transect, or borehole.
10.a resourceTypeGeneral
The resourceTypeGeneral
property is “PhysicalObject” for all IGSN IDs.
IGSN ID recommendations for other properties in DataCite Metadata Schema
6 Subject
The free-text subject
property contains the "subject, keyword, classification code, or key phrase describing the resource." For IGSN IDs, this is the materials that compose the sample. Since materials may be categorized under different schemata, the sub-properties subjectScheme
and schemeURI
should also be included whenever possible.
7 Contributor
All institutions and people involved in a sample’s workflow—from collection to archival (or discarding/destruction)—may be captured in the free-text contributor
property. If contributor
is used, the sub-properties contributorName
and contributorType
are mandatory.
Note that to be included in the reference to a resource, a person or organization must be listed in a creator
property (and can then be additionally listed in a contributor
property). People and organizations listed only in a contributor
property are not included in the resource reference.
8 Date
The date
property may be used to log events relevant to the physical object. The date
property requires the dateType
sub-property, which must be entered from a controlled list. The "Collected" dateType
may be used to record when a sample was collected. At this time, there is no controlled list value for “Destroyed”, which may be pertinent to your material samples metadata. Pending the potential addition of an equivalent value, we advise using a date
property with dateType
“Other” and “Destroyed” in the dateInformation
field.
11 AlternateIdentifier
The alternateIdentifier
property may be used to enter any other identifiers for the material sample, including local sample identifiers. For alternate identifiers that were assigned by a researcher or within a project, “local” is the default recommended alternateIdentifierType.
12 RelatedIdentifier
Connecting samples to one another, and to research based on them, is a primary goal of the IGSN ID. Such connections are captured through relatedIdentifier
properties, which list the globally unique Identifiers assigned to related resources. It is therefore recommended that relatedIdentifier
is used to the maximum extent possible and is updated on a regular basis.
The mandatory relationType
sub-property describes relationships between the material sample for which the IGSN ID is being registered and related resources (features-of-interest, parent samples, subsamples, datasets, publications, etc.).
The relatedIdentifier
property can be used to make connections that mirror sample hierarchies. Because the parent IGSN ID is a key element in IGSN ID metadata, it is recommended that a child (sub)sample identifies its parent using the relationType
"IsPartOf" or "IsDerivedFrom". Vice versa, a parent sample can identify its children using "HasPart" or "IsSourceOf". Of these four relationTypes
, only "IsPartOf" and "HasPart" currently create Event Data visible in the REST API and GraphQL API. See Connections to Works.
Relationships to other IGSN IDs registered with DataCite services should use relatedIdentifierType
DOI
.
17 Description
It is valuable to include additional information about a sample, particularly about its "birth," in the free-text description
property. If the description
property is used, then the sub-property descriptionType
is mandatory. Values for the latter are selected from a controlled list, with the most relevant for IGSN IDs being:
- Abstract – Brief description of the resource and the context in which it was created.
- Methods – The methodology employed for the study or research. For IGSN IDs, this is the collection method.
Both of these are important for discovery purposes.
18 GeoLocation
The geoLocation
property is used to encode information on the "spatial region or named place where the data was gathered or about which the data is focused." The property can be repeated to indicate a number of different locations, and can express a location as a point, bounding box, or polygon, or simply as a free-text description through its (respective) sub-properties: geoLocationPoint
, geoLocationBox
, geoLocationPolygon
, and geoLocationPlace
.
For IGSN IDs, this property will contain where a sample was acquired relative to the Earth or another astronomical object. Note that it may not be relevant for samples that are ‘non-geographic’ (e.g., a synthetic material).
Example DataCite metadata following IGSN ID recommendations
DataCite XML
<?xml version="1.0" encoding="UTF-8"?>
<resource
xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance"
xmlns="http://datacite.org/schema/kernel-4" xsi:schemaLocation="http://datacite.org/schema/kernel-4 http://schema.datacite.org/meta/kernel-4/metadata.xsd">
<identifier identifierType="DOI">10.21384/SP0001</identifier>
<creators>
<creator>
<creatorName nameType="Personal">Miller, Elizabeth</creatorName>
<givenName>Elizabeth</givenName>
<familyName>Miller</familyName>
<nameIdentifier nameIdentifierScheme="ORCID" schemeURI="https://orcid.org">https://orcid.org/0000-0001-5000-0007</nameIdentifier>
</creator>
</creators>
<titles>
<title>PL04, Lapilli tuff, Rock</title>
</titles>
<publisher>Institute of Materials Science</publisher>
<publicationYear>2022</publicationYear>
<resourceType resourceTypeGeneral="PhysicalObject">Material sample</resourceType>
<subjects>
<subject subjectScheme="iSamples Material Sample Type Vocabulary" schemeURI="https://w3id.org/isample/vocabulary/specimentype/0.9">Physical specimen</subject>
</subjects>
<contributors>
<contributor contributorType="HostingInstitution">
<contributorName nameType="Organizational">DataCite</contributorName>
<nameIdentifier nameIdentifierScheme="ROR" schemeURI="https://ror.org">https://ror.org/04wxnsj81</nameIdentifier>
</contributor>
</contributors>
<dates>
<date dateType="Collected">2022-06-01</date>
<date dateType="Other" dateInformation="Destroyed">2022-06-15</date>
<date dateType="Issued">2022</date>
</dates>
<relatedIdentifiers>
<relatedIdentifier relatedIdentifierType="DOI" relationType="IsPartOf" resourceTypeGeneral="PhysicalObject">10.21384/sp0002</relatedIdentifier>
<relatedIdentifier relatedIdentifierType="DOI" relationType="IsReferencedBy" resourceTypeGeneral="Dataset">10.26022/ieda/112166</relatedIdentifier>
<relatedIdentifier relatedIdentifierType="URL" relationType="HasMetadata">https://data.datacite.org/application/citeproc+json/10.5072/example-full</relatedIdentifier>
</relatedIdentifiers>
<sizes/>
<formats/>
<version/>
<descriptions>
<description xml:lang="en" descriptionType="Abstract">Poorly sorted volcaniclastic deposit.</description>
<description xml:lang="en" descriptionType="Methods">Collected using rock corer.</description>
</descriptions>
<geoLocations>
<geoLocation>
<geoLocationPlace>Gulf of California</geoLocationPlace>
<geoLocationPoint>
<pointLongitude>-67.302</pointLongitude>
<pointLatitude>31.233</pointLatitude>
</geoLocationPoint>
<geoLocationBox>
<westBoundLongitude>-71.032</westBoundLongitude>
<eastBoundLongitude>-68.211</eastBoundLongitude>
<southBoundLatitude>41.090</southBoundLatitude>
<northBoundLatitude>42.893</northBoundLatitude>
</geoLocationBox>
<geoLocationPolygon>
<polygonPoint>
<pointLatitude>41.991</pointLatitude>
<pointLongitude>-71.032</pointLongitude>
</polygonPoint>
<polygonPoint>
<pointLatitude>42.893</pointLatitude>
<pointLongitude>-69.622</pointLongitude>
</polygonPoint>
<polygonPoint>
<pointLatitude>41.991</pointLatitude>
<pointLongitude>-68.211</pointLongitude>
</polygonPoint>
<polygonPoint>
<pointLatitude>41.090</pointLatitude>
<pointLongitude>-69.622</pointLongitude>
</polygonPoint>
<polygonPoint>
<pointLatitude>41.991</pointLatitude>
<pointLongitude>-71.032</pointLongitude>
</polygonPoint>
</geoLocationPolygon>
</geoLocation>
</geoLocations>
</resource>
DataCite JSON
{
"id": "https://doi.org/10.21384/sp0001",
"doi": "10.21384/SP0001",
"url": "https://app.geosamples.org/sample/igsn/3210000DH&header=1",
"types": {
"ris": "GEN",
"bibtex": "misc",
"citeproc": "article",
"schemaOrg": "CreativeWork",
"resourceType": "Material sample",
"resourceTypeGeneral": "PhysicalObject"
},
"creators": [
{
"name": "Miller, Elizabeth",
"nameType": "Personal",
"givenName": "Elizabeth",
"familyName": "Miller",
"nameIdentifiers": [
{
"schemeUri": "https://orcid.org",
"nameIdentifier": "https://orcid.org/0000-0001-5000-0007",
"nameIdentifierScheme": "ORCID"
}
]
}
],
"titles": [
{
"title": "PL04, Lapilli tuff, Rock"
}
],
"publisher": "Institute of Materials Science",
"container": {
"type": "Series",
"identifier": "10.82206/sp0002",
"identifierType": "DOI"
},
"subjects": [
{
"subject": "Physical specimen",
"schemeUri": "https://w3id.org/isample/vocabulary/specimentype/0.9",
"subjectScheme": "iSamples Material Sample Type Vocabulary"
}
],
"contributors": [
{
"name": "DataCite",
"nameType": "Organizational",
"affiliation": [],
"contributorType": "HostingInstitution",
"nameIdentifiers": [
{
"schemeUri": "https://ror.org",
"nameIdentifier": "https://ror.org/04wxnsj81",
"nameIdentifierScheme": "ROR"
}
]
}
],
"dates": [
{
"date": "2022-06-01",
"dateType": "Collected"
},
{
"date": "2022-06-15",
"dateType": "Other",
"dateInformation": "Destroyed"
},
{
"date": "2022",
"dateType": "Issued"
}
],
"publicationYear": 2022,
"identifiers": [],
"sizes": [],
"formats": [],
"rightsList": [],
"descriptions": [
{
"lang": "en",
"description": "Poorly sorted volcaniclastic deposit.",
"descriptionType": "Abstract"
},
{
"lang": "en",
"description": "Collected using rock corer.",
"descriptionType": "Methods"
}
],
"geoLocations": [
{
"geoLocationBox": {
"eastBoundLongitude": "-68.211",
"northBoundLatitude": "42.893",
"southBoundLatitude": "41.090",
"westBoundLongitude": "-71.032"
},
"geoLocationPlace": "Gulf of California",
"geoLocationPoint": {
"pointLatitude": "31.233",
"pointLongitude": "-67.302"
},
"geoLocationPolygon": [
{
"polygonPoint": {
"pointLatitude": "41.991",
"pointLongitude": "-71.032"
}
},
{
"polygonPoint": {
"pointLatitude": "42.893",
"pointLongitude": "-69.622"
}
},
{
"polygonPoint": {
"pointLatitude": "41.991",
"pointLongitude": "-68.211"
}
},
{
"polygonPoint": {
"pointLatitude": "41.090",
"pointLongitude": "-69.622"
}
},
{
"polygonPoint": {
"pointLatitude": "41.991",
"pointLongitude": "-71.032"
}
}
]
}
],
"fundingReferences": [],
"relatedIdentifiers": [
{
"relationType": "IsPartOf",
"relatedIdentifier": "10.21384/sp0002",
"resourceTypeGeneral": "PhysicalObject",
"relatedIdentifierType": "DOI"
},
{
"relationType": "IsReferencedBy",
"relatedIdentifier": "10.26022/ieda/112166",
"resourceTypeGeneral": "Dataset",
"relatedIdentifierType": "DOI"
},
{
"relationType": "HasMetadata",
"relatedIdentifier": "https://data.datacite.org/application/citeproc+json/10.5072/example-full",
"relatedIdentifierType": "URL"
}
],
"relatedItems": [],
"schemaVersion": "http://datacite.org/schema/kernel-4",
"providerId": "datacite",
"clientId": "datacite.qyacaz",
"agency": "datacite",
"state": "registered"
}
Updated about 1 year ago