Data Package Quality Report
PackageId: edi.114.8
Report Date/Time: 2017-11-16T05:11:30
Dataset Report
# Identifier Status Quality Check Name Description Expected Found Explanation Suggestion Reference
1packageIdPatternvalid
Type:metadata
System:lter
On failure:error
packageId pattern matches "scope.identifier.revision"Check against LTER requirements for scope.identifier.revision'scope.n.m', where 'n' and 'm' are integers and 'scope' is one of an allowed set of valuesedi.114.8
2emlVersionvalid
Type:metadata
System:lter
On failure:error
EML version 2.1.0 or beyondCheck the EML document declaration for version 2.1.0 or highereml://ecoinformatics.org/eml-2.1.0 or eml://ecoinformatics.org/eml-2.1.1eml://ecoinformatics.org/eml-2.1.1Validity of this quality report is dependent on this check being valid.
3schemaValidvalid
Type:metadata
System:knb
On failure:error
Document is schema-valid EMLCheck document schema validityschema-validDocument validated for namespace: 'eml://ecoinformatics.org/eml-2.1.1'Validity of this quality report is dependent on this check being valid.
4parserValidvalid
Type:metadata
System:knb
On failure:error
Document is EML parser-validCheck document using the EML IDs and references parserValidates with the EML IDs and references parserEML IDs and references parser succeededValidity of this quality report is dependent on this check being valid.
5schemaValidDereferencedvalid
Type:metadata
System:lter
On failure:error
Dereferenced document is schema-valid EMLReferences are dereferenced, and the resulting file validatedschema-validDereferenced document validated for namespace: 'eml://ecoinformatics.org/eml-2.1.1'Validity of this quality report is dependent on this check being valid.
6keywordPresentvalid
Type:metadata
System:lter
On failure:warn
keyword element is presentChecks to see if at least one keyword is presentPresence of one or more keyword elements9 'keyword' element(s) found
7methodsElementPresentvalid
Type:metadata
System:lter
On failure:warn
A 'methods' element is presentAll datasets should contain a 'methods' element, at a minimum a link to a separate methods doc.presence of 'methods' at one or more xpaths.1 'methods' element(s) foundEML Best Practices, p. 28
8coveragePresentvalid
Type:metadata
System:lter
On failure:warn
coverage element is presentAt least one coverage element should be present in a dataset.At least one of geographicCoverage, taxonomicCoverage, or temporalCoverage is present in the EML.1 'coverage' element(s) found
9geographicCoveragePresentinfo
Type:metadata
System:lter
On failure:info
geographicCoverage is presentCheck that geographicCoverage exists in EML at the dataset level, or at least one entity's level, or at least one attribute's level.geographicCoverage at least at the dataset level.1 'geographicCoverage' element(s) foundMany but not all datasets are appropriate to have spatial coverage.If sampling EML is used within methods, does that obviate geographicCoverage? Or should those sites be repeated or referenced?EML Best Practices v.2, p. 22-23. "One geographicCoverage element should be included, whose boundingCoordinates describe the extent of the data....Additional geographicCoverage elements may be entered at the dataset level if there are significant distances between study sites and it would be confusing if they were grouped into one bounding box." 6 decimal places.
10taxonomicCoveragePresentinfo
Type:metadata
System:lter
On failure:info
taxonomicCoverage is presentCheck that taxonomicCoverage exists in EML at the dataset level, or at least one entity's level, or at least one attribute's level.taxonomicCoverage at least at the dataset level.0 'taxonomicCoverage' element(s) foundOnly when taxa are pertinent to the dataset will they have taxonomicCoverage.Could search title, abstract, keywords for any taxonomic name (huge). Could search keywordType="taxonomic".EML Best Practices v.2, p. 25
11temporalCoveragePresentinfo
Type:metadata
System:lter
On failure:info
temporalCoverage is presentCheck that temporalCoverage exists in EML at the dataset level, or at least one entity's level, or at least one attribute's level.temporalCoverage at least at the dataset level.1 'temporalCoverage' element(s) foundLTER wants to search datasets by time; the best place to search is the dataset level temporal coverage.Most datasets have a temporal range.EML Best Practices v.2, p. 24
12pastaDoiAbsentvalid
Type:metadata
System:lter
On failure:error
An alternateIdentifier with a DOI system attribute that looks like it is generated by PASTA should not be presentReject the data package if it contains an alternateIdentifier DOI that looks like PASTA generated it.No PASTA DOIs are expected to be found in the uploaded data packageNo PASTA DOI alternateIdentifier elements foundPASTA DOI values might appear in an uploaded data package (by various mechanisms). PASTA will assign a DOI after the upload has completed successfully, so an initial one should not be there.
13titleLengthvalid
Type:metadata
System:lter
On failure:warn
Dataset title length is at 5 least words.If the title is shorter than 5 words, it might be insufficient. Title word count between 7 and 20 including prepositions and numbers.Between 7 and 20 words153 words found.EML Best Practices, v.2, p. 13
14pubDatePresentvalid
Type:metadata
System:lter
On failure:warn
'pubDate' element is presentCheck for presence of the pubDate elementThe date that the dataset was submitted for publication in PASTA must be included. (The EML schema does not require this element, but when present, it does constrain its format to YYYY-MM-DD or just YYYY. Citation format uses only the YYYY portion even if a full date is entered.) 2017-10-24EML Best Practices v.2, p. 17
15datasetAbstractLengthvalid
Type:metadata
System:lter
On failure:warn
Dataset abstract element is a minimum of 20 wordsCheck the length of a dataset abstract and warn if less than 20 words.An abstract is 20 words or more.310 words found.EML Best Practices
16duplicateEntityNamevalid
Type:metadata
System:lter
On failure:error
There are no duplicate entity namesChecks that content is not duplicated by other entityName elements in the documententityName is not a duplicate within the documentNo duplicates foundData Manager requires a non-empty, non-duplicate entityName value for every entityDeclare a non-empty entityName and ensure that there are no duplicate entityName values in the document
Entity Report
Entity Name: SBCMBON cloned 16S genes used to construct the mock community
Entity Identifier:
# Identifier Status Quality Check Name Description Expected Found Explanation Suggestion Reference
1entityNameLengthvalid
Type:metadata
System:knb
On failure:warn
Length of entityName is not excessive (less than 100 char)length of entity name is less than 100 charactersentityName value is 100 characters or less61
2entityDescriptionPresentvalid
Type:metadata
System:lter
On failure:warn
An entity description is presentCheck for presence of an entity description.EML Best practices pp. 32-33, "...should have enough information for a user..."trueWith entityName sometimes serving as a file name rather than a title, it is important to be very descriptive here.
3numHeaderLinesPresentinfo
Type:metadata
System:knb
On failure:info
'numHeaderLines' element is presentCheck for presence of the 'numHeaderLines' element.Document contains 'numHeaderLines' element.No 'numHeaderLines' element foundIf data file contains header lines, 'numHeaderLines' must be specified.Add 'numHeaderLines' element if needed.
4numFooterLinesPresentinfo
Type:metadata
System:knb
On failure:info
'numFooterLines' element is presentCheck for presence of the 'numFooterLines' element.Document contains 'numFooterLines' element.No 'numFooterLines' element foundIf data file contains footer lines, 'numFooterLines' must be specified.Add 'numFooterLines' element if needed.
5fieldDelimiterValidinfo
Type:metadata
System:knb
On failure:error
Field delimiter is a single characterField delimiters should be one character onlyA single character is expectednullA fieldDelimiter value is not checked for binary entities or entities with an externally defined formathttp://knb.ecoinformatics.org/software/eml/eml-2.1.0/eml-physical.html#fieldDelimiter
6recordDelimiterPresentinfo
Type:metadata
System:knb
On failure:warn
Record delimiter is presentCheck presence of record delimiter. Check that the record delimiter is one of the suggested values.A record delimiter from a list of suggested values: \n, \r, \r\n, #x0A, #x0D, #x0D#x0AnullA recordDelimiter value is not checked for binary entities or entities with an externally defined formathttp://knb.ecoinformatics.org/software/eml/eml-2.1.0/eml-physical.html#recordDelimiter
7integrityChecksumPresencevalid
Type:metadata
System:lter
On failure:warn
A physical/authentication element is present and specifies a method attribute with a value of MD5 or SHA-1Check for presence of a physical/authentication element containing a checksum or hash value for an integrity check (e.g. MD5, SHA-1). Warn if an entity does not have a physical/authentication element, or if none of the physical/authentication elements specify a method attribute with a value of MD5 or SHA-1.At least one physical/authentication element with a method attribute specifying MD5 or SHA-1 and containing a checksum or hash value that can be used to perform an integrity check on the data.truePASTA will use this value to check the integrity of the data it downloads from your site. In addition, PASTA is planning to compare the contributor-supplied checksum/hash value documented in the physical/authentication element to the checksum/hash value of this entity downloaded from previous revisions of this data package. If PASTA already has a copy of this entity, it will be able to avoid an unnecessary download of the entity from your site, resulting in faster processing of the new data package revision when you update it in PASTA.Add a physical/authentication element and store the entity checksum or hash value in it using a method such as MD5 or SHA-1.
8displayDownloadDatainfo
Type:data
System:knb
On failure:info
Display downloaded dataDisplay the first kilobyte of data that is downloadedUp to one kilobyte of data should be displayed*** BINARY DATA ***
9urlReturnsDatavalid
Type:congruency
System:knb
On failure:error
URL returns dataChecks whether a URL returns data. Unless the URL is specified to be function="information", the URL should return the resource for download.A data entity that matches the metadatatruehttp://knb.ecoinformatics.org/software/eml/eml-2.1.0/eml-resource.html#UrlType
10onlineURLsvalid
Type:congruency
System:knb
On failure:error
Online URLs are liveCheck that online URLs return somethingtruetrueSucceeded in accessing URL: https://pasta.lternet.edu/package/data/eml/edi/114/1/5f7618903c0f1acb22ecbfbf46b647fe
11integrityChecksumvalid
Type:congruency
System:lter
On failure:error
Compare the metadata checksum for an entity to the checksum of the downloaded entityTwo possible responses: valid if checksums match; error if checksums do not match.5b1e5710ebc634df769602c487f7ac1c5b1e5710ebc634df769602c487f7ac1cMatching checksums will ensure data integrity during upload to the repository.If the found integrity hash value does not match the expected integrity hash value, there may have been a loss of integrity in the data download. Check that the hash method and hash value documented in the metadata are the correct values.
Entity Report
Entity Name: SBCMBON mock community sequencing results
Entity Identifier:
# Identifier Status Quality Check Name Description Expected Found Explanation Suggestion Reference
1entityNameLengthvalid
Type:metadata
System:knb
On failure:warn
Length of entityName is not excessive (less than 100 char)length of entity name is less than 100 charactersentityName value is 100 characters or less41
2entityDescriptionPresentvalid
Type:metadata
System:lter
On failure:warn
An entity description is presentCheck for presence of an entity description.EML Best practices pp. 32-33, "...should have enough information for a user..."trueWith entityName sometimes serving as a file name rather than a title, it is important to be very descriptive here.
3numHeaderLinesPresentinfo
Type:metadata
System:knb
On failure:info
'numHeaderLines' element is presentCheck for presence of the 'numHeaderLines' element.Document contains 'numHeaderLines' element.No 'numHeaderLines' element foundIf data file contains header lines, 'numHeaderLines' must be specified.Add 'numHeaderLines' element if needed.
4numFooterLinesPresentinfo
Type:metadata
System:knb
On failure:info
'numFooterLines' element is presentCheck for presence of the 'numFooterLines' element.Document contains 'numFooterLines' element.No 'numFooterLines' element foundIf data file contains footer lines, 'numFooterLines' must be specified.Add 'numFooterLines' element if needed.
5fieldDelimiterValidinfo
Type:metadata
System:knb
On failure:error
Field delimiter is a single characterField delimiters should be one character onlyA single character is expectednullA fieldDelimiter value is not checked for binary entities or entities with an externally defined formathttp://knb.ecoinformatics.org/software/eml/eml-2.1.0/eml-physical.html#fieldDelimiter
6recordDelimiterPresentinfo
Type:metadata
System:knb
On failure:warn
Record delimiter is presentCheck presence of record delimiter. Check that the record delimiter is one of the suggested values.A record delimiter from a list of suggested values: \n, \r, \r\n, #x0A, #x0D, #x0D#x0AnullA recordDelimiter value is not checked for binary entities or entities with an externally defined formathttp://knb.ecoinformatics.org/software/eml/eml-2.1.0/eml-physical.html#recordDelimiter
7integrityChecksumPresencevalid
Type:metadata
System:lter
On failure:warn
A physical/authentication element is present and specifies a method attribute with a value of MD5 or SHA-1Check for presence of a physical/authentication element containing a checksum or hash value for an integrity check (e.g. MD5, SHA-1). Warn if an entity does not have a physical/authentication element, or if none of the physical/authentication elements specify a method attribute with a value of MD5 or SHA-1.At least one physical/authentication element with a method attribute specifying MD5 or SHA-1 and containing a checksum or hash value that can be used to perform an integrity check on the data.truePASTA will use this value to check the integrity of the data it downloads from your site. In addition, PASTA is planning to compare the contributor-supplied checksum/hash value documented in the physical/authentication element to the checksum/hash value of this entity downloaded from previous revisions of this data package. If PASTA already has a copy of this entity, it will be able to avoid an unnecessary download of the entity from your site, resulting in faster processing of the new data package revision when you update it in PASTA.Add a physical/authentication element and store the entity checksum or hash value in it using a method such as MD5 or SHA-1.
8displayDownloadDatainfo
Type:data
System:knb
On failure:info
Display downloaded dataDisplay the first kilobyte of data that is downloadedUp to one kilobyte of data should be displayed*** BINARY DATA ***
9urlReturnsDatavalid
Type:congruency
System:knb
On failure:error
URL returns dataChecks whether a URL returns data. Unless the URL is specified to be function="information", the URL should return the resource for download.A data entity that matches the metadatatruehttp://knb.ecoinformatics.org/software/eml/eml-2.1.0/eml-resource.html#UrlType
10onlineURLsvalid
Type:congruency
System:knb
On failure:error
Online URLs are liveCheck that online URLs return somethingtruetrueSucceeded in accessing URL: https://pasta.lternet.edu/package/data/eml/edi/114/1/d83bd888c6de0c011a324b8235d94faf
11integrityChecksumvalid
Type:congruency
System:lter
On failure:error
Compare the metadata checksum for an entity to the checksum of the downloaded entityTwo possible responses: valid if checksums match; error if checksums do not match.c42613bfa3182c610046a0dadd224bdac42613bfa3182c610046a0dadd224bdaMatching checksums will ensure data integrity during upload to the repository.If the found integrity hash value does not match the expected integrity hash value, there may have been a loss of integrity in the data download. Check that the hash method and hash value documented in the metadata are the correct values.
Entity Report
Entity Name: SBCMBON field sequencing results
Entity Identifier:
# Identifier Status Quality Check Name Description Expected Found Explanation Suggestion Reference
1entityNameLengthvalid
Type:metadata
System:knb
On failure:warn
Length of entityName is not excessive (less than 100 char)length of entity name is less than 100 charactersentityName value is 100 characters or less32
2entityDescriptionPresentvalid
Type:metadata
System:lter
On failure:warn
An entity description is presentCheck for presence of an entity description.EML Best practices pp. 32-33, "...should have enough information for a user..."trueWith entityName sometimes serving as a file name rather than a title, it is important to be very descriptive here.
3numHeaderLinesPresentinfo
Type:metadata
System:knb
On failure:info
'numHeaderLines' element is presentCheck for presence of the 'numHeaderLines' element.Document contains 'numHeaderLines' element.No 'numHeaderLines' element foundIf data file contains header lines, 'numHeaderLines' must be specified.Add 'numHeaderLines' element if needed.
4numFooterLinesPresentinfo
Type:metadata
System:knb
On failure:info
'numFooterLines' element is presentCheck for presence of the 'numFooterLines' element.Document contains 'numFooterLines' element.No 'numFooterLines' element foundIf data file contains footer lines, 'numFooterLines' must be specified.Add 'numFooterLines' element if needed.
5fieldDelimiterValidinfo
Type:metadata
System:knb
On failure:error
Field delimiter is a single characterField delimiters should be one character onlyA single character is expectednullA fieldDelimiter value is not checked for binary entities or entities with an externally defined formathttp://knb.ecoinformatics.org/software/eml/eml-2.1.0/eml-physical.html#fieldDelimiter
6recordDelimiterPresentinfo
Type:metadata
System:knb
On failure:warn
Record delimiter is presentCheck presence of record delimiter. Check that the record delimiter is one of the suggested values.A record delimiter from a list of suggested values: \n, \r, \r\n, #x0A, #x0D, #x0D#x0AnullA recordDelimiter value is not checked for binary entities or entities with an externally defined formathttp://knb.ecoinformatics.org/software/eml/eml-2.1.0/eml-physical.html#recordDelimiter
7integrityChecksumPresencevalid
Type:metadata
System:lter
On failure:warn
A physical/authentication element is present and specifies a method attribute with a value of MD5 or SHA-1Check for presence of a physical/authentication element containing a checksum or hash value for an integrity check (e.g. MD5, SHA-1). Warn if an entity does not have a physical/authentication element, or if none of the physical/authentication elements specify a method attribute with a value of MD5 or SHA-1.At least one physical/authentication element with a method attribute specifying MD5 or SHA-1 and containing a checksum or hash value that can be used to perform an integrity check on the data.truePASTA will use this value to check the integrity of the data it downloads from your site. In addition, PASTA is planning to compare the contributor-supplied checksum/hash value documented in the physical/authentication element to the checksum/hash value of this entity downloaded from previous revisions of this data package. If PASTA already has a copy of this entity, it will be able to avoid an unnecessary download of the entity from your site, resulting in faster processing of the new data package revision when you update it in PASTA.Add a physical/authentication element and store the entity checksum or hash value in it using a method such as MD5 or SHA-1.
8displayDownloadDatainfo
Type:data
System:knb
On failure:info
Display downloaded dataDisplay the first kilobyte of data that is downloadedUp to one kilobyte of data should be displayed*** BINARY DATA ***
9urlReturnsDatavalid
Type:congruency
System:knb
On failure:error
URL returns dataChecks whether a URL returns data. Unless the URL is specified to be function="information", the URL should return the resource for download.A data entity that matches the metadatatruehttp://knb.ecoinformatics.org/software/eml/eml-2.1.0/eml-resource.html#UrlType
10onlineURLsvalid
Type:congruency
System:knb
On failure:error
Online URLs are liveCheck that online URLs return somethingtruetrueSucceeded in accessing URL: https://pasta.lternet.edu/package/data/eml/edi/114/1/62cfb69039a0951b9f90271a28d28278
11integrityChecksumvalid
Type:congruency
System:lter
On failure:error
Compare the metadata checksum for an entity to the checksum of the downloaded entityTwo possible responses: valid if checksums match; error if checksums do not match.06bffef34b36d59a7fae8b46d13296cb06bffef34b36d59a7fae8b46d13296cbMatching checksums will ensure data integrity during upload to the repository.If the found integrity hash value does not match the expected integrity hash value, there may have been a loss of integrity in the data download. Check that the hash method and hash value documented in the metadata are the correct values.
Entity Report
Entity Name: SBCMBON metagenome results
Entity Identifier:
# Identifier Status Quality Check Name Description Expected Found Explanation Suggestion Reference
1entityNameLengthvalid
Type:metadata
System:knb
On failure:warn
Length of entityName is not excessive (less than 100 char)length of entity name is less than 100 charactersentityName value is 100 characters or less26
2entityDescriptionPresentvalid
Type:metadata
System:lter
On failure:warn
An entity description is presentCheck for presence of an entity description.EML Best practices pp. 32-33, "...should have enough information for a user..."trueWith entityName sometimes serving as a file name rather than a title, it is important to be very descriptive here.
3numHeaderLinesPresentinfo
Type:metadata
System:knb
On failure:info
'numHeaderLines' element is presentCheck for presence of the 'numHeaderLines' element.Document contains 'numHeaderLines' element.No 'numHeaderLines' element foundIf data file contains header lines, 'numHeaderLines' must be specified.Add 'numHeaderLines' element if needed.
4numFooterLinesPresentinfo
Type:metadata
System:knb
On failure:info
'numFooterLines' element is presentCheck for presence of the 'numFooterLines' element.Document contains 'numFooterLines' element.No 'numFooterLines' element foundIf data file contains footer lines, 'numFooterLines' must be specified.Add 'numFooterLines' element if needed.
5fieldDelimiterValidinfo
Type:metadata
System:knb
On failure:error
Field delimiter is a single characterField delimiters should be one character onlyA single character is expectednullA fieldDelimiter value is not checked for binary entities or entities with an externally defined formathttp://knb.ecoinformatics.org/software/eml/eml-2.1.0/eml-physical.html#fieldDelimiter
6recordDelimiterPresentinfo
Type:metadata
System:knb
On failure:warn
Record delimiter is presentCheck presence of record delimiter. Check that the record delimiter is one of the suggested values.A record delimiter from a list of suggested values: \n, \r, \r\n, #x0A, #x0D, #x0D#x0AnullA recordDelimiter value is not checked for binary entities or entities with an externally defined formathttp://knb.ecoinformatics.org/software/eml/eml-2.1.0/eml-physical.html#recordDelimiter
7integrityChecksumPresencevalid
Type:metadata
System:lter
On failure:warn
A physical/authentication element is present and specifies a method attribute with a value of MD5 or SHA-1Check for presence of a physical/authentication element containing a checksum or hash value for an integrity check (e.g. MD5, SHA-1). Warn if an entity does not have a physical/authentication element, or if none of the physical/authentication elements specify a method attribute with a value of MD5 or SHA-1.At least one physical/authentication element with a method attribute specifying MD5 or SHA-1 and containing a checksum or hash value that can be used to perform an integrity check on the data.truePASTA will use this value to check the integrity of the data it downloads from your site. In addition, PASTA is planning to compare the contributor-supplied checksum/hash value documented in the physical/authentication element to the checksum/hash value of this entity downloaded from previous revisions of this data package. If PASTA already has a copy of this entity, it will be able to avoid an unnecessary download of the entity from your site, resulting in faster processing of the new data package revision when you update it in PASTA.Add a physical/authentication element and store the entity checksum or hash value in it using a method such as MD5 or SHA-1.
8displayDownloadDatainfo
Type:data
System:knb
On failure:info
Display downloaded dataDisplay the first kilobyte of data that is downloadedUp to one kilobyte of data should be displayed*** BINARY DATA ***
9urlReturnsDatavalid
Type:congruency
System:knb
On failure:error
URL returns dataChecks whether a URL returns data. Unless the URL is specified to be function="information", the URL should return the resource for download.A data entity that matches the metadatatruehttp://knb.ecoinformatics.org/software/eml/eml-2.1.0/eml-resource.html#UrlType
10onlineURLsvalid
Type:congruency
System:knb
On failure:error
Online URLs are liveCheck that online URLs return somethingtruetrueSucceeded in accessing URL: https://pasta.lternet.edu/package/data/eml/edi/114/1/567ef51962203f504d68281e1f09093d
11integrityChecksumvalid
Type:congruency
System:lter
On failure:error
Compare the metadata checksum for an entity to the checksum of the downloaded entityTwo possible responses: valid if checksums match; error if checksums do not match.d35f76416c887f928d296d5121c89454d35f76416c887f928d296d5121c89454Matching checksums will ensure data integrity during upload to the repository.If the found integrity hash value does not match the expected integrity hash value, there may have been a loss of integrity in the data download. Check that the hash method and hash value documented in the metadata are the correct values.