The above table is based on
5,853 processable files.
There are also
1 files in
the collection that could not be processed.
See record details table for more information.
All per file averages (except in the Files section) are based on the number of processable
files
Files
General information on the number of files and the file size.
Number of files:
5,854
Number of processable files:
5,853
Total size:
72,235,294
B
Average size:
12,339
B
Minimal file size:
5,669
B
Maximal file size:
22,527
B
Header
The header section shows information on the availibilty of attribute schemaLocation
as well as the
elements
MdSelfLink, MdProfile and MdCollectionDisplayName.
Number of files with schemaLocation:
290
Number of files where schemaLocation is CR resident:
290
Number of files with MdProfile:
5,853
Number of files with MdSelfLink:
5,853
Number of files with MdCollectionDisplayName:
5,842
Profile usage
The profile usage section shows information shows which profiles are used how oftenly
in a collection.
collection.
ID
Is public
Score
Count
Total number of profiles:
6
clarin.eu:cr1:p_1386164908461
true
2.70
2
clarin.eu:cr1:p_1357720977528
true
2.69
4
clarin.eu:cr1:p_1320657629644
true
2.47
23
clarin.eu:cr1:p_1562754657370
true
2.07
5,253
clarin.eu:cr1:p_1353678848745
false
1.64
8
clarin.eu:cr1:p_1381926654438
false
0.94
563
Facets
The facet section shows the facet coverage within the
collection. A facet can be covered by the instance
even when it is not covered by the profile when cross facet mapping is used.
name
coverage
average facet-coverage:
70.8%
languageCode
99.6%
collection
99.8%
resourceClass
0.5%
modality
88.4%
format
99.5%
keywords
0.0%
genre
0.0%
subject
0.0%
country
99.6%
organisation
99.6%
name
100.0%
description
95.6%
license
89.9%
availability
89.7%
temporalCoverage
99.5%
Resource proxy
The resource proxy section shows information on the number of
resource proxies on the kind (the mime type) of resources.
A resource proxy is a link to an external resource, described by
the CMD file.
Total number of resource proxies:
17,897
Average number of resource proxies:
3.06
Total number of resource proxies with MIME:
12,070
Average number of resource proxies with MIME:
2.06
Total number of resource proxies with reference:
17,897
Average number of resource proxies with references:
3.06
XML validation
The XML validation section shows the result of a simple
validation of each CMD file against its profile.
Number of XML valid Records:
5,415
Ratio XML valid Records:
92.5%
XML population
The XML population section shows information on the number of xml
elements and the fact if these elements are conatining data.
Total number of XML elements:
898,647
Average number of XML elements:
153.54
Total number of simple XML elements:
559,036
Average number of simple XML elements:
95.51
Total number of empty XML elements:
85,686
Average number of empty XML elements:
N/A
Average rate of populated elements:
84.7%
Link validation
The link validation section shows information on the number of
links and the results of link checking for the links which
have been checked so far.
Total number of links:
23,750
Average number of links:
4.06
Total number of unique links:
23,750
Average number of unique links:
4.06
Total number of checked links:
23,750
Ratio of valid links:
88.4%
Link Checking Results
Category
Count
Average Response Duration(ms)
Max Response Duration(ms)
Ok
21,000
435.7
6,848.0
Blocked_By_Robots_txt
2,347
N/A
N/A
Broken
403
16.9
423.0
Record details:
The record details section shows the particalarities of each record as far as they're
of
importance for the data provider.
Severity: ERROR,
Segment: xml-validation,
Message:
line: 208, col: 29 - cvc-complex-type.2.4.a: Invalid content was found starting with
element '{"http://www.clarin.eu/cmd/1/profiles/clarin.eu:cr1:p_1381926654438":langUsage}'.
One of '{"http://www.clarin.eu/cmd/1/profiles/clarin.eu:cr1:p_1381926654438":abstract}'
is expected.
Severity: ERROR,
Segment: xml-validation,
Message:
line: 218, col: 29 - cvc-complex-type.2.4.a: Invalid content was found starting with
element '{"http://www.clarin.eu/cmd/1/profiles/clarin.eu:cr1:p_1381926654438":langUsage}'.
One of '{"http://www.clarin.eu/cmd/1/profiles/clarin.eu:cr1:p_1381926654438":abstract}'
is expected.
Severity: WARNING,
Segment: header,
Message:
Attribute schemaLocation is missing. clarin.eu:cr1:p_1562754657370 is assumed
Severity: ERROR,
Segment: xml-validation,
Message:
line: 15, col: 118 - cvc-datatype-valid.1.2.1: 'Dr. Johann Gottfried Dingler: Polytechnisches
Journal. Bd. 57. Stuttgart, 1835..xml' is not a valid value for 'NCName'.
Severity: ERROR,
Segment: xml-validation,
Message:
line: 15, col: 118 - cvc-attribute.3: The value 'Dr. Johann Gottfried Dingler: Polytechnisches
Journal. Bd. 57. Stuttgart, 1835..xml' of attribute 'id' on element 'cmd:ResourceProxy'
is not valid with respect to its type, 'ID'.
Severity: ERROR,
Segment: xml-validation,
Message:
line: 19, col: 127 - cvc-datatype-valid.1.2.1: 'Dr. Johann Gottfried Dingler: Polytechnisches
Journal. Bd. 57. Stuttgart, 1835..landing_page' is not a valid value for 'NCName'.
Severity: ERROR,
Segment: xml-validation,
Message:
line: 19, col: 127 - cvc-attribute.3: The value 'Dr. Johann Gottfried Dingler: Polytechnisches
Journal. Bd. 57. Stuttgart, 1835..landing_page' of attribute 'id' on element 'cmd:ResourceProxy'
is not valid with respect to its type, 'ID'.
Severity: ERROR,
Segment: xml-validation,
Message:
line: 23, col: 121 - cvc-datatype-valid.1.2.1: 'Dr. Johann Gottfried Dingler: Polytechnisches
Journal. Bd. 57. Stuttgart, 1835..search' is not a valid value for 'NCName'.
Severity: ERROR,
Segment: xml-validation,
Message:
line: 23, col: 121 - cvc-attribute.3: The value 'Dr. Johann Gottfried Dingler: Polytechnisches
Journal. Bd. 57. Stuttgart, 1835..search' of attribute 'id' on element 'cmd:ResourceProxy'
is not valid with respect to its type, 'ID'.
Severity: WARNING,
Segment: header,
Message:
Attribute schemaLocation is missing. clarin.eu:cr1:p_1562754657370 is assumed
Severity: ERROR,
Segment: xml-validation,
Message:
line: 15, col: 121 - cvc-datatype-valid.1.2.1: 'Johann Zeman, Dr. Ferd. Fischer: Polytechnisches
Journal. Bd. 217. Augsburg, 1875..xml' is not a valid value for 'NCName'.
Severity: ERROR,
Segment: xml-validation,
Message:
line: 15, col: 121 - cvc-attribute.3: The value 'Johann Zeman, Dr. Ferd. Fischer:
Polytechnisches Journal. Bd. 217. Augsburg, 1875..xml' of attribute 'id' on element
'cmd:ResourceProxy' is not valid with respect to its type, 'ID'.
Severity: ERROR,
Segment: xml-validation,
Message:
line: 19, col: 130 - cvc-datatype-valid.1.2.1: 'Johann Zeman, Dr. Ferd. Fischer: Polytechnisches
Journal. Bd. 217. Augsburg, 1875..landing_page' is not a valid value for 'NCName'.
Severity: ERROR,
Segment: xml-validation,
Message:
line: 19, col: 130 - cvc-attribute.3: The value 'Johann Zeman, Dr. Ferd. Fischer:
Polytechnisches Journal. Bd. 217. Augsburg, 1875..landing_page' of attribute 'id'
on element 'cmd:ResourceProxy' is not valid with respect to its type, 'ID'.
Severity: ERROR,
Segment: xml-validation,
Message:
line: 23, col: 124 - cvc-datatype-valid.1.2.1: 'Johann Zeman, Dr. Ferd. Fischer: Polytechnisches
Journal. Bd. 217. Augsburg, 1875..search' is not a valid value for 'NCName'.
Severity: ERROR,
Segment: xml-validation,
Message:
line: 23, col: 124 - cvc-attribute.3: The value 'Johann Zeman, Dr. Ferd. Fischer:
Polytechnisches Journal. Bd. 217. Augsburg, 1875..search' of attribute 'id' on element
'cmd:ResourceProxy' is not valid with respect to its type, 'ID'.
Severity: WARNING,
Segment: header,
Message:
Attribute schemaLocation is missing. clarin.eu:cr1:p_1562754657370 is assumed
Severity: ERROR,
Segment: xml-validation,
Message:
line: 15, col: 160 - cvc-datatype-valid.1.2.1: 'Dr. Johann Gottfried Dingler, Dr.
Emil Maximilian Dingler: Polytechnisches Journal. Bd. 93. Stuttgart und Tübingen,
1844..xml' is not a valid value for 'NCName'.
Severity: ERROR,
Segment: xml-validation,
Message:
line: 15, col: 160 - cvc-attribute.3: The value 'Dr. Johann Gottfried Dingler, Dr.
Emil Maximilian Dingler: Polytechnisches Journal. Bd. 93. Stuttgart und Tübingen,
1844..xml' of attribute 'id' on element 'cmd:ResourceProxy' is not valid with respect
to its type, 'ID'.
Severity: ERROR,
Segment: xml-validation,
Message:
line: 19, col: 169 - cvc-datatype-valid.1.2.1: 'Dr. Johann Gottfried Dingler, Dr.
Emil Maximilian Dingler: Polytechnisches Journal. Bd. 93. Stuttgart und Tübingen,
1844..landing_page' is not a valid value for 'NCName'.
Severity: ERROR,
Segment: xml-validation,
Message:
line: 19, col: 169 - cvc-attribute.3: The value 'Dr. Johann Gottfried Dingler, Dr.
Emil Maximilian Dingler: Polytechnisches Journal. Bd. 93. Stuttgart und Tübingen,
1844..landing_page' of attribute 'id' on element 'cmd:ResourceProxy' is not valid
with respect to its type, 'ID'.
Severity: ERROR,
Segment: xml-validation,
Message:
line: 23, col: 163 - cvc-datatype-valid.1.2.1: 'Dr. Johann Gottfried Dingler, Dr.
Emil Maximilian Dingler: Polytechnisches Journal. Bd. 93. Stuttgart und Tübingen,
1844..search' is not a valid value for 'NCName'.
Severity: ERROR,
Segment: xml-validation,
Message:
line: 23, col: 163 - cvc-attribute.3: The value 'Dr. Johann Gottfried Dingler, Dr.
Emil Maximilian Dingler: Polytechnisches Journal. Bd. 93. Stuttgart und Tübingen,
1844..search' of attribute 'id' on element 'cmd:ResourceProxy' is not valid with respect
to its type, 'ID'.
Severity: WARNING,
Segment: header,
Message:
Value for CMD/Header/MdCollectionDisplayName is missing
Severity: ERROR,
Segment: xml-validation,
Message:
line: 13, col: 57 - cvc-datatype-valid.1.2.1: 'WebLichtWebServices:12' is not a valid
value for 'NCName'.
Severity: ERROR,
Segment: xml-validation,
Message:
line: 13, col: 57 - cvc-attribute.3: The value 'WebLichtWebServices:12' of attribute
'id' on element 'cmd:ResourceProxy' is not valid with respect to its type, 'ID'.
Severity: ERROR,
Segment: xml-validation,
Message:
line: 145, col: 66 - cvc-complex-type.2.4.a: Invalid content was found starting with
element '{"http://www.clarin.eu/cmd/1/profiles/clarin.eu:cr1:p_1320657629644":AllowManualSelectionFallback}'.
One of '{"http://www.clarin.eu/cmd/1/profiles/clarin.eu:cr1:p_1320657629644":Description,
"http://www.clarin.eu/cmd/1/profiles/clarin.eu:cr1:p_1320657629644":MIMEType, "http://www.clarin.eu/cmd/1/profiles/clarin.eu:cr1:p_1320657629644":DataType,
"http://www.clarin.eu/cmd/1/profiles/clarin.eu:cr1:p_1320657629644":isConfigurationParameter,
"http://www.clarin.eu/cmd/1/profiles/clarin.eu:cr1:p_1320657629644":DataCategory,
"http://www.clarin.eu/cmd/1/profiles/clarin.eu:cr1:p_1320657629644":SemanticType,
"http://www.clarin.eu/cmd/1/profiles/clarin.eu:cr1:p_1320657629644":RefInputParameter,
"http://www.clarin.eu/cmd/1/profiles/clarin.eu:cr1:p_1320657629644":Values}' is expected.
Severity: ERROR,
Segment: xml-validation,
Message:
line: 171, col: 11 - cvc-identity-constraint.4.3: Key 'PayloadResourceRef' with value
's101' not found for identity constraint of element 'CMD'.
Severity: ERROR,
Segment: xml-validation,
Message:
line: 171, col: 11 - cvc-id.1: There is no ID/IDREF binding for IDREF 's101'.
Severity: WARNING,
Segment: header,
Message:
Attribute schemaLocation is missing. clarin.eu:cr1:p_1562754657370 is assumed
Severity: ERROR,
Segment: xml-validation,
Message:
line: 15, col: 144 - cvc-datatype-valid.1.2.1: 'D. Philipp, Dr. Jul. Philipp: Real-Index
zu Dr. Dinglers polytechnischem Journal. Bd. 4. Stuttgart, 1871..xml' is not a valid
value for 'NCName'.
Severity: ERROR,
Segment: xml-validation,
Message:
line: 15, col: 144 - cvc-attribute.3: The value 'D. Philipp, Dr. Jul. Philipp: Real-Index
zu Dr. Dinglers polytechnischem Journal. Bd. 4. Stuttgart, 1871..xml' of attribute
'id' on element 'cmd:ResourceProxy' is not valid with respect to its type, 'ID'.
Severity: ERROR,
Segment: xml-validation,
Message:
line: 19, col: 153 - cvc-datatype-valid.1.2.1: 'D. Philipp, Dr. Jul. Philipp: Real-Index
zu Dr. Dinglers polytechnischem Journal. Bd. 4. Stuttgart, 1871..landing_page' is
not a valid value for 'NCName'.
Severity: ERROR,
Segment: xml-validation,
Message:
line: 19, col: 153 - cvc-attribute.3: The value 'D. Philipp, Dr. Jul. Philipp: Real-Index
zu Dr. Dinglers polytechnischem Journal. Bd. 4. Stuttgart, 1871..landing_page' of
attribute 'id' on element 'cmd:ResourceProxy' is not valid with respect to its type,
'ID'.
Severity: ERROR,
Segment: xml-validation,
Message:
line: 23, col: 147 - cvc-datatype-valid.1.2.1: 'D. Philipp, Dr. Jul. Philipp: Real-Index
zu Dr. Dinglers polytechnischem Journal. Bd. 4. Stuttgart, 1871..search' is not a
valid value for 'NCName'.
Severity: ERROR,
Segment: xml-validation,
Message:
line: 23, col: 147 - cvc-attribute.3: The value 'D. Philipp, Dr. Jul. Philipp: Real-Index
zu Dr. Dinglers polytechnischem Journal. Bd. 4. Stuttgart, 1871..search' of attribute
'id' on element 'cmd:ResourceProxy' is not valid with respect to its type, 'ID'.
Severity: ERROR,
Segment: xml-validation,
Message:
line: 218, col: 29 - cvc-complex-type.2.4.a: Invalid content was found starting with
element '{"http://www.clarin.eu/cmd/1/profiles/clarin.eu:cr1:p_1381926654438":langUsage}'.
One of '{"http://www.clarin.eu/cmd/1/profiles/clarin.eu:cr1:p_1381926654438":abstract}'
is expected.
Severity: WARNING,
Segment: header,
Message:
Attribute schemaLocation is missing. clarin.eu:cr1:p_1562754657370 is assumed
Severity: ERROR,
Segment: xml-validation,
Message:
line: 15, col: 148 - cvc-datatype-valid.1.2.1: 'Dr. Johann Gottfried Dingler, Dr.
Emil Maximilian Dingler: Polytechnisches Journal. Bd. 121. Stuttgart, 1851..xml' is
not a valid value for 'NCName'.
Severity: ERROR,
Segment: xml-validation,
Message:
line: 15, col: 148 - cvc-attribute.3: The value 'Dr. Johann Gottfried Dingler, Dr.
Emil Maximilian Dingler: Polytechnisches Journal. Bd. 121. Stuttgart, 1851..xml' of
attribute 'id' on element 'cmd:ResourceProxy' is not valid with respect to its type,
'ID'.
Severity: ERROR,
Segment: xml-validation,
Message:
line: 19, col: 157 - cvc-datatype-valid.1.2.1: 'Dr. Johann Gottfried Dingler, Dr.
Emil Maximilian Dingler: Polytechnisches Journal. Bd. 121. Stuttgart, 1851..landing_page'
is not a valid value for 'NCName'.
Severity: ERROR,
Segment: xml-validation,
Message:
line: 19, col: 157 - cvc-attribute.3: The value 'Dr. Johann Gottfried Dingler, Dr.
Emil Maximilian Dingler: Polytechnisches Journal. Bd. 121. Stuttgart, 1851..landing_page'
of attribute 'id' on element 'cmd:ResourceProxy' is not valid with respect to its
type, 'ID'.
Severity: ERROR,
Segment: xml-validation,
Message:
line: 23, col: 151 - cvc-datatype-valid.1.2.1: 'Dr. Johann Gottfried Dingler, Dr.
Emil Maximilian Dingler: Polytechnisches Journal. Bd. 121. Stuttgart, 1851..search'
is not a valid value for 'NCName'.
Severity: ERROR,
Segment: xml-validation,
Message:
line: 23, col: 151 - cvc-attribute.3: The value 'Dr. Johann Gottfried Dingler, Dr.
Emil Maximilian Dingler: Polytechnisches Journal. Bd. 121. Stuttgart, 1851..search'
of attribute 'id' on element 'cmd:ResourceProxy' is not valid with respect to its
type, 'ID'.
Severity: WARNING,
Segment: header,
Message:
Attribute schemaLocation is missing. clarin.eu:cr1:p_1562754657370 is assumed
Severity: ERROR,
Segment: xml-validation,
Message:
line: 15, col: 121 - cvc-datatype-valid.1.2.1: 'Johann Zeman, Dr. Ferd. Fischer: Polytechnisches
Journal. Bd. 214. Augsburg, 1874..xml' is not a valid value for 'NCName'.
Severity: ERROR,
Segment: xml-validation,
Message:
line: 15, col: 121 - cvc-attribute.3: The value 'Johann Zeman, Dr. Ferd. Fischer:
Polytechnisches Journal. Bd. 214. Augsburg, 1874..xml' of attribute 'id' on element
'cmd:ResourceProxy' is not valid with respect to its type, 'ID'.
Severity: ERROR,
Segment: xml-validation,
Message:
line: 19, col: 130 - cvc-datatype-valid.1.2.1: 'Johann Zeman, Dr. Ferd. Fischer: Polytechnisches
Journal. Bd. 214. Augsburg, 1874..landing_page' is not a valid value for 'NCName'.
Severity: ERROR,
Segment: xml-validation,
Message:
line: 19, col: 130 - cvc-attribute.3: The value 'Johann Zeman, Dr. Ferd. Fischer:
Polytechnisches Journal. Bd. 214. Augsburg, 1874..landing_page' of attribute 'id'
on element 'cmd:ResourceProxy' is not valid with respect to its type, 'ID'.
Severity: ERROR,
Segment: xml-validation,
Message:
line: 23, col: 124 - cvc-datatype-valid.1.2.1: 'Johann Zeman, Dr. Ferd. Fischer: Polytechnisches
Journal. Bd. 214. Augsburg, 1874..search' is not a valid value for 'NCName'.
Severity: ERROR,
Segment: xml-validation,
Message:
line: 23, col: 124 - cvc-attribute.3: The value 'Johann Zeman, Dr. Ferd. Fischer:
Polytechnisches Journal. Bd. 214. Augsburg, 1874..search' of attribute 'id' on element
'cmd:ResourceProxy' is not valid with respect to its type, 'ID'.