Changes for page 13 Structure Mapping
Last modified by Artur on 2025/09/10 11:19
Summary
-
Page properties (2 modified, 0 added, 0 removed)
Details
- Page properties
-
- Tags
-
... ... @@ -1,1 +1,0 @@ 1 -Component|Data set|Dimension|Observation value|Series key|Statistical data and metadata exchange - Content
-
... ... @@ -4,94 +4,97 @@ 4 4 5 5 == 13.1 Introduction == 6 6 7 -The purpose of [[SDMX>>doc:sdmx:Glossary.Statisticaldata and metadata exchange.WebHome]]structure mapping is to transform[[datasets>>doc:sdmx:Glossary.Dataset.WebHome]]from one dimensionality to another. In practice, this means that the input and output[[datasets>>doc:sdmx:Glossary.Dataset.WebHome]]conform to different Data Structure Definition.7 +The purpose of SDMX structure mapping is to transform datasets from one dimensionality to another. In practice, this means that the input and output datasets conform to different Data Structure Definition. 8 8 9 -Structure mapping does not alter the [[observation values>>doc:sdmx:Glossary.Observationvalue.WebHome]] and is not intended to perform any aggregations or calculations.9 +Structure mapping does not alter the observation values and is not intended to perform any aggregations or calculations. 10 10 11 -An input series (% style="color:#e74c3c" %)maps(%%)to:11 +An input series maps to: 12 12 13 13 1. Exactly one output series; or 14 -1. Multiple output series with different [[Series Keys>>doc:sdmx:Glossary.Series key.WebHome]], but the same[[observation values>>doc:sdmx:Glossary.Observation value.WebHome]]; or15 -1. Zero output series where no source rule matches the input [[Component>>doc:sdmx:Glossary.Component.WebHome]]values.14 +1. Multiple output series with different Series Keys, but the same observation values; or 15 +1. Zero output series where no source rule matches the input Component values. 16 16 17 17 Typical use cases include: 18 18 19 19 * Transforming received data into a common internal structure; 20 20 * Transforming reported data into the data collector's preferred structure; 21 -* Transforming unidimensional [[datasets>>doc:sdmx:Glossary.Dataset.WebHome]]{{footnote}}Unidimensional datasets are those with a single 'indicator' or 'series code' dimension.{{/footnote}}to multi-dimensional; and22 -* Transforming internal [[datasets>>doc:sdmx:Glossary.Dataset.WebHome]]with a complex structure to a simpler structure with fewer[[dimensions>>doc:sdmx:Glossary.Dimension.WebHome]]suitable for dissemination.21 +* Transforming unidimensional datasets[[(% class="wikiinternallink" %)^^~[1~]^^>>path:#_ftn1]](%%) to multi-dimensional; and 22 +* Transforming internal datasets with a complex structure to a simpler structure with fewer dimensions suitable for dissemination. 23 23 24 24 == 13.2 1-1 structure maps == 25 25 26 -1-1 (pronounced 'one to one') mappings support the simple use case where the value of a [[Component>>doc:sdmx:Glossary.Component.WebHome]]in the source structure is translated to a different value in the target, usually where different classification schemes are used for the same Concept.26 +1-1 (pronounced 'one to one') mappings support the simple use case where the value of a Component in the source structure is translated to a different value in the target, usually where different classification schemes are used for the same Concept. 27 27 28 -In the example below, ISO 2-character country [[codes>>doc:sdmx:Glossary.Code.WebHome]]are(% style="color:#e74c3c" %)mapped(%%)to their ISO 3character equivalent.28 +In the example below, ISO 2-character country codes are mapped to their ISO 3character equivalent. 29 29 30 -(% style="width:585.294px" %) 31 -|(% style="width:173px" %)**Country**|(% style="width:180px" %)**Alpha-2 code**|(% style="width:229px" %)**Alpha-3 code** 32 -|(% style="width:173px" %)Afghanistan|(% style="width:180px" %)AF|(% style="width:229px" %)AFG 33 -|(% style="width:173px" %)Albania|(% style="width:180px" %)AL|(% style="width:229px" %)ALB 34 -|(% style="width:173px" %)Algeria|(% style="width:180px" %)DZ|(% style="width:229px" %)DZA 35 -|(% style="width:173px" %)American Samoa|(% style="width:180px" %)AS|(% style="width:229px" %)ASM 36 -|(% style="width:173px" %)Andorra|(% style="width:180px" %)AD|(% style="width:229px" %)AND 37 -|(% style="width:173px" %)etc…|(% style="width:180px" %) |(% style="width:229px" %) 30 +|Country|Alpha-2 code|Alpha-3 code 31 +|Afghanistan|AF|AFG 32 +|Albania|AL|ALB 33 +|Algeria|DZ|DZA 34 +|American Samoa|AS|ASM 35 +|Andorra|AD|AND 36 +|etc…| | 38 38 39 -Different source values can also (% style="color:#e74c3c" %)map(%%)to the same target value, for example when deriving regions from country[[codes>>doc:sdmx:Glossary.Code.WebHome]].38 +Different source values can also map to the same target value, for example when deriving regions from country codes. 40 40 41 -(% style="width:490.294px" %) 42 -|(% style="width:260px" %)Source Component: REF_AREA|(% style="width:227px" %)Target Component: REGION 43 -|(% style="width:260px" %)FR|(% style="width:227px" %)EUR 44 -|(% style="width:260px" %)DE|(% style="width:227px" %)EUR 45 -|(% style="width:260px" %)IT|(% style="width:227px" %)EUR 46 -|(% style="width:260px" %)ES|(% style="width:227px" %)EUR 47 -|(% style="width:260px" %)BE|(% style="width:227px" %)EUR 40 +|Source Component: REF_AREA|Target Component: REGION 41 +|FR|EUR 42 +|DE|EUR 43 +|IT|EUR 44 +|ES|EUR 45 +|BE|EUR 48 48 49 49 == 13.3 N-n structure maps == 50 50 51 -N-n (pronounced 'N to N') mappings describe rules where a specified combination of values in multiple source [[Components>>doc:sdmx:Glossary.Component.WebHome]](% style="color:#e74c3c" %)map(%%)to specified values in one or more target[[Components>>doc:sdmx:Glossary.Component.WebHome]].For example, when mapping a partial[[Series Key>>doc:sdmx:Glossary.Serieskey.WebHome]]from a highly multidimensional cube (like Balance of Payments) to a single 'Indicator'[[Dimension>>doc:sdmx:Glossary.Dimension.WebHome]]in a target Data Structure.49 +N-n (pronounced 'N to N') mappings describe rules where a specified combination of values in multiple source Components map to specified values in one or more target Components. For example, when mapping a partial Series Key from a highly multidimensional cube (like Balance of Payments) to a single 'Indicator' Dimension in a target Data Structure. 52 52 53 53 Example: 54 54 55 -(% style="width:964.294px" %) 56 -|(% style="width:65px" %)Rule|(% style="width:519px" %)Source|(% style="width:378px" %)Target 57 -|(% style="width:65px" %)1|(% style="width:519px" %)((( 53 +|Rule|Source|Target 54 +|1|((( 58 58 If 59 -FREQUENCY=A; and 60 -ADJUSTMENT=N; and 61 -MATURITY=L. 62 -)))|(% style="width:378px" %)((( 56 + 57 +FREQUENCY=A; and ADJUSTMENT=N; and MATURITY=L. 58 +)))|((( 63 63 Set 60 + 64 64 INDICATOR=A_N_L 65 65 ))) 66 -| (% style="width:65px" %)2|(% style="width:519px" %)(((63 +|2|((( 67 67 If 68 -FREQUENCY=M; and 69 -ADJUSTMENT=S_A1; and 70 -MATURITY=TY12. 71 -)))|(% style="width:378px" %)((( 65 + 66 +FREQUENCY=M; and ADJUSTMENT=S_A1; and MATURITY=TY12. 67 +)))|((( 72 72 Set 69 + 73 73 INDICATOR=MON_SAX_12 74 74 ))) 75 75 76 76 N-n rules can also set values for multiple source Components. 77 77 78 -(% style="width:965.294px" %) 79 -|(% style="width:73px" %)Rule|(% style="width:506px" %)Source|(% style="width:383px" %)Target 80 -|(% style="width:73px" %)1|(% style="width:506px" %)((( 75 +|Rule|Source|Target 76 +|1|((( 81 81 If 78 + 82 82 FREQUENCY=A; and ADJUSTMENT=N; and MATURITY=L. 83 -)))|( % style="width:383px" %)(((80 +)))|((( 84 84 Set 82 + 85 85 INDICATOR=A_N_L, STATUS=QXR15, 84 + 86 86 NOTE="Unadjusted". 87 87 ))) 88 -| (% style="width:73px" %)2|(% style="width:506px" %)(((87 +|2|((( 89 89 If 89 + 90 90 FREQUENCY=M; and ADJUSTMENT=S_A1; and MATURITY=TY12. 91 -)))|( % style="width:383px" %)(((91 +)))|((( 92 92 Set 93 + 93 93 INDICATOR=MON_SAX_12, 95 + 94 94 STATUS=MPM12, 97 + 95 95 NOTE="Seasonally Adjusted" 96 96 ))) 97 97 ... ... @@ -101,90 +101,93 @@ 101 101 102 102 A simple example mapping a source dataset with a single dimension to one with multiple dimensions is shown below: 103 103 104 -(% style="width:972.294px" %) 105 -|(% style="width:257px" %)Source|(% style="width:315px" %)Target|(% style="width:397px" %)Output Series Key 106 -|(% style="width:257px" %)SERIES_CODE=XMAN_Z_21|(% style="width:315px" %)((( 107 +|Source|Target|Output Series Key 108 +|SERIES_CODE=XMAN_Z_21|((( 107 107 Dimensions 110 + 108 108 INDICATOR=XM 109 109 FREQ=A 110 110 ADJUSTMENT=N 111 111 Attributes 112 112 UNIT_MEASURE=_Z 116 + 113 113 COMP_ORG=21 114 -)))| (% style="width:397px" %)XM:A:N115 -|( % style="width:257px" %)(((118 +)))|XM:A:N 119 +|((( 116 116 SERIES_CODE=XMAN_Z_34 117 117 118 118 119 -)))|( % style="width:315px" %)(((123 +)))|((( 120 120 Dimensions 125 + 121 121 INDICATOR=XM 127 + 122 122 FREQ=A 129 + 123 123 ADJUSTMENT=N 131 + 124 124 Attributes 133 + 125 125 UNIT_MEASURE=_Z 135 + 126 126 COMP_ORG=34 127 -)))| (% style="width:397px" %)XM:A:N137 +)))|XM:A:N 128 128 129 -The above behaviour can be okay if the series XMAN_Z_21 contains observations for different periods of time then the series XMAN_Z_34. If however both series contain observations for the same point in time, the output for this mapping will be two observations with the same [[series key>>doc:sdmx:Glossary.Series key.WebHome]], for the same period in time.139 +The above behaviour can be okay if the series XMAN_Z_21 contains observations for different periods of time then the series XMAN_Z_34. If however both series contain observations for the same point in time, the output for this mapping will be two observations with the same series key, for the same period in time. 130 130 131 131 == 13.5 Representation maps == 132 132 133 - [[Representation>>doc:sdmx:Glossary.Representation.WebHome]](% style="color:#e74c3c" %)Maps(%%)replace the[[SDMX>>doc:sdmx:Glossary.Statisticaldata and metadata exchange.WebHome]]2.1 Codelist(% style="color:#e74c3c" %)Maps(%%)and are used describe explicit mappings between source and target[[Component>>doc:sdmx:Glossary.Component.WebHome]]values.143 +Representation Maps replace the SDMX 2.1 Codelist Maps and are used describe explicit mappings between source and target Component values. 134 134 135 -The source and target of a [[Representation>>doc:sdmx:Glossary.Representation.WebHome]](% style="color:#e74c3c" %)Map(%%)can reference any of the following:145 +The source and target of a Representation Map can reference any of the following: 136 136 137 137 1. Codelist 138 138 1. Free Text (restricted by type, e.g String, Integer, Boolean) 139 139 1. Valuelist 140 140 141 -A [[Representation>>doc:sdmx:Glossary.Representation.WebHome]](% style="color:#e74c3c" %)Map(%%)mapping ISO 2-character to ISO 3-character Codelists would take the following form:151 +A Representation Map mapping ISO 2-character to ISO 3-character Codelists would take the following form: 142 142 143 -(% style="width:356.294px" %) 144 -|(% style="width:167px" %)CL_ISO_ALPHA2|(% style="width:186px" %)CL_ISO_ALPHA3 145 -|(% style="width:167px" %)AF|(% style="width:186px" %)AFG 146 -|(% style="width:167px" %)AL|(% style="width:186px" %)ALB 147 -|(% style="width:167px" %)DZ|(% style="width:186px" %)DZA 148 -|(% style="width:167px" %)AS|(% style="width:186px" %)ASM 149 -|(% style="width:167px" %)AD|(% style="width:186px" %)AND 150 -|(% style="width:167px" %)etc…|(% style="width:186px" %) 153 +|CL_ISO_ALPHA2|CL_ISO_ALPHA3 154 +|AF|AFG 155 +|AL|ALB 156 +|DZ|DZA 157 +|AS|ASM 158 +|AD|AND 159 +|etc…| 151 151 152 -A [[Representation>>doc:sdmx:Glossary.Representation.WebHome]](% style="color:#e74c3c" %)Map(%%)mapping free text country names to an ISO 2-character Codelist could be similarly described:161 +A Representation Map mapping free text country names to an ISO 2-character Codelist could be similarly described: 153 153 154 -(% style="width:364.294px" %) 155 -|(% style="width:169px" %)Text|(% style="width:192px" %)CL_ISO_ALPHA2 156 -|(% style="width:169px" %)"Germany"|(% style="width:192px" %)DE 157 -|(% style="width:169px" %)"France"|(% style="width:192px" %)FR 158 -|(% style="width:169px" %)"United Kingdom"|(% style="width:192px" %)GB 159 -|(% style="width:169px" %)"Great Britain"|(% style="width:192px" %)GB 160 -|(% style="width:169px" %)"Ireland"|(% style="width:192px" %)IE 161 -|(% style="width:169px" %)"Eire"|(% style="width:192px" %)IE 162 -|(% style="width:169px" %)etc…|(% style="width:192px" %) 163 +|Text|CL_ISO_ALPHA2 164 +|"Germany"|DE 165 +|"France"|FR 166 +|"United Kingdom"|GB 167 +|"Great Britain"|GB 168 +|"Ireland"|IE 169 +|"Eire"|IE 170 +|etc…| 163 163 164 -Valuelists, introduced in [[SDMX>>doc:sdmx:Glossary.Statisticaldata and metadata exchange.WebHome]]3.0, are equivalent to Codelists but allow the maintenance of non-[[SDMX>>doc:sdmx:Glossary.Statisticaldata and metadata exchange.WebHome]]identifiers. Importantly, their IDs do not need to conform to IDType, but as a consequence are not Identifiable.172 +Valuelists, introduced in SDMX 3.0, are equivalent to Codelists but allow the maintenance of non-SDMX identifiers. Importantly, their IDs do not need to conform to IDType, but as a consequence are not Identifiable. 165 165 166 -When used in [[Representation>>doc:sdmx:Glossary.Representation.WebHome]](% style="color:#e74c3c" %)Maps(%%), Valuelists allow Non-[[SDMX>>doc:sdmx:Glossary.Statisticaldata and metadata exchange.WebHome]]identifiers containing characters like £, $, % to be(% style="color:#e74c3c" %)mapped(%%)to[[Code>>doc:sdmx:Glossary.Code.WebHome]]IDs, or[[Codes>>doc:sdmx:Glossary.Code.WebHome]](% style="color:#e74c3c" %)mapped(%%)to non-[[SDMX>>doc:sdmx:Glossary.Statisticaldata and metadata exchange.WebHome]]identifiers.174 +When used in Representation Maps, Valuelists allow Non-SDMX identifiers containing characters like £, $, % to be mapped to Code IDs, or Codes mapped to non-SDMX identifiers. 167 167 168 168 In common with Codelists, each item in a Valuelist has a multilingual name giving it a human-readable label and an optional description. For example: 169 169 170 -(% style="width:435.294px" %) 171 -|(% style="width:126px" %)Value|(% style="width:133px" %)Locale|(% style="width:173px" %)Name 172 -|(% style="width:126px" %)$|(% style="width:133px" %)en|(% style="width:173px" %)United States Dollar 173 -|(% style="width:126px" %)%|(% style="width:133px" %)En|(% style="width:173px" %)Percentage 174 -|(% style="width:126px" %) |(% style="width:133px" %)fr|(% style="width:173px" %)Pourcentage 178 +|Value|Locale|Name 179 +|$|en|United States Dollar 180 +|%|En|Percentage 181 +| |fr|Pourcentage 175 175 176 -Other characteristics of [[Representation>>doc:sdmx:Glossary.Representation.WebHome]](% style="color:#e74c3c" %)Maps(%%):183 +Other characteristics of Representation Maps: 177 177 178 -* Support the (% style="color:#e74c3c" %)mapping(%%) of multiple source [[Component>>doc:sdmx:Glossary.Component.WebHome]] values to multiple Target [[Component>>doc:sdmx:Glossary.Component.WebHome]] values as described in section 13.3 on n-to-n mappings; this covers also the case of (% style="color:#e74c3c" %)mapping(%%) an [[Attribute>>doc:sdmx:Glossary.Attribute.WebHome]] with an array [[representation>>doc:sdmx:Glossary.Representation.WebHome]] to (% style="color:#e74c3c" %)map(%%) combinations of values to a single target value; 179 -* Allow source or target mappings for an Item to be optional allowing rules such as 'A (% style="color:#e74c3c" %)maps(%%) to nothing' or 'nothing (% style="color:#e74c3c" %)maps(%%) to A'; and 180 -* Support for (% style="color:#e74c3c" %)mapping(%%) rules where regular expressions or substrings are used to match source [[Component>>doc:sdmx:Glossary.Component.WebHome]] values. Refer to section 13.6 for more on this topic. 185 +* Support the mapping of multiple source Component values to multiple Target Component values as described in section 13.3 on n-to-n mappings; this covers also the case of mapping an Attribute with an array representation to map combinations of values to a single target value; 186 +* Allow source or target mappings for an Item to be optional allowing rules such as 'A maps to nothing' or 'nothing maps to A'; and 187 +* Support for mapping rules where regular expressions or substrings are used to match source Component values. Refer to section 13.6 for more on this topic. 188 +*1. Regular expression and substring rules 181 181 182 - ==13.6Regular expressionandsubstringrules==190 +It is common for classifications to contain meanings within the identifier, for example the code Id 'XULADS' may refer to a particular seasonality because it starts with the letters XU. 183 183 184 - Itiscommonforclassificationstocontain meanings withintheidentifier, forexample the[[code>>doc:sdmx:Glossary.Code.WebHome]]Id'XULADS'may refer toaparticularseasonalitybecause it startswiththelettersXU.192 +With SDMX 2.1 each code that starts with XU had to be individually mapped to the same seasonality, and additional mappings added when new Codes were added to the Codelists. This led to many hundreds or thousands of mappings which can be more efficiently summarised in a single conceptual rule: 185 185 186 -With [[SDMX>>doc:sdmx:Glossary.Statistical data and metadata exchange.WebHome]] 2.1 each [[code>>doc:sdmx:Glossary.Code.WebHome]] that starts with XU had to be individually (% style="color:#e74c3c" %)mapped(%%) to the same seasonality, and additional mappings added when new [[Codes>>doc:sdmx:Glossary.Code.WebHome]] were added to the Codelists. This led to many hundreds or thousands of mappings which can be more efficiently summarised in a single conceptual rule: 187 - 188 188 //If starts with 'XU' map to 'Y'// 189 189 190 190 These rules are described using either regular expressions, or substrings for simpler use cases. ... ... @@ -274,7 +274,7 @@ 274 274 275 275 Date and time formats are specified by date and time pattern strings based on Java's Simple Date Format. Within date and time pattern strings, unquoted letters from 'A' to 'Z' and from 'a' to 'z' are interpreted as pattern letters representing the components of a date or time string. Text can be quoted using single quotes (') to avoid interpretation. "''" represents a single quote. All other characters are not interpreted; they're simply copied into the output string during formatting or matched against the input string during parsing. 276 276 277 -Due to the fact that dates may differ per locale, an optional property, defining the locale of the pattern, is provided. This would assist processing of source dates, according to the given locale {{footnote}}A list ofcommonly used localescan be foundintheJava supported locales: https://www.oracle.com/java/technologies/javase/jdk8-jre8-suported-locales.html{{/footnote}}. An indicative list of examples is presented in the following table:283 +Due to the fact that dates may differ per locale, an optional property, defining the locale of the pattern, is provided. This would assist processing of source dates, according to the given locale[[(% class="wikiinternallink" %)^^~[2~]^^>>path:#_ftn2]](%%). An indicative list of examples is presented in the following table: 278 278 279 279 |English (en)|Australia (AU)|en-AU 280 280 |English (en)|Canada (CA)|en-CA ... ... @@ -317,7 +317,7 @@ 317 317 318 318 |Letter|Date or Time Component|Presentation|Examples 319 319 |G|Era designator|[[Text>>url:https://docs.oracle.com/javase/7/docs/api/java/text/SimpleDateFormat.html#text]][[url:https://docs.oracle.com/javase/7/docs/api/java/text/SimpleDateFormat.html#text]]|AD 320 -|yy|Year short (upper case is Year of Week {{footnote}}yyyyrepresents thecalendar year while YYYY representsthe year of theweek, whichis only relevant for 53 weekyears{{/footnote}})|[[Year>>url:https://docs.oracle.com/javase/7/docs/api/java/text/SimpleDateFormat.html#year]][[url:https://docs.oracle.com/javase/7/docs/api/java/text/SimpleDateFormat.html#year]]|96326 +|yy|Year short (upper case is Year of Week[[(% class="wikiinternallink" %)^^~[3~]^^>>path:#_ftn3]](%%))|[[Year>>url:https://docs.oracle.com/javase/7/docs/api/java/text/SimpleDateFormat.html#year]][[url:https://docs.oracle.com/javase/7/docs/api/java/text/SimpleDateFormat.html#year]]|96 321 321 |yyyy|Year Full (upper case is Year of Week)|Year|1996 322 322 |MM|Month number in year starting with 1|Month|07 323 323 |MMM|Month name short|Month|Jul ... ... @@ -592,4 +592,10 @@ 592 592 593 593 ---- 594 594 595 -{{putFootnotes/}} 601 +[[~[1~]>>path:#_ftnref1]] Unidimensional datasets are those with a single 'indicator' or 'series code' dimension. 602 + 603 +[[~[2~]>>path:#_ftnref2]] A list of commonly used locales can be found in the Java supported locales: 604 + 605 +[[https:~~/~~/www.oracle.com/java/technologies/javase/jdk8>>url:https://www.oracle.com/java/technologies/javase/jdk8-jre8-suported-locales.html]][[->>url:https://www.oracle.com/java/technologies/javase/jdk8-jre8-suported-locales.html]][[jre8>>url:https://www.oracle.com/java/technologies/javase/jdk8-jre8-suported-locales.html]][[->>url:https://www.oracle.com/java/technologies/javase/jdk8-jre8-suported-locales.html]][[suported>>url:https://www.oracle.com/java/technologies/javase/jdk8-jre8-suported-locales.html]][[->>url:https://www.oracle.com/java/technologies/javase/jdk8-jre8-suported-locales.html]][[locales.html>>url:https://www.oracle.com/java/technologies/javase/jdk8-jre8-suported-locales.html]][[ >>url:https://www.oracle.com/java/technologies/javase/jdk8-jre8-suported-locales.html]] 606 + 607 +[[~[3~]>>path:#_ftnref3]] yyyy represents the calendar year while YYYY represents the year of the week, which is only relevant for 53 week years