Changes for page 12 Validation and Transformation Language (VTL)
Last modified by Artur on 2025/09/10 11:19
Summary
-
Page properties (1 modified, 0 added, 0 removed)
Details
- Page properties
-
- Content
-
... ... @@ -198,7 +198,7 @@ 198 198 199 199 The possible mapping options are described in more detail in the following sections. 200 200 201 -=== 12.3. 2Mapping from SDMX to VTL data structures ===201 +=== 12.3.3 Mapping from SDMX to VTL data structures === 202 202 203 203 ==== 12.3.3.1 Basic Mapping ==== 204 204 ... ... @@ -206,11 +206,12 @@ 206 206 207 207 When transforming **from SDMX to VTL**, this method consists in leaving the components unchanged and maintaining their names and roles, according to the following table: 208 208 209 -|**SDMX**|**VTL** 210 -|Dimension|(Simple) Identifier 211 -|TimeDimension|(Time) Identifier 212 -|Measure|Measure 213 -|DataAttribute|Attribute 209 +(% style="width:468.294px" %) 210 +|(% style="width:196px" %)**SDMX**|(% style="width:269px" %)**VTL** 211 +|(% style="width:196px" %)Dimension|(% style="width:269px" %)(Simple) Identifier 212 +|(% style="width:196px" %)TimeDimension|(% style="width:269px" %)(Time) Identifier 213 +|(% style="width:196px" %)Measure|(% style="width:269px" %)Measure 214 +|(% style="width:196px" %)DataAttribute|(% style="width:269px" %)Attribute 214 214 215 215 The SDMX DataAttributes, in VTL they are all considered "at data point / observation level" (i.e. dependent on all the VTL Identifiers), because VTL does not have the SDMX AttributeRelationships, which defines the construct to which the DataAttribute is related (e.g. observation, dimension or set or group of dimensions, whole data set). 216 216 ... ... @@ -220,10 +220,8 @@ 220 220 221 221 An alternative mapping method from SDMX to VTL is the **Pivot **mapping, which makes sense and is different from the Basic method only for the SDMX data structures that contain a Dimension that plays the role of measure dimension (like in SDMX 2.1) and just one Measure. Through this method, these structures can be mapped to multimeasure VTL data structures. Besides that, a user may choose to use any Dimension acting as a list of Measures (e.g., a Dimension with indicators), either by considering the “Measure” role of a Dimension, or at will using any coded Dimension. Of course, in SDMX 3.0, this can only work when only one Measure is defined in the DSD. 222 222 223 -In SDMX 2.1 the MeasureDimension was a subclass of DimensionComponent like Dimension and TimeDimension. In the current SDMX version, this subclass does not exist anymore, however a Dimension can have the role of measure dimension (i.e. a Dimension that contributes to the identification of the measures). In SDMX 2.1 a DataStructure could have zero or one MeasureDimensions, in the current version of the standard, from zero to many Dimension may have the role of measure dimension. Hereinafter a Dimension that plays the role of measure dimension is referenced for simplicity as “MeasureDimension“, i.e. maintaining the capital letters and the courier font even if the MeasureDimension is not anymore a class in the SDMX Information Model of the current SDMX version. For the sake of simplicity, the description below considers just one Dimension having the role of MeasureDimension (i.e., the more simple and common case). Nevertheless, it maintains its validity also if in the DataStructure there are more dimension with the role of MeasureDimensions: in this case what is said about the MeasureDimension must be applied to the combination of all the 224 +In SDMX 2.1 the MeasureDimension was a subclass of DimensionComponent like Dimension and TimeDimension. In the current SDMX version, this subclass does not exist anymore, however a Dimension can have the role of measure dimension (i.e. a Dimension that contributes to the identification of the measures). In SDMX 2.1 a DataStructure could have zero or one MeasureDimensions, in the current version of the standard, from zero to many Dimension may have the role of measure dimension. Hereinafter a Dimension that plays the role of measure dimension is referenced for simplicity as “MeasureDimension“, i.e. maintaining the capital letters and the courier font even if the MeasureDimension is not anymore a class in the SDMX Information Model of the current SDMX version. For the sake of simplicity, the description below considers just one Dimension having the role of MeasureDimension (i.e., the more simple and common case). Nevertheless, it maintains its validity also if in the DataStructure there are more dimension with the role of MeasureDimensions: in this case what is said about the MeasureDimension must be applied to the combination of all the MeasureDimensions considered as a joint variable{{footnote}}E.g., if in the data structure there exist 3 Dimensions C,D,E having the role of MeasureDimension, they should be considered as a joint MeasureDimension Z=(C,D,E); therefore when the description says “each possible value Cj of the MeasureDimension …” it means “each possible combination of values (Cj, Dk, Ew) of the joint MeasureDimension Z=(C,D,E)”.{{/footnote}}. 224 224 225 -MeasureDimensions considered as a joint variable{{footnote}}E.g., if in the data structure there exist 3 Dimensions C,D,E having the role of MeasureDimension, they should be considered as a joint MeasureDimension Z=(C,D,E); therefore when the description says “each possible value Cj of the MeasureDimension …” it means “each possible combination of values (Cj, Dk, Ew) of the joint MeasureDimension Z=(C,D,E)”.{{/footnote}}. 226 - 227 227 Among other things, the Pivot method provides also backward compatibility with the SDMX 2.1 data structures that contained a MeasureDimension. 228 228 229 229 If applied to SDMX structures that do not contain any MeasureDimension, this method behaves like the Basic mapping (see the previous paragraph). ... ... @@ -236,16 +236,18 @@ 236 236 * The SDMX Measure is not mapped to VTL as well (it disappears in the VTL Data Structure); 237 237 * An SDMX DataAttribute is mapped in different ways according to its AttributeRelationship: 238 238 ** If, according to the SDMX AttributeRelationship, the values of the DataAttribute do not depend on the values of the MeasureDimension, the SDMX DataAttribute becomes a VTL Attribute having the same name. This happens if the AttributeRelationship is not specified (i.e. the DataAttribute does not depend on any DimensionComponent and therefore is at data set level), or if it refers to a set (or a group) of dimensions which does not include the MeasureDimension; 239 -** Otherwise, if, according to the SDMX AttributeRelationship, the values of the DataAttribute depend on the MeasureDimension, the SDMX DataAttribute is mapped to one VTL Attribute for each possible Code of the SDMX MeasureDimension. By default, the names of the VTL Attributes are obtained by concatenating the name of the SDMX DataAttribute and the names of the correspondent Code of the MeasureDimension separated by underscore. For example, if the SDMX DataAttribute is named DA and the possible Codes of the SDMX MeasureDimension are named C1, C2, …, Cn, then the corresponding VTL Attributes will be named DA_C1, DA_C2, …, DA_Cn (if different names are desired, they can be achieved afterwards by renaming the Attributes through VTL operators). o Like in the Basic mapping, the resulting VTL Attributes are considered as dependent on all the VTL identifiers (i.e. "at data point / observation level"), because VTL does not have the SDMX notion of Attribute Relationship. 238 +** Otherwise, if, according to the SDMX AttributeRelationship, the values of the DataAttribute depend on the MeasureDimension, the SDMX DataAttribute is mapped to one VTL Attribute for each possible Code of the SDMX MeasureDimension. By default, the names of the VTL Attributes are obtained by concatenating the name of the SDMX DataAttribute and the names of the correspondent Code of the MeasureDimension separated by underscore. For example, if the SDMX DataAttribute is named DA and the possible Codes of the SDMX MeasureDimension are named C1, C2, …, Cn, then the corresponding VTL Attributes will be named DA_C1, DA_C2, …, DA_Cn (if different names are desired, they can be achieved afterwards by renaming the Attributes through VTL operators). 239 +** Like in the Basic mapping, the resulting VTL Attributes are considered as dependent on all the VTL identifiers (i.e. "at data point / observation level"), because VTL does not have the SDMX notion of Attribute Relationship. 240 240 241 241 The summary mapping table of the "pivot" mapping from SDMX to VTL for the SDMX data structures that contain a MeasureDimension is the following: 242 242 243 -|**SDMX**|**VTL** 244 -|Dimension|(Simple) Identifier 245 -|TimeDimension|(Time) Identifier 246 -|MeasureDimension & one Measure|One Measure for each Code of the SDMX MeasureDimension 247 -|DataAttribute not depending on the MeasureDimension|Attribute 248 -|DataAttribute depending on the MeasureDimension|((( 243 +(% style="width:739.294px" %) 244 +|(% style="width:335px" %)**SDMX**|(% style="width:400px" %)**VTL** 245 +|(% style="width:335px" %)Dimension|(% style="width:400px" %)(Simple) Identifier 246 +|(% style="width:335px" %)TimeDimension|(% style="width:400px" %)(Time) Identifier 247 +|(% style="width:335px" %)MeasureDimension & one Measure|(% style="width:400px" %)One Measure for each Code of the SDMX MeasureDimension 248 +|(% style="width:335px" %)DataAttribute not depending on the MeasureDimension|(% style="width:400px" %)Attribute 249 +|(% style="width:335px" %)DataAttribute depending on the MeasureDimension|(% style="width:400px" %)((( 249 249 One Attribute for each Code of the 250 250 SDMX MeasureDimension 251 251 ))) ... ... @@ -255,19 +255,14 @@ 255 255 At observation / data point level, calling Cj (j=1, … n) the j^^th^^ Code of the MeasureDimension: 256 256 257 257 * The set of SDMX observations having the same values for all the Dimensions except than the MeasureDimension become one multi-measure VTL Data Point, having one Measure for each Code Cj of the SDMX MeasureDimension; 258 -* The values of the SDMX simple Dimensions, TimeDimension and DataAttributes not depending on the MeasureDimension (these components by definition have always the same values for all the observations of the set above) become the values of the corresponding VTL (simple) 259 - 260 -Identifiers, (time) Identifier and Attributes. 261 - 259 +* The values of the SDMX simple Dimensions, TimeDimension and DataAttributes not depending on the MeasureDimension (these components by definition have always the same values for all the observations of the set above) become the values of the corresponding VTL (simple) Identifiers, (time) Identifier and Attributes. 262 262 * The value of the Measure of the SDMX observation belonging to the set above and having MeasureDimension=Cj becomes the value of the VTL Measure Cj 263 263 * For the SDMX DataAttributes depending on the MeasureDimension, the value of the DataAttribute DA of the SDMX observation belonging to the set above and having MeasureDimension=Cj becomes the value of the VTL Attribute DA_Cj 264 264 265 265 ==== 12.3.3.3 From SDMX DataAttributes to VTL Measures ==== 266 266 267 -* In some cases, it may happen that the DataAttributes of the SDMX DataStructure need to be managed as Measures in VTL. Therefore, a variant of both the methods above consists in transforming all the SDMX DataAttributes in VTL Measures. When DataAttributes are converted to Measures, the two methods above are called Basic_A2M and Pivot_A2M (the suffix "A2M" stands for Attributes to Measures). Obviously, the resulting VTL data structure is, in general, multi-measure and does not contain 265 +* In some cases, it may happen that the DataAttributes of the SDMX DataStructure need to be managed as Measures in VTL. Therefore, a variant of both the methods above consists in transforming all the SDMX DataAttributes in VTL Measures. When DataAttributes are converted to Measures, the two methods above are called Basic_A2M and Pivot_A2M (the suffix "A2M" stands for Attributes to Measures). Obviously, the resulting VTL data structure is, in general, multi-measure and does not contain Attributes. 268 268 269 -Attributes. 270 - 271 271 The Basic_A2M and Pivot_A2M behaves respectively like the Basic and Pivot methods, except that the final VTL components, which according to the Basic and Pivot methods would have had the role of Attribute, assume instead the role of Measure. 272 272 273 273 Proper VTL features allow changing the role of specific attributes even after the SDMX to VTL mapping: they can be useful when only some of the DataAttributes need to be managed as VTL Measures. ... ... @@ -284,11 +284,12 @@ 284 284 285 285 Mapping table: 286 286 287 -|**VTL**|**SDMX** 288 -|(Simple) Identifier|Dimension 289 -|(Time) Identifier|TimeDimension 290 -|Measure|Measure 291 -|Attribute|DataAttribute 283 +(% style="width:470.294px" %) 284 +|(% style="width:262px" %)**VTL**|(% style="width:205px" %)**SDMX** 285 +|(% style="width:262px" %)(Simple) Identifier|(% style="width:205px" %)Dimension 286 +|(% style="width:262px" %)(Time) Identifier|(% style="width:205px" %)TimeDimension 287 +|(% style="width:262px" %)Measure|(% style="width:205px" %)Measure 288 +|(% style="width:262px" %)Attribute|(% style="width:205px" %)DataAttribute 292 292 293 293 If the distinction between simple identifier and time identifier is not maintained in the VTL environment, the classification between Dimension and TimeDimension exists only in SDMX, as declared in the relevant DataStructureDefinition. 294 294 ... ... @@ -316,11 +316,12 @@ 316 316 317 317 The summary mapping table of the **unpivot** mapping method is the following: 318 318 319 -|**VTL**|**SDMX** 320 -|(Simple) Identifier|Dimension 321 -|(Time) Identifier|TimeDimension 322 -|All Measure Components|MeasureDimension (having one Code for each VTL measure component) & one Measure 323 -|Attribute|DataAttribute depending on all SDMX Dimensions including the TimeDimension and except the MeasureDimension 316 +(% style="width:638.294px" %) 317 +|(% style="width:200px" %)**VTL**|(% style="width:435px" %)**SDMX** 318 +|(% style="width:200px" %)(Simple) Identifier|(% style="width:435px" %)Dimension 319 +|(% style="width:200px" %)(Time) Identifier|(% style="width:435px" %)TimeDimension 320 +|(% style="width:200px" %)All Measure Components|(% style="width:435px" %)MeasureDimension (having one Code for each VTL measure component) & one Measure 321 +|(% style="width:200px" %)Attribute|(% style="width:435px" %)DataAttribute depending on all SDMX Dimensions including the TimeDimension and except the MeasureDimension 324 324 325 325 At observation / data point level: 326 326 ... ... @@ -342,12 +342,13 @@ 342 342 343 343 The mapping table is the following: 344 344 345 -|VTL|SDMX 346 -|(Simple) Identifier|Dimension 347 -|(Time) Identifier|TimeDimension 348 -|Some Measures|Measure 349 -|Other Measures|DataAttribute 350 -|Attribute|DataAttribute 343 +(% style="width:467.294px" %) 344 +|(% style="width:214px" %)VTL|(% style="width:250px" %)SDMX 345 +|(% style="width:214px" %)(Simple) Identifier|(% style="width:250px" %)Dimension 346 +|(% style="width:214px" %)(Time) Identifier|(% style="width:250px" %)TimeDimension 347 +|(% style="width:214px" %)Some Measures|(% style="width:250px" %)Measure 348 +|(% style="width:214px" %)Other Measures|(% style="width:250px" %)DataAttribute 349 +|(% style="width:214px" %)Attribute|(% style="width:250px" %)DataAttribute 351 351 352 352 Even in this case, the resulting SDMX definitions must be compliant with the SDMX consistency rules. For example, the SDMX DSD must have the attributeRelationship for the DataAttributes, which does not exist in VTL. 353 353 ... ... @@ -385,11 +385,11 @@ 385 385 386 386 Therefore, the generic name of this kind of VTL datasets would be: 387 387 388 -'DF(1.0.0)/INDICATORvalue.COUNTRYvalue' 387 +> 'DF(1.0.0)/INDICATORvalue.COUNTRYvalue' 389 389 390 390 Where DF(1.0.0) is the Dataflow and //INDICATORvalue// and //COUNTRYvalue //are placeholders for one value of the INDICATOR and COUNTRY dimensions. Instead the specific name of one of these VTL datasets would be: 391 391 392 -‘DF(1.0.0)/POPULATION.USA’ 391 +> ‘DF(1.0.0)/POPULATION.USA’ 393 393 394 394 In particular, this is the VTL dataset that contains all the observations of the Dataflow DF(1.0.0) for which //INDICATOR// = POPULATION and //COUNTRY// = USA. 395 395 ... ... @@ -403,13 +403,11 @@ 403 403 404 404 SDMX Dataflow having INDICATOR=//INDICATORvalue //and COUNTRY=// COUNTRYvalue//. For example, the VTL dataset ‘DF1(1.0.0)/POPULATION.USA’ would contain all the observations of DF1(1.0.0) having INDICATOR = POPULATION and COUNTRY = USA. 405 405 406 -In order to obtain the data structure of these VTL Data Sets from the SDMX one, it is assumed that the SDMX DimensionComponents on which the mapping is based are dropped, i.e. not maintained in the VTL data structure; this is possible because their values are fixed for each one of the invoked VTL Data Sets{{footnote}}If these DimensionComponents would not be dropped, the various VTL Data Sets resulting from this kind of mapping would have non-matching values for the Identifiers corresponding to the mapping Dimensions (e.g. POPULATION and COUNTRY). As a consequence, taking into account that the typical binary VTL operations at dataset level (+, -, *, / and so on) are executed on the observations having matching values for the identifiers, it would not be possible to compose the resulting VTL datasets one another (e.g. it would not be possible to calculate the population ratio between USA and CANADA).{{/footnote}}. After that, the mapping method from SDMX to VTL specified for the Dataflow DF1(1.0.0) is applied (i.e. 405 +In order to obtain the data structure of these VTL Data Sets from the SDMX one, it is assumed that the SDMX DimensionComponents on which the mapping is based are dropped, i.e. not maintained in the VTL data structure; this is possible because their values are fixed for each one of the invoked VTL Data Sets{{footnote}}If these DimensionComponents would not be dropped, the various VTL Data Sets resulting from this kind of mapping would have non-matching values for the Identifiers corresponding to the mapping Dimensions (e.g. POPULATION and COUNTRY). As a consequence, taking into account that the typical binary VTL operations at dataset level (+, -, *, / and so on) are executed on the observations having matching values for the identifiers, it would not be possible to compose the resulting VTL datasets one another (e.g. it would not be possible to calculate the population ratio between USA and CANADA).{{/footnote}}. After that, the mapping method from SDMX to VTL specified for the Dataflow DF1(1.0.0) is applied (i.e. basic, pivot …). 407 407 408 -basic, pivot …). 409 - 410 410 In the example above, for all the datasets of the kind 411 411 412 -‘DF1(1.0.0)///INDICATORvalue//.//COUNTRYvalue//’, the dimensions INDICATOR and COUNTRY would be dropped so that the data structure of all the resulting VTL Data Sets would have the identifier TIME_PERIOD only. 409 +> ‘DF1(1.0.0)///INDICATORvalue//.//COUNTRYvalue//’, the dimensions INDICATOR and COUNTRY would be dropped so that the data structure of all the resulting VTL Data Sets would have the identifier TIME_PERIOD only. 413 413 414 414 It should be noted that the desired VTL Data Sets (i.e. of the kind ‘DF1(1.0.0)/// INDICATORvalue//.//COUNTRYvalue//’) can be obtained also by applying the VTL operator “**sub**” (subspace) to the Dataflow DF1(1.0.0), like in the following VTL expression: 415 415