Changes for page 12 Validation and Transformation Language (VTL)

Last modified by Artur K. on 2026/05/29 14:29

From 6.3 to 6.4 From 6.9 to 6.10

From version 6.4

edited by Helena K.
on 2025/05/16 12:31

Change comment: There is no comment for this version

To version 6.9

edited by Helena K.
on 2025/05/16 12:37

Change comment: There is no comment for this version

Raw
Rendered

Summary

Page properties (1 modified, 0 added, 0 removed)
Attachments (0 modified, 6 added, 0 removed)

Details

Page properties

Content

@@ -348,7 +348,7 @@
  The mapping table is the following:
  (% style="width:689.294px" %)
--|(% style="width:344px" %)VTL|(% style="width:341px" %)SDMX
++|(% style="width:344px" %)**VTL**|(% style="width:341px" %)**SDMX**
  |(% style="width:344px" %)(Simple) Identifier|(% style="width:341px" %)Dimension
  |(% style="width:344px" %)(Time) Identifier|(% style="width:341px" %)TimeDimension
  |(% style="width:344px" %)Some Measures|(% style="width:341px" %)Measure
@@ -408,26 +408,14 @@
  SDMX Dataflow having INDICATOR=//INDICATORvalue //and COUNTRY=// COUNTRYvalue//. For example, the VTL dataset ‘DF1(1.0.0)/POPULATION.USA’ would contain all the observations of DF1(1.0.0) having INDICATOR = POPULATION and COUNTRY = USA.
--In order to obtain the data structure of these VTL Data Sets from the SDMX one, it is assumed that the SDMX DimensionComponents on which the mapping is based are dropped, i.e. not maintained in the VTL data structure; this is possible because their values are fixed for each one of the invoked VTL Data Sets{{footnote}}If these DimensionComponents would not be dropped, the various VTL Data Sets resulting from this kind of mapping would have non-matching values for the Identifiers corresponding to the mapping Dimensions (e.g. POPULATION and COUNTRY). As a consequence, taking into account that the typical binary VTL operations at dataset level (+, -, *, / and so on) are executed on the observations having matching values for the identifiers, it would not be possible to compose the resulting VTL datasets one another (e.g. it would not be possible to calculate the population ratio between USA and CANADA).{{/footnote}}. After that, the mapping method from SDMX to VTL specified for the Dataflow DF1(1.0.0) is applied (i.e.
++In order to obtain the data structure of these VTL Data Sets from the SDMX one, it is assumed that the SDMX DimensionComponents on which the mapping is based are dropped, i.e. not maintained in the VTL data structure; this is possible because their values are fixed for each one of the invoked VTL Data Sets{{footnote}}If these DimensionComponents would not be dropped, the various VTL Data Sets resulting from this kind of mapping would have non-matching values for the Identifiers corresponding to the mapping Dimensions (e.g. POPULATION and COUNTRY). As a consequence, taking into account that the typical binary VTL operations at dataset level (+, -, *, / and so on) are executed on the observations having matching values for the identifiers, it would not be possible to compose the resulting VTL datasets one another (e.g. it would not be possible to calculate the population ratio between USA and CANADA).{{/footnote}}. After that, the mapping method from SDMX to VTL specified for the Dataflow DF1(1.0.0) is applied (i.e. basic, pivot …).
--basic, pivot …).
++In the example above, for all the datasets of the kind ‘DF1(1.0.0)///INDICATORvalue//.//COUNTRYvalue//’, the dimensions INDICATOR and COUNTRY would be dropped so that the data structure of all the resulting VTL Data Sets would have the identifier TIME_PERIOD only.
--In the example above, for all the datasets of the kind
--
--‘DF1(1.0.0)///INDICATORvalue//.//COUNTRYvalue//’, the dimensions INDICATOR and COUNTRY would be dropped so that the data structure of all the resulting VTL Data Sets would have the identifier TIME_PERIOD only.
--
  It should be noted that the desired VTL Data Sets (i.e. of the kind ‘DF1(1.0.0)/// INDICATORvalue//.//COUNTRYvalue//’) can be obtained also by applying the VTL operator “**sub**” (subspace) to the Dataflow DF1(1.0.0), like in the following VTL expression:
--‘DF1(1.0.0)/POPULATION.USA’ :=
++[[image:1747388275998-621.png]]
--DF1(1.0.0) [ sub INDICATOR=“POPULATION”, COUNTRY=“USA” ];
--
--‘DF1(1.0.0)/POPULATION.CANADA’ :=
--
--DF1(1.0.0) [ sub INDICATOR=“POPULATION”, COUNTRY=“CANADA” ];
--
--… … …
--
  In fact the VTL operator “sub” has exactly the same behaviour. Therefore, mapping different parts of a SDMX Dataflow to different VTL Data Sets in the direction from SDMX to VTL through the ordered concatenation notation is equivalent to a proper use of the operator “**sub**” on such a Dataflow.{{footnote}}In case the ordered concatenation notation is used, the VTL Transformation described above, e.g. ‘DF1(1.0)/POPULATION.USA’ := DF1(1.0) [ sub INDICATOR=“POPULATION”, COUNTRY=“USA”], is implicitly executed. In order to test the overall compliance of the VTL program to the VTL consistency rules, it has to be considered as part of the VTL program even if it is not explicitly coded.{{/footnote}}
  In the direction from SDMX to VTL it is allowed to omit the value of one or more
@@ -438,10 +438,8 @@
  This is equivalent to the application of the VTL “sub” operator only to the identifier //INDICATOR//:
--‘DF1(1.0.0)/POPULATION.’ :=
++[[image:1747388244829-693.png]]
--DF1(1.0.0) [ sub INDICATOR=“POPULATION” ];
--
  Therefore the VTL Data Set ‘DF1(1.0.0)/POPULATION.’ would have the identifiers COUNTRY and TIME_PERIOD.
  Heterogeneous invocations of the same Dataflow are allowed, i.e. omitting different
@@ -467,54 +467,18 @@
  Some examples follow, for some specific values of INDICATOR and COUNTRY:
--‘DF2(1.0.0)/GDPPERCAPITA.USA’ <- expression11; ‘DF2(1.0.0)/GDPPERCAPITA.CANADA’ <- expression12;
++[[image:1747388222879-916.png]]
--… … …
++[[image:1747388206717-256.png]]
--‘DF2(1.0.0)/POPGROWTH.USA’ <- expression21;
--
--‘DF2(1.0.0)/POPGROWTH.CANADA’ <- expression22;
--
--… … …
--
  As said, it is assumed that these VTL derived Data Sets have the TIME_PERIOD as the only identifier. In the mapping from VTL to SMDX, the Dimensions INDICATOR and COUNTRY are added to the VTL data structure on order to obtain the SDMX one, with the following values respectively:
--VTL dataset INDICATOR value COUNTRY value
++[[image:1747388148322-387.png]]
--‘DF2(1.0.0)/GDPPERCAPITA.USA’ GDPPERCAPITA USA
--
--‘DF2(1.0.0)/GDPPERCAPITA.CANADA’ GDPPERCAPITA CANADA … … …
--
--‘DF2(1.0.0)/POPGROWTH.USA’ POPGROWTH USA
--
--‘DF2(1.0.0)/POPGROWTH.CANADA’ POPGROWTH CANADA
--
--… … …
--
  It should be noted that the application of this many-to-one mapping from VTL to SDMX is equivalent to an appropriate sequence of VTL Transformations. These use the VTL operator “calc” to add the proper VTL identifiers (in the example, INDICATOR and COUNTRY) and to assign to them the proper values and the operator “union” in order to obtain the final VTL dataset (in the example DF2(1.0.0)), that can be mapped oneto-one to the homonymous SDMX Dataflow. Following the same example, these VTL Transformations would be:
--DF2bis_GDPPERCAPITA_USA := ‘DF2(1.0.0)/GDPPERCAPITA.USA’ [calc identifier INDICATOR := ”GDPPERCAPITA”, identifier COUNTRY := ”USA”];
++[[image:1747388179021-814.png]]
--DF2bis_GDPPERCAPITA_CANADA := ‘DF2(1.0.0)/GDPPERCAPITA.CANADA’ [calc identifier INDICATOR:=”GDPPERCAPITA”, identifier COUNTRY:=”CANADA”]; … … …
--
--DF2bis_POPGROWTH_USA := ‘DF2(1.0.0)/POPGROWTH.USA’
--
--[calc identifier INDICATOR := ”POPGROWTH”, identifier COUNTRY := ”USA”];
--
--DF2bis_POPGROWTH_CANADA’ := ‘DF2(1.0.0)/POPGROWTH.CANADA’ [calc identifier INDICATOR := ”POPGROWTH”, identifier COUNTRY := ”CANADA”]; … … …
--
--DF2(1.0) <- UNION (DF2bis_GDPPERCAPITA_USA’,
--
--DF2bis_GDPPERCAPITA_CANADA’,
--
--… ,
--
--DF2bis_POPGROWTH_USA’,
--
--DF2bis_POPGROWTH_CANADA’
--
--…);
--
  In other words, starting from the datasets explicitly calculated through VTL (in the example ‘DF2(1.0)/GDPPERCAPITA.USA’ and so on), the first step consists in calculating other (non-persistent) VTL datasets (in the example
  DF2bis_GDPPERCAPITA_USA and so on) by adding the identifiers INDICATOR and COUNTRY with the desired values (//INDICATORvalue// and //COUNTRYvalue)//. Finally, all these non-persistent Data Sets are united and give the final result DF2(1.0){{footnote}}The result is persistent in this example but it can be also non persistent if needed.{{/footnote}}, which can be mapped one-to-one to the homonymous SDMX Dataflow having the dimension components TIME_PERIOD, INDICATOR and COUNTRY.

1747388148322-387.png

Author

...	...	@@ -1,0 +1,1 @@
	1	+XWiki.helena

Size

...	...	@@ -1,0 +1,1 @@
	1	+23.0 KB

Content

1747388179021-814.png

Author

...	...	@@ -1,0 +1,1 @@
	1	+XWiki.helena

Size

...	...	@@ -1,0 +1,1 @@
	1	+66.2 KB

Content

1747388206717-256.png

Author

...	...	@@ -1,0 +1,1 @@
	1	+XWiki.helena

Size

...	...	@@ -1,0 +1,1 @@
	1	+10.3 KB

Content

1747388222879-916.png

Author

...	...	@@ -1,0 +1,1 @@
	1	+XWiki.helena

Size

...	...	@@ -1,0 +1,1 @@
	1	+9.9 KB

Content

1747388244829-693.png

Author

...	...	@@ -1,0 +1,1 @@
	1	+XWiki.helena

Size

...	...	@@ -1,0 +1,1 @@
	1	+7.4 KB

Content

1747388275998-621.png

Author

...	...	@@ -1,0 +1,1 @@
	1	+XWiki.helena

Size

...	...	@@ -1,0 +1,1 @@
	1	+19.2 KB

Content