Changes for page 12 Validation and Transformation Language (VTL)
Last modified by Helena on 2025/09/10 11:19
Summary
-
Page properties (1 modified, 0 added, 0 removed)
Details
- Page properties
-
- Content
-
... ... @@ -531,9 +531,9 @@ 531 531 532 532 In other words, starting from the datasets explicitly calculated through VTL (in the example ‘DF2(1.0)/GDPPERCAPITA.USA’ and so on), the first step consists in calculating other (non-persistent) VTL datasets (in the example 533 533 534 -DF2bis_GDPPERCAPITA_USA and so on) by adding the identifiers INDICATOR and COUNTRY with the desired values (//INDICATORvalue// and //COUNTRYvalue)//. Finally, all these non-persistent Data Sets are united and give the final result DF2(1.0) ^^[[(% class="wikiinternallink wikiinternallink wikiinternallinkwikiinternallink wikiinternallink wikiinternallinkwikiinternallinkwikiinternallink wikiinternallink wikiinternallinkwikiinternallinkwikiinternallinkwikiinternallink wikiinternallinkwikiinternallinkwikiinternallinkwikiinternallinkwikiinternallinkwikiinternallinkwikiinternallinkwikiinternallink wikiinternallink wikiinternallinkwikiinternallinkwikiinternallink wikiinternallink wikiinternallink wikiinternallink wikiinternallink wikiinternallink wikiinternallink wikiinternallink wikiinternallink wikiinternallink wikiinternallink wikiinternallink wikiinternallink wikiinternallink wikiinternallink wikiinternallink wikiinternallink" %)^^37^^>>path:#sdfootnote37sym||name="sdfootnote37anc"]](%%)^^, which can be mapped one-to-one to the homonymous SDMX Dataflow having the dimension components TIME_PERIOD, INDICATOR and COUNTRY.534 +DF2bis_GDPPERCAPITA_USA and so on) by adding the identifiers INDICATOR and COUNTRY with the desired values (//INDICATORvalue// and //COUNTRYvalue)//. Finally, all these non-persistent Data Sets are united and give the final result DF2(1.0){{footnote}}The result is persistent in this example but it can be also non persistent if needed.{{/footnote}}, which can be mapped one-to-one to the homonymous SDMX Dataflow having the dimension components TIME_PERIOD, INDICATOR and COUNTRY. 535 535 536 -Therefore, mapping different VTL datasets having the same data structure to different parts of a SDMX Dataflow, i.e. in the direction from VTL to SDMX, through the ordered concatenation notation is equivalent to a proper use of the operators “calc” and “union” on such datasets. ^^[[(% class="wikiinternallink wikiinternallinkwikiinternallink wikiinternallinkwikiinternallinkwikiinternallink wikiinternallink wikiinternallinkwikiinternallink wikiinternallink wikiinternallinkwikiinternallink wikiinternallinkwikiinternallinkwikiinternallinkwikiinternallinkwikiinternallinkwikiinternallinkwikiinternallinkwikiinternallinkwikiinternallink wikiinternallinkwikiinternallinkwikiinternallink wikiinternallink wikiinternallinkwikiinternallink wikiinternallinkwikiinternallink wikiinternallinkwikiinternallinkwikiinternallink wikiinternallinkwikiinternallink wikiinternallink wikiinternallinkwikiinternallink wikiinternallink wikiinternallinkwikiinternallinkwikiinternallink"%)^^38^^>>path:#sdfootnote38sym||name="sdfootnote38anc"]](%%)[[(%class="wikiinternallinkwikiinternallinkwikiinternallink wikiinternallink wikiinternallinkwikiinternallinkwikiinternallinkwikiinternallinkwikiinternallink wikiinternallinkwikiinternallinkwikiinternallinkwikiinternallinkwikiinternallink wikiinternallinkwikiinternallink wikiinternallinkwikiinternallink wikiinternallinkwikiinternallink wikiinternallinkwikiinternallink wikiinternallink wikiinternallinkwikiinternallink wikiinternallinkwikiinternallinkwikiinternallinkwikiinternallink wikiinternallink wikiinternallinkwikiinternallinkwikiinternallink wikiinternallink wikiinternallinkwikiinternallinkwikiinternallinkwikiinternallinkwikiinternallinkwikiinternallink wikiinternallink"%)^^39^^>>path:#sdfootnote39sym||name="sdfootnote39anc"]](%%)^^536 +Therefore, mapping different VTL datasets having the same data structure to different parts of a SDMX Dataflow, i.e. in the direction from VTL to SDMX, through the ordered concatenation notation is equivalent to a proper use of the operators “calc” and “union” on such datasets.{{footnote}}In case the ordered concatenation notation from VTL to SDMX is used, the set of Transformations described above is implicitly performed; therefore, in order to test the overall compliance of the VTL program to the VTL consistency rules, these implicit Transformations have to be considered as part of the VTL program even if they are not explicitly coded.{{/footnote}} 537 537 538 538 It is worth noting that in the direction from VTL to SDMX it is mandatory to specify the value for every Dimension on which the mapping is based (in other word, in the name of the calculated VTL dataset is __not__ possible to omit the value of some of the Dimensions). 539 539 ... ... @@ -541,52 +541,51 @@ 541 541 542 542 With reference to the VTL “model for Variables and Value domains”, the following additional mappings have to be considered: 543 543 544 -|VTL|SDMX 545 -|**Data Set Component**|Although this abstraction exists in SDMX, it does not have an explicit definition and correspond to a Component (either a DimensionComponent or a Measure or a DataAttribute) belonging to one specific Dataflow^^43^^ 546 -|**Represented Variable**|((( 544 +(% style="width:1170.29px" %) 545 +|**VTL**|(% style="width:754px" %)**SDMX** 546 +|**Data Set Component**|(% style="width:754px" %)Although this abstraction exists in SDMX, it does not have an explicit definition and correspond to a Component (either a DimensionComponent or a Measure or a DataAttribute) belonging to one specific Dataflow{{footnote}}Through SDMX Constraints, it is possible to specify the values that a Component of a Dataflow can assume.{{/footnote}} 547 +|**Represented Variable**|(% style="width:754px" %)((( 547 547 **Concept** with a definite 548 548 549 549 Representation 550 550 ))) 551 -|**Value Domain**|((( 552 +|**Value Domain**|(% style="width:754px" %)((( 552 552 **Representation** (see the Structure 553 553 554 554 Pattern in the Base Package) 555 555 ))) 556 -|**Enumerated Value Domain / Code List**|**Codelist** 557 -|**Code**|((( 557 +|**Enumerated Value Domain / Code List**|(% style="width:754px" %)**Codelist** 558 +|**Code**|(% style="width:754px" %)((( 558 558 **Code** (for enumerated 559 559 560 560 DimensionComponent, Measure, DataAttribute) 561 561 ))) 562 -|**Described Value Domain**|((( 563 -non-enumerated** Representation**563 +|**Described Value Domain**|(% style="width:754px" %)((( 564 +non-enumerated** Representation** 564 564 565 565 (having Facets / ExtendedFacets, see the Structure Pattern in the Base Package) 566 566 ))) 567 -|**Value**|Although this abstraction exists in SDMX, it does not have an explicit definition and correspond to a **Code** of a Codelist (for enumerated Representations) or 568 -| |((( 569 -to a valid **value **(for non-enumerated** ** 570 - 571 -Representations) 568 +|**Value**|(% style="width:754px" %)Although this abstraction exists in SDMX, it does not have an explicit definition and correspond to a **Code** of a Codelist (for enumerated Representations) or 569 +| |(% style="width:754px" %)((( 570 +to a valid **value **(for non-enumerated** **Representations) 572 572 ))) 573 -|**Value Domain Subset / Set**|This abstraction does not exist in SDMX 574 -|**Enumerated Value Domain Subset / Enumerated Set**|This abstraction does not exist in SDMX 575 -|**Described Value Domain Subset / Described Set**|This abstraction does not exist in SDMX 576 -|**Set list**|This abstraction does not exist in SDMX 572 +|**Value Domain Subset / Set**|(% style="width:754px" %)This abstraction does not exist in SDMX 573 +|**Enumerated Value Domain Subset / Enumerated Set**|(% style="width:754px" %)This abstraction does not exist in SDMX 574 +|**Described Value Domain Subset / Described Set**|(% style="width:754px" %)This abstraction does not exist in SDMX 575 +|**Set list**|(% style="width:754px" %)This abstraction does not exist in SDMX 577 577 578 578 The main difference between VTL and SDMX relies on the fact that the VTL artefacts for defining subsets of Value Domains do not exist in SDMX, therefore the VTL features for referring to predefined subsets are not available in SDMX. These artefacts are the Value Domain Subset (or Set), either enumerated or described, the Set List (list of values belonging to enumerated subsets) and the Data Set Component (aimed at defining the set of values that the Component of a Data Set can take, possibly a subset of the codes of Value Domain). 579 579 580 -Another difference consists in the fact that all Value Domains are considered as identifiable objects in VTL either if enumerated or not, while in SDMX the Codelist (corresponding to a VTL enumerated Value Domain) is identifiable, while the SDMX non-enumerated Representation (corresponding to a VTL non-enumerated Value Domain) is not identifiable. As a consequence, the definition of the VTL Rulesets, which in VTL can refer either to enumerated or non-enumerated value domains, in SDMX can refer only to enumerated Value Domains (i.e. to SDMX Codelists). As for the mapping between VTL variables and SDMX Concepts, it should be noted that these artefacts do not coincide perfectly. In fact, the VTL variables are represented variables, defined always on the same Value Domain (“Representation” in SDMX) independently of the data set / data structure in which they appear ^^[[(% class="wikiinternallink wikiinternallinkwikiinternallinkwikiinternallink wikiinternallink wikiinternallink wikiinternallink wikiinternallinkwikiinternallink wikiinternallink wikiinternallink wikiinternallinkwikiinternallinkwikiinternallinkwikiinternallink wikiinternallinkwikiinternallink wikiinternallinkwikiinternallink wikiinternallinkwikiinternallink wikiinternallink wikiinternallinkwikiinternallinkwikiinternallinkwikiinternallink wikiinternallinkwikiinternallink wikiinternallink wikiinternallink wikiinternallinkwikiinternallink wikiinternallinkwikiinternallinkwikiinternallinkwikiinternallink wikiinternallinkwikiinternallinkwikiinternallink wikiinternallink wikiinternallink"%)^^40^^>>path:#sdfootnote40sym||name="sdfootnote40anc"]](%%)^^, while the SDMX Concepts can have different Representations in different DataStructures.^^[[(% class="wikiinternallink wikiinternallinkwikiinternallink wikiinternallinkwikiinternallink wikiinternallinkwikiinternallinkwikiinternallink wikiinternallinkwikiinternallinkwikiinternallinkwikiinternallink wikiinternallink wikiinternallink wikiinternallink wikiinternallink wikiinternallink wikiinternallinkwikiinternallinkwikiinternallink wikiinternallinkwikiinternallinkwikiinternallink wikiinternallinkwikiinternallink wikiinternallink wikiinternallink wikiinternallinkwikiinternallinkwikiinternallinkwikiinternallink wikiinternallink wikiinternallink wikiinternallinkwikiinternallinkwikiinternallink wikiinternallinkwikiinternallinkwikiinternallinkwikiinternallinkwikiinternallink" %)^^41^^>>path:#sdfootnote41sym||name="sdfootnote41anc"]](%%)^^This means that one SDMX Concept can correspond to many VTL Variables, one for each representation the Concept has.579 +Another difference consists in the fact that all Value Domains are considered as identifiable objects in VTL either if enumerated or not, while in SDMX the Codelist (corresponding to a VTL enumerated Value Domain) is identifiable, while the SDMX non-enumerated Representation (corresponding to a VTL non-enumerated Value Domain) is not identifiable. As a consequence, the definition of the VTL Rulesets, which in VTL can refer either to enumerated or non-enumerated value domains, in SDMX can refer only to enumerated Value Domains (i.e. to SDMX Codelists). As for the mapping between VTL variables and SDMX Concepts, it should be noted that these artefacts do not coincide perfectly. In fact, the VTL variables are represented variables, defined always on the same Value Domain (“Representation” in SDMX) independently of the data set / data structure in which they appear{{footnote}}By using represented variables, VTL can assume that data structures having the same variables as identifiers can be composed one another because the correspondent values can match.{{/footnote}}, while the SDMX Concepts can have different Representations in different DataStructures.{{footnote}}A Concept becomes a Component in a DataStructureDefinition, and Components can have different LocalRepresentations in different DataStructureDefinitions, also overriding the (possible) base representation of the Concept.{{/footnote}} This means that one SDMX Concept can correspond to many VTL Variables, one for each representation the Concept has. 581 581 582 582 Therefore, it is important to be aware that some VTL operations (for example the binary operations at data set level) are consistent only if the components having the same names in the operated VTL Data Sets have also the same representation (i.e. the same Value Domain as for VTL). For example, it is possible to obtain correct results from the VTL expression 583 583 584 -DS_c := DS_a + DS_b (where DS_a, DS_b, DS_c are VTL Data Sets) if the matching components in DS_a and DS_b (e.g. ref_date, geo_area, sector …) refer to the same general representation. In simpler words, DS_a and DS_b must use the same values/codes (for ref_date, geo_area, sector … ), otherwise the relevant values would not match and the result of the operation would be wrong.583 +DS_c := DS_a + DS_b (where DS_a, DS_b, DS_c are VTL Data Sets) 585 585 585 +if the matching components in DS_a and DS_b (e.g. ref_date, geo_area, sector …) refer to the same general representation. In simpler words, DS_a and DS_b must use the same values/codes (for ref_date, geo_area, sector … ), otherwise the relevant values would not match and the result of the operation would be wrong. 586 + 586 586 As mentioned, the property above is not enforced by construction in SDMX, and different representations of the same Concept can be not compatible one another (for example, it may happen that geo_area is represented by ISO-alpha-3 codes in DS_a and by ISO alpha-2 codes in DS_b). Therefore, it will be up to the definer of VTL 587 587 588 -[[image:SDMX 3-0-0 SECTION 6 FINAL-1.0_en_59eee18f.gif||alt="Shape5" height="1" width="192"]] 589 - 590 590 Transformations to ensure that the VTL expressions are consistent with the actual representations of the correspondent SDMX Concepts. 591 591 592 592 It remains up to the SDMX-VTL definer also the assurance of the consistency between a VTL Ruleset defined on Variables and the SDMX Components on which the Ruleset is applied. In fact, a VTL Ruleset is expressed by means of the values of the Variables (i.e. SDMX Concepts), i.e. assuming definite representations for them (e.g. ISOalpha-3 for country). If the Ruleset is applied to SDMX Components that have the same name of the Concept they refer to but different representations (e.g. ISO-alpha-2 for country), the Ruleset cannot work properly. ... ... @@ -601,7 +601,8 @@ 601 601 602 602 [[image:SDMX 3-0-0 SECTION 6 FINAL-1.0_en_e3df33ae.png||height="543" width="483"]] 603 603 604 -==== Figure 22 – VTL Data Types ==== 603 +(% class="wikigeneratedid" id="HFigure222013VTLDataTypes" %) 604 +**Figure 22 – VTL Data Types** 605 605 606 606 The VTL scalar types are in turn subdivided in basic scalar types, which are elementary (not defined in term of other data types) and Value Domain and Set scalar types, which are defined in terms of the basic scalar types. 607 607 ... ... @@ -608,131 +608,12 @@ 608 608 The VTL basic scalar types are listed below and follow a hierarchical structure in terms of supersets/subsets (e.g. "scalar" is the superset of all the basic scalar types): 609 609 610 610 611 +**Figure 23 – VTL Basic Scalar Types** 611 611 612 612 ((( 613 -//n// 614 - 615 -//a// 616 - 617 -//e// 618 - 619 -//l// 620 - 621 -//o// 622 - 623 -//o// 624 - 625 -//B// 626 - 627 -//n// 628 - 629 -//o// 630 - 631 -//i// 632 - 633 -//t// 634 - 635 -//a// 636 - 637 -//r// 638 - 639 -//u// 640 - 641 -//D// 642 - 643 -//d// 644 - 645 -//o// 646 - 647 -//i// 648 - 649 -//r// 650 - 651 -//e// 652 - 653 -//p// 654 - 655 -//_// 656 - 657 -//e// 658 - 659 -//m// 660 - 661 -//i// 662 - 663 -//T// 664 - 665 -//e// 666 - 667 -//t// 668 - 669 -//a// 670 - 671 -//D// 672 - 673 -//e// 674 - 675 -//m// 676 - 677 -//i// 678 - 679 -//T// 680 - 681 -//r// 682 - 683 -//e// 684 - 685 -//g// 686 - 687 -//e// 688 - 689 -//t// 690 - 691 -//n// 692 - 693 -//I// 694 - 695 -//r// 696 - 697 -//e// 698 - 699 -//b// 700 - 701 -//m// 702 - 703 -//u// 704 - 705 -//N// 706 - 707 -//g// 708 - 709 -//n// 710 - 711 -//i// 712 - 713 -//r// 714 - 715 -//t// 716 - 717 -//S// 718 - 719 -//r// 720 - 721 -//a// 722 - 723 -//l// 724 - 725 -//a// 726 - 727 -//c// 728 - 729 -//S// 730 - 731 -[[image:SDMX 3-0-0 SECTION 6 FINAL-1.0_en_82d45833.gif||alt="Shape6" height="231" width="184"]] 614 + 732 732 ))) 733 733 734 -==== Figure 23 – VTL Basic Scalar Types ==== 735 - 736 736 === 12.4.2 VTL basic scalar types and SDMX data types === 737 737 738 738 The VTL assumes that a basic scalar type has a unique internal representation and can have more external representations. ... ... @@ -1046,7 +1046,7 @@ 1046 1046 |N|fixed number of digits used in the preceding textual representation of the month or the day 1047 1047 | | 1048 1048 1049 -The default conversion, either standard or customized, can be used to deduce automatically the representation of the components of the result of a VTL Transformation. In alternative, the representation of the resulting SDMX Dataflow can be given explicitly by providing its DataStructureDefinition. In other words, the representation specified in the DSD, if available, overrides any default conversion^^[[(% class="wikiinternallink wikiinternallink wikiinternallink wikiinternallink wikiinternallink wikiinternallink wikiinternallink wikiinternallink wikiinternallink wikiinternallink wikiinternallink wikiinternallink wikiinternallink wikiinternallink wikiinternallink wikiinternallink wikiinternallink wikiinternallink wikiinternallink wikiinternallink wikiinternallink wikiinternallink wikiinternallink wikiinternallink wikiinternallink wikiinternallink wikiinternallink wikiinternallink wikiinternallink wikiinternallink wikiinternallink wikiinternallink wikiinternallink wikiinternallink wikiinternallink wikiinternallink wikiinternallink wikiinternallink wikiinternallink wikiinternallink wikiinternallink" %)^^42^^>>path:#sdfootnote42sym||name="sdfootnote42anc"]](%%)^^. 930 +The default conversion, either standard or customized, can be used to deduce automatically the representation of the components of the result of a VTL Transformation. In alternative, the representation of the resulting SDMX Dataflow can be given explicitly by providing its DataStructureDefinition. In other words, the representation specified in the DSD, if available, overrides any default conversion^^[[(% class="wikiinternallink wikiinternallink wikiinternallink wikiinternallink wikiinternallink wikiinternallink wikiinternallink wikiinternallink wikiinternallink wikiinternallink wikiinternallink wikiinternallink wikiinternallink wikiinternallink wikiinternallink wikiinternallink wikiinternallink wikiinternallink wikiinternallink wikiinternallink wikiinternallink wikiinternallink wikiinternallink wikiinternallink wikiinternallink wikiinternallink wikiinternallink wikiinternallink wikiinternallink wikiinternallink wikiinternallink wikiinternallink wikiinternallink wikiinternallink wikiinternallink wikiinternallink wikiinternallink wikiinternallink wikiinternallink wikiinternallink wikiinternallink wikiinternallink wikiinternallink wikiinternallink wikiinternallink wikiinternallink wikiinternallink wikiinternallink" %)^^42^^>>path:#sdfootnote42sym||name="sdfootnote42anc"]](%%)^^. 1050 1050 1051 1051 === 12.4.5 Null Values === 1052 1052