Changes for page SDMX 2.1 Standards. Section 6. Technical Notes

Last modified by Artur on 2025/08/19 10:43

From 4.1 to 4.2 From 4.6 to 4.7

From version 4.2

edited by Helena
on 2025/05/21 21:28

Change comment: There is no comment for this version

To version 4.6

edited by Helena
on 2025/05/21 21:32

Change comment: There is no comment for this version

Raw
Rendered

Summary

Page properties (1 modified, 0 added, 0 removed)

Details

Page properties

Content

@@ -63,18 +63,12 @@
  The following section provides a brief overview of the differences between the various SDMX formats.
--Version 2.0 was characterised by 4 data messages, each with a distinct format: Generic, Compact, Cross-Sectional and Utility. Because of the design, data in some formats could not always be related to another format. In version 2.1, this issue has been addressed by merging some formats and eliminating others. As a result, in
++Version 2.0 was characterised by 4 data messages, each with a distinct format: Generic, Compact, Cross-Sectional and Utility. Because of the design, data in some formats could not always be related to another format. In version 2.1, this issue has been addressed by merging some formats and eliminating others. As a result, in SDMX 2.1 there are just two types of data formats: //GenericData// and //StructureSpecificData// (i.e. specific to one Data Structure Definition).
--SDMX 2.1 there are just two types of data formats: //GenericData// and
--
--//StructureSpecificData// (i.e. specific to one Data Structure Definition).
--
  Both of these formats are now flexible enough to allow for data to be oriented in series with any dimension used to disambiguate the observations (as opposed to only time or a cross sectional measure in version 2.0). The formats have also been expanded to allow for ungrouped observations.
--To allow for applications which only understand time series data, variations of these formats have been introduced in the form of two data messages;
++To allow for applications which only understand time series data, variations of these formats have been introduced in the form of two data messages; //GenericTimeSeriesData// and //StructureSpecificTimeSeriesData//. It is important to note that these variations are built on the same root structure and can be processed in the same manner as the base format so that they do NOT introduce additional processing requirements.
--//GenericTimeSeriesData// and //StructureSpecificTimeSeriesData//. It is important to note that these variations are built on the same root structure and can be processed in the same manner as the base format so that they do NOT introduce additional processing requirements.
--
  === //Structure Definition// ===
  The SDMX-ML Structure Message supports the use of annotations to the structure, which is not supported by the SDMX-EDI syntax.
@@ -83,10 +83,8 @@
  === //Validation// ===
--SDMX-EDI – as is typical of EDIFACT syntax messages – leaves validation to dedicated applications (“validation” being the checking of syntax, data typing, and adherence of the data message to the structure as described in the structural
++SDMX-EDI – as is typical of EDIFACT syntax messages – leaves validation to dedicated applications (“validation” being the checking of syntax, data typing, and adherence of the data message to the structure as described in the structural definition.)
--definition.)
--
  The SDMX-ML Generic Data Message also leaves validation above the XML syntax level to the application.
  The SDMX-ML DSD-specific messages will allow validation of XML syntax and datatyping to be performed with a generic XML parser, and enforce agreement between the structural definition and the data to a moderate degree with the same tool.
@@ -97,17 +97,13 @@
  === //Character Encodings// ===
--All SDMX-ML messages use the UTF-8 encoding, while SDMX-EDI uses the ISO 8879-1 character encoding. There is a greater capacity with UTF-8 to express some character sets (see the “APPENDIX: MAP OF ISO 8859-1 (UNOC) CHARACTER
++All SDMX-ML messages use the UTF-8 encoding, while SDMX-EDI uses the ISO 8879-1 character encoding. There is a greater capacity with UTF-8 to express some character sets (see the “APPENDIX: MAP OF ISO 8859-1 (UNOC) CHARACTER SET (LATIN 1 OR “WESTERN”) in the document “SYNTAX AND DOCUMENTATION VERSION 2.0”.) Many transformation tools are available which allow XML instances with UTF-8 encodings to be expressed as ISO 8879-1-encoded characters, and to transform UTF-8 into ISO 8879-1. Such tools should be used when transforming SDMX-ML messages into SDMX-EDI messages and vice-versa.
--SET      (LATIN     1     OR     “WESTERN”)     in     the     document     “SYNTAX     AND
--
--DOCUMENTATION VERSION 2.0”.) Many transformation tools are available which allow XML instances with UTF-8 encodings to be expressed as ISO 8879-1-encoded characters, and to transform UTF-8 into ISO 8879-1. Such tools should be used when transforming SDMX-ML messages into SDMX-EDI messages and vice-versa.
--
  === //Data Typing// ===
  The XML syntax and EDIFACT syntax have different data-typing mechanisms. The section below provides a set of conventions to be observed when support for messages in both syntaxes is required. For more information on the SDMX-ML representations of data, see below.
--==== 3.3.2        Data Types ====
++==== 3.3.2 Data Types ====
  The XML syntax has a very different mechanism for data-typing than the EDIFACT syntax, and this difference may create some difficulties for applications which support both EDIFACT-based and XML-based SDMX data formats. This section provides a set of conventions for the expression in data in all formats, to allow for clean interoperability between them.
@@ -123,7 +123,8 @@
 *. Maximum 70 characters.
 *. From ISO 8859-1 character set (including accented characters)
 . **Descriptions **are:
--1*. Maximum 350 characters;             From ISO 8859-1 character set.
++1*. Maximum 350 characters;
++1*. From ISO 8859-1 character set.
 . **Code values** are:
 *. Maximum 18 characters;
 *. Any of A..Z (upper case alphabetic), 0..9 (numeric), _ (underscore), / (solidus, slash), = (equal sign), - (hyphen);
@@ -132,21 +132,25 @@
  A..Z (upper case alphabetic), 0..9 (numeric), _ (underscore)
--1. **Observation values** are:
--1*. Decimal numerics (signed only if they are negative);
--1*. The maximum number of significant figures is:
--1*. 15 for a positive number
--1*. 14 for a positive decimal or a negative integer
--1*. 13 for a negative decimal
--1*. Scientific notation may be used.
--1. **Uncoded statistical concept** text values are:
--1*.
--1**. Maximum 1050 characters;
--1**. From ISO 8859-1 character set.
--1. **Time series keys**:
++**5. Observation values** are:
--In principle, the maximum permissible length of time series keys used in a data exchange does not need to be restricted. However, for working purposes, an effort is made to limit the maximum length to 35 characters; in this length, also (for SDMXEDI) one (separator) position is included between all successive dimension values; this means that the maximum length allowed for a pure series key (concatenation of dimension values) can be less than 35 characters.  The separator character is a colon (“:”) by conventional usage.
++* Decimal numerics (signed only if they are negative);
++* The maximum number of significant figures is:
++* 15 for a positive number
++* 14 for a positive decimal or a negative integer
++* 13 for a negative decimal
++* Scientific notation may be used.
++**6. Uncoded statistical concept** text values are:
++
++*
++** Maximum 1050 characters;
++** From ISO 8859-1 character set.
++
++**7. Time series keys**:
++
++In principle, the maximum permissible length of time series keys used in a data exchange does not need to be restricted. However, for working purposes, an effort is made to limit the maximum length to 35 characters; in this length, also (for SDMXEDI) one (separator) position is included between all successive dimension values; this means that the maximum length allowed for a pure series key (concatenation of dimension values) can be less than 35 characters. The separator character is a colon (“:”) by conventional usage.
++
  == 3.4 SDMX-ML and SDMX-EDI Best Practices ==
  === 3.4.1        Reporting and Dissemination Guidelines ===