Changes for page SDMX 3.1 Standards. Section 1. Framework
Last modified by Artur on 2025/09/30 12:30
Summary
-
Page properties (1 modified, 0 added, 0 removed)
Details
- Page properties
-
- Content
-
... ... @@ -8,13 +8,13 @@ 8 8 |DRAFT 1.0|December 2024|Draft release updated for SDMX 3.1 for public consultation 9 9 |1.0|May 2025|Public release for SDMX 3.1 10 10 11 -= {{id name="_Toc56630"/}}1 Introduction =11 += 1 Introduction = 12 12 13 13 The Statistical Data and Metadata Exchange (SDMX) initiative (https:~/~/www.sdmx.org) sets standards that can facilitate the exchange of statistical data and metadata using modern information technology. 14 14 15 15 The SDMX Technical Specifications are organised into several discrete sections. 16 16 17 -The following are published on the SDMX website ([[__https:~~/~~/www.sdmx.org__>> url:https://www.sdmx.org/]][[)>>url:https://www.sdmx.org/]]:17 +The following are published on the SDMX website ([[__https:~~/~~/www.sdmx.org__>>https://https:www.sdmx.org]]): 18 18 19 19 **Section 1** **Framework for SDMX Technical Standards** – this document providing an introduction to the technical standards. 20 20 ... ... @@ -50,7 +50,7 @@ 50 50 51 51 In July 2020 the SDMX 2.1 specifications were revised to add support for the Validation and Transformation Language (VTL). For 3.0, the VTL specification has been updated to align with changes to the information model and other modifications to the Standard such as the introduction of Semantic Versioning for the versioning of structural metadata artefacts. Section 2 (Information Model) sets out details of the ‘Transformation and Expressions’ package for defining and managing VTL 2.0 programs and Section 6 (Technical Notes) provides detailed guidance on implementing and using VTL with SDMX. 52 52 53 -= {{id name="_Toc56631"/}}2 Change History =53 += 2 Change History = 54 54 55 55 The 2.0 version of this standard represented a significant increase in scope, and also provided more complete support in those areas covered in the version 1.0 specification. Version 2.0 of this standard is backward-compatible with version 1.0, so that existing implementations can be easily migrated to conformance with version 2.0. 56 56 ... ... @@ -60,13 +60,13 @@ 60 60 61 61 The 3.1 version provides supports for data models to increase dimensionality over time without impacting existing data collections. The Data Constraint model was adjusted to separate concerns of data reporting and data dissemination. 62 62 63 -== {{id name="_Toc56632"/}}2.1 Major Changes from 1.0 to 2.0 ==63 +== 2.1 Major Changes from 1.0 to 2.0 == 64 64 65 65 * **Reference Metadata**: In addition to describing and specifying data structures and formats (along with related structural metadata), the version 2.0 specification also provides for the exchange of metadata which is distinct from the structural metadata in the 1.0 version. This category includes “reference” metadata (regarding data quality, methodology, and similar types – it can be configured by the user to include whatever concepts require reporting); metadata related to data provisioning (release calendar information, description of the data and metadata provided, etc.); and metadata relevant to the exchange of categorization schemes. 66 66 * **SDMX Registry**: Provision is made in the 2.0 standard for standard communication with registry services, to support a data-sharing model of statistical exchange. These services include registration of data and metadata, querying of registered data and metadata, and subscription/notification. 67 67 * **Structural Metadata**: The support for exchange of statistical data and related structural metadata has been expanded. Some support is provided for qualitative data; data cube structures are described; hierarchical code lists are supported; relationships between data structures can be expressed, providing support for extensibility of data structures; and the description of functional dependencies within cubes are supported. 68 68 69 -== {{id name="_Toc56633"/}}2.2 Major Changes from 2.0 to 2.1 ==69 +== 2.2 Major Changes from 2.0 to 2.1 == 70 70 71 71 * **Web-Services-Oriented Changes:** Several organizations have been implementing web services applications using SDMX, and these implementations have resulted in several changes to the specifications. Because the nature of SDMX web services could not be anticipated at the time of the original drafting of the specifications, the web services guidelines have been completely re-developed. 72 72 * **Presentational Changes: **Much work has gone into using various technologies for the visualization of SDMX data and metadata, and some changes have been proposed as a result, to better leverage this graphical visualization. These changes are largely to leverage the Cross-domain Concepts of the Content Oriented Guidelines. ... ... @@ -80,11 +80,12 @@ 80 80 * **Simplification and better support for the metadata structure: **New use cases have been reported and these are now supported by a re-modelled metadata structure definition. 81 81 * **Support for partial item schemes such as a code list: **The concept of a partial (subset) item scheme such as a partial code list for use in exchange scenarios has been introduced**.** 82 82 83 -== {{id name="_Toc56634"/}}2.3 Major Changes from 2.1 to 3.0 ==83 +== 2.3 Major Changes from 2.1 to 3.0 == 84 84 85 85 SDMX version 3.0 introduces new features, improvements and changes to the Standard in the following key areas: 86 86 87 -==== Information Model ==== 87 +(% class="wikigeneratedid" id="HInformationModel" %) 88 +**Information Model** 88 88 89 89 * Simplification and improvement of the reference metadata model 90 90 * Support for microdata ... ... @@ -94,11 +94,13 @@ 94 94 * Improvements to code hierarchies for data discovery 95 95 * Improvements to constraints 96 96 97 -==== Versioning of Structural Metadata Artefacts ==== 98 +(% class="wikigeneratedid" id="HVersioningofStructuralMetadataArtefacts" %) 99 +**Versioning of Structural Metadata Artefacts** 98 98 99 -• Adoption of the three-number semantic versioning standard for structural metadata artefacts [[(>>url:https://semver.org/]][[__https:~~/~~/semver.org__>>url:https://semver.org/]][[)>>url:https://semver.org/]]101 +• Adoption of the three-number semantic versioning standard for structural metadata artefacts ([[__https:~~/~~/semver.org__>>https://https:semver.org]]) 100 100 101 -==== REST Web Services Application Programming Interface (API) ==== 103 +(% class="wikigeneratedid" id="HRESTWebServicesApplicationProgrammingInterface28API29" %) 104 +**REST Web Services Application Programming Interface (API)** 102 102 103 103 * Change to a single ‘structure’ resource for structure queries simplifying the REST API specification by reducing the number of resources to five 104 104 * Improvements to data queries ... ... @@ -105,11 +105,13 @@ 105 105 * Improvements to reference metadata queries 106 106 * Support for structural metadata maintenance using HTTP PUT, POST and DELETE verbs 107 107 108 -==== SOAP Web Services API ==== 111 +(% class="wikigeneratedid" id="HSOAPWebServicesAPI" %) 112 +**SOAP Web Services API** 109 109 110 -• The SOAP web services API has been deprecated with version 3.0 standardising on REST ** **114 +• The SOAP web services API has been deprecated with version 3.0 standardising on REST 111 111 112 -==== XML, JSON, CSV and EDI Transmission formats ==== 116 +(% class="wikigeneratedid" id="HXML2CJSON2CCSVandEDITransmissionformats" %) 117 +**XML, JSON, CSV and EDI Transmission formats** 113 113 114 114 * The SDMX-ML, SDMX-JSON and SDMX-CSV specifications have been extended and modified where needed to support the new features and changes such as reference metadata and microdata 115 115 * Obsolete SDMX-ML data message variants including Generic, Compact, Utility and Cross-sectional have been deprecated standardising on Structure Specific Data as the sole XML format for data exchange ... ... @@ -131,24 +131,26 @@ 131 131 132 132 The SDMX 3.0 Major Changes document provides more information including an analysis of the breaking changes. 133 133 134 -== {{id name="_Toc56635"/}}2.4 Major Changes from 3.0 to 3.1 ==139 +== 2.4 Major Changes from 3.0 to 3.1 == 135 135 136 -==== Information Model ==== 141 +(% class="wikigeneratedid" id="HInformationModel-1" %) 142 +**Information Model** 137 137 138 -* Addition of Dimension Constraint property to a Dataflow // //139 -* Addition of evolving structure property to a Data Structure Definition // //140 -* Remove version property on Categorisation // //141 -* Simplification of Constraints o Removal of Advanced Release Calendar // //144 +* Addition of Dimension Constraint property to a Dataflow 145 +* Addition of evolving structure property to a Data Structure Definition 146 +* Remove version property on Categorisation 147 +* Simplification of Constraints o Removal of Advanced Release Calendar 142 142 143 143 o Removal of Role, Data Constraints only restrict data that can be reported// //o Restrict constraint targets to Identifiable structures (not URLs) o Addition of Availability Constraint to define actual data 144 144 145 -==== Documentation ==== 151 +(% class="wikigeneratedid" id="HDocumentation" %) 152 +**Documentation** 146 146 147 147 • Registering Reference Metadata removed from documentation, to align with XML Registration object which is unable to reference a Metadata Provision, and REST API which is unable to query for registered reference metadata sources. 148 148 149 -= {{id name="_Toc56636"/}}3Processes and Business Scope =156 += 3 Processes and Business Scope = 150 150 151 -== {{id name="_Toc56637"/}}3.1 Process Patterns ==158 +== 3.1 Process Patterns == 152 152 153 153 SDMX identifies three basic process patterns regarding the exchange of statistical data and metadata. These can be described as follows: 154 154 ... ... @@ -168,7 +168,7 @@ 168 168 169 169 It is important to note that SDMX is primarily focused on the //exchange// and //dissemination// of statistical data and metadata. There may also be many uses for the standard model and formats specified here in the context of internal processing of data that are not concerned with the exchange between organizations and users, however. It is felt that a clear, standard formatting of data and metadata for the purposes of exchange and dissemination can also facilitate internal processing by organizations and users, but this is not the focus of the specification. 170 170 171 -== {{id name="_Toc56638"/}}3.2 SDMX and Process Automation ==178 +== 3.2 SDMX and Process Automation == 172 172 173 173 Statistical data and metadata exchanges employ many different automated processes, but some are of more general interest than others. There are some common information technologies that are nearly ubiquitous within information systems today. SDMX aims to provide standards that are most useful for these automated processes and technologies. 174 174 ... ... @@ -176,15 +176,12 @@ 176 176 177 177 1. //Batch Exchange of Data and Metadata~:// The transmission of whole or partial databases between counterparties, including incremental updating. 178 178 1. //Provision of Data and Metadata on the Internet~:// Internet technology - including its use in private or semi-private TCP/IP networks - is extremely common. This technology includes XML, JSON and REST web services as primary mechanisms for automating data and metadata provision, as well as the more traditional static HTML and database-driven publishing. 179 -1. //Generic Processes~:// While many applications and processes are specific to some set of data and metadata, other types of automated services and processes are designed 180 - 181 -to handle any type of statistical data and metadata whatsoever. This is particularly true in cases where portal sites and data feeds are made available on the Internet. 182 - 186 +1. //Generic Processes~:// While many applications and processes are specific to some set of data and metadata, other types of automated services and processes are designed to handle any type of statistical data and metadata whatsoever. This is particularly true in cases where portal sites and data feeds are made available on the Internet. 183 183 1. //Presentation and Transformation of Data~:// In order to make data and metadata useful to consumers, they must support automated processes that transform them into application-specific processing formats, other standard formats, and presentational formats. Although not strictly an aspect of exchange, this type of automated processing represents a set of requirements that must be supported if the information exchange between counterparties is itself to be supported. 184 184 185 185 The SDMX standards specified here are designed to support the requirements of all of these automation processes and technologies. 186 186 187 -== {{id name="_Toc56639"/}}3.3 Statistical Data and Metadata ==191 +== 3.3 Statistical Data and Metadata == 188 188 189 189 To avoid confusion about which "data" and "metadata" are the intended content of the SDMX formats specified here, a statement of scope is offered. Statistical "data" are sets of often numeric observations which typically have time associated with them. They are associated with a set of metadata values, representing specific concepts, which act as identifiers and descriptors of the data. These metadata values and concepts can be understood as the named dimensions of a multi-dimensional co-ordinate system, describing what is often called a "cube" of data. 190 190 ... ... @@ -206,7 +206,7 @@ 206 206 207 207 **Figure 1: High Level Schematic of Major Artefacts in the SDMX 3.0 Information Model** 208 208 209 -== {{id name="_Toc56640"/}}3.4 The SDMX View of Statistical Exchange ==213 +== 3.4 The SDMX View of Statistical Exchange == 210 210 211 211 Version 1.0 of ISO/TS 17369 SDMX covered statistical data sets and the metadata related to the structure of these data sets. This scope was useful in supporting the different models of statistical exchange (bilateral exchange, gateway exchange, and data-sharing) but was not by itself sufficient to support them completely. Versions 2.0 and 2.1 provide a much more complete view of statistical exchange, so that an open data-sharing model can be fully supported, and other models of exchange can be more completely automated. In order to produce technical standards that will support this increased scope, the SDMX Information Model provides a broader set of formal objects which describe the actors, processes, and resources within statistical exchanges. 212 212 ... ... @@ -236,7 +236,7 @@ 236 236 * //**Dataflow Definition:**// In SDMX, data sets are reported or disseminated according to a data flow definition. The data flow definition identifies the data structure definition and may be associated with one or more subject matter domains via a Categorisation (this facilitates the search for data according to organised category schemes). Constraints, in terms of reporting periodicity or sub set of possible keys that are allowed in a data set, may be attached to the data flow definition. 237 237 * //**Metadataflow Definition:**// A metadata flow definition is very similar to a data flow definition, but describes, categorises, and constrains metadata sets. 238 238 * //**Data Provider: **//An organization which produces data is termed a data provider. 239 -* //**Metadata Provider: **//An organization which produces reference metadata is termed a metadata provider. // //243 +* //**Metadata Provider: **//An organization which produces reference metadata is termed a metadata provider. 240 240 * //**Provision Agreement (Metadata Provision Agreement):**// The set of information which describes the way in which data sets and metadata sets are provided by a data/metadata provider. A provision agreement can be constrained in much the same way as a data or metadata flow definition. Thus, a data provider can express the fact that it provides a particular data flow covering a specific set of countries and topics, Importantly, the actual source of registered data or metadata is attached to the provision agreement (in terms of a URL). The term “agreement” is used because this information can be understood as the basis of a “service-level agreement”. In SDMX, however, this is informational metadata to support the technical systems, as opposed to any sort of contractual information (which is outside the scope of a technical specification). In version 3.0, metadata provision agreement and data provision agreement are two separate artefacts. 241 241 * //**Data Constraint:**// Used to restrict content (such as enumerations) and are used by provision agreements, data flows, data structure definitions in order to provide a set of reporting restrictions in the context of a collection 242 242 * //**Metadata Constraint:**// Used to restrict content (such as enumerations) and are used by metadata provision agreements, metadata flows, metadata structure definitions in order to provide a set of reporting restrictions in the context of a collection ... ... @@ -259,7 +259,7 @@ 259 259 260 260 • //**Transformation Scheme:**// A transformation scheme is a set of Validation and Transformation Language (VTL) transformations aimed at obtaining some meaningful results for the user (e.g., the validation of one or more data sets). The set of transformations is meant to be executed together (in the same run) and may contain 597 any number of transformations in order to produce any number of results. Thus, a transformation scheme can be considered as a VTL ‘program’. 261 261 262 -== {{id name="_Toc56641"/}}3.5 SDMX Registry Services ==266 +== 3.5 SDMX Registry Services == 263 263 264 264 In order to provide visibility into the large amount of data and metadata which exists within the SDMX model of statistical exchange, it is felt that an architecture based on a set of registry services is potentially useful. A “registry” – as understood in webservices terminology – is an application which maintains and stores metadata for querying, and which can be used by any other application in the network with sufficient access privileges (though note that the mechanism of access control is outside of the scope of the SDMX standard). It can be understood as the index of a distributed database or metadata repository which is made up of all the data provider’s data sets and reference metadata sets within a statistical community, located across the Internet or similar network. 265 265 ... ... @@ -274,7 +274,7 @@ 274 274 * //**Querying: **//The registry services have interfaces for querying the metadata contained in a registry, so that applications and users can discover the existence of data sets and reference metadata sets, structural metadata, the providers/agencies associated with those objects, and the provider agreements which describe how the data and metadata are made available, and how they are categorized. 275 275 * //**Subscription/Notification:**// It is possible to “subscribe” to specific objects in a registry, so that a notification will be sent to all subscribers whenever the registry objects are updated. 276 276 277 -== {{id name="_Toc56642"/}}3.6 RESTful Web services ==281 +== 3.6 RESTful Web services == 278 278 279 279 Web services allow computer applications to exchange data directly over the Internet, essentially allowing modular or distributed computing in a more flexible fashion than ever before. In order to allow web services to function, however, many standards are required: for requesting and supplying data; for expressing the enveloping data which is used to package exchanged data; for describing web services to one another, to allow for easy integration into applications that use other web services as data resources. 280 280 ... ... @@ -289,7 +289,7 @@ 289 289 290 290 The following conceptual example uses the ‘data’ resource to query a data repository for a series identified by the key ‘M.USD.EUR.SP00.A’ in the EXR (ECB exchange rates) Dataflow: https:~/~/ws-entry-point/data/dataflow/ECB/EXR/1.0.0/M.USD.EUR.SP00.A 291 291 292 -= {{id name="_Toc56643"/}}4The SDMX Information Model =296 += 4 The SDMX Information Model = 293 293 294 294 SDMX provides a way of modelling statistical data, and defines the set of metadata constructs used for this purpose. Because SDMX specifies a number of transmission formats for expressing data and structural metadata, the model is used as a mechanism for guaranteeing that transformation between the different formats is lossless. In this sense, all of the formats are syntax-bound expressions of the common information model. 295 295 ... ... @@ -305,9 +305,9 @@ 305 305 306 306 A full UML conceptual design of the information model is set out in Section 2 of the Technical Specifications. 307 307 308 -= {{id name="_Toc56644"/}}5The SDMX Transmission Formats =312 += 5 The SDMX Transmission Formats = 309 309 310 -== {{id name="_Toc56645"/}}5.1 SDMX-ML ==314 +== 5.1 SDMX-ML == 311 311 312 312 SDMX-ML is the XML transmission format specification for exchanging structural metadata, data and reference metadata, and interacting with SDMX registry services. It is designed as a general-purpose format for all automation and data / metadata exchange tasks, and provides the most complete coverage. 313 313 ... ... @@ -335,7 +335,7 @@ 335 335 1. //Data: //For the exchange of data. Unlike SDMX-ML, the structure of a SDMX-JSON data message is not specific to the DSDs of the data sets so schema validation will not check for compliance of the data with the DSDs. 336 336 1. //Metadata//: For the exchange of reference metadata sets. 337 337 338 -== {{id name="_Toc56647"/}}5.3 SDMX-CSV ==342 +== 5.3 SDMX-CSV == 339 339 340 340 SDMX-CSV is the CSV transmission format specification for exchanging data and reference metadata only. 341 341 ... ... @@ -346,7 +346,7 @@ 346 346 1. //Data//: For the exchange of data. Like SDMX-JSON, SDMX-CSV can include both code IDs and labels which is helpful when using the data to create human readable charts and dashboards. 347 347 1. //Metadata//: For the exchange of reference metadata sets. 348 348 349 -== {{id name="_Toc56648"/}}5.4 Formats and Messages Deprecated in Version 3.0 ==353 +== 5.4 Formats and Messages Deprecated in Version 3.0 == 350 350 351 351 The following formats and messages have been deprecated in version 3.0 to simplify, modernise and rationalise the standard. 352 352 ... ... @@ -363,17 +363,17 @@ 363 363 * SDMX-ML Query messages 364 364 * SDMX-ML Submit Structure Request messages 365 365 366 -= {{id name="_Toc56649"/}}6Dependencies on SDMX content-oriented guidelines =370 += 6 Dependencies on SDMX content-oriented guidelines = 367 367 368 368 The technical standards proposed here are designed so that they can be used in conjunction with other SDMX guidelines which are more closely tied to the content and semantics of statistical data exchange. The SDMX Information Model works equally well with any statistical concept, but to encourage interoperability, it is also necessary to standardize and harmonize the use of specific concepts and terminology. To achieve this goal, SDMX creates and maintains guidelines for cross-domain concepts, terminology, and structural definitions. There are three major parts to this effort. 369 369 370 -== {{id name="_Toc56650"/}}6.1 Cross-Domain Concepts ==374 +== 6.1 Cross-Domain Concepts == 371 371 372 372 The SDMX Cross-Domain Concepts is a content guideline concerning concepts which are used across statistical domains. This list is expected to grow and to be subject to revision as SDMX is used in a growing number of domains. The use of the SDMX Cross-Domain Concepts, where appropriate, provides a framework to further promote interoperability among organisations using the technical standards presented here. The harmonization of statistical concepts includes not only the definitions of the concepts, and their names, but also, where appropriate, their representation with standard code lists, and the role they play within data structure definitions and metadata structure definitions. 373 373 374 374 The intent of this guideline is two-fold: to provide a core set of concepts which can be used to structure statistical data and metadata, to promote interoperability between systems (“structural metadata”, as described above); and to promote the exchange of metadata more widely, with a set of harmonized concept names and definitions for other types of metadata (“reference metadata”, as defined above.) 375 375 376 -== {{id name="_Toc56651"/}}6.2 Metadata Common Vocabulary ==380 +== 6.2 Metadata Common Vocabulary == 377 377 378 378 The Metadata Common Vocabulary is an SDMX guideline which provides definition of terms to be used for the comparison and mapping of terminology found in data structure definitions and in other aspects of statistical metadata management. Essentially, it provides ISOcompliant definitions for a wide range of statistical terms, which may be used directly, or against which other terminology systems may be mapped. This set of terms is inclusive of the terminology used within the SDMX Technical Standards. 379 379 ... ... @@ -381,17 +381,17 @@ 381 381 382 382 Concepts work is built. 383 383 384 -== {{id name="_Toc56652"/}}6.3 Statistical Subject-Matter Domains ==388 +== 6.3 Statistical Subject-Matter Domains == 385 385 386 386 The Statistical Subject-Matter Domains is a listing of the breadth of statistical information for the purposes of organizing widespread statistical exchange and categorization. It acts as a standard scheme against which the categorization schemes of various counterparties can be mapped, to facilitate interoperable data and metadata exchange. It serves another useful purpose, however, which is to allow an organization of corresponding “domain groups”, each of which could define standard data structure definitions, concepts, etc. within their domains. Such groups already exist within the international community. SDMX would use the Statistical Subject-Matter Domains list to facilitate the efforts of these groups to develop the kinds of content standards which could support the interoperation of SDMX-conformant technical systems within and across statistical domains. The organisation of the content of such schemes is supported in SDMX as a Category Scheme. 387 387 388 388 SDMX Statistical Subject-Matter Domains will be listed and maintained by the SDMX Initiative and will be subject to adjustment. 389 389 390 -== {{id name="_Toc56653"/}}6.4 SDMX Concept Roles ==394 +== 6.4 SDMX Concept Roles == 391 391 392 392 These guidelines define the standard set of SDMX Concept Roles and their use. This set of standard SDMX Concepts are implemented as a cross-domain Concept Scheme that defines the set of concept roles and gives examples on concept role implementation in SDMX 2.0, 2.1 and 3.0. A concept role gives a particular context to a concept for easy and systematic interpretation by machine processing and visualization tools. For example, the concepts REPORTING_AREA and COUNTERPART_AREA are different concepts but they are both geographical characteristics, therefore they can be associated with the same concept role ID: "GEO". This allows visualization systems to interpret these concepts as geographical data in order to generate maps. The implementation of concept roles is different in versions 2.0 and 2.1/3.0 of the SDMX technical standard. Specifically for SDMX 3.0, this set of roles is considered a normative list that must be interpreted in the same way by all organisations. Additional roles may be provided via the standard roles’ mechanism in SDMX 3.0, i.e., via Concept Schemes; the semantics of these roles have to be agreed bilateraly in data exchanges. The Concept Roles are available as an SDMX Concept Scheme on the SDMX Global Registry. 393 393 394 -= {{id name="_Toc56654"/}}7 Validation and Transformation Language =398 += 7 Validation and Transformation Language = 395 395 396 396 For many years the SDMX initiative has been fostering and supporting the development of a standard calculation language, called Validation and Transformation Language (VTL). A blueprint for defining calculations was already described in the original SDMX 2.1 specifications (package 13 of the Information Model - “Transformations and Expressions”). It was just a basic framework that required further developments to became operational in order to achieve a calculation language able to manipulate SDMX artefacts. 397 397