Version 1.1 by Helena K. on 2026/01/27 12:51

Hide last authors
Helena K. 1.1 1 {{box title="**Contents**"}}
2 {{toc/}}
3 {{/box}}
4
5 = Document History =
6
7 |Version|Date|Comment
8 |1.0|21/8/2015|Initial version.
9 |2.0|7/3/2018|(((
10 Replaced the “Embargo: Privileged access” use case confidentiality status to use CONF_STATUS:E instead of CONF_STATUS:N. When this guideline is implemented, the CONF_STATUS:N can no longer be used for this use case (the embargo time is ignored if the CONF_STATUS is N).
11
12
13 Clarified the document text, removed superfluous text.
14
15 Added use of time zone is recommended.
16 )))
17
18 = Introduction =
19
20 This paper presents use case scenarios related to confidentiality and embargo in SDMX data exchanges, and provides recommendations on how to represent these elements in the SDMX model. The aim is to provide a consistent and practical way to represent these aspects in SDMX artefacts in order to promote cross-domain consistency, and harmonise methodology and processes.
21
22
23 Confidentiality aims at protecting data from unauthorised disclosure that could be prejudicial or harmful to the interest of the source or other relevant parties.
24
25
26 Embargo means that data may become public only after expiry of a pre-defined date and time.
27
28
29 Embargo establishes a relationship between a set of data (e.g. an observation), a date/time and a group of privileged data recipients.
30
31
32 Disclosure of data marked as confidential or under embargo is not permitted. Procedures should be in place to prevent such disclosure, including rules for staff, aggregation rules when disseminating data, provision of unit records, etc.
33
34
35 There needs to be a formal agreement between organisations involved in the exchange of confidential data in order to prepare systems and workflows.
36
37
38 Data exchange partners are advised to agree up front on the usage of the embargo mechanism(s) for specific data messages.
39
40
41 The embargo CONF_STATUS value “E” is not recommended for final dissemination to users but only for data exchange.
42
43 = Use Cases =
44
45 This section describes the confidentiality and embargo use cases that are addressed by these guidelines.  The use cases and embargo SDMX representations are summarised in annex 1:
46
47 == Use case 1: Non-confidential data ==
48
49 Data is available to the public immediately, meaning that data is not confidential and there is no embargo.
50
51
52 The data’s CONF_STATUS attribute should be set to “Free (free for publication)”.
53
54 |(((
55 === SDMX representation ===
56
57 * **CONF_STATUS**: F
58 )))
59
60 == Use case 2: Confidential data ==
61
62 === Exchange of confidential data without embargo nor forwarding to secondary recipients ===
63
64 One or more observations in the data message are confidential. Embargo does not play a role in this scenario. Depending on arrangements between data exchange partners, this data can be made available to privileged data users.
65
66
67 The observation’s CONF_STATUS attribute should use a specific code denoting the confidential character of the information.  Below are some examples of such confidentiality statuses[[~[1~]>>path:#_ftn1]]:
68
69 * **N**:   Not for publication, restricted for internal use only.  Used to denote observations that are restricted for internal use only within organisations
70 * **C**:   Confidential statistical information (primary confidentiality) due to identifiable respondents
71 * **D**:   Secondary confidentiality set by the sender, not for publication
72 * **A**:   Primary confidentiality due to small counts
73
74
75
76 === Forwarding confidential data to secondary recipients ===
77
78 A sender sends confidential data to certain primary recipients, and allows those to forward the confidential data to a restricted and pre-defined set of secondary recipients.
79
80
81 The observation’s CONF_STATUS attribute should be marked as “Not for publication, restricted for internal use only”.  An additional observation-level attribute: CONF_REDIST, defines the secondary recipient(s) to whom the sender allows the primary recipient to forward confidential data[[~[2~]>>path:#_ftn2]].  See section **Use of the CONF_REDIST attribute** for the appropriate coding of this attribute.
82
83
84 The forwarding of confidential data is represented as follows in SDMX:
85
86 |(((
87 === SDMX representation ===
88
89 * **CONF_STATUS**: N;
90 * **CONF_REDIST **(Observation, Conditional): [Organisation(s)];
91 )))
92
93 === Adding embargo information to a data message ===
94
95 Following the definition of embargo, the recipient must keep the data confidential until a pre-defined point in time (embargo) when it becomes public.
96
97 Two cases can be distinguished:
98
99 * Allowing privileged access to embargoed data
100 * Enabling the frontloading of data into systems
101
102
103
104 **//Allowing privileged access to embargoed data//**
105
106 If the goal is to allow the data recipient to have privileged access to embargoed observations in a data message (message), the embargoed observation’s CONF_STATUS attribute should be coded as “E: Not for publication until the embargo time expires; free for publication after the embargo time expires.” with an observation level attribute EMBARGO_TIME (date/time/time zone).
107
108 |(((
109 === SDMX representation ===
110
111 * **CONF_STATUS**: E;
112 * **EMBARGO**_**TIME** (Observation, Conditional): [timestamp]
113 )))
114
115 Including a time zone is strongly recommended and the best case is to use the UTC (Coordinated Universal Time) time standard. However, if no time zone is provided then the time zone of the recipient is assumed.
116
117 These two examples represent the same time for a recipient established in the Central European time zone (e.g. Germany, Norway, Gibraltar):
118
119 * (Recommended) With UTC indicator: 2017-12-15T14:02:29Z
120 * With timezone indicator: 2017-12-15T15:02:29+01:00
121
122
123
124 **//Enabling the frontloading of data into systems//**
125
126 If the goal is to allow frontloading of a whole data message into systems so that the data can be made visible to users at the expiry of the embargo date/time, the header section of the message should contain an embargo date/time attribute.  This implies that all information in the data message is under the embargo date/time set in the header.  The header attribute EmbargoDate with format date/time/time zone indicates until when the whole data message received cannot be shared with any recipient users. 
127
128
129 Once the EmbargoDate in the header elapses, each observation’s confidentiality status becomes that which is marked in the CONF_STATUS attributes.
130
131
132 Note that this scenario presumes that all data in the message cannot be viewed before the header EmbargoDate, and that there is no privileged access before this time.  However, observations may be marked with any other confidentiality status that is valid after the frontloading EmbargoDate elapses.
133
134 |(((
135 === SDMX Representation ===
136
137 * **CONF_STATUS**: <Set to the required confidentiality status after the embargo time elapses>; <Header>\<EmbargoDate>: [timestamp]
138 )))
139
140
141
142 The two ways of representing embargoed data exist to provide efficiency in the exchange, allow for differentiating data intended to be frontloaded and data aimed to be provided in advance to a restricted audience, and provide flexibility when few observations need to be embargoed in a large data message.  The trade-off is the complication of system implementation to support the two representations of embargo, which has to be done locally on a case-by-case basis.
143
144 = Additional recommendations and examples =
145
146 In data flows that feature confidential data, CONF_STATUS is highly recommended to be a mandatory attribute.  However, if CONF_STATUS is optional in the DSD and missing from an observation, it is always implied to be “F” (free).
147
148 === Use of the CONF_REDIST attribute ===
149
150
151 The CONF_REDIST attribute defines the secondary recipient(s) to whom the sender allows the primary recipient to forward confidential data.  It is recommended to be an optional attribute at observation level. Ideally it should reference a shared code list containing standard organisation codes. To allow several secondary recipients there are these possibilities:
152
153
154 1. Use a code that represents multiple organisations, or;
155
156
157
158 1. Use several CONF_REDIST attributes to portray the multiple recipients.  Each attribute represents one recipient and references the same codelist.  This implementation is cleaner than the above point 1, though this will require adding as many attributes to your DSD as there are potential recipients of the redistributed confidential data.
159
160
161
162 If the EMBARGO_TIME and CONF_REDIST attributes are both used:
163
164 1. Data is available only to the organisations in CONF_REDIST until EMBARGO_TIME
165 1. Data is available to the public after EMBARGO_TIME
166
167 |(% colspan="3" %)(((
168 = Privileged Access =
169 )))
170 |**Use case**|**No forwarding**|**Forwarding**
171 |**Embargo**|(((
172 CONF_STATUS: E
173
174 EMBARGO_TIME
175 )))|(((
176 CONF_STATUS: E
177
178 EMBARGO_TIME
179
180 CONF_REDIST
181 )))
182 |**No embargo**|CONF_STATUS: N|(((
183 CONF_STATUS:N
184
185 CONF_REDIST
186 )))
187
188 === An example of sending data for privileged access with data forwarding information ===
189
190 This example describes a case where data needs to be embargoed until a certain date and time, and may be sent to certain other organisations in a single transmission without modification of the data or attributes.
191
192 This example is based on the exchange of sector accounts statistics within the European statistical system.
193
194 * The national statistical institutes send data to Eurostat, and allow the data to be shared with the ECB for statistical coproduction
195 * The data may only be shared with the public on the next day
196
197
198
199 **CONF_STATUS:**E**;**
200
201 **CONF_REDIST: **ECB**;**
202
203 **EMBARGO_TIME=<**T+1 day**, **e.g.** **2017-12-15T10:00:00Z>
204
205
206 The solutions suggested above aim at covering the most common confidentiality and embargo use cases within a single transmission from the primary reporter to the primary recipient. However, for some more complex scenarios it might still be required to make multiple transmissions.
207
208
209 It is strongly recommended that use cases are specified in an agreement between organisations involved in regular transmissions up-front in order to avoid unnecessary delay in data publication or – much worse – confidentiality breaches.
210
211 **Annex 1: SDMX Representation of the confidentiality use cases**
212
213 |(((
214 == Use case ==
215 )))|(((
216 == CONF_STATUS (Observation) ==
217 )))|(((
218 == Additional attributes ==
219 )))|(((
220 == Remarks ==
221 )))
222 |(((
223 == Non-confidential data ==
224 )))|(((
225 == F ==
226 )))|(((
227 == ==
228 )))|(((
229 == ==
230 )))
231 |(((
232 == Confidential data with no embargo ==
233 )))|(((
234 == C;D;S;A;O;T;G;M;N ==
235 )))|(((
236 == ==
237 )))|(((
238 == CONF_STATUS will usually be C but may also be D;S;A;O;T;G;M;N depending on the required status and confidentiality reason.  See the CL_CONF_STATUS code list for details[[~[3~]>>path:#_ftn3]] ==
239 )))
240 |**Forwarding of confidential data**|N|(((
241 CONF_REDIST: (Observation, Conditional)
242
243
244 )))|CONF_REDIST may represent multiple organisations
245 |(((
246 == Embargo: Privileged access ==
247 )))|(((
248 == E ==
249 )))|(((
250 == EMBARGO_TIME (Observation, Conditional) ==
251
252
253 )))|Only the observations with an EMBARGO_TIME attribute are embargoed. After the embargo time elapses, the data are free for publication (equivalent to F status).
254 |(((
255 == Embargo: Privileged access with forwarding ==
256 )))|(((
257 == E ==
258 )))|(((
259 EMBARGO_TIME (Observation, Conditional)
260
261 == CONF_REDIST: (Observation, Conditional) ==
262 )))|(((
263 == Only the observations with an EMBARGO_TIME attribute are embargoed. After the embargo time elapses, the data are free for publication (equivalent to F status). ==
264
265 == CONF_REDIST may represent multiple organisations ==
266 )))
267 |(((
268 == Embargo: Frontloading ==
269 )))|(((
270 == Set to the required confidentiality status after the embargo time elapses. ==
271 )))|(((
272 == <Header\EmbargoDate>: [timestamp] ==
273
274
275 )))|There is no EMBARGO_TIME attribute as the whole message is embargoed with no privileged access.
276
277
278
279
280 ----
281
282 [[~[1~]>>path:#_ftnref1]] For a full list of confidentiality statuses, see [[https:~~/~~/sdmx.org/wp-content/uploads/CL_CONF_STATUS_1_2_2018.docx>>url:https://sdmx.org/wp-content/uploads/CL_CONF_STATUS_1_2_2018.docx]].
283
284 [[~[2~]>>path:#_ftnref2]] Example: National statistical institute XX reporting data to Eurostat indicates that Eurostat can forward those data to the ECB, IMF and OECD.  More complex use case: The reporting organization specifies that Eurostat can forward those data only to the ECB Statistics Department, thus excluding all other organisations as well as all other ECB departments.
285
286 [[~[3~]>>path:#_ftnref3]] [[https:~~/~~/sdmx.org/wp-content/uploads/CL_CONF_STATUS_1_2_2018.docx>>url:https://sdmx.org/wp-content/uploads/CL_CONF_STATUS_1_2_2018.docx]]
© Semantic R&D Group, 2026