This is a set of guidelines for defining new data models.
Before creating a new data model, explore the existing ones or the quick finder to be sure there is already a data model that covers your use case. Recall to use synonyms in your search. E.g. what you might call Public Transport already exists under UrbanMobility.
If you are looking for guidelines on adoption of existing data models, please refer to How to use Smart Data Models in your projects section.
- Use English terms, preferably American English.
- Use camel case syntax for attribute names (
camelCase
). - Entity Type names must start with a Capital letter, for instance,
WasteContainer
. - Use names and not verbs for Attributes of type Property, ex.
name
, qualifying it when necessary, ex.totalSpotNumber
ordateIssued
. - Avoid plurals in Attribute names, but state clearly when a list of items
fits. Ex.
category
. - All first level properties defined in the json schema has to have a description attribute
- Check for the existence of the same Attribute on any of the other models and reuse it, in this resource if pertinent.
- When you find a confict it is recommended to extend the name. Example category would be replaced by deviceCategory.
- It is also available a resource for looking into the descriptions of the different properties. Access to this resource.
- Have a look at schema.org trying to find a similar term with the same semantics.
- Try to find common used ontologies or existing standards well accepted by the Community, or by goverments, agencies, etc. For instance, Open311 for civic issue tracking or Datex II for transport systems.
- The schemas defined here are aimed to validate key-values representation of the payloads
- It means that it is possible to have arrays of relationships (something which is not allowed in NGSI-LD, although it could be implemented through the datasetId)
- When possible reuse schema.org data types (
Text
,Number
,DateTime
,StructuredValue
, etc.). - Remember that
null
is not allowed in NGSI-LD and therefore should be avoided as a value.
-
Enumerate the allowed values for each attribute. Generally speaking it is a good idea to leave it open for applications to extend the list, provided the new value is not semantically covered by any of the existing ones.
-
State clearly what attributes are mandatory and what are optional. Remember that
null
value should be avoided as it is prohibited in NGSI-LD. The minimum required attributes will make the data models more flexible for other to use them. -
Whenever you create a list of enumerated values, use camelCase coding and strictly avoid spaces within the values (only allowed when mapping existing enumerated values)
-
Internal attributes. In NGSIv2 there are two special attributes created by the system:
- dateCreated
- dateModified
Similarly in NGSI-LD there are two different:
- createdAt
- modifiedAt
those attributes must NOT be included into the definition of the data model (schema.json) but they can appear in the payloads of the examples included.
- When a value is a number, when relevant include range limits if they exist. Otherwise do not do it.
-
Define a default unit for magnitudes. Normally it will be the unit as stated by the International System of Units.
-
If a quantity is expressed in a different unit than the default one, use the unitCode metadata attribute in NGSI v2.
-
In NGSI-LD the Property
unitCode
is already defined and available to be used. -
The list of UN/CEFACT Common Code (3 characters) can be download from this page.
-
The list is available directly from here.
- Use values between
0
and1
for relative quantities, which represent attribute values such asrelativeHumidity
,precipitationProbability
, etc.
-
Use
address
attribute for civic locations as per schema.org. You can read the location-commons -
Use the
location
Attribute for geographical coordinates. GeoJSON must be used for encoding geospatial properties. -
There is a shared resource including both at https://github.com/smart-data-models/data-models/blob/master/common-schema.json
- When an Entity Attribute is used as a link (relationship) to other entities name
the attribute using a verb (plus optionally an object) such as
hasStop
,operatedBy
,hasTrip
, etc. This option is the one advocated by NGSI-LD, as in NGSI-LD URNs are used to identify entities, and NGSI-LD URNs already convey the type of the target entity, for instanceurn:ngsi-ld:gtfs:Stop:S123
.
-
In NGSI v2 the Attribute type must be
DateTime
. -
In NGSI-LD, please check the date and time encoding at the NGSI-LD FAQ.
-
Use the
date
prefix for naming entity attributes representing dates (or complete timestamps). Ex.dateLastEmptying
. -
dateCreated
in NGSIv2 (createdAt
in NGSI-LD) must not be used as long as they are internal attributes of the NGSI specification. -
dateModified
in NGSIv2 (modifiedAt
in NGSI-LD) must not be used as long as they are internal attributes of the NGSI specification. -
dateCreated
anddateModified
are special read-only Entity Attributes provided off-the-shelf by NGSI implementations. Be careful because they can be different than the actual creation or update date of the real world entity represented by its corresponding digital entity. -
When necessary define additional Attributes to capture precisely all the details about dates. For instance, to denote the date at which a weather forecast was delivered an attribute named
dateIssued
can be used. In that particular case just reusing the internal attributedateCreated
would be incorrect because the latter would be the creation date of the (digital) entity representing the weather forecast which typically might have a delay.
-
In NGSI v2 use a metadata attribute named
timestamp
for capturing the last update timestamp of a dynamic attribute. Please note that this is the actual date at which the measured value was obtained (from a sensor, by visual observation, etc.), and that date might be different than the date (metadata attribute nameddateModified
as per NGSI v2) at which the attribute of the digital entity was updated, as typically there might be delay, specially on IoT networks which deliver data only at specific timeslots. -
In NGSI-LD use the
observedAt
Property to convey timestamps.
There can be certain entity attributes which content is subject to be internationalized. For instance, the description of a Point of Interest. The internationalization (i18N) guidelines for the Smart Data Models are defined as follows:
-
By default, the value of an attribute subject to be internationalized should be expressed in American English (
en-US
). However there can be situations where an English term is not the most common one, for instance, the English exonym for the city of Livorno (Italy) is a very obscure term,Leghorn
. In such situations, the common international name (Livorno
in our example) in latin script should be used. -
There shall always be a term for the original attribute, i.e. it is not allowed to have Entity representations which only contain terms associated to language variants.
-
[Under revision] For each language variant of an internationalized attribute, there shall be an additional Entity Attribute which name shall be in the form:
<AttributeName>_<LanguageTag>
where AttributeName
is the original attribute
name and LanguageTag
shall be a language tag as mandated by
RFC 5646. W3C provides guidelines
on
how to use language tags.
JSON-LD can facilitate developers to parse internationalized Entity representations, thus Context Data Producers are encouraged to use JSON-LD (provided that the backing implementations support it).
When parsing plain JSON content, developers should validate that the corresponding JSON terms are actually conveying a language variant of an attribute. For instance, by validating that the term's suffix actually corresponds to a valid language tag and by checking that the corresponding original attribute is contained in the entity.
[Under review] Example:
An entity may contain an attribute named description
. The value of such
attribute shall be expressed in American English. Additionally, it might exist an
attribute named description_es
used to convey the value of such a
description
attribute in Spanish.
In case of doubt check the existing data models. The full list can be got in the attributes search tool by using an empty string.
name
alternateName
description
serialNumber
category
features
source
temperature
FIWARE Foundation, TMForum, OASC and IUDX Smart Data Models Project aim to maintain backwards compatibility, however some
incompatibilities will inevitably occur over time. Data providers may choose to
tag Entities with an additional schemaVersion
Attribute so that Data Consumers
can behave accordingly. This aligns with the
https://schema.org/schemaVersion Property
definition.
Smart Data Models follows the six principles of agile standardization Thus, many existing, adopted and open standards are mapped into the SDM (GTFS, GBFS, DCAT-AP, schema.org, etc) those cases we include in the schemas defining the data model an attribute 'derivedFrom' linking to the origin and specifying the source in the description. Besides, although the recommendation for the codification of attributes is to use cameCase notation, those cases the original ones are preserved.
In some specific cases like schema.org we only map those attributes that we have information of being used. Why? Because schema.org have hundreds of attributes, and we only include those actually used. Hw could we document new attributes used and not available. Just rise an issue or make a PR on the repository and a new version will be generated extending the current data model.
Contributions should come in the form of pull requests. Fork the repository, Create a branch containing your changes, and proceed with a Pull Request.
Pull Request should be easy to review, so if the model, or the changes you are proposing are wide, please create different pull requests.
New data models should be added under a folder structured as follows:
- `NewModel/`
- `notes.yaml`: An optional file with customization contents for the specification. Optional
- `LICENSE.md`: file with the legal permission of use of th data model. It always grant free user, free modification and free share of modifications.
- `ADOPTERS.yaml`: A file containing use cases of the data models. Optional
- `schema.json`: The JSON Schema definition, which includes the descriptons of attributes, e.g.
[schema.json of WeatherObserved](./Weather/WeatherObserved/schema.json)
- `examples/`
- `example.json`: One JSON key-values for NGSI v2 example file, e.g.
[example.json of WeatherObserved](https://github.com/smart-data-models/dataModel.Weather/blob/master/WeatherObserved/examples/example.json)
- `example.jsonld`: One JSON key-values for NGSI-LD example file, e.g.
[example.json of WeatherObserved](https://github.com/smart-data-models/dataModel.Weather/blob/master/WeatherObserved/examples/example.jsonld)
- `example-normalized.json`: One JSON example file in NGSI v2
normalized format, e.g.
[example-normalized.json of WeatherObserved](https://github.com/smart-data-models/dataModel.Weather/blob/master/WeatherObserved/examples/example-normalized.json)
- `example-normalized-ld.jsonld`: One JSON example file in
**NGSI-LD** normalized format, e.g.
[example-normalized-ld.jsonld of WeatherObserved](https://github.com/smart-data-models/dataModel.Weather/blob/master/WeatherObserved/examples/example-normalized.jsonld)
- `resources/`. folder with additional contents for customization a data model in case notes.yaml is not enough. i.e. images. Optional
New Subjects containing data models should be added under a folder structured as follows:
- `Subject/`
- `README.md`. Contains links and descriptions to the different data models contained in the subject. Generated automatically
- `CONTRIBUTORS.yaml`. Contains data of the authors to the different data models contained in the subject. Optional
- `notes.yaml`. Contents for the customization of the Subject README.md. Optional
- `Subject-schema.json`. Schema containing objects used across differente data models. Referenced from there. Optional
- `DataModel1`. Folder containing all the assets for a data model
- `DataModel_incubated`. Folder with a link to where this new data model is being developing. Soon to be available. Optional
The section definitions will be included into the subject-schema.json name of the subject.
Whenever possible they will be absolute references in order to provide the ability to use the data models isolated from the rest of documents
For a clear explanation on the current use of the data models. Check the Going through the data models