Smart Data Models - Extending DataServiceDCAT-AP and DataServiceRun

Aligned with the Agenda Analytics use case of the SALTED Project, the development and employment of data services is practically a must when talking about the collection and processing of data stored on the Internet.

Services such as web crawlers or data linkers can be represented by the DataServiceDCAT-AP Smart Data Model, following the principles of linked data, in addition to being fully compliant with the DCAT-AP specification. However, in the project context, certain attributes have been identified for describing these services that were not included in this data model, and thus not included in the DCAT-AP specification.

On the other hand, due to the traceability requirement described in SALTED, it has become essential to describe the different instances of execution of these data services, in order to specify their inputs, configuration parameters and final outputs, among other properties. This evolves into a new data model called DataServiceRun.

Putting these two needs together, our partner Kybeidos has taken these opportunities to extend the DataServiceDCAT-AP data model with these new properties and to contribute to the initiative with the new DataServiceRun model. Both actions will allow the use of the subject that contains them to be more widespread. Moreover, aligned with the agile standardization philosophy of the initiative, they will have higher chances to be submitted to the original standardization group (DCAT-AP) for their adoption.

Both the new data model for DataServiceRun and the extension for the DataServiceDCAT-AP data model can be found in the official Smart Data Models repository: DCAT-AP.

Agriparcel dron image

One of the SALTED use cases is Smart Agriculture, which aims to analyse the CO2 levels at several kinds of parcels with different types of soil and/or crops. The parcels used for this assessment are located near the city of Madrid.

Taking advantage of the Smart Data Models initiative, several data models have been chosen for their suitability to the specifics of the Smart Agriculture use case. These are: AgriParcel, to describe the parcels of land; AgriCrop, to describe the crops; and finally, AirQualityObserved, to characterise the levels of CO2 above the parcels assessed. Nevertheless, no relationship between the AgriParcel and AirQualityObserved models existed until now, making the full characterisation of the use case unfeasible.

To address this shortcoming, partners Grupo Amper and Universidad de Cantabria have contributed to the initiative by adding a new relationship (named hasAirQualityObserved) to the AgriParcel data model, linking it to the air quality observations gathered above this parcel, modelled as AirQualityObserved entities. Thanks to the relation between these two models, the main goal of SALTED can be fulfilled: the generation of enriched data which encourage the development of innovative value-added services.

The extension of the data model AgriParcel can be found in the official Smart Data Models repository.

D3.1 deliverable image

One of the key goals of SALTED is the publication of the NGSI-LD high-value data generated within the project in the European Data Portal (EDP). This document describes the required connectors that interconnect the SALTED architecture to the EDP, by leveraging a Comprehensive Knowledge Archive Network (CKAN) public instance deployed in between.
The document details the implementation and deployment of the concerned connectors, their internal components, and the data models they use. The resulting platform, which includes the Scorpio Broker, the CKAN instance, and the connectors, is aligned with the main output expected from Work Package 3 in general, and Task 3.1 in particular.

The deliverable can be found in the publications section.

D2.2. Image - Report on Data Modelling and Linking

This document represents the D2.2 deliverable of the data modelling and linking. The focus of this document is the thorough definition of the architecture implementation and deployment, including details about how security is handled. Furthermore, D2.2 includes an update on the data modelling introduced in D1 and D2.1, and a review of the linking modules currently implemented.

The deliverable can be found in the publications section of the webpage.

Data Quality Smart Data Models

SALTED aims to provide datasets and data-streams enriched with new properties and relationships, facing the need of harmonization of these data. One of the most relevant initiatives pursuing the standardization of data modelling is the Smart Data Models initiative, with more than 800 data models available to use.

Within the SALTED project, our partner Universidad de Cantabria has contributed to the improvement of the Smart Data Models initiative by releasing a new Smart Data Model for the Data Quality Assessment. The main goal of this new Smart Data Model is to characterize the quality properties of several entities or measurements.

The new data model for DataQualityAssessment can be found in the official Smart Data Models repository: DataQuality.

SALTED D1 deliverable

This document represents the D1.1 deliverable of the semantic features extraction for heterogeneous data sources contextualization. The focus of this document is to give an introduction to different data sources used in SALTED as well as smart data models for converting into the NGSI-LD architecture. Furthermore, D1.1 defines the injection chains of different sources and their contributions to the SALTED architecture with a few possible use cases.

This document has a twofold goal: on one side, it is intended for internal use in order to drive the design and implementation of subsequent phases of the DET (i.e. data linking and enrichment toolchain) or application use cases for the project or internally to the partner organizations; on the other hand, it is publicly available in order to increase the awareness of the SALTED activities, its relation with the FIWARE ecosystem and it points to the important issue of the data treatment and consolidation for interoperable applications and domains.

The deliverable can be found in the publications and resources section of the webpage.

Enabling data spaces: existing developments and challenges

This paper focuses on the concept of data spaces, which can serve as a basis for the future data economy. In data spaces, applicable to various business domains, stakeholders will be able to share data with each other in a controlled way. First, the paper describes the real motivations and needs for enabling data spaces. Second, it highlights the major technical developments in the area of data spaces in the light of open ecosystems and standards. Lastly, it focuses on two key challenges for enabling data spaces: 1) Data interoperability, 2) Data value generation. As a concrete data spaces solution example, this paper proposes the "Green Twin" use case that can be developed as a carbon neutrality solution in the domains of mobility and smart cities.

The paper "Enabling data spaces: existing developments and challenges" can be found in the proceedings of the 1st International Workshop on Data Economy.

NGSI-LD Scorpio Context Broker has become part of Amazon Web Services, as one of the selected components of their Smart Territory Framework (STF). STF provides a set of tools intented to implement solutions based on the FIWARE ecosystem components. More precisely, thanks to the inclusion of Scorpio, the provision of context information following the NGSI-LD standard from the European Telecommunications Standardization Institute (ETSI), is guaranteed.

Developed by our partner NEC Laboratories Europe, Scorpio Context Broker is the first FIWARE component which is fully compliant with the NGSI-LD specification v1.3.1, enabling near-real-time information from different data sources, including Internet of Things objects.

Ernö Kovacs, Senior Manager of Data Ecosystems and Standards at NEC Laboratories Europe, said, “Using Scorpio Broker  with an advanced cloud solution like AWS STF, you can build smart services that link different IoT devices and services in very little time, while ensuring semantic interoperability with other devices. Whether it’s a neighborhood connected by IoT devices or a farm optimizing crop yields – the building blocks and the data models remain the same. Scorpio Broker is already being used to advance smart solutions in different regions of the world.”

The Scorpio Context Broker is one of the core components of the SALTED architecture, working as a broker between the apps and the different Data Toolchain Componets from SALTED, providing enriched and linked data.

Scorpio in STF from AWS

More information about the inclusion of Scorpio Context Broker as part of the STF from AWS can be found here.

Water Semantic Data

As the main goal of SALTED initiative is to provide a comprehensive set of tools for the linking and enrichment of data to foster the European digital data market, data harmonization becomes an essential task. Therefore, using and contributing to existing data model standards is of utmost importance to guarantee the data interoperability and ease the experimentation and application creation leveraging existing data sources.

Among the initiatives that pursue the standardization of data modelling, we can highlight the Smart Data Models, a collaborative program to provide multidomain standardized free and open-licensed data models based on real use cases and existing open standards. This initiative, supported by the FIWARE foundation, IUDX, TMForum, and OASC, amon others, and receiving contributions from more than 70 entities.

In this sense, as part of SALTED, our partner Kybeidos has contributed to the improvement of the Smart Data Models initiative by releasing a new Smart Data Model for the Organization data type, mapping the existing model.

The new data model for Organization can be found in the official Smart Data Models repository.


The aim of Situation-Aware Linked heTerogeneous Enriched Data (SALTED) is to add value to existing datasets and data-streams by enriching them through the application of the principles of linked-data, semantics and Artificial Intelligence (AI) and publish the enriched data sets in NGSI-LD as Open Data. This document D2.1 provides an overview of the architecture of the Data Linking and Enrichment processes thus defining the Data Enrichment Toolchain Architecture (short DET architecture). Through that it describes the main project contributions that will be developed and evaluated during the lifetime of the project.

The deliverable can be found in the publications and resoureces section of the webpage.