Data has become one of the most valuable assets driving the digital transformation. Current data mining solutions are optimized to deal with specific data requirements but fail to cope as the data characteristics become extreme.

To fill this technological gap, EXTRACT will deliver a data-driven open-source software platform integrating the most relevant technologies, to facilitate the development of trustworthy, accurate, fair and green data mining workflows able to generate high-quality actionable knowledge.

The EXTRACT platform will improve the complete lifecycle of extreme data mining workflows, significantly enhancing performance, energy-efficiency, scalability and security, while fulfilling the extreme data characteristics in a holistic way.

Moreover, multiple computing technologies, from edge to cloud to HPC, will be integrated into a unified and secure compute continuum.

Specifically, the platform will feauture:

Enhanced data infrastructures and AI & big-data frameworks

Novel data-driven orchestration and distributed monitoring mechanisms

A unified continuum abstraction and cybersecurity

Novel data-driven orchestration and distributed monitoring mechanisms

The EXTRACT platform will be validated in two real-world use-cases with different extreme data requirements:

Process extreme data from 2000 radio-telescopes for the real-time assessment of solar activity, generating knowledge for further scientific exploitation.

Integrates data from the European data sources, Copernicus and Galileo, with 5G localization signals and smart city IoT sensors for civilian-centric crisis management.