Delivering a data-driven open-source platform integrating cloud, edge and HPC technologies for trustworthy, accurate, fair and green data mining workflows for high-quality actionable knowledge
Objectives
Enable the development of complex and secure data mining workflows
Develop novel data-driven orchestration mechanisms to efficiently deploy and execute data mining workflows
Deliver the EXTRACT software platform and demonstrate its benefits in two use cases
Fully exploit the performance capabilities of the compute continuum to effectively address extreme data characteristics (high volume, variety, velocity, veracity) holistically
Foster the adoption of EXTRACT technology by industrial and academic communitie
Use cases
Personalized Evacuation Routing (PER) System
A Personalized Evacuation Routing (PER) System will serve to guide citizens in an urban environment (the city of Venice) through a safe route in real time.
The EXTRACT platform will be used to develop, deploy and execute a data-mining workflow to generate personalized evacuation routes for each citizen, displayed in a mobile phone app, by processing and analysing extreme data composed of Copernicus and Galileo satellite data, IoT sensors installed across the city, 5G mobile signal, and a semantic data lake fusing all this information.
Transient Astrophysics with a Square Kilometer Array Pathfinder (TASKA)
The Transient Astrophysics with a Square Kilometer Array Pathfinder (TASKA) case will use EXTRACT technology to develop data mining workflows that effectively reduce the huge amount of raw data produced by NenuFAR radio-telescopes by a factor of 100. This will allow the populating of high-quality datasets that will be openly accessible to the astronomy community (through the EOSC portal) to be leveraged for multiple research activities.
Sustainability in the EXTRACT project
n the EXTRACT project, sustainability is a key driver for scientific, technical and business decisions. On the technical and scientific levels, Open Science principles are applied to facilitate the use of results by other parties. Adequation with current and future needs is also essential to ensure the sustainable development of the project.
Advancing the compute continuum at EXTRACT-organized WS at HiPEAC
EXTRACT had a strong presence at HiPEAC 2025, through the co-organized workshop “Advancing the compute continuum: intelligent solutions for extreme data and 5G-enabled ecosystems” with presentations given by Barelona Supercomputing Center and Observatoire de Paris.
TASKA use case presented at key astronomy events
The EXTRACT project is leveraging cutting-edge data processing technology across the compute continuum (edge, cloud and HPC) that is being tested in...
EXTRACT INVITED TALK on the Compute Continnum @ 2024 AEiC
The 28th Ada-Europe International Conference on Reliable Software Technologies (AEiC 2024) was held in Barcelona from 11-14 June 2024. The...
Digital Booth in ADR Exhibition
This year's European Convergence Summit 2024 will include a very special co-located Digital Exhibition on AI, Data and Robotics Technology. EXTRACT...
URV Presents “Data Plug” at CCGRID’24
Aitor Arjona from URV presented in the session on Distributed and Parallel Storate Systems 1#, at the 24th IEEE/ACM International Sypoisum on Cluster, Cloud and Internet Computing (CCGRID), held in Philadelphia, USA from 6-9 May 2024.





