A distributed data-mining
software platform for
EXTReme dAta Across the
Compute conTinuum

Delivering a data-driven open-source platform integrating cloud, edge and HPC technologies for trustworthy, accurate, fair and green data mining workflows for high-quality actionable knowledge

Objectives

Enable the development of complex and secure data mining workflows

Develop novel data-driven orchestration mechanisms to efficiently deploy and execute data mining workflows

Deliver the EXTRACT software platform and demonstrate its benefits in two use cases

Fully exploit the performance capabilities of the compute continuum to effectively address extreme data characteristics (high volume, variety, velocity, veracity)  holistically

Foster the adoption of EXTRACT technology by industrial and academic communitie

Use cases

Personalized Evacuation Routing (PER) System

A Personalized Evacuation Routing (PER) System will serve to guide citizens in an urban environment (the city of Venice) through a safe route in real time.

The EXTRACT platform will be used to develop, deploy and execute a data-mining workflow to generate personalized evacuation routes for each citizen, displayed in a mobile phone app, by processing and analysing extreme data composed of Copernicus and Galileo satellite data, IoT sensors installed across the city, 5G mobile signal, and a semantic data lake fusing all this information.

Transient Astrophysics with a Square Kilometer Array Pathfinder  (TASKA)

The Transient Astrophysics with a Square Kilometer Array Pathfinder (TASKA) case will use EXTRACT technology to develop data mining workflows that effectively reduce the huge amount of raw data produced by NenuFAR radio-telescopes by a factor of 100. This will allow the populating of high-quality datasets that will be openly accessible to the astronomy community (through the EOSC portal) to be leveraged for multiple research activities.

Year 2 Wraps Up with Geneva F2F

Year 2 Wraps Up with Geneva F2F

This is the project’s fifth in-person meeting since the project’s inception. With a favorable mid-term review halfway through the project and several dissemination activities showcasing project results, we are confident that 2025 will help us reach our final goals and yield exciting results.

EXTRACT deliverables now available

EXTRACT deliverables now available

Nine deliverables detailing the project's work have been published in the results section of the website. The EXTRACT deliverables offer essential...

Digital Booth in ADR Exhibition

Digital Booth in ADR Exhibition

This year's European Convergence Summit 2024 will include a very special co-located Digital Exhibition on AI, Data and Robotics Technology. EXTRACT...

URV Presents “Data Plug” at CCGRID’24

URV Presents “Data Plug” at CCGRID’24

Aitor Arjona from URV presented in the session on Distributed and Parallel Storate Systems 1#, at the 24th IEEE/ACM International Sypoisum on Cluster, Cloud and Internet Computing (CCGRID), held in Philadelphia, USA from 6-9 May 2024.