A distributed data-mining
software platform for
EXTReme dAta Across the
Compute conTinuum

Delivering a data-driven open-source platform integrating cloud, edge and HPC technologies for trustworthy, accurate, fair and green data mining workflows for high-quality actionable knowledge

Objectives

Enable the development of complex and secure data mining workflows

Develop novel data-driven orchestration mechanisms to efficiently deploy and execute data mining workflows

Deliver the EXTRACT software platform and demonstrate its benefits in two use cases

Fully exploit the performance capabilities of the compute continuum to effectively address extreme data characteristics (high volume, variety, velocity, veracity)  holistically

Foster the adoption of EXTRACT technology by industrial and academic communitie

Use cases

Personalized Evacuation Routing (PER) System

A Personalized Evacuation Routing (PER) System will serve to guide citizens in an urban environment (the city of Venice) through a safe route in real time.

The EXTRACT platform will be used to develop, deploy and execute a data-mining workflow to generate personalized evacuation routes for each citizen, displayed in a mobile phone app, by processing and analysing extreme data composed of Copernicus and Galileo satellite data, IoT sensors installed across the city, 5G mobile signal, and a semantic data lake fusing all this information.

Transient Astrophysics with a Square Kilometer Array Pathfinder  (TASKA)

The Transient Astrophysics with a Square Kilometer Array Pathfinder (TASKA) case will use EXTRACT technology to develop data mining workflows that effectively reduce the huge amount of raw data produced by NenuFAR radio-telescopes by a factor of 100. This will allow the populating of high-quality datasets that will be openly accessible to the astronomy community (through the EOSC portal) to be leveraged for multiple research activities.

Optimizing Compute Continuum Workflows with COMPSs Orchestration

Optimizing Compute Continuum Workflows with COMPSs Orchestration

The EXTRACT project tests its technology on the TASKA and PER use cases that make use of the entire compute continuum (edge, cloud, and High-Performance Computing (HPC) resources). Workflows orchestration is a key project activity that ensures that tasks are executed...

Highlighting DataNexus cluster at EBDVF 2024

Highlighting DataNexus cluster at EBDVF 2024

The EXTRACT project joined fellow EU-funded projects as part of the DATANEXUS cluster at the European Big Data Value Forum (EBDVF) 2024. From 2 to 4 October, partners Mathema, Binare and Barcelona Supercomputing Center shared information on the project during the...

EXTRACT at the MWC24

EXTRACT at the MWC24

The 2024 Mobile World Congress (MWC) was held in Barcelona from 26-29 February. Partner Barcelona Supercomputing Center (BSC) attended and presented EXTRACT project technology during two days of the event. Hosted by the GSMA, the MWC Barcelona convenes leaders from...

HiPEAC 2024: Workshop on CompContinuum

HiPEAC 2024: Workshop on CompContinuum

Partner Barcelona Supercomputing Center will take part in the HiPEAC 2024 Workshop: "CompContinuum: Computing Continuum of Cloud, Edge, and IoT Technologies" on 19 January 2024, 14h-17:30. This afternoon workshop examines how the computing landscape is rapidly...

URV presents at MIDDLEWARE 2023

URV presents at MIDDLEWARE 2023

BSC partner Universitat Rovira i Virgili (URV) presented its paper, "Glider: Serverless Ephemeral Stateful Near-Data Computation" on 13 December 2023. Researcher Daniel Pons delivered the presentation as part of Session1A- Serverless. The publication is available here...