asterisk labs

Asterisk Labs and Alberto Arribas from National Oceanography Centre are taking part in a 6 month residency funded by ARIA through their FROST UK joint programme, where we are developing a proposal on strategies for resilient, globally distributed infrastructure, making it possible for everybody, everywhere to easily access and analyse all available environmental data.

The critical bottleneck limiting high-impact environmental research is how difficult it has become to process increasingly huge and complex datasets. To overcome this bottleneck we will build open source, AI-powered software infrastructure and data-as-AI models that are trustworthy and publicly-owned. We will not only build technology, but establish a multi-institutional cooperative, reducing current fragmentation and complexity to massively increase the number of organisations that can access and manipulate Earth-scale environmental data.

Our software infrastructure will simultaneously empower data producers (so they can easily create Earth Embedding models) and data users (so they can easily access and manipulate them). Therefore, we will enable a transformative shift, massively reducing the compute costs and complexity for all.

Our software infrastructure has three key components:

  • Novel architecture enabling collaboration at global scale;
  • Compressing Technology enabling reduction of data volumes for Earth Observations, model data and video/audio data;
  • Protocol and standards enabling PB-scale data use and AI training;

By combining these three key components our software infrastructure will collapse today’s atomised, bespoke and excessively complex approaches, enabling the delivery of high-impact research not only for underpinning environmental monitoring/prediction (e.g. high-resolution data-driven weather and climate models) but for critical, real-world applications, such as:

  • Protecting critical infrastructure;
  • Ensuring food security and human health;
  • Underpinning energy transition;
  • Enabling ecosystem monitoring and protection;

We will work with a wide ecosystem of public institutions, private organisations and researchers to ensure global interoperability and transparent governance. Our FRO will have a not-for-profit structure (limited by guarantee) and we will incentivise collaboration by establishing a multi-institution co-operative that we expect to become financially sustainable by providing training, verification, consultancy, and enterprise services to data providers and users.

By building novel technology and establishing shared ownership we will not only significantly accelerate progress, but create resilience - a topical and pragmatic real-world use of our infrastructure would be data rescue. For example, to create efficient and low-cost proxy AI representations of unique and critically valuable datasets at risk of disappearance due to funding cuts, e.g. Saving USA climate data | BBC - and empower the global community to tackle the triple climate, biodiversity and pollution planetary crisis.