Skip to main content

How PSI's Materials Software and Data Group Uses Renku to Lower the Barrier to Entry for Data Reuse

· 4 min read
Laura Kinkead
Renku Product Manager
Julian Geiger
Materials Software and Data Group

From technical local setup to one-click data exploration: How Renku simplifies researcher access to computational materials science data on the Materials Cloud Archive.

intro-image

The Challenge: Valuable Data Behind Technical Barriers

Computational science generates valuable data that can accelerate research— as long as it’s accessible to the community at large. This is where the Materials Cloud Archive (MCA) was facing a challenge: while the platform was hosting plenty of computational (materials science) research data, accessing it required a rather technical local setup, not accessible to everyone.

This is because the data consists of archives created using the AiiDA workflow manager, and, "exploring AiiDA archives historically required users to install AiiDA locally with complex dependencies like PostgreSQL and RabbitMQ," the AiiDA team explains. In addition, "users also needed prior knowledge of AiiDA's API to explore the data."

The complexity meant valuable research data was effectively locked away from researchers who weren't already familiar with AiiDA. And even experienced users had to manually download archives, configure environments, and set up profiles before beginning the meaningful work of data discovery.

Due to their shared commitment to research data provenance and shared Swiss origins, the AiiDA and Renku development teams have been in touch as early as 2019. The first integration was achieved by creating a template Renku project with AiiDA pre-installed. The connection between the two platforms was then achieved via parameterized launch links that pass specific MCA dataset archive IDs directly into Renku sessions.

mca.png

Now, researchers simply click the Renku logo next to any AiiDA archive on MCA and are automatically launched into a fully configured environment with their selected data ready for exploration.

session.png

Evolution to Renku 2.0: The recent upgrade to Renku 2.0 brought significant improvements to the integration. The new implementation features:

  • True one-click access without intermediate Renku project creation steps
  • Faster startup times by deferring downloads until needed
  • Intelligent customization using archive metadata
  • Enhanced notebooks with AiiDA API examples

You can explore the current implementation on RenkuLab, and learn more about the technical details of the integration on the AiiDA website.

The Impact: Lowering the Barrier to Access

mca-project-page.jpg

What once required complex local installations, prior AiiDA knowledge, and manual configuration steps is now accessible with a single click. Researchers can discover data on MCA and immediately explore it interactively without any local setup or risk of modifying their existing installations.

"Without the infrastructure provided by Renku, the first working online deployment would likely have been achieved much later," notes the development team. While they've also been working on similar infrastructure like an AiiDAlab demo server, the Renku integration was established five years ago and continues to evolve— demonstrating the value of leveraging existing platforms rather than building from scratch.

Looking Forward: Expanding the Partnership

The AiiDA team is exploring further expansions using Renku 2.0's new capabilities:

Expanding data access via Renku Data Connectors: The AiiDA team is exploring Renku 2.0's new data connectors to mount MCA archives directly in project filesystems, which would eliminate download times and storage limitations— crucial for large archives.

Supporting the broader AiiDA App Ecosystem: With Renku 2.0's support for arbitrary Docker images, they're exploring expanding the integration to other AiiDA applications like AiiDAlab. With this, users would be able to use the graphical user interface (GUI) provided by AiiDAlab, rather than just a single jupyter landing notebook.

The Power of Collaboration

"Why reinvent the wheel when a fitting solution already exists? Often (but not always, of course) it can be the quicker path to a result to invest the time finding and familiarizing oneself with existing infrastructure rather than recreating it by yourself from scratch- a classical case of KISS (keep it simple, stupid)." says the AiiDA team. "Renku has been very stable throughout the various steps of the integration. Their team has always been responsive and super helpful with any concerns or technical difficulties."

The Renku-Materials Cloud Archive integration demonstrates how the seamless integration of data repositories and compute platforms enables more researchers to engage with and reuse computational datasets, promoting scientific collaboration.


Want to learn more about how Renku can help democratize access to your research data? Explore RenkuLab or contact our team to discuss your use case.