NSF POSE Phase II proposal submitted 9/2/2025.

Data Access

Data sets can be local or remote (e.g. cloud storage), and a wide variety of file formats are supported. The large, multidimensional, datasets required for modern scientific analysis are supported natively. Adding completely new data formats requires only a few lines of code.

Data can be anywhere

Local data

Data on your machine can be loaded directly into memory or lazy-loaded from disk (for very large files)

Remote data

Remote datasets can either be streamed on-demand or downloaded and loaded into memory

Data can be BIG

🧬 A large (~60 GB) multi-resolution image can be interactively panned and zoomed while remaining entirely on disk.

Most common data formats are already supported

Many common scientific data file formats are supported alongside domain-specific datasets, leveraging open-source libraries that provide access to a wide variety of underlying file formats.

🪐 Astro Favorites

🧬 Bio Favorites

New data loaders are easy

If a Python library exists to read your data file format, it is often just a few lines of code to define a custom data loader.

New: LLM (Chatbot) Data Loaders

Thanks to the rapid advances in AI chatbot capabilities, LIVE collaborators are currently experimenting with AI-enhanced data discovery and loading tools--watch this space for updates!

Page updated

Report abuse