Data Sources

PSDI provides access to many databases and repositories of physical sciences data. PSDI and our partners also provide a range of other data sources of interest to the physical science community that can be accessed through the links on this page.

You can narrow down the list of data sources displayed on this page by selecting from the filters on the left. Click on a data source of interest to find out more about it, access the resource landing page, and to access related resources in the same Resource Theme for those that have one.

BenchmarkSet1500 logo

BenchmarkSet1500: High-Accuracy Excited-State Reference Benchmark Dataset for Organic Semiconductors

BenchmarkSet1500 is an open-access multireference excited-state database established to provide the first dedicated high-accuracy benchmark set for organic semiconductor research. The repository comprises 1,500 small organic molecules with consistently computed vertical excited-state properties obtained using state-averaged complete active space self-consistent field (SA-CASSCF) and strongly contracted N-electron valence state second-order perturbation theory (SC-NEVPT2), alongside the full reproducible workflow code used to generate the dataset. The dataset focuses on systems where single-reference approaches (e.g. TD-DFT) are known to fail, including molecules exhibiting strong static correlation and inverted singlet-triplet gaps. BenchmarkSet1500 is designed to support rigorous method benchmarking, systematic assessment of theory-level performance, development of predictive models, and screening for technologically relevant organic semiconductors. This dataset is available in two forms: (1) a data collection with one entry per molecule which contains curated metadata, optimised geometries (at B3LYP/6-31g* level of theory), complete electronic-structure output files, and computed excited-state energies and oscillator strengths for low-lying singlet and triplet states and (2) a consolidated machine-learning-ready CSV file which aggregates all molecules with their structural descriptors and excited-state properties to enable immediate integration into data-driven workflows.

Flow Battery logo

Flow Battery Experimental Metadata Repository

The Flow Battery Experimental Metadata Repository provides a structured place for contributing and viewing metadata about experimental practices, equipment choices, and procedural decisions made during multi-institution flow battery reproducibility studies. These studies bring together several of the world's leading research groups, national laboratories, and industrial partners to understand how experimental practices influence the repeatability, replicability, and reproducibility of flow battery electrochemical testing. Structured metadata collection allows researchers to compare approaches, identify factors affecting performance variation, and support the development of more consistent testing methodology. In the longer term, the study aims to provide benchmark datasets for use by the wider community and help new entrants to the field by lowering the barrier to consistent and reliable experimental practices. At present, the catalogue is populated with metadata gathered through the multi-institution "Phase 2" studies conducted across 2025/26, with contributions from over approximately 40 participants worldwide. The resource is openly accessible for browsing, allowing the broader community to explore experimental set-ups and methodological variability. In future phases, the catalogue will be open to submissions from the wider community and anticipate running further benchmarking studies to expand the collection. The repository is supported and hosted by PSDI partners on behalf of the flow battery research community.