BenchmarkSet1500: High-Accuracy Excited-State Reference Benchmark Dataset for Organic Semiconductors
BenchmarkSet1500 is an open-access multireference excited-state database established to provide the first dedicated high-accuracy benchmark set for organic semiconductor research. The repository comprises 1,500 small organic molecules with consistently computed vertical excited-state properties obtained using state-averaged complete active space self-consistent field (SA-CASSCF) and strongly contracted N-electron valence state second-order perturbation theory (SC-NEVPT2), alongside the full reproducible workflow code used to generate the dataset. The dataset focuses on systems where single-reference approaches (e.g. TD-DFT) are known to fail, including molecules exhibiting strong static correlation and inverted singlet-triplet gaps. BenchmarkSet1500 is designed to support rigorous method benchmarking, systematic assessment of theory-level performance, development of predictive models, and screening for technologically relevant organic semiconductors. This dataset is available in two forms: (1) a data collection with one entry per molecule which contains curated metadata, optimised geometries (at B3LYP/6-31g* level of theory), complete electronic-structure output files, and computed excited-state energies and oscillator strengths for low-lying singlet and triplet states and (2) a consolidated machine-learning-ready CSV file which aggregates all molecules with their structural descriptors and excited-state properties to enable immediate integration into data-driven workflows.
To use this resource go to the resource landing page.
This resource is part of the High-Accuracy Excited-State Reference Benchmark Dataset for Organic Semiconductors resource theme.
Linked Resources
Qualified Attribution
Further Information
Publisher
Access
Open Access
License
Contact
Citation
0

