BenchmarkSet1500: High-Accuracy Excited-State Reference Benchmark Dataset for Organic Semiconductors

BenchmarkSet1500 logo

BenchmarkSet1500 is an open-access multireference excited-state database established to provide the first dedicated high-accuracy benchmark set for organic semiconductor research. The repository comprises 1,500 small organic molecules with consistently computed vertical excited-state properties obtained using state-averaged complete active space self-consistent field (SA-CASSCF) and strongly contracted N-electron valence state second-order perturbation theory (SC-NEVPT2), alongside the full reproducible workflow code used to generate the dataset. The dataset focuses on systems where single-reference approaches (e.g. TD-DFT) are known to fail, including molecules exhibiting strong static correlation and inverted singlet-triplet gaps. BenchmarkSet1500 is designed to support rigorous method benchmarking, systematic assessment of theory-level performance, development of predictive models, and screening for technologically relevant organic semiconductors. This dataset is available in two forms: (1) a data collection with one entry per molecule which contains curated metadata, optimised geometries (at B3LYP/6-31g* level of theory), complete electronic-structure output files, and computed excited-state energies and oscillator strengths for low-lying singlet and triplet states and (2) a consolidated machine-learning-ready CSV file which aggregates all molecules with their structural descriptors and excited-state properties to enable immediate integration into data-driven workflows.

To use this resource go to the resource landing page.

Linked Resources

Qualified Attribution

Further Information

Publisher

Access

Open Access

License

Contact

Citation

0

Keywords and Subjects

PSDI-funding-call-FC1-2025
organic semiconductors
excited-state
multireference
time-dependent density functional theory
TD-DFT
complete active space self-consistent field
CASSCF
N-electron valence state second-order perturbation theory
NEVPT2
inverted singlet-triplet gap
INVEST
organic light-emitting diode
OLED
thermally activated delayed fluorescence
TADF
singlet-triplet gap