ESG-SASB Label Stability: A Curated Benchmark and Reproducible Pipeline for Reusing Sentence-Level Sustainability Disclosure Labels

Archive/ESG-SASB Label Stability: A Curated Benchmark and Reproducible Pipeline for Reusing Sentence-Level Sustainability Disclosure Labels

Yufei Li, Tianhao Chen, Wei Ke et al.

3 de julio de 2026

Abstract

Annotated text datasets are increasingly reused as classifier targets, annotation candidates, and inputs to aggregate profiles, yet their labels often circulate without enough information about how they were produced. This article presents a reproducible benchmark and validation workflow for the public SASB-Aligned ESG Sentences corpus, a sentence-level sustainability disclosure dataset organized around standards-based categories such as those used in Sustainability Accounting Standards Board (SASB) analytics. Using the downloaded 6460-row version of the corpus, we construct fixed train/validation/test splits, map released child labels to parent categories, and evaluate label reuse through supervised classifiers, prompted GPT-4o classification, blind and candidate-visible Claude annotation, and Monte Carlo aggregation into ESG/Non-ESG category profiles. The reproducibility artifacts provide split metadata, label mappings, prompt templates, model predictions, LLM annotation outputs, profile sensitivity outputs, figure inputs, and scripts for reproducing the reported tables and figures. Results show that label reproduction is strongest at coarser label levels, blind annotation flags 40.3% of held-out sentences as ambiguous, candidate-visible annotation increases agreement while changing the task format, and aggregate profiles remain sensitive to label source. The benchmark supports transparent reuse of sentence-level ESG labels by reporting label source, annotation condition, prompt family, and aggregation level.

Metadata

DOI: 10.3390/informatics13070106 CC BY 4.0 license

IPC Classification

G06

Keywords

esg-sasblabelstabilitycuratedbenchmarkreproduciblepipelinereusingsentence-levelsustainabilitydisclosurelabelsinformaticsannotatedtextdatasetsincreasinglyreusedclassifiertargetsannotationcandidatesinputsaggregate

Citar esta publicación

€ 4.00

← Back to Archive