Home About the data

About the data

By Euan

• 29 articles

About the SDG categories in Overton

*Describes how policy documents are linked to different SDGs Please note: this functionality was updated and improved in March 2025. If you would like more information on how we previously linked policy documents to SDGs there is more information here. *As well as topics and subject areas Overton tries to map policy documents to one or more Sustainable Development Goals, which are a set of 17 goals set up by the United Nations to serve as a framework for global development. The SDGs are often used as a quick way to group policy or research relating to a particular problem area e.g. climate change, poverty or gender inequality. Each SDG is accompanied by targets which provide specific pathways to achieving the overall goal of a fairer and more sustainable world. Overton uses an advanced multi-label approach, which allows a single classifier to predict multiple categories at the same time. This means our classifier can simultaneously categorize policy documents into multiple Sustainable Development Goals (SDGs) and their corresponding targets. As input it uses the new document descriptions and uses ModernBERT, a powerful language model known for its ability to understand the context of text, and organises the classification process hierarchically. Each SDG is associated with a set of targets. For example SDG 8 “Decent work and economic growth” has 12 targets (see https://en.wikipedia.org/wiki/Sustainable_Development_Goal_8 ) including “Diversify, innovate and upgrade for economic productivity” (target 8.2) and “Promote policies to support job creation and growing enterprises” (target 8.3). Our classifier first matches policy documents to these “targets” and then rolls them up to the parent SDG. We created a large set of training data to ensure coverage of all targets, even for categories with limited data. The performance of our classifier was evaluated using precision and recall, which measure how accurate and consistent the classifier’s predictions are. The results were very promising.

About the data

About the SDG categories in Overton

About topics, entities, subject areas and COFOG

Documents in different languages

Duplicate policy documents in Overton

Funding data in Overton

How Overton collects and displays institution data

How Overton defines policy documents

How are journal subjects assigned?

How are scholarly references matched in policy documents?

How does Overton classify policy sources?

How does Overton find citation contexts?

How does Overton find people mentioned in policy documents?

How does Overton generate document descriptions?

How does Overton know about author affiliations?

How does Overton know who authored a scholarly article?

How far back does the database go?

How international are your sources?

How is OpenAlex used in Overton?

How to reference Overton

How we disambiguate policy documents

Requesting a new policy source

The ari.org.uk dataset

What are data notes?

What are overrepresented topics and how are they found?

What are your criteria for adding new sources?

What is Overton’s coverage and how does it compare to other systems?

What sources does Overton track?

Why am I seeing “unknown date” instead of a publication date?

Why are some authors not appearing in the People tab?