Introduction

The European Green Deal (EGD) is the growth plan for a climate-neutral Europe by 2050. It covers a wide range of areas, including climate action, energy, agriculture, industry, infrastructure, environment and biodiversity, transportation, finance and development, and research and innovation1. The EGD serves as both an environmental, social, and economic blueprint, covering all sectors through policy areas and initiatives that collectively aim to transform the European Union (EU) into a more sustainable and climate-resilient society while fostering economic growth and social equity.

The United Nations (UN) conceptualize “Sustainability,” as the condition of meeting the needs of the present without compromising the ability of future generations to meet their own needs2, which encompasses an equilibrium between environmental, economic, and social dimensions, ensuring a harmonious relationship between human activities and our planet’s ecosystems. Sustainability, in this context, can refer to all efforts trying to ensure the well-being of current and future generations, within the limits of the natural world. This requires integrated and cross-disciplinary knowledge materialized by policy frameworks3,4.

The UN Agenda 2030, signed by 193 countries, is the global agenda towards the “future we want”, focused on the sustainable development at a global scale by 20305. Its 17 Sustainable Development Goals (SDGs) are the plan for building national, continental, and global investment programs for sustainable development. In particular, the SDGs provide a comprehensive universal framework with tangible and measurable indicators (covering multiple sectors with social, economic, and environmental components) that encourages countries to work towards their achievement6,7. However, the progress on achieving the SDGs (globally and per country) is still slow, and recent evidence suggests only limited transformative political impact of the SDGs8.

In an attempt to provide a broader and at the same time a more concise picture of the necessary actions towards sustainability, Sachs et al.9 presented the Six Transformations as a means to operationalize the SDGs. They practically grouped the 17 goals into six key areas of transformation that societies need to undertake for sustainable development. These are: 1. Education, Gender, and Inequality, recognizing that sustainable development requires a foundation of educated citizens, with equal opportunities for all genders and marginalized groups; 2. Health, Wellbeing, and Demography, addressing demographic changes that can impact societal structures and sustainability outcomes; 3. Energy Decarbonization and Sustainable Industry, calling for a shift to sustainable, renewable energy sources and the decarbonization of industries to mitigate climate change effects and promote economic growth that does not overexploit natural resources; 4. Sustainable Food, Land, Water, and Oceans, aiming to manage natural resources responsibly, and ensure food security, sustainable agriculture, freshwater availability, and marine conservation; 5. Sustainable Cities and Communities, including urban planning, infrastructure, and innovation to support community well-being and environmental health; and 6. Digital Revolution for Sustainable Development, which recognizes the role of technology and digital innovation in accelerating progress towards the SDGs.

Motivated by the ongoing efforts from the EGD, the SDGs, and the research spaces towards sustainability, we explore how the EGD is aligned to the SDG UN Agenda 2030. The aim of this paper is to and quantify how much each SDG and also each of the Six Transformations are supported by the EDG-related policies. By thoroughly assessing the relation of all EGD’s central policies and strategies published during 2019–2023 we reveal the major and most overlooked areas of the UN Agenda 2030, in terms of the political attention they have received. A novel machine-learning-based text-mining method (ML), validated and enhanced by a human-based textual analysis is applied to 74 EGD related policy documents. The alignment of the EGD policies with the six necessary transformations for Sustainable Development proposed by Sachs et al.9 is also quantified to provide policymakers with more insights into the policy areas that should have progressed more, given their presence in EU policy, and others that should be further supported.

Since the introduction of the EGD in December 2019, the European Commission has launched a plethora of policies, regulations, recommendations, and strategy documents to support the actions required by the EU Member States to achieve its goals. The Commission President, Ursula von der Leyen, is committed to integrating the SDGs into European Semester, the EU’s budgeting processes10, while the Annual Growth Strategy outlines that “this economic agenda must transform the Union into a sustainable economy, helping the EU and its Member States to achieve the United Nations Sustainable Development Goals”11. Arguably, achieving the sustainability agenda is a difficult and ambitious task. But this statement clearly indicates that although the EGD is focused on climate-neutrality, it is an opportunity to achieve a broader vision, that of sustainable development. This rationale is backed up by research in the field arguing that sustainability goes beyond decarbonization and climate-neutrality, and that all SDG areas are complementary and necessary requirements to achieve each one of them (e.g., we cannot talk about a sustainable world even we use green energy, but we experience wars, inequalities, poverty, etc.)12,13,14. Since all SDGs are interconnected, and a truly sustainable future passes through their complementary achievement15, the EGD should take the opportunity and evolve EU’s development with a holistically sustainable manner.

However, evidence increasingly shows that the progress towards the achievement of the SDGs is poor16. EU still faces several challenges for the achievement of SDGs17,18. After the COVID-19 pandemic, the war in Ukraine have further slowed down the successful implementation of several EGD policies and the SDG progress in Europe, on top of other issues such as outsourcing and its considerable environmental footprint19,20. So, urgent action is required, and improved understanding of the underlying policy mechanisms is key to implement successful measures21,22. Monitoring the impact of the SDGs, along with tracking their progress are expected to be cornerstones for the sustainable development18,23,24. Our paper builds towards this direction by assessing the integration of SDGs in the EGD-related policies.

Although researchers have studied the progress of the SDGs in EU18,25, as well as the impact of the EGD on multiple sectors (e.g.,26,27,28), the interlinkages between the EGD and the SDGs are an overlooked issue. Despite research on specific countries29 and the exploration of specific SDGs under certain policies, or methodological contributions aiming to improve their monitoring30, an overview of all SDGs into the EGD policies is missing. The alternative grouping of the SDGs as six broad necessary transformations towards sustainability, as proposed by Sachs et al.9, still remains unexplored in terms of policy-coverage. The literature has been slow to examine how these domains have been considered into the policy agendas, such as the EGD, and this paper tries to fill this gap, in order to reveal which policy areas, need further support. Having this information is crucial, as it can directly influence future policies and EU Member-States fund planning, to speed up the sustainability transition. Another very important aspect is the addition of the time-dimension. In particular, according to several studies, assessing sustainable development and policy influence over time, towards the 2030 Agenda’s objectives is a key research area, that remains under-explored4,18,23,31. This paper contributes to this aspect as the integration of SDGs in the EGD-related policies is assessed over time, for the period 2019–2023.

Our assessment combines a machine-learning (ML) language processing approach, with a human-based approach. Artificial Intelligence (AI) and ML have the potential to contribute with various ways to a sustainable development32. ML language processing models can identify hidden elements and patterns in complex policy documents33, and have been used for detecting sustainability-related objectives. Matsui et al.34 used ML natural language processing modeling for translating semantics, visualizing nexus patterns, and connecting stakeholders using the SDGs. Hajikhani and Suominen35 have used ML to explore the presence of SDGs in publications and patents. Porciello et al.36 highlight the advantages of using ML approaches for synthesizing evidence from the literature on SDGs, compared to the quite time-consuming evidence synthesis by human work, in terms of speed and accuracy. There are fewer applications on identifying the SDG policy-related influence, with the exception of Smith et al.37 who used ML language processing analysis to detect policy and scientific discourse around SDGs, focusing on identifying overlaps in international policies. To our knowledge, an analysis quantifying the presence of SDGs and/or the Six Transformations in EGD has not been performed so far.

This paper contributes to the existing literature by assessing the EGD-SDG alignment, revealing the degree to which SDGs are represented by different EU policy areas. Thus, the results are expected to provide useful insights with respect to the areas that are need to be strengthened. Such insights are of direct interest to researchers and policymakers, as they contribute to an improved understanding of the hidden elements and interconnections among the different policies and SDGs, and outline priorities for action towards more inclusive and sustainable policies. To further enhance our work’s policy implications, we reflect on our findings and provide strategic recommendations for addressing the significant issues and current gaps that are likely to arise through the implementation of the ambitious sustainability agenda.

Results

Initial alignment evaluation

The ML model provided the similarity scores representing the probability of a given EGD policy document being relevant to a specific SDG. These scores are presented as percentages, with higher percentages suggesting a stronger likelihood of the policy being relevant to the corresponding SDG. Figure 1 provides a representative selection of these policies, showing indicatively four well-known policies with their respective scores. For a comprehensive view of all 74 policies and their SDG - similarity scores, please refer to the Supplementary file (Supplementary Table 2).

Fig. 1: Initial alignment scores of selected EGD Policies with SDGs.
figure 1

Represents the preliminary alignment probabilities of selected EGD policies with the SDGs, visualized through a color gradient from red (low alignment) to green (high alignment).

These initial results reveal that each policy primarily has impact mainly on one or two SDGs. Figure 1 shows a heatmap visualization of the initial similarity scores between EGD and the SDGs, prior to rescaling for four selected policies. This is an excerpt from the comprehensive list of policies available at the supplementary file (Supplementary Table 2). For instance, EU Energy Integration Strategy has an almost perfect alignment (99.12%) with SDG7 (Affordable and clean energy). Similarly, the EU Hydrogen Strategy also exhibits a substantial link to SDG7, with a score of 94.07%. However, some policies have a more diverse impact. Particularly, Circular Economy Plan contributes significantly to three SDGs: SDG17 (Partnerships for the Goals), SDG12 (Responsible consumption and production), and SDG16 (Peace, justice and strong institutions) with similarity scores of 40.15, 23.45, and 7.75% respectively. The Sustainable Growth Strategy 2021 largely impacts SDG9 (Industry, innovation and infrastructure), SDG17 (Partnerships for the Goals), and SDG7 (Affordable and clean energy), with similarity scores of 22.86, 19.22, and 11.62%, respectively.

Rescaled similarity scores

In order to reveal and highlight the deeper shades of the EGD policy-SDG interplay, we performed a data transformation based on these initial findings. The algorithm allows rescaling the similarity scores of the secondary SDGs, adding up to 100% after omitting the prevailing one with the highest score, provides a more nuanced understanding of the data by de-emphasizing the most dominant connections and focusing more on secondary and tertiary linkages. This enables a more detailed exploration of the policies’ impacts across a broader range of SDGs, their ranking, and reveals potential indirect effects and interactions that might have been overshadowed in the initial results. Figure 2 provides a representative selection of the same indicative four well-known policies and their respective final scores. Table S3 of the Supplementary file includes the detailed list with all 74 policies with their SDG - similarity scores.

Fig. 2: Detailed alignment scores of EGD Policies with SDGs.
figure 2

Provides a detailed view of the alignment scores between selected EGD policies and the SDGs, using the same color gradient for visualization, as in Fig. 1.

SDG representation in EGD Policies

These results show that the Circular Economy Plan aligns particularly strongly with SDG12 (Responsible Consumption and Production) at 39.18%, which is reasonable as the plan focuses on sustainability in resource use and waste reduction. The EU Energy Integration Strategy and the EU Hydrogen Strategy have substantial alignment with SDG12 and SDG13 (Climate action), reflecting the focus on integrating energy systems and addressing climate change. The Sustainable Growth Strategy 2021 shows significant alignment with SDG17 (Partnerships for the Goals), suggesting an emphasis on collaborative efforts to drive sustainable growth. The relations reported in Fig. 1 are expected, given the nature of the policies examined, indicating their association to a certain SDG, but the analysis allowed us to quantify the similarity scores for each SDG. The results of Fig. 2, however, were not so obvious from the beginning, indicating the value of the ML approach with respect to a deeper interpretability of the results, spanning across more than one main SDG target.

The results summary for all the 74 EGD policies are depicted in Fig. 3. Notably, SDG7 (Affordable and Clean Energy), SDG12 (Responsible Consumption and Production), SDG13 (Climate Action) and SDG17 (Partnerships for the Goals) emerge as the most prominent in terms of their alignment with EGD policies.

Fig. 3: Overview of EGD Policy Contributions to SDGs Over Time.
figure 3

Displays the evolution of EGD policies’ contributions to SDGs from 2019 to 2023, using similarity scores and percentage shares.

The prominence of SDGs 7, 12, and 13 in the alignment with EGD policies could be attributed to several factors. The EGD inherently emphasizes climate-neutrality, clean energy, and climate action, directly correlating with the themes of SDG7, SDG12, and SDG13. These areas were identified as urgent priorities within the EU’s strategic goals. Additionally, the high score of SDG17 indicates that partnerships are considered vital for a continent-wide initiative like the EGD, as achieving sustainability goals requires cooperation between member states, private sector actors, and international bodies.

These targets are followed by other important SDGs, namely SDG 9 (Industry, Innovation & Infrastructure), SDG11 (Sustainable Cities and Communities), SDG14 (Life Below Water), and SDG15 (Life on Land). These rankings underscore the emphasis on biodiversity and sustainable urban and industrial development within the EGD.

The SDGs that appear to have received ‘medium’ attention in the EGD context include SDG2 (Zero Hunger), SDG3 (Good Health and Well-being), SDG6 (Clean Water and Sanitation), SDG8 (Decent Work and Economic Growth), and SDG16 (Peace, Justice, and Strong Institutions). This indicates a moderate emphasis on crucial aspects of social welfare and environmental health.

The less represented SDGs in the EGD policies include SDG1 (No Poverty), SDG4 (Quality education), SDG5 (Gender equality), and SDG10 (Reduced inequalities). These SDGs showing the least alignment with EGD initiatives, might be due to their more multidimensional nature involving complex social factors that are harder to address directly through environmental policy frameworks. While environmental policies can be more sector-specific, social goals require cross-sectoral efforts that integrate various policy domains, which may be less developed within the current scope of the EGD. Another explanation might be a stronger political will for immediate decarbonization and renewable energy action, compared to the more systemic and long-term societal changes needed for a more sustainable world across all domains.

These results from Fig. 3 showing the degree of the representation of each SDG in the EGD, are also summarized in Table 1.

Table 1 SDG representation in EGD Policies Ranked

Examining the attention that each SDG received over time by comparing the percentage shares of the similarity scores (Fig. 3), it is obvious that in 2019 only SDG7 (Affordable and Clean Energy) and SDG13 (Climate Action) were mobilized by the EGD, signaling the significance and prioritization of these two areas in the EU’s sustainability agenda. Nevertheless, from 2020 onwards, the policy contributions become more diversified across different SDGs, indicating an expansion of the EGD’s policy scope. SDG9 (Industry, Innovation & Infrastructure) had the comparatively larger ‘attention’ increase in 2020 (around 30%). This trend for SDG9 continued in 2021, along with SDG8 (Decent Work and Economic Growth), SDG13 (Climate Action), and SDG11 (Sustainable Cities and Communities) (all spanning from around 45–55%). A significant increase in the alignment of EGD policies with SDG6 (Clean Water and Sanitation) was observed in 2022, more than any other SDG (around 45%), suggesting a strengthened focus on environmental health. However, this was the case only for 2022 (e.g., SDG6’s share in 2023 is around 10%). Some generally ‘overlooked’ SDGs have received the most attention in 2023. In particular, SDG5’s (Gender Equality) alignment with EGD policies peaks in 2023 (43%). Same applied for SDG2 (No Hunger) (41%), and SDG14 (Life Below Water) (38%), which received their highest attention in 2023.

These results of Fig. 3 showing the evolution of the representation of each SDG in the EGD over time, are presented in Table 2 below.

Table 2 Comparative SDG alignment with EGD Policies

It is important to note that the SDGs that were traditionally central in EGD policy, received their comparably less attention over the past two years (2022–23), such as SDG7 and SDG13. The declines in the alignment of EGD policy with SDG7 from 2021 to 2023 indicates potential shifts in policy focus that may necessitate strategic adjustments to sustain progress. The SDGs that have overall received less attention, having consistently low similarity scores, exhibit similar distribution of percentage shares across the years, suggesting that there was not any trend of shift in priority for addressing these gaps.

Human textual analysis validation

The human textual analysis was performed as a complementary process for the validation of the ML text-mining results. A comprehensive review of all 74 policy documents was undertaken to ensure the accuracy and plausibility of the ML findings. The outcome confirmed the validity of the ML approach’s results, in terms of the prevalence of SDGs 7, 13, and 9, as well as the gaps with respect to the SDGs 5, 10, 4, 1, 2, 3, 6, and 16. It is noteworthy that SDG3 (Health and Well-being) received little attention, even during the peak of the COVID-19 pandemic. Similarly, another continuously overlooked area is the SDG16 (Peace, Justice, and Strong Institutions) with increasing and ongoing wars since 2022.

Six transformations analysis

As mentioned earlier, the same analysis was conducted to illustrate the relations of the EGD policies with the broader sustainability concept in terms of the Six Transformations. The results are summarized in Fig. 4. In line with the results presented above, the main focus areas have been: no.3 Energy Decarbonization and Sustainable Industry, no.4 Sustainable Food, Land, Water, and Oceans, and no.6 Digital Revolution for Sustainable Development. A medium attention is observed in the areas no.5 Sustainable Cities and Communities, and no.1 Education, Gender, and Inequality, while the less represented area is still no.2 Health, Wellbeing and Demography. According to Fig. 4, the annual distribution of policy influence of the Six Transformations has been consistent over the years, with the most areas receiving their highest attention in 2021.

Fig. 4: Impact of six transformations on EGD Policies’ Evolution.
figure 4

Illustrates the impact of the Six Transformations on EGD policies from 2019 to 2023, showing changes in similarity scores and percentage shares. More details on the data processing and extensive results can be found in the Supplementary file (Supplementary Tables 4, 5, 6, Supplementary Fig. 1).

Discussion

In this paper we explored whether and how the SDGs, and the Six Transformations, fit inside the EGD’s policies, by combining ML and human text-mining approaches. By identifying policy areas that are most and least represented in terms of their sustainability themes we can provide to policymakers a tangible set of SDGs/ areas that need to be strengthened in terms of policies and investments, respectively.

The main findings suggest that EGD policies are heavily focused on specific areas so far, such as decarbonization and clean energy, and growth (production-consumption). Consistently overlooked areas pertain to social sustainability components, namely tackling inequalities, poverty, hunger, improving health and well-being, education, and achieving peace, justice, and decent work.

These findings are more evident in the detailed analysis considering the SDGs, where the specific areas can be assessed, compared to the more high-level, “aggregated” view of the Six Transformations. For instance, the relatively high scores of the Six Transformations’ areas such as no.1 Education, Gender, and Inequality and no.4 Sustainable Food, Land, Water, and Oceans, are not representative of all the sub-areas included. From the more detailed analysis presented by assessing the SDGs-related policies, it is clear that all the social-related SDGs and certain environmental ones (e.g., clean water) are in principle overlooked. In general, as noted by Di Lucia et al.38, “decision-makers prioritize methods that are simple and flexible to apply and able to provide directly actionable and understandable results. They are less concerned with the accuracy, precision, completeness or quantitative nature of the knowledge”. However, in the context of multi-dimensional and complex policies, we demonstrate the crucial need to examine the detailed progress across specific areas (as e.g., reflected by the SDGs) for more efficient and targeted policy improvements.

The EGD is inherently focused on climate-neutrality, so decarbonization and clean energy without significant production losses are expected to be its core. However, this is not necessarily sustainable, as previous research has pointed out39, neither efficient, since it cannot address persisting social issues40. As Fankhauser et al.41 explain, the climate neutrality concepts have emerged from physical climate science, but cannot be operationalized through social, political and economic systems, if broader sustainable development objectives such as equitable net-zero transition, socio-ecological sustainability and the pursuit of broad economic opportunities, are missing. This concern is being increasingly expressed, by questioning the justice of climate politics42,43, and their social legitimacy44. Considering the policy areas that have received less attention within the EGD policies, it would be naïve to assume that they were reasonably under-represented because their progress is satisfactory. On 24 February 2022 the ongoing war in Ukraine begun, with multiple sustainability impacts45. Nevertheless, “Peace, Justice, and Strong Institutions” (SDG16), “No Poverty” (SDG1), “Zero-Hunger” (SDG2), and “Good Health and Well-being” (SDG3) continue to be overlooked and have represented missed opportunities, even by the Recovery and Resilience Fund. However, such social aspects of the SDGs are necessary for reducing inequalities and contribute to a truly sustainable future. We believe that future EU policies should have a special funding focus on these areas.

Besides economically efficient climate-neutrality, EGD is and should be regarded as an opportunity for the achievement of a broader sustainability vision, which cannot be achieved only by focusing on specific SDGs. A world with wars, recessions and inequality cannot be considered to be sustainable, even if it is climate-neutral and the industry maintains high productivity rates. Sustainability emanates from all areas, as outlined in the SDGs, and these should be regarded as complementary policy actions, that should be reflected in the future policies. Future EU policies should distribute the necessary funds to multiple, cross-SDG objectives, particularly focusing on social and environmental aspects, that will facilitate the achievement of the current EGD’s targets.

Methods

In our attempt to reveal the sustainability components of the EGD policies, a mapping of those policies from 2019–2023 was performed. These sustainability components were found in: (a) the SDGs, as they represent a holistic sustainability measure; (b) the Six Transformations proposed by Sachs et al.9, as they can provide the policymakers with something broader (or conceptually more high-level) than the many SDGs. By assessing both the SDGs and the Six Transformations, we provide a thorough and holistic evaluation capturing all elements of the different policy documents, aiming to identify overlooked areas in sustainability terms. These will be key areas of focus for further policy support and orienting the necessary funds to enhance their sustainability angle.

Policy documents selection

The sample of policies selected included all policy and strategy documents derived from the EGD, spanning from its inception in December 2019 to the latest policies up to November 2023, ensuring an up-to-date dataset. This allowed us to assess the progress of each policy area by year (Table 3). The full list of policies along with a short description for each one and their detailed list of references, according to the EUR-Lex46 are available in the Supplementary File (Supplementary Table 1).

Table 3 Analysis of EGD policy publication frequency by year

Detecting connections with text-mining

To present an integrated mapping of the policies of Table 3 and their joint implementation with the 17 SDGs and the Six Transformations, a methodology based on Machine Learning (ML) text-mining was applied, and in specifically, on Deep Learning (DL) techniques. DL refers to extensive neural networks with many layers (deep) that “allow computational models that are composed of multiple processing layers to learn representations of data with multiple levels of abstraction”47. Simply put, machines learn from experience by representations that are expressed in terms of other, more straightforward representations, and this is particularly useful for text-mining48. The model chosen for the analysis was BERT, a pre-trained transformer-based model. BERT stands for “Bidirectional Encoder Representations from Transformers” and was first introduced by Google Research in 201849. The BERT model is pretrained using a combination of the Toronto Book Corpus and Wikipedia, totaling around 2.5 billion words. The “bert-base-uncased” variant, with 12 layers, 768 hidden dimensions, 12 heads, and 110 million parameters, was chosen for this study. BERT’s architecture focuses on learning deep bidirectional representations from text, considering context from both sides. The ‘base’ version contains 12 transformer layers, each outputting a 768-dimensional hidden state, and is trained on lowercase text (uncased). This model is widely used in NLP tasks like question answering and sentiment analysis. BERT was fine-tuned by employing the OSDG Community Dataset, incorporating targets and indicators for each goal50. For goals lacking text, relevant expressions from policies were used. The dataset is curated by over 1400 citizen scientists from over 140 countries, labeling sentences related to each SDG51. More information about the model’s specifications can be found in the Supplementary file.

The dataset used in this paper was formed after text excerpts of paragraphs deriving from public documents, such as reports, policies, and publication abstracts. Furthermore, some documents originate from UN-related sources e.g.,52, and SDG Library53. The dataset includes over 42,000 labeled document excerpts and it contains the referred SDG, the number of volunteers participating in the OSDG’s classification process that classified the connection to the SDG as negative, the number of volunteers that classified the connection to the SDG as positive and the agreement score based on Eq. 1:

$${\rm{agreement}}=\frac{\left|{{labels}}_{{positive}}-{{labels}}_{{negative}}\right|\,}{{{labels}}_{{positive}}+{{labels}}_{{negative}}}$$
(1)

The data used were pre-selected using the following criteria:

  1. 1.

    labelspositive > labelsnegative, as we needed only to use data related to an SDG,

  2. 2.

    agreement > 0.6, as we needed to be sure that the volunteers agreed to the labeling.

For fine-tuning BERT, we incorporated additional descriptions and indicators for each of the 17 SDGs, as sourced from sdg-tracker.org54, since it was not included in the OSDG-CD. To ensure the credibility of the labeled text-excerpts, the dataset was further refined by applying criteria like a positive labeling agreement above 0.6, ensuring consensus among the volunteers in the dataset. In total, 35,001 text excerpts were used to train the model.

The model was developed in Python, using PyTorch and Scikit-Learn55. 80% of the text excerpts were used as training data and 20% as testing data. As an optimizer, the Adam Optimizer method was employed, while the optimization criterion was the Cross-Entropy Loss56. The model was trained for 10 epochs with a learning rate of 10−5, resulting in an accuracy score of 0.85. Considering that the training data used for each SDG are not of the same size and the fact that a text excerpt is most probably linked to more than one SDG, the accuracy score is acceptable.

In other words, the algorithm effectively answers the question ‘how probable is the X policy document to be related to the Y SDG’. Therefore, the algorithm’s score provides a numerical estimation (similarity score) of the probability that a given policy document is relevant to a specific SDG. Then, the algorithm allows a second-level evaluation, by examining only the rest of the SDGs (except the prevailing one(s) with the highest similarity scores), to identify the less obvious relations, as well.

Human textual analysis

The ML approach enabled us to assess the policy documents thoroughly and fast, revealing connections and linkages between EGD policies and the Six Transformations with SDGs. The advantage of this approach is that many of these connections and patterns are not always evident by the human-eye, in a consistent way when reading the documents (especially for our case where the subject was to 74 policy documents). Although the ML approach was found to be very insightful, offering efficiency and speed at processing large volumes of text and identifying ‘hidden’ relationships within the data that may not be immediately apparent to human readers, ML models can sometimes lack the nuanced understanding that comes from human interpretation. In our effort to validate this ML approach and provide robust results, we complemented it with the human manual reading of the policy documents. We leveraged the expertize of individuals familiar with the sustainability policy landscape to interpret results and assess their plausibility. The human textual analysis involved 18 experts on sustainability science, across 5 countries, including the author team, that read manually the policy documents, identifying phrases that were semantically linked to the objectives of one or more SDGs. This served as a validation to the ML approach. This is a complementarily process to the ML text-mining, which could confirm the relations or reveal further links. Unlike algorithms, humans have an intrinsic ability to detect nuanced semantic connections between policies and SDGs, thereby providing a more in-depth and thoughtful interpretation.

Mapping EGD policies to the six transformations

The same analysis was also carried out for the Six Transformations as described by Sachs et al.9, to illuminate the broader links of various policies on these six core areas required to achieve the 17 SDGs. These include: (1) Education, Gender, and Inequality; (2) Health, Wellbeing, and Demography; (3) Energy Decarbonization and Sustainable Industry; (4) Sustainable Food, Land, Water, and Oceans; (5) Sustainable Cities and Communities; (6) Digital Revolution for Sustainable Development. Their complex interaction highlights the need for comprehensive, cross-sectoral efforts in the policy-making process.

This complementary analysis aims to provide policymakers with insights into how existing EGD policies contribute to these transformative areas and to highlight where additional focus might be required. Initially, based on the estimates by Sachs et al.9, the SDG contributions (the average influence each SDG) on the Six Transformations were estimated. We then multiplied these average contributions by the similarity scores of EGD policies to SDGs obtained from the ML approach, to formulate a matrix indicating the degree to which each policy contributes to the Six Transformations. By summing the results from step two across each year (2019–2023), we conducted a temporal analysis to detect trends and shifts in the EGD’s influence on the Six Transformations over time.

Reporting summary

Further information on research design is available in the Nature Research Reporting Summary linked to this article.