1. Introduction
Sustainability is a universal concern [1,2]. Environment pollution inequity, injustice, and poverty continue to be faced by millions of people across the world [3]. While sustainability is gaining attention at the macro level (e.g., at the United Nations SDG, regional, and country levels), large companies are still focused on maximizing wealth based on shareholder theory, rather than taking the negative impacts of climate change into consideration [4]. Heavy industrial development is still a threat to sustainable development in many developing countries [5]. However, on the brighter side, corporate sustainability is gaining increased attention due to pressure from various stakeholders [6]. While corporate sustainability has been variously defined, the idea generally includes the pledge of a business to contribute to sustainable execution of business operations, working with multiple stakeholders, such as customers, suppliers, employees and their families, the government, nonprofits, NGOs, watchdogs, the community in which it does business, and the greater society, to enhance the quality of living [6]. An outgrowth of corporate engagement is sustainability is that companies are facing increasing tension on two issues—the demand for more disclosure and transparency and the need to adopt good corporate governance practices [7]. This stakeholder- and activist-fueled ultimatum for business answerability and clarity [8,9] has arisen out of the growing concern for rapid environmental degradation and the adverse effects of climate change. The negative tide and across-the-board apprehension regarding the future state of mankind due to the potential adverse effects of business endeavors have caught the attention of global citizenry. In the glare of such tension, businesses have elevated environment, social, and governance (ESG) to the front of the business mission [10,11]. The executive suite is starting to recognize the need for greater clarity regarding business ESG initiatives and their implications for the planet [12]. To enlighten the concerned parties, corporate social responsibility (CSR) and ESG data are being incorporated into business revelations (e.g., annual reports, 10-K and other filings), and corporate road-map strategies are being redrawn to have superior CSR and ESG outcomes [11]. CSR encompasses a gamut of perspectives, such as promoting economic expansion, shareholder benefit, image and standing, customer and supplier linkages, and the caliber of its products and services [13]; substantiating the proactive and routine incorporation of ESG aspects in corporate practices [14]; addressing stakeholders’ current needs without yielding the capability to manage the future requirements [15]; and generating everlasting stakeholder benefit by embracing opportunities while overcoming challenges in ESG [16]. Despite a surplus of definitions, the unifying theme is the expected business allegiance to CSR and ESG initiatives [17,18]. Businesses have shown their dedication to sustainability by embracing various initiatives and communicating these typically via disclosures and reporting [19]. Most companies, especially publicly traded ones, must practice mandatory financial compliance and reporting. However, sustainability-related disclosures are typically voluntary [20,21]. The triple-bottom-line view of Elkington (1998) with respect to ESG [22] pays attention to three components, namely economic, environmental, and social [23]. This perspective argues that businesses owe it to stakeholders to not only generate profits and earnings but also reckon with CSR and ESG issues [24]. Thereupon, the number of corporations that disseminate particulars on their CSR/ESG is increasing exponentially [25]. To reiterate, a key method by which these corporations disseminate the CSR/ESG information is via the CSR/ESG (sustainability) report, typically appended to the financial filings and reports [26,27]. In this research, we adopt a data-driven approach to eliciting corporate climate change and environmental concerns by applying machine learning text analytic methods to the SEC filings by corporations in the United States.
For the past many years, researchers have attempted to glean insight into CSR/ESG or various companies from these filings and reports [28]. While some researchers have focused on the timing and other meta-level data from the disclosures (filings/reports), others have paid attention to interpreting the narrative and extracting the key elements of the CSR/ESG practices [25,29]; yet others have applied traditional qualitative methods, such as content analysis, to delve into the disclosures [24]. More recently, studies have started to examine the content using formal computing methods, such as machine learning and text analytics. The goal is to elicit keywords and topics that describe CSR/ESG. For example, a simple word count may indicate the popular themes [24,26,27,30,31,32]. In general, most of these studies have analyzed fewer disclosures and applied basic analytic techniques, thereby limiting their findings. More recently, Szekely and vom Brocke (2017) used advanced text analytic methods, such as topic modeling, on approximately 9514 sustainability disclosure reports that were submitted between 1999 and 2015 [26]. They used latent Dirichlet allocation (LDA) to identify the primary themes that emerged from their large sample of disclosures [33]. Simultaneously, a new phenomenon has emerged in which large companies are including more sustainability-related information in their mandatory and voluntary disclosures, such as in SEC filings [28,34,35]. This is in response to a demand by the numerous stakeholders, particularly activist shareholders, and institutional investors for more formal information [36,37]. Activists are relentless in their criticism of major companies, such as BP, Chevron, Exxon, and Shell, typically in the oil and gas industry for perceived negative impact on the environment [38]. Given the tension to ameliorate the environment [39] and the high cost of environmental wrongdoing, e.g., via litigation [40], it is no wonder current disclosures emphasize the environment [28,41,42,43]. Climate change, for example, escalates health threats and creates public health concerns, such as respiratory and cardiovascular diseases, temperature-related death and illness, and threats to mental health [44,45,46]. In this era of heightened awareness of climate change, a new epoch of sustainability is spreading across the globe as citizens are becoming environmentally conscious and asserting their determination through their lives. Businesses are also becoming mindful of their CSS/ESG practices on the larger communities in which they do business [14,15,47].
Our impetus for this study comes from several perspectives: First, sustainability has permeated business practices, while, at the same time, society wants to be engaged and informed; second, what follows is businesses are enthusiastically responding to CSR/ESG; third, companies are more actively communicating their engagement via several media, including social media, such as Twitter and Facebook, and through voluntary and mandatory disclosures especially filed with the Securities and Exchange Commission (SEC), a US government regulatory agency; and fourth, research in sustainability disclosure (in general) and sustainability reports (in particular) is still scarce and far from mature [27]. Surfacing out corporate sustainability issues and practices from this perceived authentic source (SEC) would shed light on the disclosures.
Our empirical study builds on prior studies that have applied advanced machine learning and text analytics (e.g., [26,27]) to elicit and scrutinize the more significant environment and climate change aspects disclosed in the corporate SEC filings. While past studies have, in general, explored sustainability reports, we examine climate change and environmental disclosures contained in the various filings (10-K, 20-F, and 40-F) with the SEC. It is important to note that while disclosure in sustainability reports is primarily subjective and voluntary on the part of the company, disclosure in the filings with the SEC is typically mandated and subject to audit. Pulling corporate sustainability issues and practices from this highly regarded source would shed light on the disclosures. In analyzing sustainability disclosures, we focused primarily on climate change, environmental issues, and the variances in sustainability issues among the sectors/industries [26,27].
We add to current research on the environment and climate change in several ways. First, our research examines the disclosures submitted to a government regulatory agency, namely the SEC. Therefore, the disclosures and our findings are more credible. Second, we examine more recent data in disclosures, the most recent year being 2020, and include an entire decade, resulting in a substantial volume of data. By extending the period studied as well as the quantity of disclosures, we expand the scope of the study to run analyses at the industry and sector levels. Further, this empowered us to focus the spotlight on the progression of topics over time, as well as investigate inter-industry and inter-sector comparisons. Third, we contribute to the advancement of the ever-increasing application of machine learning and text analytics to understanding large corpuses of unstructured text data (tens of thousands of disclosures) absent predetermined keywords. Fourth, this study examines the environment and climate change issues through the lens of the company. When companies use the findings of this study themselves, and by watchdogs, nonprofits and NGOs, regulatory bodies, and others, it will lead to improved environmental and climate change practices as the entities make informed decisions [26,27]. Thus, we analyze corporate disclosures in their SEC filings to elicit sustainability concerns and issues, particularly in terms of climate change and the environment.
The rest of this study is arranged as follows: Section 2 provides an overview of the research background with discussions of sustainability, sustainability disclosures and reporting, and machine learning and text analytics. Methods, results, and analysis are presented in Section 3; Section 4 provides a discussion, while Section 5 highlights the scope and limitations; and lastly, Section 6 offers conclusions and future possibilities for research.
2. Research Background
2.1. Sustainability
The Brundtland Commission’s 1987 report labeled “Our Common Future” defined sustainability as “development that meets the needs of the present without compromising the ability of future generations to meet their own needs [48]” (World Commission on Environment and Development, 1987). Though this definition emphasizes the environment, the report is also aware that “development cannot be said to be sustainable if it is not equitable, or if it does not meet the pressing needs of the majority of the inhabitants of the globe.” Not only must sustainability engage the current generation, but it must also be mindful of the needs of future generations. This view of sustainability has been accepted universally [26,49]. Though the Brundtland report provides a working definition, the expression “corporate sustainability” generally refers to companies [49]. While some researchers have focused on the environmental components of sustainability, others have highlighted the social dimensions. Yet others have adopted a more holistic approach integrating environmental, social, and economic issues, without any one issue dominating [49,50]. Corporate sustainability is generally perceived to be the epitome of economic function, environmental performance, and social responsibility [15,51]. Focusing on environmental performance, the related practices address the consequences of the expending of natural resources and the production of gaseous emissions. Generally, it would be expected that the two be at the minimum levels possible to safeguard the overall ecosystem [15]. It follows then that positive environmental practices are associated with diminishing environmental destruction by preserving natural resources, including energy [52], and efficient waste disposal, for example, via recycling [53]. The typical metrics of measuring the environment are air emissions, biodiversity, energy usage, natural resource loss, solid waste, transportation, and water use and discharge [51,54]. The social practice has the goal of improving the quality of life of local communities [15,52]. Economic sustainability practices support long-term liquidity and healthy returns on investments to the stakeholders [15]. However, the latter two are not the targets of this study. Companies are also motivated to immerse themselves in sustainability due to regulation [51]. In addition, they are susceptible to encouragement from customers, employees, and suppliers [55], as well as stress from management, since long-term investments in sustainability may indeed promote financial performance [56]. All in all, it is settled that companies the world over are engaged in CSR/ESG as indicators of sustainability. The key challenge then is, how do these companies’ practices become more visible? The answer lies in the timely and effective disclosure of their sustainability practices, namely filings and reports [49].
2.2. Sustainability Disclosures and Reporting
Sustainability and its disclosure go together. To wit, companies share their sustainability-related information to enable transparency and thereby create a positive image about themselves. Numerous theories attempt to explain the dynamics of sustainability disclosures. Applying legitimacy and stakeholder theories, the disclosure of financial, environmental, and social information (i.e., corporate sustainability disclosure—CSD) is inclusive in the ongoing conversation between a company and its stakeholders. A disclosure facilitates relevant information flow regarding a company’s sustainability-related initiative that assists in the company’s activities that help validate its attitude, educate the public, and transform expectations [57,58]. The prospect of the enabling the possibility of environmental reporting is fueling increased awareness of the promise of sustainability [59]. Researchers have therefore emphasized the imperative to inquire further into the various dimensions and nuances that can leverage disclosure practices [59,60]. As indicated before, the literature has confirmed the companies’ use of sustainability disclosure to manage their reputation risk [59,61]. Corporate sustainability disclosure has also been used as a channel to connect with the various stakeholders. It implies the company may communicate with different stakeholders, including regulatory agencies, nonprofits and NGOs, activist shareholders, institutional investors, and others [62,63,64,65,66]. Researchers, realizing the value of these disclosures, have quested to extract the nuggets of insight from the companies’ dissemination via various publication outlets. While the disclosures may provide biased views, namely those of management, nevertheless, one gets at least an inkling of the company’s practices [26,67]. Numerous researchers have studied various aspects of corporate sustainability reports (disclosures) by adopting different methods [3,68]. Sobhani et al. (2011), for example, identified the performance indicators disclosed in corporate sustainability reports [3,65]. Managers can use sustainability reports to obtain accurate knowledge of a company’s sustainability efforts, rather than relying on third parties. Similarly, researchers can analyze sustainability reports to test hypotheses entailing sustainability reporting and performance. Studies that have conducted content analysis of sustainability reports have investigated the evolution of report content quality [25,65] or trends in sustainability reporting in industries [69,70] or countries. Tate et al. (2010) analyzed sustainability reports using content analysis with automated software to uncover supply chain sustainability themes, which are compared against companies’ geographic location and revenues [71]. Montabon et al. (2007) also conducted content analysis of corporate sustainability reports based on environmental management practices identified in the literature [72]. However, there were limitations as to how the analysis was carried out [61]. Several previous studies have measured companies’ level of disclosure through simply the number of relevant words, sentences, or pages in sustainability reports on different themes [69,70]. The drawback of counting space allocations for certain words or themes is that such an approach fails to capture the information in the reports [65]. In a similar vein, some researchers have used computer-aided text analysis to uncover supply chain sustainability (e.g., [71]). However, many sustainability reports present information graphically, which limits the use of computer-aided text analysis [65]. In addition, sustainability reporting in Australia, for example, is largely voluntary and most reports/disclosures are not subject to the process of assurance. Overall, as researchers continue to use several qualitative techniques, due to a paucity of quantitative data that would be more amenable to analysis, machine learning and text analytics have emerged as the leading methods with immense potential in this domain [18,26,27,28,29,30,31,32,33,34,35,36,37,38,39,40,41,42,43,44,45,46,47,48,49,50,51,52,53,54,55,56,57,58,59,60,61,62,63,64,65,66,67,68,69,70,71,72,73,74].
2.3. Machine-Learning-Based Text Analytics
The increasing number of sustainability reports and growing interest among the diverse stakeholder groups have created new challenges for policymakers and researchers in measuring the effectiveness and exploring the information of these reports, given the diversity of the reports’ content and structures [30]. Machine-learning-based text analytics has emerged as a viable technique to analyze large corpuses of unstructured data, such as textual data. Multiple tasks, such as text categorization, text clustering, entity extraction, document summarization, and entity relation modeling, can be implemented via text mining [75]. Recently, machine-learning-based text analytics has garnered enthusiasm in its application to the analysis of sustainability and CSR/ESG-related textual disclosures and reports [18,26,27]. Considering the wide prevalence of unstructured data, extracting knowledge from these corporate disclosures is discerning for application and research. For instance, Bayesian-combined text mining has been applied to the study of economic, environmental, and social factors in a large sample of sustainability reports, revealing that the content of the reports is fragmented even under the Global Reporting Initiative [30]. Text mining using multivariate discriminant analysis has been used to investigate sustainability reports, indicating that firms’ disclosures differ across industrial sectors. Text mining has also been implemented in the comparative analysis of sustainability reports before and after an industrial crisis, demonstrating a decline in the quantity of disclosed information during the year of the crisis [41]. Computer-based content analysis has been used to explore sustainable supply chain management trends based on news articles and sustainability reports [76]. Green IT practices across industries have also been researched using dictionary-based text mining [77]. The trend and pattern of firms’ sustainability reports in the Indonesian construction industry have been studied via text mining, enabling stakeholders to interpret the firms’ sustainability strategies [78]. Moreover, semi-supervised text mining using association rules has been adopted to study the maritime industry’s sustainability reports, providing managerial implications for the maritime industry to satisfy SDGs [79]. Hence, applying a text-mining method to explore sustainability disclosure information is acknowledged as appropriate. Due to the prevalence of unstructured data, extraction of knowledge from these corporate disclosures is highly significant for research as well as for application.
The power of natural language processing (NLP), machine learning (ML), and text analytics together in spontaneously discovering patterns and trends in textual documents cannot be underestimated. By using these multiple methods simultaneously, the researcher can prepare and can convert large volumes of unstructured data (“big data”) into a ordered format that is appropriate for the application of ML techniques, and further conduct analysis [80]. Using text analytics, researchers can wrest various facets and the relationships among them. In the current research context, SEC filings about sustainability proffer a justified arena of study. It provides a substantial canon of ill-structured textual data. It is also indicative of significant reflections of top management and therefore presents a requisite depository to investigate. Since businesses disseminate their allegiance to various societal missions, it is only natural that the disclosures be probed in detail.
The methodology of machine-learning-based text analytics has been applied in numerous streams of research to analyze unstructured textual data, such as tweets on vaccination [81], shareholder resolutions about sustainability [18], sustainability disclosure on the web [17,26,27], or legal analysis [82]. To reiterate, machine learning and text analytics together offer powerful analytical tools to interpret big text data [26,27,32,83]. The understanding gained from the study would guide top executives, shareholders, institutional and individual investors, and watchdogs to make informed decisions.
Compared with other text-mining methods, LDA is the most straightforward probabilistic approach to model topics from textual documents [26,27]. LDA generates a list of topics that have weights for each document, making it suitable to handle large datasets and produces insightful results. The methods of TF-IDF, cluster analysis, and LDA are three of the more appropriate and widely used ML-based text analytic methods and, therefore, used in this study [26,27].
3. Methods, Results, and Analysis
This text-analytics-based research study delves into the sustainability disclosures of companies to understand the expansive sustainability issues, practices, and trends. The origin of the disclosures’ data is the website of Ceres, a sustainability-focused, non-profit concern that has made available several searchable databases (
Table 1 summarizes the disclosure rate for the period 2011–2020 that we extracted and synthesized from the files. It shows the percentage of companies that disclosed climate-change-related information in their filings for each of the years. As seen, the number of companies providing disclosures related to climate change was relatively stable during the 10 years of this study and most of the disclosures were typically provided by publicly traded companies.
Likewise, it was observed that all disclosures follow the same structure in the Ceres API. For example, Figure 2 shows the disclosure information for the company 1ST SOURCE CORP in 2017, which is under the “Banks and Financial Service” industry group and the “Finance, Insurance, and Real Estate” division. It also shows all disclosure contents related to climate change for the company in the 2017 annual filing.
As mentioned previously, Python was used to retrieve all the data from Ceres, which were then transformed into a structured DataFrame. Table 2 shows the variables extracted from each company’s disclosure collectively. Besides the basic information about a company, such as the name, industry, and ticker, Ceres also provided four risk ratios. These ratios were extracted as well. They were calculated by dividing the frequency of relevant keywords/phrases in each risk category by the frequency of total keywords/phrases appearing in the excerpt. The four ratios would add up to 1 (100%).
The four risk ratios were pre-defined by Ceres. They highlighted the relevant keywords/phrases in the excerpt and classified them into four risk categories. Ceres then calculated the distribution for terms in each risk category. This was to provide the values for the four risk ratios for each report (disclosure). The sample terms and phrases for each risk category are shown in Table 3. Appendix A provides a snapshot view of the industry (division) of companies with disclosures in the year 2020.
3.1. Text Analytics
Next, machine-learning-based text analytics was applied to examine the disclosures. To apply ML and text analytics, considerable data engineering preparation and data-preprocessing methods were adopted. The retrieved data for each disclosure were saved as a text file prior to vectorization. A smatter of disclosures was deleted due to a paucity of relevancy/materiality and broken links. A caveat with text data is that predictive analytic models developed with text data muddy the analytical methods. First, textual data cannot be used as input in many precise numerical or quantitative models. Therefore, an NLP program was run to transfigure the textual content into distinct components elements for additional scrutiny. Second, text-based datasets are bigger in terms of volume vis-à-vis quantitative datasets. Therefore, a robust archetype warrants data extraction by recognizing the most appropriate chunks of data. In the data engineering step, the granular information in the summary section was converted into simple text output (files) through the removal of typical punctuation marks, spaces, numbers, and standard stop words using the Natural Language Toolkit (NLTK). Next, the text was changed to lowercase using the NLTK and TextBlob (
As shown in Figure 3, the darker the color and the larger the size, the more frequently the word appears in the overall disclosure report. It is obvious that energy-related words, such as gas, emission, power, and oil, formed a substantial portion of the content. Climate-change-related words, such as greenhouse, ghg, water, air, and carbon dioxide, were also quite significant. Furthermore, regulation-related terms, such as legislation, compliance, regulation, and agreement, were prominent as well.
The term frequency–inverse document frequency (TF-IDF) technique mentioned before was next applied to calculate the weight of each term to denote its relative importance in a document. This is a data retrieval approach that allocates a weight to a term’s frequency (TF) and its inverse document frequency (IDF). Each term is given these two scores. The weight of the term is then computed as a product of these scores. TF-IDF is a statistical modeling tool used to assess the worth of a word to a document that belongs in a corpus. The value of a word increases vis-à-vis the frequency of its appearance in a document [29,44]. Simultaneously, it decreases in reverse symmetry to the frequency of its presence in the corpus. The primary thrust of TF-IDF is that when a word or phrase appears in a document with a greater frequency of TF, and seldom shows up in other documents, the word or phrase has reasonable classification prowess and is competent for classification problems. In this study, the companies’ disclosures were divided into 31 industry groups to calculate the TF-IDF value for the 55 extracted keywords. The resulting matrix is a data frame with a size of 31 × 55. Figure 4 displays partial examples of the final matrix.
There were a total of 31 rows representing 31 different industry group. Each row contained 55 values, and a high value for a keyword implied that the high term frequency was weighted by the oddity of the keyword’s occurrence. Subsequently, the K-means clustering model was developed to surface-out the key sustainability concepts. K-means, a widely used ML algorithm for clustering, is typically set to classify cases based on similarity measures (i.e., the distance between the cases). It is generally applied to solve classification and pattern recognition problems. For this study, six high-level clusters were identified with a word cloud utility. Classification with the KNN classifier (a supervised machine learning algorithmic method) next ensued. In this application, the data were partitioned into two sets (training and testing datasets) to assess the success of the classifying algorithm. The rationale is to recognize the varied subclasses (i.e., the clusters) in the data such that the observations (cases) in the same subclass are akin, while the observations in the different clusters are distinct from each other. The TF-IDF features of the top 55 keywords for each industry were extracted as our model input variables. K-means clustering was implemented to obtain the clusters, which were then labeled numerically. Since the goal was to find the patterns of keywords that describe different industries, the K-means algorithm was used to build the segmentation of six clusters formed from among the 31 industry groups that contained 4461 companies. The model was evaluated using the silhouette score. The silhouette score is an average measure of how close each point in one cluster is to points in the neighboring clusters, and a positive value indicates the point has been classified to a correct cluster. The six-cluster model had an acceptable average silhouette score of 0.4, as shown in Figure 5.
In Figure 6, the taxonomy of clusters and keywords is displayed. The six different clusters are shown in different colors and shapes, and for each cluster, the most important keywords are identified. One can describe each cluster based on its keywords. The silhouette score is used to appraise the caliber of a cluster identified with the use of standard clustering algorithms, such as K-means. The issue addressed is to what degree are the observations clustered with other observations that are like one another. The value of the silhouette score varies from −1 to 1. When the score is 1, the clusters are distinct and sharply delineated. A value close to 0 indicates congruent clusters with the observations in proximity to the decision boundary of the nearby clusters. In this study, the derived silhouette score of 0.4 indicated the clusters were quite distinct.
There were in total 6 clusters that were marked with different colors and shapes. The number of shapes in each cluster represents the number of industries that have been grouped into that cluster.
As observed in Figure 6, the blue circle represents the top five key features in cluster 1. These are different from the other five clusters. Table 4 gives detailed information about each cluster, the key features, the number of industries, and the specific industries.
Besides common key features, such as gas, ghg, and climate, industries in different clusters have their own specific focus. For example, oil/gas manufacturing and consumption industries, such as oil and gas, transportation, and aerospace and defense are in cluster 1 with the unique key feature “oil.” An eclectic group of industries, representing “climate” change, are in cluster 2. One could extrapolate that all industries are concerned about climate change. Indeed, several industries fall into this cluster. Cluster 3 is labeled as “energy,” and the main industries in this cluster are electric power and gas utilities. Cluster 4 has the greatest number of industries and is associated with “impact,” which has a couple of interpretations. One, companies are concerned about the impact of climate change and environmental degradation on their business and on the planet. Two, companies have been concerned in recent years with the effect of climate change and the environment on financial performance and shareholder value. In cluster 5, we see some association with “cost,” likely related to the loss and damage incurred due to climate change and an adverse environment (e.g., the negative effect of extreme weather).
Heavy emission industries, such as automobile and coal mining, are in cluster 6 with the unique key feature “ghg.” When compared to the “issues” available at
As discussed previously, a text-based dataset contains rich, yet complicated information compared to numerical datasets, so a data-preprocessing step was needed to tease the relevant information out of the lengthy text. Second, besides TF-IDF, several other textual analysis methods, such as sentiment analysis, POS tagging, and doc2vec, were also applied.
To gain insight into the regulatory aspects mentioned by companies in their sustainability disclosures, the statutes typically mentioned in terms of compliance by extracting law-related phrases using spaCy (
[Figure omitted. See PDF]
However, we noticed some specific statutes or act names were abbreviated or contained values such as “10-K SEC Act,” which could not be captured by the POS tagging patterns. Given the fact that the descriptive words before “Act” always modify it in this case, another approach was taken by setting the wildcard token (token with wildcard logic) as the matching pattern, which tells the matcher to search for and extract any two or three tokens that the word “Act” follows. As a result of this method, one can locate all the sentences that contain “Act” and find out which “Act” is the most concerning and frequently mentioned during the past 10 years. In Figure 7, a bar chart shows the most cited statutes (acts) in the disclosure reports during the past 10 years.
The top 30 federal and state statutes by frequency cited in the disclosures are displayed. The implication is the higher the frequency of mention, the more relevant the statute to sustainability-related activities. The extraction of the statutes is a novel contribution of this study. Further, we were also interested in the types of greenhouse gases mentioned most frequently in the companies’ disclosure reports during the past 10 years. The following donut chart was developed by counting the number of occurrences of various types of greenhouse gases mentioned in the companies’ disclosures and the proportional relationship between them and the total number of counts. This is shown in Figure 8.
The chart shows that carbon dioxide is the main source of greenhouse gases, followed by methane. Companies appear to pay close attention to the emission of carbon dioxide and methane, followed by other gases. The fact that greenhouse gases are even mentioned in the disclosures indicate companies are getting interested in addressing greenhouse-gas-caused climate change. To get a sense of how much a company’s disclosure changes over time, the content variance for each company was calculated. First, the NLTK POS tagging package was used to extract top 20 most frequent noun keywords from the companies’ most recent and oldest disclosure reports, and then, cosine similarity was applied between the two vectorized lists. For example, the company Advance Auto Parts Inc. (ticker: AAP) has been providing climate-change-related disclosures consistently from 2011 to 2020. For the 10 years of disclosure content, the top 20 most frequent noun keywords of its disclosure report for 2011 and those in its report for 2020 were compared. A content variance of 1 means the two disclosures’ top 20 keywords are the same, whereas a content variance of 0 means the two disclosures’ top 20 keywords share nothing in common. The content variance for each company was calculated and the results visualized using a box-and-whisker plot for all sectors. Figure 9, a box-and-whisker graph, exhibits the content variance distribution across the different sectors of all companies that provided climate-change-related disclosures.
Figure 9 shows that the median content variance scores for eight sectors were all around 0.45. Since its box is more skewed downward, the transportation and communications division tends to have a similarity score closer to 0 compared to other sectors. We concluded that the disclosure contents for companies in the transportation and communications sector vary more over time when compared to other sectors. Doc2Vec similarity captures how similar a company’s disclosure compares with other companies’ disclosure reports within the same industry. The methodology for Doc2Vec (
The darker and larger the box, the larger the variance, and the more variant the content within that specific sector. As shown in Figure 10, the variance of Doc2Vec similarity scores was the highest in the wholesale trade sector, meaning the disclosure content of companies in the wholesale trade sector varies more from each other than from companies in other sectors.
3.2. Disclosure Sentiment
The well-known and commonly used technique of sentiment analysis computationally distinguishes and compartmentalizes beliefs expressed in the text. This helps especially to determine what the opinionator’s stance is toward a particular topic, etc. (
Three sets of sentiment analysis were conducted to capture companies’ perspectives on the three risk categories. The keywords were selected from the top 30 frequent keywords that were mentioned in the 25,428 disclosures.
Figure 11 is a line chart showing the compound sentiment score for the aspects of regulatory-related company disclosures. As shown in the chart, the increasing trend in the wholesale trade industry implies companies in this sector have complied with laws and regulation more effectively during the recent year. The compound sentiment in the mining sector decreased from 2019 to 2020.
Figure 12 shows the compound sentiment score for the aspects of climate-change-related company disclosures. As shown in the chart, the compound sentiment score for agriculture disclosures dramatically decreased during recent years. This implies that the agricultural business has been severely affected by the recent escalation in climate change and that companies perceive climate change as a risk factor to their business.
Figure 13 shows the compound sentiment score for the aspects of renewable-energy-related company disclosures. As shown in the chart, the compound sentiment score in agriculture disclosures dropped during the year 2020. This is because the number of disclosures in the agricultural sector is low, so the base number of the average is high, thereby resulting in extreme values. We next turn our attention to topic modeling.
3.3. Topic Modeling (LDA)
The aim of latent Dirichlet allocation (LDA) is to assist in the classification of the content of disclosures reports and find a mix of topics that companies focused on. LDA was applied to the disclosure content to elicit the key topics addressed by the disclosures [26,27]. By assigning each disclosure to a cluster, the model can glean the hidden characteristics in the text and generate labels for the supervised learning model. Since unsupervised learning models have a high degree of uncertainty, Gensim’s LDA implementation was used (
To build the initial LDA model, the parameter for the number of topics was set to 20, alpha to 0.01, and beta to 0.1 to develop the initial model as a baseline model for performance comparison. The perplexity level and coherence scores were calculated for this purpose. The perplexity level measures the probability of unexpected results in the classification. The coherence score is a value of the semantic similarity between high-frequency words. It assesses the quality of the learned topics; therefore, the higher the score, the better the quality of the topics extracted. The perplexity score measures how credibly the model represents or reproduces the statistics of the hold-out data. From a performance perspective, one would want to build a model with a low perplexity level and a high coherence score by tweaking the topics and other coefficients.
The overall performance of the initial model is shown below:
Perplexity: −6.39885911951267
Coherence score: 0.4222068061701333
The initial model demonstrated reasonable performance, as indicated by the perplexity level, while the coherence score indicated the model needs additional refinement. Tuning is a way to maximize the performance of a model without overfitting or producing a large variance. A series of sensitivity tests were conducted to determine the number of topics (K), Dirichlet hyperparameter alpha (Document-Topic Density), and Dirichlet hyperparameter beta (Word-Topic Density). The tests were performed sequentially. One parameter was examined at a time by holding others constant. Figure 15 shows the coherence score for the number of topics across one epoch of the validation set with a fixed alpha = 0.01 and beta = 0.1. It plotted the number of topics by coherence score and studied the trend as the number of topics ranged from 2 to 15. We observed that while the coherence score was volatile as the number of topics changed, it had the best value when the number of topics was 6. As the coherence score appeared to decrease after 6 topics, K = 6 was chosen.
The evaluation metrics of the optimal model are shown below.
Perplexity: −7.196056573358106
Coherence score: 0.5500174631782340
In further comparing model 1 (Gensim’s LDA) to model 2 (mallet LDA), the perplexity level and coherence scores showed nearly identical performance with similar keywords in each topic. Therefore, to scope the analysis, only the results from model 1 (Gensim) were examined further. The results in Table 6 were generated with this LDA model. Six topics were extracted along with their corresponding terms. On examination of the terms, each topic was assigned a label that most reflected the terms within. These topics and terms are indicative of the sustainability issues that companies are typically concerned with. Label categories to some extent reflect the companies’ emphasis on a certain topic. The six key topics are gas emission, carbon risk, climate change, loss and damage, renewable energy, and financial impact. When compared to the “issues” at
Figure 16 displays the topic distance map; the area within the circle characterizes the importance of each topic over the entire corpus, and the distance between the centers of the circles describes the similarity between the topics. The topic distance map was drawn using the built-in function of the pyLDAvis package to visualize the volume of topics and the keywords contained in the topic [86,87]. To the left, the different-size bubbles represent the topics; the larger the bubble, the more companies’ disclosures in that topic. The distance between the topics approximates the extent of the semantic relationship between the topics, and if the topics share common keywords, the bubbles will overlap (in proximity). The right part of Figure 16 is a bar chart showing the relative importance of each term in topic 1. The red-shaded area describes the frequency of each term in each topic, while the blue bar shows the frequency distribution of the terms in all disclosures. Topic 1 is the largest topic in all the disclosures put together; the top 30 keywords in the right bar chart represent the most frequent keywords in this topic group. Studying the keywords in Table 6 and Figure 16, topic 1 seems to have disclosures related to greenhouse gas emissions and corresponding legislative action. Topic 1 also overlaps with topic 4 by having a few keywords. Topic 6 has a large distance from the other topics. This makes sense as topic 6 describes the financial impact aspect of sustainability. As mentioned previously, the related keywords describe issues different than the keywords of the other traditional sustainability-related topics. Topics 1, 2, 3, and 4 are in proximity and collectively describe environment-related topics, since they share the related keywords. Among the environmental topics, topic 1 focuses more on greenhouse gas emissions, and topic 2 pays more attention to carbon-related emissions affecting sustainability. Topic 3, however, shows a specific focus on climate change correlated with weather and risks. Topic 4 contains more natural-disaster-related keywords, while topic 5 deals with renewable energy technologies. Topic 6, of course, highlights the financial impact [88,89]. This topic is an interesting finding of this study. Figure 17 is a highlight table showing the count of companies within different sectors that focus predominantly on specific topics. As seen, each sector has its own focus.
For example, manufacturing emphasized topic 2 (carbon risk) and topic 3 (climate change), while finance, insurance, and real estate focused on topic 4 (loss and damage). However, these are not mutually exclusive. Every sector is associated with each of the six topics. Caution must be paid attention to the total number of companies in each sector, as frequency counts are not being compared.
4. Discussion
Our research applied topic modeling and other text analytic methods to approximately 25,428 disclosure reports filed with the SEC in the years 2011–2020 to identify key climate change and environmental issues companies are concerned with. We identify the six key topics as gas emission, carbon risk, climate change, loss and damage, renewable energy, and financial impact. These topics and terms are indicative of the sustainability issues that companies are typically concerned with. Financial impact is a major topic identified in this study. This is consistent with the recent literature that has emerged on the association between ESG/sustainability and company financial performance. Specifically, the analysis indicates carbon dioxide is the main source of greenhouse gases, followed by methane. Companies appear to pay close attention to the emission of carbon dioxide and methane, followed by other gases. The fact that greenhouse gases are even mentioned in the disclosures indicate companies are getting interested in addressing greenhouse-gas-caused climate change. The study also finds that besides common key features, such as “gas,” “ghg,” and “climate,” industries in different clusters have their own specific focus. For example, oil/gas manufacturing and consumption industries, such as oil and gas, transportation, and aerospace and defense, are described by a unique key feature “oil.” An eclectic group of industries is characterized by “climate change.” One could extrapolate that all industries are concerned about climate change. Energy describes the primary industries of electric power and gas utilities. Interestingly, a large number of industries is associated with “impact.” Overall, companies are concerned about the impact of climate change and environmental degradation on their business and on the planet. In addition, companies have been concerned in recent years with the effect of climate change and the environment on financial performance and shareholder value. Some industries are associated with “cost,” likely related to the loss and damage incurred due to climate change and an adverse environment (e.g., the negative effect of extreme weather). Heavy emission industries, such as automobile and coal mining, are characterized by “ghg.” Generally, our study indicates that the number of companies providing disclosures related to climate change has been relatively stable during the 10 years of this study and that most of the disclosures were typically provided by publicly traded companies.
5. Scope and Limitations
From a research method perspective, the reproducibility and replication of results are some of the major challenges related to applying machine learning and text analytics to unstructured data. Additionally, despite algorithmic modeling, the intuitive naming of clusters (topics) limits our conclusions somewhat. Still, based on comparisons with other studies, we are confident the findings here are consistent with those from prior research. Thus, the methods developed and applied here lend themselves to future studies in this area. Additional limitations extend to the reliability of disclosure data and/or the validity of data engineering methods. In addition, this is a data-driven study wherein extracted disclosures were subject to machine-learning-based text analytics. Therefore, our research background that summarizes contemporary studies in sustainability-related topics is not subject to a formal method to identify the literature. Rather, several searches were conducted, including in Scopus, to ensure recency of cited studies.
There are limits to the generalizability of the clusters (topics), particularly when examining purely voluntary disclosures and not those of all registered companies. The disclosures may not sufficiently reflect overall corporate initiatives or efforts relating to sustainability. Further, ML algorithm models can only extract limited insight, being constrained by their assumptions. Additionally, this study used the disclosure tool at
6. Conclusions and Future Research
This exploratory study attempted to shed light on climate-change- and environment-related sustainability issues of companies by studying the sustainability-related disclosures submitted to the SEC in various filings. Our study provided a comprehensive overview of the key climate change and environment topics and issues that companies are concerned with in terms of sustainability. In addition to the dimension-level analysis, the study provided an overview of corporate sustainability disclosure at the sector and industry levels. The research objective was met by the identification of six key clusters using the K-means method and six main topics using the LDA method that related to the disclosure of climate-change-related environmental concerns. As discussed before, both methods produced results that are generally overlapping. The overlapping results further reinforce and enhance our understanding of climate-change-related disclosure at various levels, such as sector, industry, and topic. Our analysis shows that companies are concerned primarily with the topics of gas emission, carbon risk, climate change, loss and damage, renewable energy, and financial impact. While a wider range of topics, such as CSR and ESG, may be included in sustainability, the sustainability disclosures examined in this study predominantly focused on climate-change- and environmental-related issues only. However, many other issues necessitate further research. For example, this study found hardly any information about social, governance, ethics, or human rights topics [90,91,92]. However, recent research is developing new models of sustainability applications and practices and alternative cost–benefit and performance models. For example, Novoselov et al. (2022) discuss social investment approaches versus traditional compensation models in the sustainable development of the Arctic [93,94]. They argue for a “social orientation” that considers the aspirations of the native population. In the case of the Artic, this implies the development of contemporary social infrastructure indigenous to the polar, energy-conscious, and environmentally appropriate dwellings; the promotion of the local business; the nurturing of local tourism; and the protection of cultural traditions and the conservancy of ethnic groups of the Arctic [93,94]. Rather than focusing on firm-level financial performance, the authors [93,94] suggest that the industrialization of the Arctic should dwell on the quality of life by enabling the creation of jobs such that the people do not have to migrate and can live in their own communities and reduce poverty and disease [91,92]. Future research should investigate the linkages between corporate CSR, ESG, and sustainability in the context of the SDGs.
One may also investigate how textual documents such as SEC filings can be used to conduct predictive analytics to get a handle on direction and action. Sentiment analysis can be deployed to explore stakeholders’ attitudes and shifts in the long run. The effect of disclosures on the community in general and stakeholders (e.g., climate change activists) can be analyzed from the perspective of social media (public sentiment analysis). Further, as recently emerging studies indicate, the effect of good sustainability practices on corporate performance (e.g., financial, reputation) is another area of potential study. Notwithstanding the limitations, our study contributes to sustainability policy and research in several ways. First, we used data on disclosures published as recently as 2020 and looked at a time frame that includes multiple years. Second, we analyzed a significantly large number of disclosures. By extending the time frame and number of disclosures, we extended the coverage to many divisions, industries, and sectors. This also enabled us to explore inter-industry/inter-sector comparisons. Third, we contributed to the advancement of the ever-increasing application of machine learning and text analytics to understand large corpuses of text data, namely the disclosures. Text analysis offers the novel ability to deep-dive into text data without assuming a pre-defined list of terms. Fourth, our study analyzed corporate sustainability from the company’s perspective (i.e., management).
Companies themselves, as well as watchdogs, nonprofits and NGOs, regulatory bodies, and others, will be able to use the findings of this study to make informed decisions, leading to improved sustainability practices. Future studies can continue to analyze disclosure data using alternative ML techniques, such as deep learning, to obtain more robust and richer analysis. For example, prescriptive and discovery analytics can be explored for not just predicting likely outcomes of practices but also developing creative solutions to climate change and other problems.
Other future research directions encompass exploring the phenomenon of sustainability disclosure across industries or across the globe and analyzing the association between sustainability and company performance. Research into sustainability is still at a nascent stage, but rapid advances in analytical platforms and tools can accelerate the maturing process.
All the authors (W.R., S.J.W. and V.R.) contributed equally to conceptualization, methodology, software, validation, formal analysis, investigation, resources, data curation, writing, visualization, supervision, and project administration. All authors have read and agreed to the published version of the manuscript.
Not applicable.
Not applicable.
The data will be made available upon request to the corresponding author.
The authors declare no conflict of interest.
Footnotes
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.
Figure 10. Heat map for variance of the Doc2Vec similarity score of different sectors.
Figure 12. Disclosure sentiment score for climate change (emission and greenhouse gas), 2011–2020.
Filings’ summary in the Ceres database.
| Year | Total # of Filings | Total # of Filings with Climate Change Disclosure | Disclosure Rate |
| 2011 | 5623 | 2485 | 44% |
| 2012 | 5620 | 2557 | 45% |
| 2013 | 5601 | 2625 | 47% |
| 2014 | 5569 | 2695 | 48% |
| 2015 | 5535 | 2683 | 48% |
| 2016 | 5580 | 2578 | 46% |
| 2017 | 5373 | 2168 | 40% |
| 2018 | 5235 | 2704 | 52% |
| 2019 | 5268 | 2754 | 52% |
| 2020 | 4014 | 2179 | 54% |
| Total | 53,418 | 25,428 | 48% |
Notes: Total # of fillings: total number of annual filings in the Ceres database in each year. Total # of filings with climate-change-related disclosure: total number of annual filings that have mentioned climate change risks in the Ceres database in each year. Disclosure rate: percentage of annual filings that have mentioned climate change risks in the Ceres database in each year.
Disclosure structure in the CERES dataset.
| Variables | Description | Example |
| Company name | Company name | 3 M CO |
| Industry group | Industry of the company | Medical equipment manufacturing |
| Sector | Sector of each company | Manufacturing |
| Year | Year of disclosure | 2019 |
| Ticker | Ticker number for publicly trading company | MMM |
| Climate risk ratio | Non-specific climate disclosure | 10% |
| Regulatory risk ratio | Regulatory risk/impact | 15% |
| Physical risk ratio | Physical risk/impact | 0% |
| Energy ratio | Renewable energy/clean technology/energy efficiency | 75% |
| Content | All text content in one disclosure report | This segment’s energy... |
Sample terms and phrases for risk categories (source:
| Risk Categories | Sample Terms and Phrases |
| Non-specific climate disclosure | Climate change, fossil fuels, consumes significant energy |
| Regulatory risk/impact | Clean Power Plan, Cross-State Air Pollution Rule, Clean Air Act |
| Physical risk/impact | Extreme weather conditions, severe storms, droughts |
| Renewable energy/clean technology/energy efficiency | Energy efficiency, renewable energy, energy costs |
Cluster taxonomy.
| Cluster | Key Features | Number of Industries | Industries |
| Cluster 1 | gas, greenhouse, climate, weather, energy, oil, carbon, air, impact, fuel | 5 | Aerospace and Defense, Hospitality and Tourism, Oil and Gas, Transportation, Waste Management |
| Cluster 2 | weather, climate, impact, energy, ability, increase, loss, tax, effect, insurance | 9 | Agriculture, Apparel and Textiles, Entertainment and Recreation, Food and Beverages, Insurance Services, Media, Pharmaceuticals/Health Care, Retail, Wholesale |
| Cluster 3 | energy, power, gas, climate, weather, electricity, ghg, generation, market, development | 4 | Electric Power and Gas Utilities, Electronics, Services—Business Related, Services—Educational |
| Cluster 4 | energy, climate, weather, carbon, gas, impact, greenhouse, ability, power increase | 10 | Banks and Financial Services, Chemicals, Consumer Goods, Information Technology, Manufacturing and Industrial Materials, Medical Equipment Manufacturing, Mining, Real Estate Finance/Property Development/Construction, Services—Other, Telecommunications |
| Cluster 5 | water, weather, demand, climate, supply, ability, impact, increase, energy, cost | 1 | Water Utility/Services |
| Cluster 6 | fuel, gas, ghg, greenhouse, emission, power, energy, dioxide, carbon, air | 2 | Automotive, Coal Mining |
| Total | 31 |
Selected keywords for each topic.
| Climate Issue | Regulation | Renewable Energy |
| Emission | Regulation | Renewable |
| Gas | GHG | Technology |
| Energy | EPA | Efficiency |
| Weather | Legislation | |
| Greenhouse | Law |
LDA-generated topics and terms.
| Topic No. | Label | Most Probable Terms |
| 1 | Gas Emission | 0.032%emission; 0.028%gas; 0.017%regulation; 0.014%oil; 0.012%state; 0.012%ghg; 0.011%natural; 0.010%greenhouse; 0.010%epa; 0.010%change |
| 2 | Carbon Risk | 0.015%carbon; 0.012%energy; 0.012%climate; 0.010%emission; 0.009%risk; 0.009%product; 0.008%environmental; 0.007% sustainability; 0.007%change; 0.007%business |
| 3 | Climate Change | 0.022%change; 0.019%condition; 0.016%weather; 0.016%result; 0.015%operation; 0.014%cost; 0.013%business; 0.012%impact; 0.011%climate; 0.010%risk |
| 4 | Loss and Damage | 0.028%loss; 0.023%hurricane; 0.022%event; 0.019%insurance; 0.019%property; 0.013%storm; 0.013%weather; 0.012%damage; 0.011%natural; 0.011%disaster |
| 5 | Renewable Energy | 0.037%energy; 0.022%power; 0.018%renewable; 0.016%gas; 0.015%electric; 0.011%customer; 0.010%generation; 0.009%utility; 0.009%coal; 0.009% project |
| 6 | Financial Impact | 0.101%gaap; 0.016%segment; 0.012%stock; 0.012%note; 0.011%credit; 0.010%value; 0.010%income; 0.010%srt; 0.010% debt; 0.010%tax |
Appendix A
Industry distribution of companies with disclosures in 2020.
| Industry-Division | Count of Industry | # of Companies |
| Agriculture, Forestry, and Fishing | 1 | 7 |
| Agriculture | 1 | 7 |
| Finance, Insurance, and Real Estate | 3 | 580 |
| Banks and Financial Services | 1 | 239 |
| Insurance Services | 1 | 93 |
| Real Estate Finance/Property Development/Construction | 1 | 248 |
| Manufacturing | 9 | 715 |
| Aerospace and Defense | 1 | 19 |
| Automotive | 1 | 10 |
| Chemicals | 1 | 72 |
| Consumer Goods | 1 | 53 |
| Electronics | 1 | 81 |
| Food and Beverages | 1 | 75 |
| Manufacturing and Industrial Materials | 1 | 189 |
| Medical Equipment Manufacturing | 1 | 43 |
| Pharmaceuticals/Health Care | 1 | 173 |
| Mining | 3 | 282 |
| Coal Mining | 1 | 13 |
| Mining | 1 | 59 |
| Oil and Gas | 1 | 210 |
| Retail Trade | 2 | 96 |
| Apparel and Textiles | 1 | 28 |
| Retail | 1 | 68 |
| Services | 6 | 200 |
| Entertainment and Recreation | 1 | 12 |
| Hospitality and Tourism | 1 | 41 |
| Information Technology | 1 | 59 |
| Services—Business Related | 1 | 43 |
| Services—Educational | 1 | 4 |
| Services—Other | 1 | 41 |
| Transportation, Communications, Electric, Gas, and Sanitary Services | 6 | 259 |
| Electric Power and Gas Utilities | 1 | 121 |
| Media | 1 | 14 |
| Telecommunications | 1 | 27 |
| Transportation | 1 | 75 |
| Waste Management | 1 | 11 |
| Water Utility/Services | 1 | 11 |
| Wholesale Trade | 1 | 40 |
| Wholesale | 1 | 40 |
| Grand Total | 31 | 2179 |
Appendix B
Top 55 nouns and proper nouns and counts in all disclosure reports.
| Word | Count | |
| 0 | energy | 105,739 |
| 1 | gas | 91,349 |
| 2 | power | 44,252 |
| 3 | climate | 43,274 |
| 4 | weather | 41,135 |
| 5 | greenhouse | 40,045 |
| 6 | oil | 38,019 |
| 7 | carbon | 31,092 |
| 8 | demand | 29,841 |
| 9 | production | 27,434 |
| 10 | ability | 26,641 |
| 11 | air | 26,634 |
| 12 | legislation | 22,534 |
| 13 | development | 21,302 |
| 14 | emission | 20,759 |
| 15 | market | 20,617 |
| 16 | cost | 19,995 |
| 17 | addition | 19,994 |
| 18 | effect | 19,862 |
| 19 | equipment | 19,625 |
| 20 | water | 19,617 |
| 21 | regulation | 19,423 |
| 22 | fuel | 18,468 |
| 23 | impact | 18,142 |
| 24 | industry | 17,918 |
| 25 | act | 17,545 |
| 26 | generation | 17,193 |
| 27 | control | 17,112 |
| 28 | capital | 16,989 |
| 29 | ghg | 16,801 |
| 30 | tax | 16,161 |
| 31 | loss | 15,672 |
| 32 | compliance | 15,649 |
| 33 | electricity | 15,169 |
| 34 | reduction | 15,071 |
| 35 | insurance | 14,884 |
| 36 | management | 14,367 |
| 37 | government | 13,955 |
| 38 | price | 13,801 |
| 39 | agreement | 13,163 |
| 40 | condition | 13,091 |
| 41 | rule | 12,885 |
| 42 | risk | 12,824 |
| 43 | construction | 12,594 |
| 44 | availability | 12,415 |
| 45 | program | 12,039 |
| 46 | damage | 12,020 |
| 47 | cash | 11,892 |
| 48 | technology | 11,757 |
| 49 | performance | 11,706 |
| 50 | increase | 11,705 |
| 51 | dioxide | 11,517 |
| 52 | supply | 11,503 |
| 53 | property | 11,429 |
| 54 | transportation | 11,211 |
References
1. Allen, C.; Metternicht, G.; Wiedmann, T. Initial progress in implementing the Sustainable Development Goals (SDGs): A review of evidence from countries. Sustain. Sci.; 2018; 13, pp. 1453-1467. [DOI: https://dx.doi.org/10.1007/s11625-018-0572-3]
2. Rosati, F.; Faria, L.G. Addressing the SDGs in sustainability reports: The relationship with institutional factors. J. Clean. Prod.; 2019; 215, pp. 1312-1326. [DOI: https://dx.doi.org/10.1016/j.jclepro.2018.12.107]
3. Gómez-Bezares, F.; Przychodzen, W.; Przychodzen, J. Bridging the gap: How sustainable development can help companies create shareholder value and improve financial performance. Bus. Ethics A Eur. Rev.; 2017; 26, pp. 1-17. [DOI: https://dx.doi.org/10.1111/beer.12135]
4. Przychodzen, J.; Przychodzen, W. Corporate sustainability and shareholder wealth. J. Environ. Plan. Manag.; 2013; 56, pp. 474-493. [DOI: https://dx.doi.org/10.1080/09640568.2012.685927]
5. Sobhani, F.A.; Amran, A.; Zainuddin, Y. Revisiting the corporate social and environmental practices in Bangladesh. Corp. Soc. Res. Environ. Manag.; 2009; 16, pp. 167-183. [DOI: https://dx.doi.org/10.1002/csr.193]
6. Herbohn, K.; Walker, J.; Loo, H.Y.M. Corporate social responsibility: The link between sustainability disclosure and sustainability performance. Abacus; 2014; 50, pp. 422-459. [DOI: https://dx.doi.org/10.1111/abac.12036]
7. Kaymak, T.; Bektas, E. Corporate social responsibility and governance: Information disclosure in multinational corporations. Corp. Soc. Responsib. Environ. Manag.; 2017; 24, pp. 555-569. [DOI: https://dx.doi.org/10.1002/csr.1428]
8. Clarkson, P.M.; Li, Y.; Richardson, G.D.; Vasvari, F.P. Revisiting the relation between environmental performance and environmental disclosure: An empirical analysis. Account. Organ. Soc.; 2008; 33, pp. 303-327. [DOI: https://dx.doi.org/10.1016/j.aos.2007.05.003]
9. Galbreath, J. ESG in focus: The Australian evidence. J. Bus. Ethics; 2013; 118, pp. 529-541. [DOI: https://dx.doi.org/10.1007/s10551-012-1607-9]
10. Klettner, A.; Clarke, T.; Boersma, M. The governance of corporate sustainability: Empirical insights into the development, leadership and implementation of responsible business strategy. J. Bus. Ethics; 2014; 122, pp. 145-165. [DOI: https://dx.doi.org/10.1007/s10551-013-1750-y]
11. Velte, P. Does ESG performance have an impact on financial performance? Evidence from Germany. J. Glob. Responsib.; 2017; 80, pp. 169-178. [DOI: https://dx.doi.org/10.1108/JGR-11-2016-0029]
12. Henri, J.; Journeault, M. Eco-control: The influence of management control systems on environmental and economic performance. Account. Organ. Soc.; 2010; 35, 6380. [DOI: https://dx.doi.org/10.1016/j.aos.2009.02.001]
13. Szekely, F.; Knirsch, M. Responsible leadership and corporate social responsibility metrics for sustainable performance. Eur. Manag. J.; 2005; 23, pp. 628-647. [DOI: https://dx.doi.org/10.1016/j.emj.2005.10.009]
14. Van Marrewijk, M. Concepts and definitions of CSR and corporate sustainability: Between agency and communion. J. Bus. Ethics; 2003; 44, pp. 95-105. [DOI: https://dx.doi.org/10.1023/A:1023331212247]
15. Dyllick, T.; Hockerts, K. Beyond the business case for corporate sustainability. Bus. Strategy Environ.; 2002; 11, pp. 130-141. [DOI: https://dx.doi.org/10.1002/bse.323]
16. Lo, S.F.; Sheu, H.J. Is Corporate Sustainability a Value-Increasing Strategy for Business?. Corp. Gov. Int. Rev.; 2007; 15, pp. 345-358. [DOI: https://dx.doi.org/10.1111/j.1467-8683.2007.00565.x]
17. Raghupathi, V.; Raghupathi, W. Corporate Sustainability Reporting and Disclosure on the Web: An Exploratory Study. Inf. Resour. Manag. J.; 2019; 32, pp. 1-27. [DOI: https://dx.doi.org/10.4018/IRMJ.2019010101]
18. Raghupathi, V.; Ren, J.; Raghupathi, W. Identifying Corporate Sustainability Issues by Analyzing Shareholder Resolutions: A Machine-Learning Text Analytics Approach. Sustainability; 2020; 12, 4753. [DOI: https://dx.doi.org/10.3390/su12114753]
19. Braam, G.; Peters, R. Corporate Sustainability Performance and Assurance on Sustainability Reports: Diffusion of Accounting Practices in the Realm of Sustainable Development. Corp. Soc. Responsib. Environ. Manag.; 2018; 25, pp. 164-181. [DOI: https://dx.doi.org/10.1002/csr.1447]
20. Sandberg, M.; Holmlund, M. Impression management tactics in sustainability reporting. Soc. Responsib. J.; 2015; 11, pp. 677-689. [DOI: https://dx.doi.org/10.1108/SRJ-12-2013-0152]
21. De Silva Lokuwaduge, C.S.; Heenetigala, K. Integrating environmental, social and governance (ESG) disclosure for a sustainable development. Bus. Strategy Environ.; 2017; 26, pp. 438-450. [DOI: https://dx.doi.org/10.1002/bse.1927]
22. Elkington, J. Cannibals with Forks: The Triple Bottom Line of the 21st Century; New Society Publishers: Stoney Creek, CT, USA, 1998.
23. Belz, F.; Binder, J. Sustainable Entrepreneurship: A Convergent Process Model. Bus. Strategy Environ.; 2015; 26, pp. 1-17. [DOI: https://dx.doi.org/10.1002/bse.1887]
24. Freundlieb, M.; Teuteberg, F. Corporate social responsibility reporting-a transnational analysis of online corporate social responsibility reports by market–listed companies: Contents and their evolution. Int. J. Innov. Sustain. Dev.; 2013; 7, pp. 1-26. [DOI: https://dx.doi.org/10.1504/IJISD.2013.052117]
25. Kolk, A. A decade of sustainability reporting: Developments and significance. Int. J. Environ. Sustain. Dev.; 2004; 3, pp. 51-64. [DOI: https://dx.doi.org/10.1504/IJESD.2004.004688]
26. Székely, N.; vom Brocke, J. What can we learn from corporate sustainability reporting? Deriving propositions for research and practice from over 9500 corporate sustainability reports published between 1999 and 2015 using topic modelling technique. PLoS ONE; 2017; 12, e0174807. [DOI: https://dx.doi.org/10.1371/journal.pone.0174807]
27. Zhou, Y.; Wang, X.; Yuen, K.F. Sustainability disclosure for container shipping: A text-mining approach. Transp. Policy; 2021; 110, pp. 465-477. [DOI: https://dx.doi.org/10.1016/j.tranpol.2021.06.020]
28. Brammer, S.; Pavelin, S. Factors influencing the quality of corporate environmental disclosure. Bus. Strategy Environ.; 2008; 17, pp. 120-136. [DOI: https://dx.doi.org/10.1002/bse.506]
29. Bjørn, A.; Bey, N.; Georg, S.; Røpke, I.; Hauschild, M.Z. Is Earth recognized as a finite system in corporate responsibility reporting?. J. Clean. Production; 2016; 163, pp. 106-117. [DOI: https://dx.doi.org/10.1016/j.jclepro.2015.12.095]
30. Modapothala, J.R.; Issac, B. Evaluation of Corporate Environmental Reports Using Data Mining Approach; IEEE: New York, NY, USA, 2009; pp. 543-547.
31. Modapothala, J.R.; Issac, B.; Jayamani, E. Appraising the corporate sustainability reports–text mining and multi-discriminatory analysis. Innovations in Computing Sciences and Software Engineering; Springer: Berlin/Heidelberg, Germany, 2010; pp. 489-494.
32. Shahi, A.M.; Issac, B.; Modapothala, J.R. Analysis of supervised text classification algorithms on corporate sustainability reports. Proceedings of the 2011 International Conference on Computer Science and Network Technology; Harbin, China, 24–26 December 2011; IEEE: New York, NY, USA, 2011; Volume 1, pp. 96-100.
33. Blei, D.M. Probabilistic topic models. Commun. ACM; 2012; 55, pp. 77-84. [DOI: https://dx.doi.org/10.1145/2133806.2133826]
34. Higgins, K.; White, J.; Beller, A.; Schapiro, M. The SEC and improving sustainability reporting. J. Appl. Corp. Financ.; 2017; 29, pp. 22-31. [DOI: https://dx.doi.org/10.1111/jacf.12230]
35. Rodriguez, A.; Cotran, H.; Stewart, L.S. Evaluating the effectiveness of sustainability disclosure: Findings from a recent SASB study. J. Appl. Corp. Financ.; 2017; 29, pp. 100-108. [DOI: https://dx.doi.org/10.1111/jacf.12237]
36. Carberry, E.J.; Bharati, P.; Levy, D.L.; Chaudhury, A. Social movements as catalysts for corporate social innovation: Environmental activism and the adoption of green information systems. Bus. Soc.; 2019; 58, pp. 1083-1127. [DOI: https://dx.doi.org/10.1177/0007650317701674]
37. Tian, H.; Tian, J. The mediating role of responsible innovation in the relationship between stakeholder pressure and corporate sustainability performance in times of crisis: Evidence from selected regions in China. Int. J. Environ. Res. 71. Public Health; 2021; 18, 7277. [DOI: https://dx.doi.org/10.3390/ijerph18147277] [PubMed: https://www.ncbi.nlm.nih.gov/pubmed/34299728]
38. Domenec, F. The “greening” of the annual letters published by Exxon, Chevron and BP between 2003 and 2009. J. Commun. Manag.; 2012; 16, pp. 296-311. [DOI: https://dx.doi.org/10.1108/13632541211245767]
39. Galbreath, J. How does corporate social responsibility benefit firms? Evidence from Australia. Eur. Bus. Rev.; 2010; 22, pp. 411-431. [DOI: https://dx.doi.org/10.1108/09555341011056186]
40. Cormier, D.; Magnan, M. The economic relevance of environmental disclosure and its impact on corporate legitimacy: An empirical investigation. Bus. Strategy Environ.; 2015; 24, pp. 431-450. [DOI: https://dx.doi.org/10.1002/bse.1829]
41. Aureli, S.; Del Baldo, M.; Lombardi, R.; Nappo, F. Nonfinancial reporting regulation and challenges in sustainability disclosure and corporate governance practices. Bus. Strategy Environ.; 2020; 29, pp. 2392-2403. [DOI: https://dx.doi.org/10.1002/bse.2509]
42. Helfaya, A.; Whittington, M. Does designing environmental sustainability disclosure quality measures make a difference?. Bus. Strategy Environ.; 2019; 28, pp. 525-541. [DOI: https://dx.doi.org/10.1002/bse.2262]
43. Järvinen, J.T.; Laine, M.; Hyvönen, T.; Kantola, H. Just look at the numbers: A case study on quantification in corporate environmental disclosures. J. Bus. Ethics; 2020; pp. 1-22. [DOI: https://dx.doi.org/10.1007/s10551-020-04600-7]
44. Berry, H.L.; Bowen, K.; Kjellstrom, T. Climate change and mental health: A causal pathways framework. Int. J. Public Health; 2010; 55, pp. 123-132. [DOI: https://dx.doi.org/10.1007/s00038-009-0112-0]
45. Bosello, F.; Roson, R.; Tol, R.S. Economy-wide estimates of the implications of climate change: Human health. Ecol. Econ.; 2006; 58, pp. 579-591. [DOI: https://dx.doi.org/10.1016/j.ecolecon.2005.07.032]
46. Hill, J.; Polasky, S.; Nelson, E.; Tilman, D.; Huo, H.; Ludwig, L.; Bonta, D. Climate change and health costs of air emissions from biofuels and gasoline. Proc. Natl. Acad. Sci. USA; 2009; 106, pp. 2077-2082. [DOI: https://dx.doi.org/10.1073/pnas.0812835106] [PubMed: https://www.ncbi.nlm.nih.gov/pubmed/19188587]
47. Benn, S.; Edwards, M.; Williams, T. Organizational Change for Corporate Sustainability; Routledge: Abingdon, UK, 2014.
48. World Commission on Environment and Development. Report of the World Commission on Environment and Development: Our common Future; United Nations: New York, NY, USA, 1987.
49. Linnenluecke, M.K.; Griffiths, A. Corporate sustainability and organizational culture. J. World Bus.; 2010; 45, pp. 357-366. [DOI: https://dx.doi.org/10.1016/j.jwb.2009.08.006]
50. Hahn, R.; Lülfs, R. Legitimizing negative aspects in GRI-oriented sustainability reporting: A qualitative analysis of corporate disclosure strategies. J. Bus. Ethics; 2014; 123, pp. 401-420. [DOI: https://dx.doi.org/10.1007/s10551-013-1801-4]
51. Azapagic, A.; Perdan, S. Managing corporate social responsibility: Translating theory into business practice. Int. J. Corp. Sustain.; 2003; 10, pp. 97-108.
52. Krug, B.A.; Burnett, S.E.; Dennis, J.H.; Lopez, R.G. Growers look at operating a sustainable greenhouse. GMPro; 2008; 28, pp. 43-45.
53. Belu, C. Ranking corporations based on sustainable and socially responsible practices. A data envelopment analysis (DEA) approach. Sustain. Dev.; 2009; 17, pp. 257-268. [DOI: https://dx.doi.org/10.1002/sd.390]
54. Hahn, T.; Scheermesser, M. Approaches to corporate sustainability among German companies. Corp. Soc. Responsib. Environ. Manag.; 2006; 13, pp. 150-165. [DOI: https://dx.doi.org/10.1002/csr.100]
55. Wilkinson, A.; Hill, M.; Gollan, P. The sustainability debate. Int. J. Oper. Prod. Manag.; 2001; 21, pp. 1492-1502. [DOI: https://dx.doi.org/10.1108/01443570110410865]
56. Ameer, R.; Othman, R. Sustainability practices and corporate financial performance: A study based on the top global corporations. J. Bus. Ethics; 2012; 108, pp. 61-79. [DOI: https://dx.doi.org/10.1007/s10551-011-1063-y]
57. Adams, C.A.; McNicholas, P. Making a difference: Sustainability reporting, accountability and organizational change. Account. Audit. Account. J.; 2007; 20, pp. 382-402. [DOI: https://dx.doi.org/10.1108/09513570710748553]
58. Gnanaweera, K.A.K.; Kunori, N. Corporate sustainability reporting: Linkage of corporate disclosure information and performance indicators. Cogent Bus. Manag.; 2018; 5, 1423872. [DOI: https://dx.doi.org/10.1080/23311975.2018.1423872]
59. Michelon, G. Sustainability disclosure and reputation: A comparative study. Corp. Reput. Rev.; 2011; 14, pp. 79-96. [DOI: https://dx.doi.org/10.1057/crr.2011.10]
60. Adams, C.A. Internal Organizational Factors Influencing Corporate Social and Ethical Reporting: Beyond Current Theorizing. Account. Audit. Account. J.; 2002; 15, pp. 223-250. [DOI: https://dx.doi.org/10.1108/09513570210418905]
61. Bebbington, J.; Larrinaga-González, C.; Moneva, J.M. Corporate Social Responsibility and Reputation Risk Management. Account. Audit. Account. J.; 2008; 21, pp. 337-362. [DOI: https://dx.doi.org/10.1108/09513570810863932]
62. Bae, S.M.; Masud, M.; Kaium, A.; Kim, J.D. A cross-country investigation of corporate governance and corporate sustainability disclosure: A signaling theory perspective. Sustainability; 2018; 10, 2611. [DOI: https://dx.doi.org/10.3390/su10082611]
63. Fernández-Gago, R.; Cabeza-García, L.; Nieto, M. Independent directors’ background and CSR disclosure. Corp. Soc. Responsib. Environ. Manag.; 2018; 25, pp. 991-1001. [DOI: https://dx.doi.org/10.1002/csr.1515]
64. Masud, M.; Kaium, A.; Hossain, M.S.; Kim, J.D. Is green regulation effective or a failure: Comparative analysis between Bangladesh Bank (BB) green guidelines and global reporting initiative guidelines. Sustainability; 2018; 10, 1267. [DOI: https://dx.doi.org/10.3390/su10041267]
65. Papoutsi, A.; Sodhi, M.S. Does disclosure in sustainability reports indicate actual sustainability performance?. J. Clean. Prod.; 2020; 260, 121049. [DOI: https://dx.doi.org/10.1016/j.jclepro.2020.121049]
66. Perrault, E.; Clark, C. Environmental shareholder activism: Considering status and reputation in firm responsiveness. Organ. Environ.; 2016; 29, pp. 194-211. [DOI: https://dx.doi.org/10.1177/1086026615571939]
67. Ali, W.; Frynas, J.G.; Mahmood, Z. Determinants of corporate social responsibility (CSR) disclosure in developed and developing countries: A literature review. Corp. Soc. Responsib. Environ. Manag.; 2017; 24, pp. 273-294. [DOI: https://dx.doi.org/10.1002/csr.1410]
68. Raucci, D.; Tarquinio, L. A study of the economic and non-financial performance indicators in corporate sustainability reports. J. Sustain. Dev.; 2015; 8, pp. 216-230. [DOI: https://dx.doi.org/10.5539/jsd.v8n6p216]
69. Roca, L.C.; Searcy, C. An analysis of indicators disclosed in corporate sustainability reports. J. Clean. Prod.; 2012; 20, pp. 103-118. [DOI: https://dx.doi.org/10.1016/j.jclepro.2011.08.002]
70. Patten, D.M.; Zhao, N. Standalone CSR reporting by US retail companies. Accounting Forum; Taylor and Francis Ltd.: Oxford, UK, 2014; Volume 38, pp. 132-144.
71. Tate, W.L.; Ellram, L.M.; Kirchoff, J.F. Corporate social responsibility reports: A thematic analysis related to supply chain management. J. Supply Chain Manag.; 2010; 46, pp. 19-44. [DOI: https://dx.doi.org/10.1111/j.1745-493X.2009.03184.x]
72. Montabon, F.; Sroufe, R.; Narasimhan, R. An examination of corporate reporting, environmental management practices and firm performance. J. Oper. Manag.; 2007; 25, pp. 998-1014. [DOI: https://dx.doi.org/10.1016/j.jom.2006.10.003]
73. Azhar, N.A.; Pan, G.; Seow, P.S.; Koh, A.; Tay, W.Y. Text analytics approach to examining corporate social responsibility. Asian J. Account. Gov.; 2019; 11, pp. 85-96.
74. Nilashi, M.; Rupani, P.F.; Rupani, M.M.; Kamyab, H.; Shao, W.; Ahmadi, H.; Rashid, T.A.; Aljojo, N. Measuring sustainability through ecological sustainability and human sustainability: A machine learning approach. J. Clean. Prod.; 2019; 240, 118162. [DOI: https://dx.doi.org/10.1016/j.jclepro.2019.118162]
75. Aggarwal, C.C.; Zhai, C. A survey of text classification algorithms. Mining Text Data; Springer: Boston, MA, USA, 2012; pp. 163-222.
76. Kim, D.; Kim, S. Sustainable supply chain based on news articles and sustainability reports: Text mining with Leximancer and DICTION. Sustainability; 2017; 9, 1008. [DOI: https://dx.doi.org/10.3390/su9061008]
77. Deng, Q.; Ji, S.; Wang, Y. Green IT practice disclosure: An examination of corporate sustainability reporting in IT sector. J. Inf. Commun. Ethics Soc.; 2017; 15, pp. 145-164. [DOI: https://dx.doi.org/10.1108/JICES-12-2016-0046]
78. Harymawan, I.; Nasih, M.; Salsabilla, A.; Putra, F.K.G. External assurance on sustainability report disclosure and firm value: Evidence from Indonesia and Malaysia. Entrep. Sustain. Issues; 2020; 7, pp. 1500-1512. [DOI: https://dx.doi.org/10.9770/jesi.2020.7.3(5)]
79. Wang, X.; Yuen, K.F.; Wong, Y.D.; Li, K.X. How can the maritime industry meet Sustainable Development Goals? An analysis of sustainability reports from the social entrepreneurship perspective. Transp. Res. Part D Transp. Environ.; 2020; 78, 102173. [DOI: https://dx.doi.org/10.1016/j.trd.2019.11.002]
80. Shelley, M.; Krippendorff, K. Content analysis: An introduction to its methodology. J. Am. Stat. Assoc.; 1984; 79, 240. [DOI: https://dx.doi.org/10.2307/2288384]
81. Raghupathi, V.; Ren, J.; Raghupathi, W. Studying public perception about vaccination: A sentiment analysis of tweets. Int. J. Environ. Res. Public Health; 2020; 17, 3464. [DOI: https://dx.doi.org/10.3390/ijerph17103464] [PubMed: https://www.ncbi.nlm.nih.gov/pubmed/32429223]
82. Raghupathi, V.; Zhou, Y.; Raghupathi, W. Legal decision support: Exploring big data analytics approach to modeling pharma patent validity cases. IEEE Access; 2018; 6, pp. 41518-41528. [DOI: https://dx.doi.org/10.1109/ACCESS.2018.2859052]
83. Dietterich, T.G. Machine learning in ecosystem informatics and sustainability. Proceedings of the Twenty-First International Joint Conference on Artificial Intelligence; Pasadena, CA, USA, 11–17 July 2009.
84. Parris, T.M. Corporate sustainability reporting. Environment; 2006; 48, 3. [DOI: https://dx.doi.org/10.3200/ENVT.48.5.3-3]
85. Smith, J.A., III. The CERES principles: A voluntary code for corporate environmental responsibility. Yale J. Int’l L.; 1993; 18, 307.
86. Chuang, J.; Manning, C.D.; Heer, J. Termite: Visualization techniques for assessing textual topic models. Proceedings of the International Working Conference on Advanced Visual Interfaces; Capri, Italy, 21–25 May 2012; pp. 74-77.
87. Sievert, C.; Shirley, K. LDAvis: A method for visualizing and interpreting topics. Proceedings of the Workshop on Interactive Language Learning, Visualization, and Interfaces; Baltimore, MD, USA, 27 June 2014; pp. 63-70.
88. De Lucia, C.; Pazienza, P.; Bartlett, M. Does good ESG lead to better financial performances by firms? Machine learning and logistic regression models of public enterprises in Europe. Sustainability; 2020; 12, 5317. [DOI: https://dx.doi.org/10.3390/su12135317]
89. Sardana, D.; Gupta, N.; Kumar, V.; Terziovski, M. CSR ‘sustainability’ practices and firm performance in an emerging economy. J. Clean. Prod.; 2020; 258, 120766. [DOI: https://dx.doi.org/10.1016/j.jclepro.2020.120766]
90. Armstrong, A. Ethics and ESG. Australas. Account. Bus. Financ. J.; 2020; 14, pp. 6-17. [DOI: https://dx.doi.org/10.14453/aabfj.v14i3.2]
91. Chouaibi, S.; Affes, H. The effect of social and ethical practices on environmental disclosure: Evidence from an international ESG data. Corp. Gov. Int. J. Bus. Soc.; 2021; 21, pp. 1293-1317. [DOI: https://dx.doi.org/10.1108/CG-03-2020-0087]
92. Chouaibi, S.; Rossi, M.; Siggia, D.; Chouaibi, J. Exploring the moderating role of social and ethical practices in the relationship between environmental disclosure and financial performance: Evidence from ESG companies. Sustainability; 2022; 14, 209. [DOI: https://dx.doi.org/10.3390/su14010209]
93. Novoselov, A.; Potravny, I.; Novoselova, I.; Gassiy, V. Social investing modeling for sustainable development of the Russian Arctic. Sustainability; 2022; 14, 933. [DOI: https://dx.doi.org/10.3390/su14020933]
94. Novoselov, A.; Potravny, I.; Novoselova, I.; Gassiy, V.; Sharkova, A. Harmonization of interests during Arctic industrial development: The case of mining corporation and indigenous peoples in Russia. Polar Sci.; 2022; 100915. [DOI: https://dx.doi.org/10.1016/j.polar.2022.100915]
You have requested "on-the-fly" machine translation of selected content from our databases. This functionality is provided solely for your convenience and is in no way intended to replace human translation. Show full disclaimer
Neither ProQuest nor its licensors make any representations or warranties with respect to the translations. The translations are automatically generated "AS IS" and "AS AVAILABLE" and are not retained in our systems. PROQUEST AND ITS LICENSORS SPECIFICALLY DISCLAIM ANY AND ALL EXPRESS OR IMPLIED WARRANTIES, INCLUDING WITHOUT LIMITATION, ANY WARRANTIES FOR AVAILABILITY, ACCURACY, TIMELINESS, COMPLETENESS, NON-INFRINGMENT, MERCHANTABILITY OR FITNESS FOR A PARTICULAR PURPOSE. Your use of the translations is subject to all use restrictions contained in your Electronic Products License Agreement and by using the translation functionality you agree to forgo any and all claims against ProQuest or its licensors for your use of the translation functionality and any output derived there from. Hide full disclaimer
© 2023 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.
Abstract
As sustainability becomes fundamental to companies, voluntary and mandatory disclosures or corporate sustainability practices have become a key source of information for various stakeholders, including regulatory bodies, environmental watchdogs, nonprofits and NGOs, investors, shareholders, and the public at large. Understanding sustainability practices by analyzing a large volume of disclosures poses major challenges, given that the information is mostly in the form of text. Applying machine learning and text analytic methods, we analyzed approximately 25,428 disclosure reports for the period of 2011 to 2020, extracted from the Securities and Exchange Commission (SEC) filings and made available at the Ceres website via application programming interfaces (APIs). Our study identified six industry clusters from the K-means and six main topics from the latent Dirichlet allocation (LDA) method that related to the disclosure of climate-change-related environmental concerns. Both methods produced overlapping results that further reinforce and enhance our understanding of climate-change-related disclosure at various levels, such as sector, industry, and topic. Our analysis shows that companies are concerned primarily with the topics of gas emission, carbon risk, climate change, loss and damage, renewable energy, and financial impact when disclosing climate-change-related issues to the government. The study has implications for corporate sustainability practices, the communication and dissemination of such practices to stakeholders at large and furthering our understanding of sustainability in general.
You have requested "on-the-fly" machine translation of selected content from our databases. This functionality is provided solely for your convenience and is in no way intended to replace human translation. Show full disclaimer
Neither ProQuest nor its licensors make any representations or warranties with respect to the translations. The translations are automatically generated "AS IS" and "AS AVAILABLE" and are not retained in our systems. PROQUEST AND ITS LICENSORS SPECIFICALLY DISCLAIM ANY AND ALL EXPRESS OR IMPLIED WARRANTIES, INCLUDING WITHOUT LIMITATION, ANY WARRANTIES FOR AVAILABILITY, ACCURACY, TIMELINESS, COMPLETENESS, NON-INFRINGMENT, MERCHANTABILITY OR FITNESS FOR A PARTICULAR PURPOSE. Your use of the translations is subject to all use restrictions contained in your Electronic Products License Agreement and by using the translation functionality you agree to forgo any and all claims against ProQuest or its licensors for your use of the translation functionality and any output derived there from. Hide full disclaimer





