Part 6
Henrique C. Martins
This is a lecture about:
Why this type of lecture is important?
Well, if we don’t understand what works and what don’t, we can never improve.
Also, analyzing data might be the best way to separate the underlying effects from the all the “buzz” that is out there.
The source of data is: Refinitiv for the years 2015-2020.
Therefore, our discussion can be affected by their score methodology.
Finally, these are simple stylized facts about the data. We cannot make strong conclusions using these graphs.
There are several question we can try answer with data:
library(ggpubr)
g1 <- ggplot(esg, aes(x=`ESG Score`)) + geom_histogram(binwidth=1) + theme_solarized()
g2 <- ggplot(esg, aes(x=`Environmental Pillar Score`)) + geom_histogram(binwidth=1) + theme_solarized()
g3 <- ggplot(esg, aes(x=`Social Pillar Score`)) + geom_histogram(binwidth=1) + theme_solarized()
g4 <- ggplot(esg, aes(x=`Governance Pillar Score`)) + geom_histogram(binwidth=1) + theme_solarized()
ggarrange(g1, g2, g3, g4, ncol = 2, nrow = 2)
h1<- ggplot(esg, aes(y= `Environmental Pillar Score`, x=`Social Pillar Score`)) + geom_point() + geom_smooth(method=lm, se=FALSE) + theme_solarized()
h2<- ggplot(esg, aes(y= `Environmental Pillar Score`, x=`Governance Pillar Score`)) + geom_point() + geom_smooth(method=lm, se=FALSE) + theme_solarized()
h3<- ggplot(esg, aes(y= `Social Pillar Score`, x=`Governance Pillar Score`)) + geom_point() + geom_smooth(method=lm, se=FALSE) + theme_solarized()
ggarrange(h1, h2, h3, ncol = 2, nrow = 2)
There are several question we can try answer with data:
Call:
lm(formula = esg$MtB ~ esg$`ESG Score`)
Residuals:
Min 1Q Median 3Q Max
-2.527 -1.209 -0.651 0.330 59.583
Coefficients:
Estimate Std. Error t value Pr(>|t|)
(Intercept) 2.664010 0.057120 46.64 <2e-16 ***
esg$`ESG Score` -0.019416 0.001228 -15.81 <2e-16 ***
---
Signif. codes: 0 '***' 0.001 '**' 0.01 '*' 0.05 '.' 0.1 ' ' 1
Residual standard error: 2.665 on 11378 degrees of freedom
(1556 observations deleted due to missingness)
Multiple R-squared: 0.0215, Adjusted R-squared: 0.02142
F-statistic: 250.1 on 1 and 11378 DF, p-value: < 2.2e-16
There are several question we can try answer with data:
There are several question we can try answer with data:
There are several question we can try answer with data:
“Improving” as measured by the Score.
There are several question we can try answer with data:
There are several question we can try answer with data:
# A tibble: 18 × 2
`NAICS Sector Name` mean
<chr> <dbl>
1 Accommodation and Food Services 44.8
2 Administrative and Support and Waste Management and Remediation Servic… 41.9
3 Agriculture, Forestry, Fishing and Hunting 36.0
4 Arts, Entertainment, and Recreation 29.6
5 Construction 40.6
6 Educational Services 35.3
7 Finance and Insurance 35.8
8 Health Care and Social Assistance 39.5
9 Information 37.7
10 Manufacturing 43.3
11 Mining, Quarrying, and Oil and Gas Extraction 39.0
12 Other Services (except Public Administration) 32.3
13 Professional, Scientific, and Technical Services 35.4
14 Real Estate and Rental and Leasing 41.0
15 Retail Trade 46.3
16 Transportation and Warehousing 43.8
17 Utilities 49.3
18 Wholesale Trade 38.4
Find me at: