ESG data remains remarkably inconsistent across companies and markets. Some firms report detailed carbon emissions. Others give partial figures. Some barely report anything at all. That creates a real problem for investors who want to compare companies on climate risk, efficiency, and sustainability.
This paper asks a direct question: can machine learning fill the gap? The answer is yes — and the way it does so reveals something important about the relationship between how companies are governed and how they manage their environmental footprint.
The study: what the researchers tested
The authors tested multiple machine learning models to forecast carbon emission intensity for publicly listed firms. The models were evaluated across both pre-pandemic and post-pandemic periods to check whether they hold up when the business environment fundamentally changes.
The key result: XGBoost, a non-linear ensemble model, consistently outperformed other algorithms. This isn't just a statistical win — it means that carbon emissions are shaped by complex, non-linear relationships that simpler linear approaches miss. And the model remains robust across different economic regimes, which matters for real-world deployment.
Governance as a proxy for emissions
The most striking finding is what happens when direct emissions data are unavailable. The authors found that corporate governance metrics can serve as powerful proxies for environmental performance.
The governance variables that stood out:
- Board characteristics — composition, independence, and expertise
- Oversight structures — how environmental risks are monitored at board level
- Management quality — broader operational discipline reflected in governance scores
These variables carry meaningful signals about a firm's likely carbon profile. In other words, the way a company is structured and managed tells you something about how it handles its carbon footprint — even before you see the emissions data.
If emissions data are missing or unreliable, you don't have to stop your ESG analysis. Governance data can serve as a meaningful signal, combined with ML models that predict likely emission intensity. This is especially valuable in markets where disclosure standards are weak or inconsistent.
Why management quality predicts carbon performance
Firms with better board oversight and stronger governance structures tend to be more disciplined about monitoring emissions, responding to climate risks, and implementing environmental controls. Management quality isn't just about decision-making — it reflects broader operational discipline that extends to environmental performance.
This confirms an intuition many ESG practitioners share: governance is not a box-ticking exercise. It actually reflects how well a company manages its environmental footprint.
Who should pay attention
| Audience | What this means for you |
|---|---|
| ESG fund managers | Use governance data to screen companies when emissions disclosure is incomplete — don't wait for perfect data |
| Climate-risk analysts | XGBoost models can benchmark likely emission intensity across portfolios with uneven disclosure |
| Credit teams | Governance metrics add a predictive layer to credit assessments for climate-exposed sectors |
| Policymakers | AI-based methods can identify firms needing closer scrutiny and support sustainable finance regulation |
The bigger picture
This research points toward a more sophisticated view of sustainability analysis — one that is data-driven, adaptive, and predictive. Instead of treating emissions as a single number, you can think about the systems and structures that shape them.
Machine learning won't solve every disclosure problem. But it can help investors and policymakers make better judgments in a world where carbon data are often incomplete. And as reporting standards improve, these models will become even more accurate.
The combination of machine learning, governance data, and carbon analysis is where ESG is heading. Not just reporting better numbers — but building smarter systems that work with imperfect data.
Explore more on AI and ESG in real estate
Sherry Xu's books cover the practical intersection of AI, sustainability, and real estate investment.