ESG Greenwashing Detection Model

Multi-task PhoBERT model for Vietnamese ESG content analysis.

Model Architecture

4-task learning:

  1. Greenwashing Classification (Legitimate/Greenwashing/Uncertain)
  2. ESG Pillar Classification (Environmental/Social/Governance/General)
  3. Content Quality Scoring (0-100)
  4. ESG Score Prediction (0-100)

Training

  • Base Model: vinai/phobert-base
  • Strategy: stratified_group_kfold_3
  • Folds: 2
  • Total Samples: 46

Performance

Greenwashing Detection

  • F1 Score: 0.473
  • Precision: 0.480
  • Recall: 0.483

Pillar Classification

  • Accuracy: 0.804
  • F1 Macro: 0.297

Quality Scoring

  • MAE: 38.771
  • R²: -64.195

ESG Score Prediction

  • MAE: 38.070
  • R²: -5.921
  • Correlation: -0.034

Usage

from transformers import AutoTokenizer, AutoModel
import torch

tokenizer = AutoTokenizer.from_pretrained("hiennthp/esg-bank-model-v3")
# Load model architecture then weights
# model = MultiTaskPhoBERT(config)
# model.load_state_dict(torch.load("best_model_fold0.pt"))

Files

  • best_model_fold0.pt - Fold 0 model weights
  • best_model_fold2.pt - Fold 1 model weights
  • step5_metrics.json - Detailed metrics with per-fold breakdown
  • tokenizer/ - PhoBERT tokenizer files

Citation

@software{esg_greenwashing_model,
  author = {ESG Research Team},
  title = {Vietnamese ESG Greenwashing Detection Model},
  year = {2026},
  url = {https://huggingface.co/hiennthp/esg-bank-model-v3}
}
Downloads last month

-

Downloads are not tracked for this model. How to track
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support