ESG Greenwashing Detection Model
Multi-task PhoBERT model for Vietnamese ESG content analysis.
Model Architecture
4-task learning:
- Greenwashing Classification (Legitimate/Greenwashing/Uncertain)
- ESG Pillar Classification (Environmental/Social/Governance/General)
- Content Quality Scoring (0-100)
- ESG Score Prediction (0-100)
Training
- Base Model: vinai/phobert-base
- Strategy: stratified_group_kfold_3
- Folds: 2
- Total Samples: 46
Performance
Greenwashing Detection
- F1 Score: 0.473
- Precision: 0.480
- Recall: 0.483
Pillar Classification
- Accuracy: 0.804
- F1 Macro: 0.297
Quality Scoring
- MAE: 38.771
- R²: -64.195
ESG Score Prediction
- MAE: 38.070
- R²: -5.921
- Correlation: -0.034
Usage
from transformers import AutoTokenizer, AutoModel
import torch
tokenizer = AutoTokenizer.from_pretrained("hiennthp/esg-bank-model-v3")
# Load model architecture then weights
# model = MultiTaskPhoBERT(config)
# model.load_state_dict(torch.load("best_model_fold0.pt"))
Files
best_model_fold0.pt- Fold 0 model weightsbest_model_fold2.pt- Fold 1 model weightsstep5_metrics.json- Detailed metrics with per-fold breakdowntokenizer/- PhoBERT tokenizer files
Citation
@software{esg_greenwashing_model,
author = {ESG Research Team},
title = {Vietnamese ESG Greenwashing Detection Model},
year = {2026},
url = {https://huggingface.co/hiennthp/esg-bank-model-v3}
}