SDG Fine-tuned Enhanced Model

A Sentence Transformer model fine-tuned for SDG (Sustainable Development Goals) alignment tasks. This model is designed to classify and analyze text activities according to the 17 UN Sustainable Development Goals.

Model Description

  • Base Model: all-mpnet-base-v2
  • Fine-tuned for: SDG alignment and classification
  • Embedding Dimension: 768
  • Max Sequence Length: 384
  • Language: English

Intended Use

This model is specifically trained to:

  1. Align activities with SDGs: Determine which SDG(s) a given activity or text passage relates to
  2. Semantic similarity: Measure how similar activities are in the context of SDG alignment
  3. Text classification: Classify text into SDG categories for sustainability reporting

Primary Use Cases

  • Local government annual report analysis for SDG alignment
  • Corporate sustainability reporting
  • Research on SDG-related activities
  • Automated SDG tagging systems

Usage

Using Sentence Transformers

from sentence_transformers import SentenceTransformer

# Load the model
model = SentenceTransformer('voyager205/sdg-finetuned-enhanced')

# Encode sentences
sentences = [
    "The council implemented a new recycling program to reduce waste.",
    "New solar panels were installed on community buildings.",
    "The health department launched a vaccination campaign."
]

embeddings = model.encode(sentences)
print(f"Embedding shape: {embeddings.shape}")  # (3, 768)

For SDG Alignment

from sentence_transformers import SentenceTransformer
import numpy as np

model = SentenceTransformer('voyager205/sdg-finetuned-enhanced')

# SDG descriptions or keywords
sdg_texts = [
    "No Poverty - End poverty in all its forms everywhere",
    "Zero Hunger - End hunger, achieve food security",
    "Good Health and Well-being - Ensure healthy lives",
    # ... all 17 SDGs
]

# Encode SDG descriptions and activities
sdg_embeddings = model.encode(sdg_texts)
activity_embedding = model.encode(["The council built affordable housing units."])

# Find most aligned SDG
similarities = np.dot(activity_embedding, sdg_embeddings.T)
most_aligned_sdg = np.argmax(similarities) + 1
print(f"Most aligned SDG: {most_aligned_sdg}")

Training Details

Training Data

The model was fine-tuned on council annual reports and SDG-related activities from Australian local governments, including:

  • Activity descriptions from annual reports
  • SDG-aligned text samples for each of the 17 SDGs
  • Manually curated and validated training examples

Training Configuration

  • Base Model: all-mpnet-base-v2
  • Fine-tuning Method: Contrastive learning with SDG-specific pairs
  • Training Framework: Sentence Transformers

Performance

This enhanced model provides improved accuracy for SDG alignment tasks compared to the base model:

  • Better semantic understanding of SDG-related content
  • More accurate classification of government activities
  • Improved handling of Australian local government terminology

Limitations

  • Trained primarily on Australian local government documents
  • May have reduced accuracy for other contexts or regions
  • English language only
  • Best suited for activity-level text (sentences to paragraphs)

Ethical Considerations

  • This model should be used as a tool to assist human analysis, not replace it
  • Results should be validated by domain experts for critical applications
  • Consider potential biases in the training data

Citation

If you use this model in your research, please cite:

@software{sdg_finetuned_enhanced,
  author = {voyager205},
  title = {SDG Fine-tuned Enhanced Model},
  year = {2025},
  url = {https://huggingface.co/voyager205/sdg-finetuned-enhanced}
}

License

MIT License

Contact

For questions or issues with this model, please open an issue on the Hugging Face repository.

Downloads last month
45
Safetensors
Model size
0.1B params
Tensor type
F32
·
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support