SDG Fine-tuned Enhanced Model

A Sentence Transformer model fine-tuned for SDG (Sustainable Development Goals) alignment tasks. This model is designed to classify and analyze text activities according to the 17 UN Sustainable Development Goals.

Model Description

Base Model: all-mpnet-base-v2
Fine-tuned for: SDG alignment and classification
Embedding Dimension: 768
Max Sequence Length: 384
Language: English

Intended Use

This model is specifically trained to:

Align activities with SDGs: Determine which SDG(s) a given activity or text passage relates to
Semantic similarity: Measure how similar activities are in the context of SDG alignment
Text classification: Classify text into SDG categories for sustainability reporting

Primary Use Cases

Local government annual report analysis for SDG alignment
Corporate sustainability reporting
Research on SDG-related activities
Automated SDG tagging systems

Usage

Using Sentence Transformers

from sentence_transformers import SentenceTransformer

# Load the model
model = SentenceTransformer('voyager205/sdg-finetuned-enhanced')

# Encode sentences
sentences = [
    "The council implemented a new recycling program to reduce waste.",
    "New solar panels were installed on community buildings.",
    "The health department launched a vaccination campaign."
]

embeddings = model.encode(sentences)
print(f"Embedding shape: {embeddings.shape}")  # (3, 768)

For SDG Alignment

from sentence_transformers import SentenceTransformer
import numpy as np

model = SentenceTransformer('voyager205/sdg-finetuned-enhanced')

# SDG descriptions or keywords
sdg_texts = [
    "No Poverty - End poverty in all its forms everywhere",
    "Zero Hunger - End hunger, achieve food security",
    "Good Health and Well-being - Ensure healthy lives",
    # ... all 17 SDGs
]

# Encode SDG descriptions and activities
sdg_embeddings = model.encode(sdg_texts)
activity_embedding = model.encode(["The council built affordable housing units."])

# Find most aligned SDG
similarities = np.dot(activity_embedding, sdg_embeddings.T)
most_aligned_sdg = np.argmax(similarities) + 1
print(f"Most aligned SDG: {most_aligned_sdg}")

Training Details

Training Data

The model was fine-tuned on council annual reports and SDG-related activities from Australian local governments, including:

Activity descriptions from annual reports
SDG-aligned text samples for each of the 17 SDGs
Manually curated and validated training examples

Training Configuration

Base Model: all-mpnet-base-v2
Fine-tuning Method: Contrastive learning with SDG-specific pairs
Training Framework: Sentence Transformers

Performance

This enhanced model provides improved accuracy for SDG alignment tasks compared to the base model:

Better semantic understanding of SDG-related content
More accurate classification of government activities
Improved handling of Australian local government terminology

Limitations

Trained primarily on Australian local government documents
May have reduced accuracy for other contexts or regions
English language only
Best suited for activity-level text (sentences to paragraphs)

Ethical Considerations

This model should be used as a tool to assist human analysis, not replace it
Results should be validated by domain experts for critical applications
Consider potential biases in the training data

Citation

If you use this model in your research, please cite:

@software{sdg_finetuned_enhanced,
  author = {voyager205},
  title = {SDG Fine-tuned Enhanced Model},
  year = {2025},
  url = {https://huggingface.co/voyager205/sdg-finetuned-enhanced}
}

License

MIT License

Contact

For questions or issues with this model, please open an issue on the Hugging Face repository.

Downloads last month: 45

Safetensors

Model size

0.1B params

Tensor type

F32