pii-redactor / NOTICE
louis030195's picture
contact: correct Google Workspace inbox (louis@screenpi.pe)
c1b2bc4 verified
screenpipe-pii-redactor
Copyright 2026 Mediar, Inc.
This product is a derivative work of the OpenAI Privacy Filter
(https://github.com/openai/privacy-filter), licensed under the
Apache License, Version 2.0. The full upstream Apache 2.0 license
text is preserved in LICENSE.upstream-apache2.txt.
The base model architecture, tokenizer, and pretrained weights are the
work of OpenAI and are licensed Apache 2.0. screenpipe-pii-redactor
extends those weights via supervised fine-tuning on a custom corpus and
re-initializes the output head for a 12-label PII taxonomy specific to
desktop activity logs.
Significant modifications introduced by this derivative:
- Output head re-initialized for a 12-class PII label space
(29 rows copied from upstream where labels aligned, 20 rows
initialized from random for new classes; see model/finetune_summary.json).
- Fine-tuned for 3 epochs on a mixed corpus of:
* synthetic accessibility / window-title / OCR data
(privatenot redistributed)
* a 25% slice of ai4privacy/pii-masking-300k (CC-BY-4.0)
with labels mapped to the 12-class taxonomy
* targeted secret-shape augmentation (private)
- Context window n_ctx raised from 128 to 256.
- Hyperparameters: batch_size=4, lr=1e-4, weight_decay=0,
max_grad_norm=1.0, shuffle_seed=1337.
Distribution license:
- The fine-tuned weights and accompanying materials in this repository
(the "Derivative Work") are licensed under CC BY-NC 4.0; see LICENSE.
- The Apache 2.0 obligations on the base model are preserved by:
(a) shipping LICENSE.upstream-apache2.txt with this repo,
(b) attributing OpenAI Privacy Filter in README.md,
(c) declaring significant modifications above.
- "OpenAI" and "Privacy Filter" are trademarks / brands of OpenAI; this
derivative does not use those marks to endorse or suggest endorsement
of this work by OpenAI.
Third-party datasets used during fine-tuning:
- ai4privacy/pii-masking-300k
https://huggingface.co/datasets/ai4privacy/pii-masking-300k
Licensed CC-BY-4.0.
Questions about license compatibility or commercial use:
louis@screenpi.pe