gaurv007's picture
Add model card
1b0f4f7 verified
metadata
language: en
license: apache-2.0
tags:
  - legal
  - contract-analysis
  - clause-classification
  - onnx
  - quantized
  - int8
  - legal-bert
  - cuad
pipeline_tag: text-classification
base_model: nlpaueb/legal-bert-base-uncased
datasets:
  - theatticusproject/cuad-qa
library_name: optimum

🛡️ ClauseGuard CUAD Classifier — ONNX INT8 Quantized

ONNX Runtime + dynamic INT8 quantized version of the ClauseGuard legal clause classifier for 2-4x faster CPU inference.

Model Details

Property Value
Base model nlpaueb/legal-bert-base-uncased (110M params)
LoRA adapter Mokshith31/legalbert-contract-clause-classification
Export method merge_and_unload() → ONNX → dynamic INT8 quantization
Labels 41 CUAD clause categories
ONNX FP32 size 438.3 MB
INT8 size 110.3 MB (75% reduction)
Runtime ONNX Runtime (CPU)
Expected speedup 2-4x over PyTorch on CPU

Usage

from optimum.onnxruntime import ORTModelForSequenceClassification
from transformers import AutoTokenizer
import torch

model = ORTModelForSequenceClassification.from_pretrained(
    "gaurv007/clauseguard-onnx-int8",
    file_name="model_quantized.onnx"
)
tokenizer = AutoTokenizer.from_pretrained("gaurv007/clauseguard-onnx-int8")

texts = ["The company may terminate your account at any time without notice."]
inputs = tokenizer(texts, return_tensors="pt", padding=True, truncation=True, max_length=512)

with torch.no_grad():
    outputs = model(**inputs)
    probs = torch.softmax(outputs.logits, dim=-1)
    predicted = probs.argmax(dim=-1)
    print(f"Label: {model.config.id2label[str(predicted.item())]}")
    print(f"Confidence: {probs.max().item():.3f}")

41 CUAD Labels

Document Name, Parties, Agreement Date, Effective Date, Expiration Date, Renewal Term, Notice Period to Terminate Renewal, Governing Law, Most Favored Nation, Non-Compete, Exclusivity, No-Solicit of Customers, No-Solicit of Employees, Non-Disparagement, Termination for Convenience, ROFR/ROFO/ROFN, Change of Control, Anti-Assignment, Revenue/Profit Sharing, Price Restriction, Minimum Commitment, Volume Restriction, IP Ownership Assignment, Joint IP Ownership, License Grant, Non-Transferable License, Affiliate License-Licensor, Affiliate License-Licensee, Unlimited/All-You-Can-Eat License, Irrevocable or Perpetual License, Source Code Escrow, Post-Termination Services, Audit Rights, Uncapped Liability, Cap on Liability, Liquidated Damages, Warranty Duration, Insurance, Covenant Not to Sue, Third Party Beneficiary, Other

Part of ClauseGuard

This model powers ClauseGuard, the open-source legal contract analysis tool.

License

Apache 2.0