contactdoc-tokenizer / tokenizer_config.json
WillHeld's picture
Upload ContactDoc fixed-vocab tokenizer
aebda50 verified
{
"backend": "tokenizers",
"bos_token": "<begin_sequence>",
"eos_token": "<end>",
"model_max_length": 1000000000000000019884624838656,
"pad_token": "<pad>",
"tokenizer_class": "PreTrainedTokenizerFast",
"unk_token": "<UNK>"
}