CoreX Tokenizer Information ========================== Vocabulary Size: 32000 Model Type: unigram Special Tokens: PAD: 0 -> '' UNK: 1 -> '' BOS: 2 -> '' EOS: 3 -> ''