#tokenizer

1 post tagged with #tokenizer.

2025
📏 Bits per byte (BPB) for LLMs: tokenizer-agnostic loss