1 / 3
Research
U-Net, a new recursive tokenizer
Avoids using predefined vocabs and memory-heavy embedding tables. Instead, it uses Autoregressive U-Nets to embed information directly from raw bytes. This enables infinite vocab size and more. https:
Use ← → arrow keys to navigate