Glossary/LLM Fundamentals

LLM Fundamentals

Term 60 of 68

Tokens

The basic units of text that language models process, typically word fragments.

Top-p (Nucleus Sampling)

Browse All Terms

Full Definition3 paragraphs

Tokens are the fundamental units that language models use to process and generate text. Rather than working with individual characters or complete words, most LLMs use subword tokenization, which breaks text into meaningful chunks that balance vocabulary size with representation efficiency.

Common tokenization schemes include BPE (Byte Pair Encoding) and SentencePiece. A rough approximation is that 1 token equals about 4 characters or 0.75 words in English, though this varies by language and content type. Code often tokenizes less efficiently than prose.

Understanding tokens is essential for AI engineers because: (1) API pricing is typically per-token, (2) context windows are measured in tokens, (3) generation speed is tokens-per-second, and (4) token boundaries can affect model behavior. Tools like tiktoken help count tokens for specific models, enabling cost estimation and context management in production applications.

Key Concept

The basic units of text that language models process, typically word fragments.

Related Terms

Context Window LLM (Large Language Model)

More in LLM Fundamentals

Context Window Fine-tuning Inference LLM (Large Language Model)

View all LLM Fundamentals

2Related

3Sections

Explore More

Agentic Workflow

AI systems that autonomously plan and execute multi-step tasks.

AI Agents

Autonomous AI systems that can plan, execute tasks, and use tools to achieve goals.

AI-Native Engineer

Engineers who leverage AI tools as a core part of their development workflow.

Tools & Platforms

Anthropic

The AI safety company that develops Claude and constitutional AI techniques.

API Rate Limiting

Restrictions on how frequently you can call an AI API within a time period.

Tools & Platforms

AWS Bedrock

Amazon's managed service for accessing foundation models from multiple providers.

Tools & Platforms

Azure OpenAI Service

Microsoft's enterprise offering of OpenAI models with Azure integration.

Caching

Storing AI responses to reduce costs and latency for repeated queries.

Apply your knowledge

Master AI Development

Join our network of elite AI-native engineers.

Apply Now Browse Jobs