Pricing & Performance
Tokens Per Second
The rate at which a model generates tokens after the first one.
Published 2026-06-12
Related terms
Explore the glossary
Find definitions for AI, LLM, MCP, RAG, agent, and prompt engineering terms.
Browse all termsRelated Resources
API Pricing
GlossaryThe cost structure for using a model or service via API, usually per input and output tokens.
DeepSeek Coder Architect
PromptLeverage DeepSeek Coder for complex software architecture, code generation, and technical problem-solving with advanced reasoning.
3D Printing Optimizer
SkillOptimize 3D models for additive manufacturing considering orientation, supports, infill, and material properties.
Firecrawl
MCP ServerOfficial Firecrawl MCP Server - Adds powerful web scraping and search to Cursor, Claude and any other LLM clients.
Input Token
GlossaryA token counted from the prompt sent to a model.