Skip to main content
API Pricing

ByteDance: UI-TARS 7B API Pricing

UI-TARS-1.5 is a multimodal vision-language agent optimized for GUI-based environments, including desktop interfaces, web browsers, mobile systems, and games. Built by ByteDance, it builds upon the UI-TARS framework with reinforcement...

Input / 1M tokens
$0.10
Output / 1M tokens
$0.20
Cost per request (1K in / 500 out)
$0.0002
~$0.20 per 1M tokens mixed

Specifications

Context Window 128K tokens
Max Output Tokens 2,048
Vision Support Yes
Function Calling No
Streaming Yes
Modality multimodal
Release Date 2025-07-22
Price history will appear here after the first price change is detected.

Similar Models from Other Providers

Input / 1M tokens
Variable
Output / 1M tokens
Variable
Context: 2.0M Vision Streaming
Input / 1M tokens
Free
Output / 1M tokens
Free
Context: 262K Vision Streaming
Input / 1M tokens
Free
Output / 1M tokens
Free
Context: 262K Vision Streaming
Input / 1M tokens
Free
Output / 1M tokens
Free
Context: 1.0M Vision Streaming
Input / 1M tokens
Free
Output / 1M tokens
Free
Context: 1.0M Vision Streaming