API Pricing

Qwen: Qwen3 VL 8B Thinking API Pricing

Qwen3-VL-8B-Thinking is the reasoning-optimized variant of the Qwen3-VL-8B multimodal model, designed for advanced visual and textual reasoning across complex scenes, documents, and temporal sequences. It integrates enhanced multimodal alignment and...

Input / 1M tokens

$0.12

Output / 1M tokens

$1.36

Cost per request (1K in / 500 out)

$0.0008

~$0.80 per 1M tokens mixed

Specifications

Context Window 256K tokens

Max Output Tokens 32,768

Vision Support Yes

Function Calling No

Streaming Yes

Modality multimodal

Release Date 2025-10-14

Price history will appear here after the first price change is detected.

More from Qwen

Qwen: Qwen Plus 0728

Input / 1M tokens

$0.26

Output / 1M tokens

$0.78

Context: 1.0M Streaming

Qwen: Qwen Plus 0728 (thinking)

Input / 1M tokens

$0.26

Output / 1M tokens

$0.78

Context: 1.0M Streaming

Qwen: Qwen-Plus

Input / 1M tokens

$0.26

Output / 1M tokens

$0.78

Context: 1.0M Streaming

Qwen: Qwen2.5 7B Instruct

Input / 1M tokens

$0.04

Output / 1M tokens

$0.10

Context: 131K Streaming

Qwen: Qwen2.5 VL 72B Instruct

Multimodal

Input / 1M tokens

$0.80

Output / 1M tokens

$1.00

Context: 131K Vision Streaming

Similar Models from Other Providers

Auto Router

Multimodal

Input / 1M tokens

Variable

Output / 1M tokens

Variable

Context: 2.0M Vision Streaming

Google: Gemma 4 26B A4B (free)

Multimodal

Input / 1M tokens

Free

Output / 1M tokens

Free

Context: 262K Vision Streaming

Google: Gemma 4 31B (free)

Multimodal

Input / 1M tokens

Free

Output / 1M tokens

Free

Context: 262K Vision Streaming

Google: Lyria 3 Clip Preview

Multimodal

Input / 1M tokens

Free

Output / 1M tokens

Free

Context: 1.0M Vision Streaming

Google: Lyria 3 Pro Preview

Multimodal

Input / 1M tokens

Free

Output / 1M tokens

Free

Context: 1.0M Vision Streaming

Compare Qwen: Qwen3 VL 8B Thinking

Qwen: Qwen3 VL 8B Thinking

Side-by-side comparison

Qwen: Qwen3 VL 8B Thinking

Google: Gemma 4 26B A4B (free)

Side-by-side comparison

Qwen: Qwen3 VL 8B Thinking

Google: Gemma 4 31B (free)

Side-by-side comparison

Cost Calculator

Estimate your API spend

Side-by-side comparison

Lowest cost models ranked

All Qwen Models

Browse provider pricing