Google logo
Multimodal

Gemini 2.5 Pro Preview

Google Gemini 2.5-pro-preview-06-05 June 5, 2025
Featured

June 2025 preview with Deep-Think reasoning, 2 M-token context, full multimodality.

Model Capabilities

Input

Accepted Formats

Text
Image
Audio

Input Length

Context Window: 1M Tokens

Output

Generated Formats

Text
Image
Audio
Video

Output Length

Max Output: 65,000 Tokens

Core Features & Strengths

Function Calling / Tools
Function Calling
Multimodal Input
Large Context

Specifications & Pricing

Specifications

Model Size: Unknown params
Architecture: Transformer

Pricing

Input / 1M tk

$0.1500

Output / 1M tk

$0.6000

Est. cost in USD

Usage Guidance

Prompting Guide

General Tips

  • **Be Specific & Clear:** Detail the desired outcome, format, tone, and constraints.
  • **Provide Context:** Explain the background, your role, or the purpose (e.g., "Act as a helpful architect...").
  • **Specify Format:** Request bullet points, JSON, markdown, code blocks, etc.
  • **Use Examples (Few-Shot):** Provide 1-3 examples of input/output for complex tasks.
  • **Iterate:** Refine prompts based on the model's responses. Don't expect perfection first try.

Vision Prompt Example

Analyze the attached site plan. List potential challenges for drainage and suggest mitigation strategies.

Function Calling Example

"Book a meeting room for 3 people tomorrow at 2 PM for 1 hour. Find available rooms first."

// Model should trigger 'find_rooms' then 'book_room'