Multimodal

Gemini 2.5 Pro Preview

Google Gemini 2.5-pro-preview-06-05 June 5, 2025

Featured

Google DeepMind's most advanced model in the Gemini series (mid-2025), featuring enhanced reasoning and coding capabilities. Introduces "Deep Think" mode for complex tasks and supports native audio output with improved security.

Model Capabilities

Input

Accepted Formats

Text

Image

Audio

Input Length

Context Window: 1M Tokens

Output

Generated Formats

Text

Image

Audio

Video

Output Length

Max Output: 65,000 Tokens

Core Features & Strengths

Function Calling / Tools

Function Calling

Multimodal Input

Large Context

What is it good for?

Primary Use Cases

Visual Analysis: Image understanding, document analysis, chart interpretation

Tool Integration: API calls, data processing, automated workflows

Long Documents: Research papers, books, large codebases analysis

Content Creation: Writing, editing, summarization, translation

Code Assistance: Programming help, debugging, code review

Best For

Business Applications: Customer service, content generation

Specifications & Pricing

Specifications

Model Size: Unknown params

Architecture: Transformer

Open Source:No

Pricing

Input / 1M tk

$0.1500

Output / 1M tk

$0.6000

Est. cost in USD

💡 What does this cost in practice?

📖 300-page book:~200,000 tokens ≈ $0.0300

📄 Research paper:~15,000 tokens ≈ $0.0022

💬 Chat conversation:~1,000 tokens ≈ $0.0001

✍️ Generated response:~500 tokens ≈ $0.0003

View Provider Info

Usage Guidance

How to Prompt This Model

✨ General Tips

🎯Be Specific: Detail the desired outcome, format, tone, and constraints.

🎭Set Context: Explain your role or the purpose (e.g., "Act as a helpful architect...").

📝Specify Format: Request bullet points, JSON, markdown, code blocks, etc.

🔄Iterate: Refine prompts based on responses. Don't expect perfection first try.

🚀 Best Practices for Gemini 2.5 Pro Preview

Large Context Window

Leverage the 1M context by including full documents, multiple examples, or comprehensive background information.

Vision Capabilities

Describe what you want analyzed in images. Be specific about details, text, or patterns you want identified.

Function Calling

Describe your goal naturally. The model will determine which tools to use and in what order.

📋 Example Prompts

Content Creation Example

Write a professional email to a client explaining a project delay. The project is a website redesign, the delay is due to additional security requirements, and the new timeline is 2 weeks. Keep the tone apologetic but confident.

Vision Analysis Example

Analyze this architectural drawing. Identify:\n1. Room types and their approximate sizes\n2. Potential accessibility issues\n3. Suggestions for improving natural light\n4. Any code compliance concerns you notice

Function Calling Example

I need to schedule a team meeting for next Tuesday at 2 PM. First check everyone's availability, then book a conference room that fits 8 people, and finally send calendar invites to the team.

Long Document Analysis

[Attach full research paper]\n\nAnalyze this research paper and provide:\n1. A 3-sentence summary of the main findings\n2. List of 5 key methodologies used\n3. Potential limitations or biases\n4. How this relates to current industry practices\n5. Actionable insights for a product manager

← Back to Models Directory