Skip to main content
The Keyword
Gemini 3 Pro: the frontier of vision AI
["How is Gemini changing Maps?", "What is \"vibe design?\"", "How can I learn new AI skills?"]

Gemini 3 Pro: the frontier of vision AI

Image with black background and Gemini 3 Pro logo
Listen to article
This content is generated by Google AI. Generative AI is experimental
[[duration]] minutes
Vision AI benchmarks table
Input image of an old merchants handbook ledger along with an output image that clearly reconstructed transcription

Example 1: Handwritten Complex Table from 18th century Albany Merchant’s Handbook

Input image of a scan of an equation alongside an output of the model solving the equation

Example 2: Reconstructing equations from an image

Image showing input of a scanned diagram into a an interactive chart

Example 3: Reconstructing Florence Nightingale's original Polar Area Diagram into an interactive chart (with a toggle!)

Pdf image highlighting the numbers -1.2 and 3.2

Visual Extraction: To answer the Gini Index Comparison question, Gemini located and cross-referenced this info in Figure 3 about “Money Income decreased by 1.2 percent” and in Table B-3 about “Post-Tax Income increased by 3.2 percent”

Pdf image highlighting the ARPA policies lapsing in 2021 and the stimulus payments ending

Causal Logic: Crucially, Gemini 3 does not stop at the numbers; it correlates this gap with the text’s policy analysis, correctly identifying Lapse of ARPA Policies and the end of Stimulus Payments are the main causes.

Pdf highlighting the numbers 2.9 and 3.0 for 2021 and 2022 respectively

Numerical Comparison: To compare the lowest quantile’s share rising or falling, Gemini 3 looked at table A-3, and compared the number of 2.9 and 3.0, and concluded that “the share of aggregate household income held by the lowest quintile was rising.”

Final model response text

Final Model Answer

Image showing a cluttered box, a bottle, a screwdriver, a pouch and a measuring tape on a table. A line connects a clear path between the measuring tape and the box created by Gemini 3 Pro
A picture of a cluttered kitchen counter with open cabinets. Three lines show the trajectory between the mug, the glass and the bowl and specific spots in the cabinet where they should go, created by Gemini 3 Pro
Picture of a circuit board with each distinct item labeled by Gemini 3 Pro

By processing video at 10 FPS—10x the default speed—Gemini 3 Pro catches every swing and shift in weight, unlocking deep insights into player mechanics.

Prompt: “Here is a photo of my homework attempt. Please check my steps and tell me where I went wrong. Instead of explaining in text, show me visually on my image.” (Note: Student work is shown in blue; model corrections are shown in red). [See prompt in Google AI Studio]

Image showing input of a handwritten equation on the left and the model's correction annotated on top of the handwritten equation

Input image from MicroVQA - a benchmark for microscopy-based biological research

Image showing a stained kidney cortex image on the left and the model prompt and response on the right
1

Gemini 3 Pro is not intended for clinical diagnosis or patient care and is not a substitute for professional medical advice.

Let’s stay in touch. Get the latest news from Google in your inbox.

Subscribe