Skip to main content
The Keyword
Build with Gemini Deep Research
["How does Gemini work in Google Maps?", "What is quantum computing?", "What are the camera features on Pixel 10?"]

Build with Gemini Deep Research

Gemini Deep Research Agent Text logo
Listen to article
This content is generated by Google AI. Generative AI is experimental
[[duration]] minutes

Gemini Deep Research achieves state-of-the-art 46.4% on the full Humanity’s Last Exam (HLE) set, 66.1% on DeepSearchQA and a high 59.2% on BrowseComp

Benchmark showcase DeepSearchQA, Humanity's Last Exam and BrowseComp.

Comparing pass@8 vs. pass@1 results demonstrates the value of letting the agent explore multiple parallel trajectories for answer verification. These results were computed on a 200-prompt subset of DeepSearchQA.

Inference Time Scaling

Let’s stay in touch. Get the latest news from Google in your inbox.

Subscribe