Google AI Build with Gemini Deep Research : US Pioneer Global VC DIFCHQ SFO NYC Singapore – Riyadh Swiss Our Mind

We have reimagined Gemini Deep Research to be more powerful than ever. It is now accessible to developers via the new Interactions API, launching alongside DeepSearchQA, a benchmark for complex web search tasks.

Benchmark showcase DeepSearchQA, Humanity's Last Exam and BrowseComp.

Gemini Deep Research achieves state-of-the-art 46.4% on the full Humanity’s Last Exam (HLE) set, 66.1% on DeepSearchQA and a high 59.2% on BrowseComp

Inference Time Scaling

Comparing pass@8 vs. pass@1 results demonstrates the value of letting the agent explore multiple parallel trajectories for answer verification. These results were computed on a 200-prompt subset of DeepSearchQA.