Fanout Query Analysis

by

in ,

When AI models like Gemini, GPT or Nova answer a question using web search, they don’t just run your query as-is. They generate their own internal search queries, or fanout queries. A single user prompt can trigger multiple fanout queries as the model breaks down the question, explores subtopics and verifies information.

We captured 365,920 of these fanout queries across three providers, Google (Gemini), OpenAI (GPT) and Amazon (Nova), by logging the grounding metadata returned from their APIs during citation mining runs. This data comes from real production workloads across multiple projects, not synthetic benchmarks.

Below is an analysis of how these providers differ in the queries they generate.

ProviderCountAvg CharsMinMax1-3 words4-6 words7+ words
Google158,1865202524.5%30.6%64.9%
OpenAI207,1746063233.4%20.8%75.8%
Amazon56059281980.2%16.2%83.6%
Total~365,9205603233.9%25.0%71.1%

Google (n=158,184)

WordsCount%Cumul%
1530.0%0.0%
21,0920.7%0.7%
35,9943.8%4.5%
414,9169.4%13.9%
517,47111.0%25.0%
615,92310.1%35.1%
718,08011.4%46.5%
820,32512.8%59.3%
920,01312.7%72.0%
1016,96810.7%82.7%
1111,7407.4%90.1%
127,3164.6%94.8%
134,0432.6%97.3%
142,1241.3%98.7%
15+1,1460.7%100.0%

OpenAI (n=207,174)

WordsCount%Cumul%
16160.3%0.3%
23,7151.8%2.1%
32,6911.3%3.4%
47,3603.6%6.9%
514,5167.0%13.9%
621,22110.2%24.2%
726,54412.8%37.0%
828,91214.0%51.0%
927,86113.4%64.4%
1023,35411.3%75.7%
1117,8758.6%84.3%
1212,3396.0%90.3%
137,9833.9%94.1%
144,9592.4%96.5%
15+5,2282.5%100.0%

Amazon (n=560)

WordsCount%Cumul%
310.2%0.2%
440.7%0.9%
5234.1%5.0%
66411.4%16.4%
710218.2%34.6%
811019.6%54.3%
911320.2%74.5%
106411.4%85.9%
11356.2%92.1%
12203.6%95.7%
1391.6%97.3%
1450.9%98.2%
15+101.8%100.0%

POS Distribution by Provider

GroupGoogleOpenAIAmazon
Nouns52.3%58.4%50.2%
Verbs11.3%9.9%8.5%
Adjectives11.0%8.9%18.6%
Prepositions7.4%3.5%10.3%
Wh-words3.6%2.1%1.5%
Numbers2.2%5.3%2.8%
Determiners2.6%1.8%0.1%
Conjunctions1.6%0.6%2.4%
Adverbs0.6%0.7%2.3%
Modals0.7%0.5%0.0%
Pronouns1.2%0.9%0.1%
  • OpenAI is the most noun-heavy (58.4%), especially proper nouns (18.9% vs Google’s 8.6%) — it generates more entity-specific queries
  • Amazon leans heavily into adjectives (18.6% vs ~10% for others) — more descriptive, qualifier-rich queries like “best,” “top,” “most effective”
  • Google uses more wh-words and verbs — generates more question-style queries (“what,” “how,” “which”)
  • OpenAI uses 2x more numbers (5.3%) — likely year references and quantities in queries

Comments

Leave a Reply

Your email address will not be published. Required fields are marked *