Sona

Google-Extended

A robots.txt token — not a separate crawler. Controls AI training use without affecting Google Search.

OperatorGoogle
PowersGemini and Vertex AI generative training
PurposeTraining opt-out token
User-agent tokenGoogle-Extended
Respects robots.txtYes

Google-Extended is not a distinct bot with its own user-agent traffic. It is a robots.txt control token that tells Google whether content it already crawls may be used to train and ground generative models like Gemini.

Crucially, disallowing Google-Extended does NOT remove you from Google Search and does not affect your rankings. It only governs generative-AI training use. Googlebot indexing is controlled separately.

Allow Google-Extended

Your content can be used to improve and ground Google's generative AI products.

User-agent: Google-Extended
Allow: /

Block Google-Extended

You want to stay in Google Search but opt out of generative-AI training — block Google-Extended while leaving Googlebot allowed.

User-agent: Google-Extended
Disallow: /

Can Google-Extended read your page right now?

Test any URL and see exactly what AI crawlers receive.

Check my site