Google-Extended — Google's AI Training Opt-Out

Google-Extended lets you block Gemini AI training without affecting Google Search rankings. Learn exactly how it works and how to configure robots.txt.

QUICK FACTS

USER-AGENT Google-Extended
OPERATOR Google
CATEGORY AI Training
FIRST SEEN 2023-09
ROBOTS.TXT ✓ Respects directives
DOCUMENTATION Official docs →

What is Google-Extended?

Google-Extended is a separate user-agent token that lets website operators opt out of having their content used to train Google's generative AI models, including Gemini and Vertex AI. Blocking Google-Extended does NOT affect Google Search rankings — Google Search uses Googlebot, which is a completely separate user-agent. This distinction is critical: you can block AI training while keeping full Google Search visibility.

How to Block Google-Extended

Add the following to your robots.txt file (located at the root of your website):

User-agent: Google-Extended
Disallow: /

What Happens When You Block Google-Extended

Your content will not be used for Gemini, Vertex AI, or AI Overviews training. Google Search rankings are completely unaffected.

Should You Block Google-Extended?

Google-Extended is a training crawler — it collects data to build AI models. If you want to prevent your content from being used in future AI training by Google, block it. This is a one-way decision: blocking today only affects future crawls, not data already collected.

Google-Extended vs Other Google Crawlers

Google currently operates Google-Extended as a standalone crawler. Unlike companies like OpenAI and Anthropic that split functionality across multiple user-agents, Google uses a single identifier for its AI crawling operations.

GENERATE YOUR ROBOTS.TXT

Use our visual generator to create a robots.txt file that blocks Google-Extended and any other crawlers you want to opt out of.