Should I block /_astro/ in Astro robots.txt?

No. /_astro/ contains the JS and CSS bundles that hydrate islands. Google needs them to render pages. Blocking it hurts rankings.

What is the sitemap URL when using @astrojs/sitemap?

The integration emits /sitemap-index.xml at the site root referencing sitemap-0.xml etc. Add Sitemap: https://yourdomain.com/sitemap-index.xml to robots.txt.

Should I use a static robots.txt file or an endpoint in Astro?

Use the static file in public/ for simple rules. Use the endpoint when you need env-var-based logic, e.g. blocking previews or per-environment rules.

Robots.txt for Astro — Static Site & Dynamic Endpoint Examples

Q: Does Astro generate robots.txt automatically?

No. Astro has no built-in robots.txt convention. Either ship a static file from public/ or define a dynamic endpoint at src/pages/robots.txt.ts.

Add robots.txt to your Astro site: simple static file in /public, or a dynamic endpoint that reads your sitemap config. Plus what NOT to block.

Astro has no special robots convention. The simplest path is a static file in public/; a dynamic endpoint at src/pages/robots.txt.ts works when you need build-time logic.

OPEN GENERATOR →

Option 1: Static File in `public/`

Drop the file into public/robots.txt and Astro copies it as-is to the build output. This is the recommended approach unless you need conditional logic.

# public/robots.txt
User-agent: *
Allow: /

Sitemap: https://yourdomain.com/sitemap-index.xml

If you use the official @astrojs/sitemap integration, the sitemap is emitted at /sitemap-index.xml (with one or more sitemap-N.xml files referenced from it).

Option 2: Dynamic Endpoint at `src/pages/robots.txt.ts`

Astro pages can be plain .ts files that export a GET handler. The file becomes a route, and the response body is whatever you return. This lets you read env vars or compute rules at build/render time.

// src/pages/robots.txt.ts
import type { APIRoute } from 'astro';

const SITE = 'https://yourdomain.com';

const body = `User-agent: *
Allow: /
Disallow: /api/

User-agent: GPTBot
Disallow: /

User-agent: ClaudeBot
Disallow: /

User-agent: anthropic-ai
Disallow: /

User-agent: Google-Extended
Disallow: /

User-agent: CCBot
Disallow: /

Sitemap: ${SITE}/sitemap-index.xml
`;

export const GET: APIRoute = () => new Response(body, {
  headers: { 'Content-Type': 'text/plain; charset=utf-8' },
});

The endpoint runs at build time for static output and per request for SSR — both produce the correct /robots.txt URL.

Option 3: Per-Environment Robots.txt

Block crawlers entirely on staging or preview deploys to keep them out of the index:

// src/pages/robots.txt.ts
import type { APIRoute } from 'astro';

export const GET: APIRoute = () => {
  const isProd = import.meta.env.PROD && import.meta.env.PUBLIC_SITE_ENV === 'production';
  const body = isProd
    ? `User-agent: *\nAllow: /\n\nSitemap: https://yourdomain.com/sitemap-index.xml\n`
    : `User-agent: *\nDisallow: /\n`;
  return new Response(body, {
    headers: { 'Content-Type': 'text/plain; charset=utf-8' },
  });
};

Set PUBLIC_SITE_ENV=production in your production deploy environment so non-production builds always disallow everything.

Don't Block `/_astro/`

The /_astro/ directory contains the JavaScript and CSS chunks that hydrate your islands. Google fetches these to render your page during indexing. Blocking the directory in robots.txt will degrade rendering — leave it open.

i18n Routing

If you use Astro's i18n config (locale subpaths like /ja/, /ar/), you don't need separate robots rules per locale. Use hreflang tags in your layout (Astro generates them via the link rel="alternate" pattern) and let crawlers see all locales freely.

Block AI Crawlers

Whether you use the static file or the endpoint approach, the AI crawler block is the same standard robots.txt syntax — five user-agent groups (GPTBot, ClaudeBot, anthropic-ai, Google-Extended, CCBot), each with Disallow: /.

Verify After Build

Run astro build && astro preview and open http://localhost:4321/robots.txt.
Confirm the file is served with Content-Type: text/plain.
After deploy, fetch the production URL and verify the same content.
Submit your sitemap in Google Search Console if you haven't already.

Astro Robots.txt FAQ

Does Astro generate robots.txt automatically?

No. Astro has no built-in robots.txt convention. You either ship a static file from public/ or define a dynamic endpoint at src/pages/robots.txt.ts.

Should I block /_astro/?

No. /_astro/ contains the JS and CSS bundles that hydrate your islands. Google needs them to render the page during indexing. Blocking /_astro/ will hurt rendering and rankings.

What's the sitemap URL when using @astrojs/sitemap?

The integration emits /sitemap-index.xml at the site root, which references one or more sitemap-0.xml, sitemap-1.xml files. Add Sitemap: https://yourdomain.com/sitemap-index.xml to your robots.txt.

Static file or endpoint — which should I use?

Use the static file in public/robots.txt for simple, fixed rules. Switch to the endpoint when you need to read env vars (e.g. blocking previews), generate rules from a config object, or share rule logic with another part of your app.

Will the endpoint work with static output: 'static'?

Yes. Astro pre-renders the endpoint at build time and writes the response body to dist/robots.txt. No SSR runtime is needed.

RELATED TOOLS

SITEMAP XML OG TAGS SCHEMA MARKUP