---
title: "AI Directives · Nuxt Robots · Nuxt SEO"
meta:
  "og:description": "Control how AI systems interact with your content using Content-Usage and Content-Signal directives."
  "og:title": "AI Directives · Nuxt Robots · Nuxt SEO"
  description: "Control how AI systems interact with your content using Content-Usage and Content-Signal directives."
---

**Core Concepts**

# **AI Directives**

Last updated **Jan 28, 2026** by [Harlan Wilton](https://github.com/harlan-zw) in [chore: repo clean up / sync](https://github.com/nuxt-modules/robots/commit/05cd53f8396872c518aae32c86d98a9bdbe8431b).

[Copy for LLMs

AI Directives allow you to express preferences about how AI systems, search engines, and automated tools should interact with your content. Two standards are supported:

- **[**Content-Usage**](https://ietf-wg-aipref.github.io/drafts/draft-ietf-aipref-vocab.html)** - IETF standard with broader automation categories
- **[**Content-Signal**](https://contentsignals.org/)** - Cloudflare's widely-deployed implementation focused on AI use cases

Both can be used together in your robots.txt file and are enforced through the robots.txt protocol.

[](https://nuxtseo.com/tools/robots-txt-generator)**Test AI crawler blocking** - Our [**Robots.txt Generator**](https://nuxtseo.com/tools/robots-txt-generator) includes presets for GPTBot, ClaudeBot, and other AI crawlers.

**Important:** AI directives rely on voluntary compliance by crawlers and AI systems. They are not enforced by web servers and should be combined with other protection methods for sensitive content.

## [Content-Usage (IETF aipref-vocab)](#content-usage-ietf-aipref-vocab)

The Content-Usage directive follows the [**IETF AI Preferences specification**](https://ietf-wg-aipref.github.io/drafts/draft-ietf-aipref-vocab.html), providing a standardized way to express automation preferences.

### [Categories](#categories)

| **Category** | **Description** | **Example Use Case** |
| --- | --- | --- |
| `train-ai` | Foundation Model Production | Training large language models |

### [Values](#values)

- `y` - Allow this category of use
- `n` - Disallow this category of use

### [Syntax](#syntax)

robots.txt

```
User-agent: *
Content-Usage: <category>=<value>[, <category>=<value>]
Content-Usage: /path/ <category>=<value>[, <category>=<value>]
```

### [Examples](#examples)

#### [Block AI Training Globally](#block-ai-training-globally)

robots.txt

```
User-agent: *
Allow: /
Content-Usage: train-ai=n
```

#### [Allow Bots, Block AI Training](#allow-bots-block-ai-training)

robots.txt

```
User-agent: *
Allow: /
Content-Usage: bots=y, train-ai=n
```

#### [Path-Specific Rules](#path-specific-rules)

robots.txt

```
User-agent: *
Allow: /
Content-Usage: train-ai=n
Content-Usage: /docs/ train-ai=y
Content-Usage: /api/ train-ai=n
```

### [Programmatic Configuration](#programmatic-configuration)

**Object Format (Recommended)** - Type-safe with autocomplete:

nuxt.config.ts

```
export default defineNuxtConfig({
  robots: {
    groups: [
      {
        userAgent: '*',
        allow: '/',
        contentUsage: {
          'train-ai': 'n'
        }
      }
    ]
  }
})
```

## [Content-Signal (Cloudflare/IETF aipref-contentsignals)](#content-signal-cloudflareietf-aipref-contentsignals)

Content-Signal is [**Cloudflare's implementation**](https://blog.cloudflare.com/content-signals-policy/) based on [**IETF aipref-contentsignals**](https://www.ietf.org/archive/id/draft-romm-aipref-contentsignals-00.html).

### [Categories](#categories-1)

| **Category** | **Description** | **Example Use Case** |
| --- | --- | --- |
| `search` | Search Applications | Indexing for search results and snippets |
| `ai-input` | AI Input | RAG, grounding, generative AI search answers |
| `ai-train` | AI Training | Training or fine-tuning AI models |

### [Values](#values-1)

- `yes` - Allow this category of use
- `no` - Disallow this category of use

### [Syntax](#syntax-1)

robots.txt

```
User-agent: *
Content-Signal: <category>=<value>[, <category>=<value>]
Content-Signal: /path/ <category>=<value>[, <category>=<value>]
```

### [Examples](#examples-1)

#### [Block AI Training, Allow Search](#block-ai-training-allow-search)

robots.txt

```
User-agent: *
Allow: /
Content-Signal: ai-train=no, search=yes
```

#### [Block All AI Usage](#block-all-ai-usage)

robots.txt

```
User-agent: *
Allow: /
Content-Signal: ai-train=no, ai-input=no, search=yes
```

#### [Path-Specific Rules](#path-specific-rules-1)

robots.txt

```
User-agent: *
Allow: /
Content-Signal: ai-train=no, search=yes
Content-Signal: /docs/ ai-input=yes
Content-Signal: /api/ ai-train=no, ai-input=no, search=no
```

### [Programmatic Configuration](#programmatic-configuration-1)

**Object Format (Recommended)** - Type-safe with autocomplete:

nuxt.config.ts

```
export default defineNuxtConfig({
  robots: {
    groups: [
      {
        userAgent: '*',
        allow: '/',
        contentSignal: {
          'ai-train': 'no',
          'search': 'yes'
        }
      }
    ]
  }
})
```

## [Vendor-Specific AI Tokens](#vendor-specific-ai-tokens)

While `Content-Usage` and `Content-Signal` are emerging standards, some major AI providers offer specific User-Agent tokens to opt-out of AI training while maintaining search visibility.

These are highly effective as they are strictly adhered to by their respective companies.

### [Google-Extended](#google-extended)

[**Google-Extended**](https://developers.google.com/search/docs/crawling-indexing/overview-google-crawlers#google-extended) is a standalone token that allows you to control whether your content is used to help improve Google's generative AI APIs and services (Gemini, Vertex AI).

- **Does NOT** affect your site's ranking in Google Search.
- **Does NOT** stop Googlebot from crawling your site for indexing.

robots.txt

```
User-agent: Google-Extended
Disallow: /
```

### [Applebot-Extended](#applebot-extended)

[**Applebot-Extended**](https://support.apple.com/en-us/119829) allows you to opt-out of having your website content used to train Apple's foundation models that power generative AI features across Apple products.

- **Does NOT** affect your site's ranking in Apple Search (Spotlight, Siri).
- **Does NOT** stop Applebot from crawling your site.

robots.txt

```
User-agent: Applebot-Extended
Disallow: /
```

### [Dataset Crawlers](#dataset-crawlers)

Some crawlers are specifically designed to build massive datasets used for training many different AI models. Blocking these can be a broad-stroke approach to AI protection.

- **CCBot (Common Crawl)**: Used by many open-source and commercial models (including early GPT versions).
- **Bytespider**: Crawler for ByteDance (TikTok) AI models.
- **Diffbot**: AI-powered knowledge extraction.

robots.txt

```
User-agent: CCBot
Disallow: /

User-agent: Bytespider
Disallow: /

User-agent: Diffbot
Disallow: /
```

## [Using Both Together](#using-both-together)

You can use both Content-Usage and Content-Signal in the same robots.txt for comprehensive coverage:

robots.txt

```
User-agent: *
Allow: /
Content-Usage: bots=y, train-ai=n
Content-Signal: ai-train=no, search=yes
```

```
export default defineNuxtConfig({
  robots: {
    groups: [
      {
        userAgent: '*',
        allow: '/',
        contentUsage: {
          'train-ai': 'n'
        },
        contentSignal: {
          'ai-train': 'no',
          'search': 'yes'
        }
      }
    ]
  }
})
```

```
export default defineNuxtConfig({
  robots: {
    groups: [
      {
        userAgent: '*',
        allow: '/',
        contentUsage: ['bots=y, train-ai=n'],
        contentSignal: ['ai-train=no, search=yes']
      }
    ]
  }
})
```

## [Examples](#examples-2)

### [Block All AI Training](#block-all-ai-training)

```
User-agent: *
Allow: /
Content-Usage: train-ai=n
```

```
User-agent: *
Allow: /
Content-Signal: ai-train=no
```

### [Documentation-Only Training](#documentation-only-training)

```
User-agent: *
Allow: /
Content-Usage: train-ai=n
Content-Usage: /docs/ train-ai=y
```

```
User-agent: *
Allow: /
Content-Signal: ai-train=no
Content-Signal: /docs/ ai-train=yes
```

[Edit this page](https://github.com/nuxt-modules/robots/edit/main/docs/content/2.guides/5.ai-directives.md)

[Markdown For LLMs](https://nuxtseo.com/docs/robots/guides/ai-directives.md)

**Did this page help you? **

[**Bot Detection** Detect and classify bots with server-side header analysis and client-side browser fingerprinting.](https://nuxtseo.com/docs/robots/guides/bot-detection) [**Yandex: Clean-param** Learn how to use the `clean-param` directive to remove query parameters from URLs with Yandex.](https://nuxtseo.com/docs/robots/advanced/yandex)

**On this page**

- [Content-Usage (IETF aipref-vocab)](#content-usage-ietf-aipref-vocab)
- [Content-Signal (Cloudflare/IETF aipref-contentsignals)](#content-signal-cloudflareietf-aipref-contentsignals)
- [Vendor-Specific AI Tokens](#vendor-specific-ai-tokens)
- [Using Both Together](#using-both-together)
- [Examples](#examples-2)