updated robots.txt by TimLFletcher · Pull Request #884 · couchbase/docs-site

TimLFletcher · 2026-03-30T12:00:05Z

As per the ticket: https://jira.issues.couchbase.com/browse/DOC-14190

_This is mostly political and boiler plate. We already allow blanket access for agents but many tools that critique AI discoverability will not recognise this or penalise us for not having more granular rules.

Since it doesn’t actually harm anything to implement such rules… I’m going to suggest we just do it._

Let me know if you think this is a bad idea.

osfameron · 2026-03-30T15:15:49Z

+    User-agent: GPTBot
+    User-agent: PerplexityBot
+    User-agent: Google-Extended
+    Allow: /


does this override the all-agent setting below?
We still want to disallow /sdk-api/ (to encourage crawlers to only crawl the most recent versions)

It's a bit of a black box and likely depends on how each specific crawler behaves.

The AI discoverability tools are yelling at me about disallowing .md now because there's no explicit allowance for crawlers to see it. When I looked up how these rules work there's a handwaving "more explicit rules overrule more general rules".

So I've made some changes. Now it gives a set of explicit rules for the bots and then a set of explicit rules for general user agents. I think this should work but.... I dunno. Take a look.

osfameron · 2026-03-30T15:16:32Z

@@ -0,0 +1,81 @@
+# npx antora --clean --fetch antora-playbook.yml


did you mean to check in this local playbook?

I did not. Unsure why this one change persisted but I've cleaned it up.

osfameron · 2026-04-09T12:58:07Z

+    Allow: /sdk-api/couchbase-core-io/
+    Allow: /sdk-api/couchbase-transactions-dotnet/
+    # Sitemap and LLM index
+    Sitemap: https://docs.couchbase.com/sitemap.xml


Is the Sitemap directive "top-level" or also scoped under the User-agent bracket? 🤔
If latter, might be worth repeating for *

I don't think it matters but the redundancy is unlikely to hurt so we'll pop it in for now.

TimLFletcher · 2026-04-09T12:58:12Z

Made some further changes to try and make the robots.txt permissions as explicit and granular as possible.

osfameron

Looks OK, I still don't fully understand why this is needed, but doesn't look like it'll do any harm, and the explicit Sitemap: directive is a good call 👍

updated robots.txt

799830c

TimLFletcher requested a review from osfameron March 30, 2026 12:13

osfameron reviewed Mar 30, 2026

View reviewed changes

TimLFletcher added 2 commits April 9, 2026 13:47

cleanup and tweaks

e9270ca

super explicit robots.txt

3523be2

osfameron reviewed Apr 9, 2026

View reviewed changes

TimLFletcher requested a review from osfameron April 9, 2026 12:58

osfameron approved these changes Apr 9, 2026

View reviewed changes

redundancy in sitemap declare

13cf38a

TimLFletcher merged commit 7954d61 into master Apr 10, 2026
4 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

updated robots.txt#884

updated robots.txt#884
TimLFletcher merged 4 commits intomasterfrom
DOC-14190

TimLFletcher commented Mar 30, 2026

Uh oh!

osfameron Mar 30, 2026

Uh oh!

TimLFletcher Apr 9, 2026

Uh oh!

osfameron Mar 30, 2026

Uh oh!

TimLFletcher Apr 9, 2026

Uh oh!

osfameron Apr 9, 2026

Uh oh!

TimLFletcher Apr 9, 2026

Uh oh!

TimLFletcher commented Apr 9, 2026

Uh oh!

osfameron left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

		@@ -0,0 +1,81 @@
		# npx antora --clean --fetch antora-playbook.yml

Conversation

TimLFletcher commented Mar 30, 2026

Uh oh!

osfameron Mar 30, 2026

Choose a reason for hiding this comment

Uh oh!

TimLFletcher Apr 9, 2026

Choose a reason for hiding this comment

Uh oh!

osfameron Mar 30, 2026

Choose a reason for hiding this comment

Uh oh!

TimLFletcher Apr 9, 2026

Choose a reason for hiding this comment

Uh oh!

osfameron Apr 9, 2026

Choose a reason for hiding this comment

Uh oh!

TimLFletcher Apr 9, 2026

Choose a reason for hiding this comment

Uh oh!

TimLFletcher commented Apr 9, 2026

Uh oh!

osfameron left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants