# About the scanner bot

## User-Agent

```
CanAISeeBot/1.0 (+https://canaisee.com/about-bot)
```

## What it does

When a visitor enters your URL, our scanner fetches a small set of files
from your site: the canonical URL, `/robots.txt`, `/llms.txt`,
`/ai.txt`, `/sitemap.xml`, `/.well-known/mcp.json`,
`/.well-known/agent-card.json`, and an optional `.md` mirror. For one
check it additionally loads the canonical URL in a headless Chromium
instance to compare the rendered text to the plain HTML.

## Rate and politeness

The scanner honors `robots.txt`. Each scan produces at most a handful of
requests against your origin, and we cache results for several minutes so
repeated shares of a scorecard don't retrigger fetches. We are subject to
a per-URL cooldown (currently five minutes).

## Opting out

If you do not want your site scannable by canaisee, you can:

- Block the `CanAISeeBot` user-agent in `robots.txt`.
- Request a takedown by emailing `hello@evangent.org` with the URL.

## Source

canaisee is open-source at https://github.com/8gara8/agentread under the
MIT license.
