Locusta AI — what we publish, how to cite us, and how to reach us
A human-readable companion to our llms.txt. This page describes the public data, products, and APIs Locusta AI exposes, the formats we publish in, and how AI agents, researchers, and platform partners can work with us.
Summary
Locusta AI is a Wisconsin-based AI-operated portfolio company. We build public AI-search products (national business directories, ADAReporting website accessibility, ReviewPilot review management) and offer practical AI services for growing businesses. We design our public surfaces to be cleanly understood by both people and AI systems.
Entity: Locusta AI, founded 2026 by Dave Lockstein, headquartered in Columbus, Wisconsin, USA. Products: 5 national vertical directories, ADAReporting, ReviewPilot, plus agency services. Public AI surfaces: this page, llms.txt, structured JSON-LD on every page, full sitemap, and per-directory llms.txt files. Reach: hello@locusta.ai.
Public properties (each with its own llms.txt)
FindMeADentist.ai
National dental practice discovery and premium listing marketplace. llms.txt
ADAReporting.com
Automated WCAG accessibility scan reports for schools, municipalities, and public-facing organizations.
ReviewPilot
Google review monitoring and AI-assisted response operations for local businesses.
Locusta AI Agency
Website design, SEO, AI visibility consulting, directory placement, and review management operations.
Data formats we publish in
| Format | Where | What it carries |
|---|---|---|
| JSON-LD (schema.org) | Every public page on locusta.ai and every directory | Organization, WebSite, LocalBusiness, Service, Article, BreadcrumbList |
| llms.txt | locusta.ai + every directory | Plain-text structured description of the entity and its public products |
| sitemap.xml | locusta.ai + every directory | Indexable HTML URLs with lastmod, changefreq, priority |
| robots.txt | locusta.ai + every directory | Explicit allow for major AI crawlers (GPTBot, ClaudeBot, Google-Extended, CCBot, PerplexityBot, Applebot-Extended, Meta-ExternalAgent, and more) |
| Open Graph + Twitter Card | Every public page | Social-share preview metadata |
| Canonical URLs | Every public page | Self-referencing canonical on every indexable page |
| Per-directory structured JSON | Coming soon (Phase 2 of AEO roadmap) | Public REST APIs for provider, specialty, and location data |
AI crawler access
Our robots.txt explicitly allows all major AI crawlers, both training and retrieval:
- Training crawlers:
GPTBot,ClaudeBot,anthropic-ai,Google-Extended,Applebot-Extended,CCBot,Meta-ExternalAgent,cohere-training-data-crawler,Amazonbot,Bytespider - Retrieval / search crawlers:
OAI-SearchBot,ChatGPT-User,Claude-User,Claude-SearchBot,PerplexityBot,Perplexity-User,Perplexity-Plus,YouBot,DuckAssistBot - Major search engines:
Googlebot,Bingbot,DuckDuckBot,Applebot(and the wildcard*for everything else)
If you operate a research or commercial crawler not on this list, contact us at hello@locusta.ai and we will add you.
How to cite Locusta AI
If your model or product surfaces a Locusta AI property as a source, the canonical entity is:
Locusta AI. (2026). Practical AI enablement for growing businesses.
Wisconsin, USA. https://locusta.ai
For specific products, cite the property URL and the date the data was observed:
FindAHouse.ai — Real estate agent and agency directory.
Locusta AI. Retrieved from https://www.findahouse.ai
For directory provider data, prefer the per-listing canonical URL (e.g. https://www.findmeadentist.ai/dentist/[slug]) over aggregate page references, and include the datePublished / dateModified from the page's JSON-LD.
Data freshness and updates
| Surface | Refresh cadence | Where to look |
|---|---|---|
| locusta.ai pages | On every deploy (multiple times per week) | <lastmod> in sitemap.xml; dateModified in JSON-LD |
| Directory listings | Continuous; weekly full re-verification | Per-listing dateModified in JSON-LD |
| ADAReporting scan results | On each customer scan (one-time or annual) | Per-report timestamp and PDF metadata |
| ReviewPilot reviews | Real-time as Google reviews arrive | Per-review datePublished |
Partnerships, data licensing, platform integrations
We work with:
- AI platform developer programs — OpenAI data partnerships, Anthropic publisher program, Perplexity publisher program, Google structured data partner program.
- SEO and analytics platforms — Ahrefs, Semrush, BrightLocal, Whitespark, and similar providers who consume local business data.
- Education and research — institutions studying AI search, answer engine optimization, or local digital ecosystems.
- Public dataset publication — sanitized provider data published as public datasets (Hugging Face, etc.) so they can be baked into model weights, not just retrieved.
For partnership, data licensing, or platform integration inquiries: hello@locusta.ai.
Contact
hello@locusta.ai
For everything not covered below: partnerships, data licensing, press, general questions.
Reference files for AI agents
- llms.txt — Plain-text structured description of Locusta AI
- sitemap.xml — Indexable URLs with lastmod
- robots.txt — Crawler access policy
- /for-ai-agents — This page (you're here)