# Allow all web crawlers access to all content by default User-agent: * Allow: / # Disallow specific AI/SEO/Data bots User-agent: Amazonbot Disallow: / User-agent: GPTBot Disallow: / User-agent: Google-Extended Disallow: / User-agent: CCBot Disallow: / User-agent: Bytespider Disallow: / User-agent: meta-externalagent Disallow: / User-agent: YouBot Disallow: / User-agent: Omgili Disallow: / User-agent: Diffbot Disallow: / User-agent: cohere-ai Disallow: / # Disallow specific SEO/Data bots User-agent: DataForSeoBot Disallow: / User-agent: Seobility Disallow: / User-agent: AhrefsBot Disallow: / User-agent: Barkrowler Disallow: / User-agent: BLEXBot Disallow: / User-agent: DotBot Disallow: / User-agent: MJ12bot Disallow: / User-agent: SemrushBot Disallow: / User-agent: Mediatoolkitbot Disallow: / User-agent: VelenPublicWebCrawler Disallow: / # Specify sitemap location Sitemap: https://www.signalbloom.ai/sitemap.xml