Fedizens who self-host services, do you:
-
Fedizens who self-host services, do you:
- Have a service exposed on a domain name matching
knowledge.DOMAIN? - Notice heavy waves of crawlers accessing this specific domain?
- (optional) Use
iocaineor a similar poisoning tool?
I'm also interested if you have multiple subdomains protected by
iocaineand notice one in particular gets hit.
- Have a service exposed on a domain name matching
-
Fedizens who self-host services, do you:
- Have a service exposed on a domain name matching
knowledge.DOMAIN? - Notice heavy waves of crawlers accessing this specific domain?
- (optional) Use
iocaineor a similar poisoning tool?
I'm also interested if you have multiple subdomains protected by
iocaineand notice one in particular gets hit.
The reason why I ask about "knowledge" in particular is that I saw several LLM websites using that domain name for their stuff, so I wonder if they think I host some kind of LLM and thus go on this domain name in particular
- Have a service exposed on a domain name matching
-
The reason why I ask about "knowledge" in particular is that I saw several LLM websites using that domain name for their stuff, so I wonder if they think I host some kind of LLM and thus go on this domain name in particular
@Soblow I wonder if it’s because “knowledge base”
I’ll add one after work and report back if it’s hit
-
@Soblow I wonder if it’s because “knowledge base”
I’ll add one after work and report back if it’s hit
@ChlorideCull Yeah, that's why I wonder if the domain name matters
I'm highly interested in your results!
-
Fedizens who self-host services, do you:
- Have a service exposed on a domain name matching
knowledge.DOMAIN? - Notice heavy waves of crawlers accessing this specific domain?
- (optional) Use
iocaineor a similar poisoning tool?
I'm also interested if you have multiple subdomains protected by
iocaineand notice one in particular gets hit.
@Soblow the general consensus seems to be the AI scrapers are using certificate transparency logs to find subdomains to scrape
- Have a service exposed on a domain name matching
-
@Soblow the general consensus seems to be the AI scrapers are using certificate transparency logs to find subdomains to scrape
@niko yup, but not all my domains are scrapped equally, why?
-
@Soblow the general consensus seems to be the AI scrapers are using certificate transparency logs to find subdomains to scrape
-
Fedizens who self-host services, do you:
- Have a service exposed on a domain name matching
knowledge.DOMAIN? - Notice heavy waves of crawlers accessing this specific domain?
- (optional) Use
iocaineor a similar poisoning tool?
I'm also interested if you have multiple subdomains protected by
iocaineand notice one in particular gets hit.
- Have a service exposed on a domain name matching
-
we don't have a
knowledgesubdomain but we do use Iocaine to poison the relatively large proportion of scrapers -
we don't have a
knowledgesubdomain but we do use Iocaine to poison the relatively large proportion of scrapers -
@niko yup, but not all my domains are scrapped equally, why?
@Soblow random chance afaict
-
across all of them, even across two domains; They used to use HTTPs transparency logs to find any new subdomain to insta-spam them
-
0 0mega@sk.zehnvorne.social shared this topic