My instance is getting pummeled by scrapers crawling nonsense. Like issue and pull searches with every single variant of label combinations.
Everything’s coming from a shitload of different residential IPs at a very fast cadence.
There’s just not that much content on my instance to warrant this traffic. It could be scraped in a minute or two like this if it were legitimate traffic.


Yes! Exactly how you describe. They were going through certain repos and parsing every commit. I couldn’t block them because there were loads of different residential IPs and random user-agents. :-(
Well, at least it doesn’t seem targeted, then. Did you do anything to remedy the situation?
Why do you need a self hosted instance open to the World? Mine is behind a CloudFlare rule that allows connections only from a list of IPs, like my self hosted WireGuard instance.
Because I can and I want to?
Leaving the “know how” part for last, I see.
Imagine never learning through trying things. Also, you’re on Lemmy arguing against self-hosting.