Be a Good Maven Citizen: Don't Scrape the Whole Central Repository

By

1 minute read time

For the last few nights, we have had some particularly bad Maven citizens averaging almost 3,000 connections in a 10-minute period. This particular abuser comes from the Netherlands, and this is the first time I have completely blocked an IP permanently from the Central Repository (I'm sure they will just use another one, but it's a start). I've started trying to track down who the IP (82.94.207.11) belongs to. I have a pretty good suspicion who it is.

We're happy to work with groups who want to mirror the repository using rsync provided you are using that mirror to service other Maven users. Trying to scape Maven Central cripples the primary feeder to all the mirrors and the synchronization with other open source organizations we work with. By opening 3,000 connections to Central and scraping it you screw every other Maven user on the planet, you idiots. Anyone who knows me knows how rabidly tenacious I am, and I will track down every IP you have and create a blacklist that every Maven mirror and Maven repository manager will just drop connections to. If you make the life of Maven users more difficult than necessary, I will find a way to do the same to you.

Picture of Jason van Zyl

Written by Jason van Zyl

Jason is a co-founder and the former CTO of Sonatype.

Tags