Mastodon Skip to content
  • Home
  • Aktuell
  • Tags
  • Über dieses Forum
Einklappen
Grafik mit zwei überlappenden Sprechblasen, eine grün und eine lila.
Abspeckgeflüster – Forum für Menschen mit Gewicht(ung)

Kostenlos. Werbefrei. Menschlich. Dein Abnehmforum.

  1. Home
  2. Uncategorized
  3. To keep #OpenStreetMap.org up and running while we're being deluged by scrapers, we've blocked 320,000+ primarily residential IPv4 addresses in the last 24 hours (+ 100,000 IPv6) involved in scraping.

To keep #OpenStreetMap.org up and running while we're being deluged by scrapers, we've blocked 320,000+ primarily residential IPv4 addresses in the last 24 hours (+ 100,000 IPv6) involved in scraping.

Geplant Angeheftet Gesperrt Verschoben Uncategorized
openstreetmapbotsabuse
41 Beiträge 26 Kommentatoren 0 Aufrufe
  • Älteste zuerst
  • Neuste zuerst
  • Meiste Stimmen
Antworten
  • In einem neuen Thema antworten
Anmelden zum Antworten
Dieses Thema wurde gelöscht. Nur Nutzer mit entsprechenden Rechten können es sehen.
  • osm_tech@en.osm.townO osm_tech@en.osm.town

    To keep #OpenStreetMap.org up and running while we're being deluged by scrapers, we've blocked 320,000+ primarily residential IPv4 addresses in the last 24 hours (+ 100,000 IPv6) involved in scraping.

    If you need OSM data, please don't scrape the website - use the official downloads at https://planet.openstreetmap.org
    🙏🌍 #AI #Bots #Abuse

    jonsaenzagirre@mastodon.eusJ This user is from outside of this forum
    jonsaenzagirre@mastodon.eusJ This user is from outside of this forum
    jonsaenzagirre@mastodon.eus
    schrieb zuletzt editiert von
    #19

    @osm_tech question. Why do people scrape server which make the data freely available? And, probably, better structured in the final product. I don't see the point.

    osm_tech@en.osm.townO vampirdaddy@chaos.socialV 2 Antworten Letzte Antwort
    0
    • jonsaenzagirre@mastodon.eusJ jonsaenzagirre@mastodon.eus

      @osm_tech question. Why do people scrape server which make the data freely available? And, probably, better structured in the final product. I don't see the point.

      osm_tech@en.osm.townO This user is from outside of this forum
      osm_tech@en.osm.townO This user is from outside of this forum
      osm_tech@en.osm.town
      schrieb zuletzt editiert von
      #20

      @JonSaenzAgirre It is a good questions, and we don't know the answer either. Our planet data is so much easier to process and use.

      ff7@freiburg.socialF 1 Antwort Letzte Antwort
      0
      • utf_7@mastodon.socialU utf_7@mastodon.social

        @osm_tech tHeN yOu jUsT neEd tO sCaLe

        osm_tech@en.osm.townO This user is from outside of this forum
        osm_tech@en.osm.townO This user is from outside of this forum
        osm_tech@en.osm.town
        schrieb zuletzt editiert von
        #21

        @utf_7 In this economy with RAM prices what they are?!? 😉

        1 Antwort Letzte Antwort
        0
        • osm_tech@en.osm.townO osm_tech@en.osm.town

          To keep #OpenStreetMap.org up and running while we're being deluged by scrapers, we've blocked 320,000+ primarily residential IPv4 addresses in the last 24 hours (+ 100,000 IPv6) involved in scraping.

          If you need OSM data, please don't scrape the website - use the official downloads at https://planet.openstreetmap.org
          🙏🌍 #AI #Bots #Abuse

          gme@bofh.socialG This user is from outside of this forum
          gme@bofh.socialG This user is from outside of this forum
          gme@bofh.social
          schrieb zuletzt editiert von
          #22

          @osm_tech@en.osm.town
          Could something like Anubis help you guys?

          1 Antwort Letzte Antwort
          0
          • wando@troet.cafeW wando@troet.cafe shared this topic
          • hunterz@mastodon.sdf.orgH hunterz@mastodon.sdf.org

            @osm_tech does coming from residential IPs mean that someone has baked a scraper into some popular tool that people don't realize is doing that?

            jay0@alico.nexusJ This user is from outside of this forum
            jay0@alico.nexusJ This user is from outside of this forum
            jay0@alico.nexus
            schrieb zuletzt editiert von
            #23

            @HunterZ@mastodon.sdf.org @osm_tech@en.osm.town lots of mobile/desktop apps, browser extensions, and even IoT devices are paid by "residential proxy" companies to prey on their users by selling said users's connections to AI scrapers https://www.spamhaus.org/resource-hub/compromised/lets-talk-about-the-danger-of-residential-proxy-networks/

            1 Antwort Letzte Antwort
            0
            • ryanprior@mastodon.socialR ryanprior@mastodon.social

              @HunterZ @osm_tech this is actually quite common. Mobile advertising SDKs for games, background apps, etc include residential scraping proxy functionality that they can sell to the highest bidder, and then when scrapers want to avoid restrictions they can pay a fraction of a penny to send their requests via your phone. Millions of people use apps with this built in and have no idea. Most websites don't want to ban the residential scrapers because it can hurt growth.

              olbohlen@norden.socialO This user is from outside of this forum
              olbohlen@norden.socialO This user is from outside of this forum
              olbohlen@norden.social
              schrieb zuletzt editiert von
              #24

              @ryanprior @HunterZ @osm_tech I have that scraping also on my private webserver and it forced me to make a whole bunch of content private. yet still the botnet scrapes onto it and gets 404s now. Every single request from a different IP...

              ryanprior@mastodon.socialR 1 Antwort Letzte Antwort
              0
              • osm_tech@en.osm.townO osm_tech@en.osm.town

                To keep #OpenStreetMap.org up and running while we're being deluged by scrapers, we've blocked 320,000+ primarily residential IPv4 addresses in the last 24 hours (+ 100,000 IPv6) involved in scraping.

                If you need OSM data, please don't scrape the website - use the official downloads at https://planet.openstreetmap.org
                🙏🌍 #AI #Bots #Abuse

                sadmin@social.tchncs.deS This user is from outside of this forum
                sadmin@social.tchncs.deS This user is from outside of this forum
                sadmin@social.tchncs.de
                schrieb zuletzt editiert von
                #25

                @osm_tech one day if you'd like to switch to nginx, I lend you a hand if you have a specific problem

                1 Antwort Letzte Antwort
                0
                • osm_tech@en.osm.townO osm_tech@en.osm.town

                  To keep #OpenStreetMap.org up and running while we're being deluged by scrapers, we've blocked 320,000+ primarily residential IPv4 addresses in the last 24 hours (+ 100,000 IPv6) involved in scraping.

                  If you need OSM data, please don't scrape the website - use the official downloads at https://planet.openstreetmap.org
                  🙏🌍 #AI #Bots #Abuse

                  zymurgic@mastodon.onlineZ This user is from outside of this forum
                  zymurgic@mastodon.onlineZ This user is from outside of this forum
                  zymurgic@mastodon.online
                  schrieb zuletzt editiert von
                  #26

                  @osm_tech I wonder if the culprit will ever come forward, apologise, and change their ways? Someone tasked these proxy scrapers with ridiculous requests.
                  Have they been targeting the main OSM API, the website interface designed for humans, or Overpass?

                  grechaw@sfba.socialG 1 Antwort Letzte Antwort
                  0
                  • olbohlen@norden.socialO olbohlen@norden.social

                    @ryanprior @HunterZ @osm_tech I have that scraping also on my private webserver and it forced me to make a whole bunch of content private. yet still the botnet scrapes onto it and gets 404s now. Every single request from a different IP...

                    ryanprior@mastodon.socialR This user is from outside of this forum
                    ryanprior@mastodon.socialR This user is from outside of this forum
                    ryanprior@mastodon.social
                    schrieb zuletzt editiert von
                    #27

                    @olbohlen @HunterZ @osm_tech sad to hear that! It's wild though, you can sign up for a scraper proxy service in minutes. They're legal, inexpensive, and easy to use. Admins who assume scrapers are using their own machines that inauthentic traffic will come from a few IP addresses are sadly living in the past.

                    olbohlen@norden.socialO 1 Antwort Letzte Antwort
                    0
                    • ryanprior@mastodon.socialR ryanprior@mastodon.social

                      @olbohlen @HunterZ @osm_tech sad to hear that! It's wild though, you can sign up for a scraper proxy service in minutes. They're legal, inexpensive, and easy to use. Admins who assume scrapers are using their own machines that inauthentic traffic will come from a few IP addresses are sadly living in the past.

                      olbohlen@norden.socialO This user is from outside of this forum
                      olbohlen@norden.socialO This user is from outside of this forum
                      olbohlen@norden.social
                      schrieb zuletzt editiert von
                      #28

                      @ryanprior @HunterZ @osm_tech sure I could, but I refuse to put my selfhosted stuff behind some new dependency...

                      ryanprior@mastodon.socialR 1 Antwort Letzte Antwort
                      0
                      • olbohlen@norden.socialO olbohlen@norden.social

                        @ryanprior @HunterZ @osm_tech sure I could, but I refuse to put my selfhosted stuff behind some new dependency...

                        ryanprior@mastodon.socialR This user is from outside of this forum
                        ryanprior@mastodon.socialR This user is from outside of this forum
                        ryanprior@mastodon.social
                        schrieb zuletzt editiert von
                        #29

                        @olbohlen @HunterZ @osm_tech the complexity of setting up defenses for this is regrettable

                        1 Antwort Letzte Antwort
                        0
                        • osm_tech@en.osm.townO osm_tech@en.osm.town

                          To keep #OpenStreetMap.org up and running while we're being deluged by scrapers, we've blocked 320,000+ primarily residential IPv4 addresses in the last 24 hours (+ 100,000 IPv6) involved in scraping.

                          If you need OSM data, please don't scrape the website - use the official downloads at https://planet.openstreetmap.org
                          🙏🌍 #AI #Bots #Abuse

                          hlunke@darmstadt.socialH This user is from outside of this forum
                          hlunke@darmstadt.socialH This user is from outside of this forum
                          hlunke@darmstadt.social
                          schrieb zuletzt editiert von
                          #30

                          @osm_tech

                          Might be a good idea to become OSMF Member now or just donate some money.
                          Membership is starting at 15£/yer
                          https://supporting.openstreetmap.org/

                          1 Antwort Letzte Antwort
                          0
                          • osm_tech@en.osm.townO osm_tech@en.osm.town

                            @JonSaenzAgirre It is a good questions, and we don't know the answer either. Our planet data is so much easier to process and use.

                            ff7@freiburg.socialF This user is from outside of this forum
                            ff7@freiburg.socialF This user is from outside of this forum
                            ff7@freiburg.social
                            schrieb zuletzt editiert von
                            #31

                            @osm_tech @JonSaenzAgirre thats dumb ai, probably. No "i" at all...

                            1 Antwort Letzte Antwort
                            0
                            • osm_tech@en.osm.townO osm_tech@en.osm.town

                              To keep #OpenStreetMap.org up and running while we're being deluged by scrapers, we've blocked 320,000+ primarily residential IPv4 addresses in the last 24 hours (+ 100,000 IPv6) involved in scraping.

                              If you need OSM data, please don't scrape the website - use the official downloads at https://planet.openstreetmap.org
                              🙏🌍 #AI #Bots #Abuse

                              tykayn@mastodon.cipherbliss.comT This user is from outside of this forum
                              tykayn@mastodon.cipherbliss.comT This user is from outside of this forum
                              tykayn@mastodon.cipherbliss.com
                              schrieb zuletzt editiert von
                              #32

                              @osm_tech
                              Have you tried some #iocaine, #anubis, and shared block lists for #fail2ban ?
                              I think some defense guides for admin sys would be useful for a few people around here that are hosting things for osm too

                              #fuckiascraping

                              1 Antwort Letzte Antwort
                              0
                              • osm_tech@en.osm.townO osm_tech@en.osm.town

                                To keep #OpenStreetMap.org up and running while we're being deluged by scrapers, we've blocked 320,000+ primarily residential IPv4 addresses in the last 24 hours (+ 100,000 IPv6) involved in scraping.

                                If you need OSM data, please don't scrape the website - use the official downloads at https://planet.openstreetmap.org
                                🙏🌍 #AI #Bots #Abuse

                                burtyb@widget.ukB This user is from outside of this forum
                                burtyb@widget.ukB This user is from outside of this forum
                                burtyb@widget.uk
                                schrieb zuletzt editiert von
                                #33

                                @osm_tech sounds familiar, last year I braved turning cloudflares "under attack" mode off for https://dnshistory.org/ and saw an extra 5 million requests/day (500k unique IPs) overloading things. It's still blocking >700k requests/day a month later...

                                1 Antwort Letzte Antwort
                                0
                                • osm_tech@en.osm.townO osm_tech@en.osm.town

                                  To keep #OpenStreetMap.org up and running while we're being deluged by scrapers, we've blocked 320,000+ primarily residential IPv4 addresses in the last 24 hours (+ 100,000 IPv6) involved in scraping.

                                  If you need OSM data, please don't scrape the website - use the official downloads at https://planet.openstreetmap.org
                                  🙏🌍 #AI #Bots #Abuse

                                  clarinerd@mastodon.socialC This user is from outside of this forum
                                  clarinerd@mastodon.socialC This user is from outside of this forum
                                  clarinerd@mastodon.social
                                  schrieb zuletzt editiert von
                                  #34

                                  @osm_tech and we can tell the scrapers are AI built because a cursory glance at the documentation on the "coders" part would've prevented this problem.

                                  jkb@gotosocial.jkbockstael.beJ 1 Antwort Letzte Antwort
                                  0
                                  • osm_tech@en.osm.townO osm_tech@en.osm.town

                                    To keep #OpenStreetMap.org up and running while we're being deluged by scrapers, we've blocked 320,000+ primarily residential IPv4 addresses in the last 24 hours (+ 100,000 IPv6) involved in scraping.

                                    If you need OSM data, please don't scrape the website - use the official downloads at https://planet.openstreetmap.org
                                    🙏🌍 #AI #Bots #Abuse

                                    chestycougth@mastodon.socialC This user is from outside of this forum
                                    chestycougth@mastodon.socialC This user is from outside of this forum
                                    chestycougth@mastodon.social
                                    schrieb zuletzt editiert von
                                    #35

                                    @osm_tech Thank you. I'm a beginner who has just been doing toy projects and has barely any notion of what web scraping is but I'm very happy to learn that your data can be downloaded 🙏

                                    1 Antwort Letzte Antwort
                                    0
                                    • zymurgic@mastodon.onlineZ zymurgic@mastodon.online

                                      @osm_tech I wonder if the culprit will ever come forward, apologise, and change their ways? Someone tasked these proxy scrapers with ridiculous requests.
                                      Have they been targeting the main OSM API, the website interface designed for humans, or Overpass?

                                      grechaw@sfba.socialG This user is from outside of this forum
                                      grechaw@sfba.socialG This user is from outside of this forum
                                      grechaw@sfba.social
                                      schrieb zuletzt editiert von
                                      #36

                                      @zymurgic @osm_tech this kind of abuse has become normal and normalized. It's the AI way. Makes it tough for the legit crawlers out there, too.

                                      1 Antwort Letzte Antwort
                                      0
                                      • hunterz@mastodon.sdf.orgH hunterz@mastodon.sdf.org

                                        @osm_tech does coming from residential IPs mean that someone has baked a scraper into some popular tool that people don't realize is doing that?

                                        marcel@waldvogel.familyM This user is from outside of this forum
                                        marcel@waldvogel.familyM This user is from outside of this forum
                                        marcel@waldvogel.family
                                        schrieb zuletzt editiert von
                                        #37

                                        @HunterZ @osm_tech
                                        My first guess would be some dual-use browser extension. Aka Trojan.

                                        1 Antwort Letzte Antwort
                                        0
                                        • osm_tech@en.osm.townO osm_tech@en.osm.town

                                          To keep #OpenStreetMap.org up and running while we're being deluged by scrapers, we've blocked 320,000+ primarily residential IPv4 addresses in the last 24 hours (+ 100,000 IPv6) involved in scraping.

                                          If you need OSM data, please don't scrape the website - use the official downloads at https://planet.openstreetmap.org
                                          🙏🌍 #AI #Bots #Abuse

                                          apnoe_soeren@mastodon.socialA This user is from outside of this forum
                                          apnoe_soeren@mastodon.socialA This user is from outside of this forum
                                          apnoe_soeren@mastodon.social
                                          schrieb zuletzt editiert von
                                          #38

                                          @osm_tech Limit the speed to Modem 14400 speed each IP for a month or so. 😅

                                          1 Antwort Letzte Antwort
                                          0
                                          • necrosis@chaos.socialN necrosis@chaos.social shared this topic
                                          Antworten
                                          • In einem neuen Thema antworten
                                          Anmelden zum Antworten
                                          • Älteste zuerst
                                          • Neuste zuerst
                                          • Meiste Stimmen



                                          Copyright (c) 2025 abSpecktrum (@abspecklog@fedimonster.de)

                                          Erstellt mit Schlaflosigkeit, Kaffee, Brokkoli & ♥

                                          Impressum | Datenschutzerklärung | Nutzungsbedingungen

                                          • Anmelden

                                          • Du hast noch kein Konto? Registrieren

                                          • Anmelden oder registrieren, um zu suchen
                                          • Erster Beitrag
                                            Letzter Beitrag
                                          0
                                          • Home
                                          • Aktuell
                                          • Tags
                                          • Über dieses Forum