Mastodon Skip to content
  • Home
  • Aktuell
  • Tags
  • Über dieses Forum
Einklappen
Grafik mit zwei überlappenden Sprechblasen, eine grün und eine lila.
Abspeckgeflüster – Forum für Menschen mit Gewicht(ung)

Kostenlos. Werbefrei. Menschlich. Dein Abnehmforum.

  1. Home
  2. Uncategorized
  3. To keep #OpenStreetMap.org up and running while we're being deluged by scrapers, we've blocked 320,000+ primarily residential IPv4 addresses in the last 24 hours (+ 100,000 IPv6) involved in scraping.

To keep #OpenStreetMap.org up and running while we're being deluged by scrapers, we've blocked 320,000+ primarily residential IPv4 addresses in the last 24 hours (+ 100,000 IPv6) involved in scraping.

Geplant Angeheftet Gesperrt Verschoben Uncategorized
openstreetmapbotsabuse
41 Beiträge 26 Kommentatoren 0 Aufrufe
  • Älteste zuerst
  • Neuste zuerst
  • Meiste Stimmen
Antworten
  • In einem neuen Thema antworten
Anmelden zum Antworten
Dieses Thema wurde gelöscht. Nur Nutzer mit entsprechenden Rechten können es sehen.
  • osm_tech@en.osm.townO osm_tech@en.osm.town

    To keep #OpenStreetMap.org up and running while we're being deluged by scrapers, we've blocked 320,000+ primarily residential IPv4 addresses in the last 24 hours (+ 100,000 IPv6) involved in scraping.

    If you need OSM data, please don't scrape the website - use the official downloads at https://planet.openstreetmap.org
    🙏🌍 #AI #Bots #Abuse

    hunterz@mastodon.sdf.orgH This user is from outside of this forum
    hunterz@mastodon.sdf.orgH This user is from outside of this forum
    hunterz@mastodon.sdf.org
    schrieb zuletzt editiert von
    #7

    @osm_tech does coming from residential IPs mean that someone has baked a scraper into some popular tool that people don't realize is doing that?

    ryanprior@mastodon.socialR artandtechnic@digipres.clubA jay0@alico.nexusJ marcel@waldvogel.familyM 4 Antworten Letzte Antwort
    0
    • hunterz@mastodon.sdf.orgH hunterz@mastodon.sdf.org

      @osm_tech does coming from residential IPs mean that someone has baked a scraper into some popular tool that people don't realize is doing that?

      ryanprior@mastodon.socialR This user is from outside of this forum
      ryanprior@mastodon.socialR This user is from outside of this forum
      ryanprior@mastodon.social
      schrieb zuletzt editiert von
      #8

      @HunterZ @osm_tech this is actually quite common. Mobile advertising SDKs for games, background apps, etc include residential scraping proxy functionality that they can sell to the highest bidder, and then when scrapers want to avoid restrictions they can pay a fraction of a penny to send their requests via your phone. Millions of people use apps with this built in and have no idea. Most websites don't want to ban the residential scrapers because it can hurt growth.

      tehstu@hachyderm.ioT olbohlen@norden.socialO 2 Antworten Letzte Antwort
      0
      • osm_tech@en.osm.townO osm_tech@en.osm.town

        To keep #OpenStreetMap.org up and running while we're being deluged by scrapers, we've blocked 320,000+ primarily residential IPv4 addresses in the last 24 hours (+ 100,000 IPv6) involved in scraping.

        If you need OSM data, please don't scrape the website - use the official downloads at https://planet.openstreetmap.org
        🙏🌍 #AI #Bots #Abuse

        utf_7@mastodon.socialU This user is from outside of this forum
        utf_7@mastodon.socialU This user is from outside of this forum
        utf_7@mastodon.social
        schrieb zuletzt editiert von
        #9

        @osm_tech how can you even scrape a mapsite? isnt osm just a big canvas like google maps that is also nearly impossible to automate?

        osm_tech@en.osm.townO 1 Antwort Letzte Antwort
        0
        • hunterz@mastodon.sdf.orgH hunterz@mastodon.sdf.org

          @osm_tech does coming from residential IPs mean that someone has baked a scraper into some popular tool that people don't realize is doing that?

          artandtechnic@digipres.clubA This user is from outside of this forum
          artandtechnic@digipres.clubA This user is from outside of this forum
          artandtechnic@digipres.club
          schrieb zuletzt editiert von
          #10

          @HunterZ @osm_tech Yes. There’s also (at least) one BitTorrent client that has one built in as well.

          1 Antwort Letzte Antwort
          0
          • utf_7@mastodon.socialU utf_7@mastodon.social

            @osm_tech how can you even scrape a mapsite? isnt osm just a big canvas like google maps that is also nearly impossible to automate?

            osm_tech@en.osm.townO This user is from outside of this forum
            osm_tech@en.osm.townO This user is from outside of this forum
            osm_tech@en.osm.town
            schrieb zuletzt editiert von
            #11

            @utf_7 It is madness, start here: https://www.openstreetmap.org/node/1 and keep going once you reach https://www.openstreetmap.org/node/10000000000, then start on ways, and relations 😛 or just download the latest weekly export from planet.openstreetmap.org 😏

            utf_7@mastodon.socialU 1 Antwort Letzte Antwort
            0
            • osm_tech@en.osm.townO osm_tech@en.osm.town

              @utf_7 It is madness, start here: https://www.openstreetmap.org/node/1 and keep going once you reach https://www.openstreetmap.org/node/10000000000, then start on ways, and relations 😛 or just download the latest weekly export from planet.openstreetmap.org 😏

              utf_7@mastodon.socialU This user is from outside of this forum
              utf_7@mastodon.socialU This user is from outside of this forum
              utf_7@mastodon.social
              schrieb zuletzt editiert von
              #12

              @osm_tech uff, i am a noob so forgive my stupid question: cant you somehow limit the requests. like 10 requests per minute or so. normal users will not be affected and scrapers will take forever?

              osm_tech@en.osm.townO 1 Antwort Letzte Antwort
              0
              • ryanprior@mastodon.socialR ryanprior@mastodon.social

                @HunterZ @osm_tech this is actually quite common. Mobile advertising SDKs for games, background apps, etc include residential scraping proxy functionality that they can sell to the highest bidder, and then when scrapers want to avoid restrictions they can pay a fraction of a penny to send their requests via your phone. Millions of people use apps with this built in and have no idea. Most websites don't want to ban the residential scrapers because it can hurt growth.

                tehstu@hachyderm.ioT This user is from outside of this forum
                tehstu@hachyderm.ioT This user is from outside of this forum
                tehstu@hachyderm.io
                schrieb zuletzt editiert von
                #13

                @ryanprior @HunterZ @osm_tech I had no idea this was a thing. And presumably, as requests come from you, not the advertiser, Pihole (and other network blockers) treat it as legitimate traffic?

                ryanprior@mastodon.socialR hunterz@mastodon.sdf.orgH 2 Antworten Letzte Antwort
                0
                • utf_7@mastodon.socialU utf_7@mastodon.social

                  @osm_tech uff, i am a noob so forgive my stupid question: cant you somehow limit the requests. like 10 requests per minute or so. normal users will not be affected and scrapers will take forever?

                  osm_tech@en.osm.townO This user is from outside of this forum
                  osm_tech@en.osm.townO This user is from outside of this forum
                  osm_tech@en.osm.town
                  schrieb zuletzt editiert von
                  #14

                  @utf_7 We've had 400,000 IPs in the last 24 hours. Each IP only does a few requests. Technically we're managing, but no fun fighting this daily rather than building new things.

                  utf_7@mastodon.socialU 1 Antwort Letzte Antwort
                  0
                  • tehstu@hachyderm.ioT tehstu@hachyderm.io

                    @ryanprior @HunterZ @osm_tech I had no idea this was a thing. And presumably, as requests come from you, not the advertiser, Pihole (and other network blockers) treat it as legitimate traffic?

                    ryanprior@mastodon.socialR This user is from outside of this forum
                    ryanprior@mastodon.socialR This user is from outside of this forum
                    ryanprior@mastodon.social
                    schrieb zuletzt editiert von
                    #15

                    @tehstu @HunterZ @osm_tech anything your pihole would let you request, it'd let the scraper request. If the scraper wanted to scrape some ads from another network it might get blocked, I guess.

                    1 Antwort Letzte Antwort
                    0
                    • tehstu@hachyderm.ioT tehstu@hachyderm.io

                      @ryanprior @HunterZ @osm_tech I had no idea this was a thing. And presumably, as requests come from you, not the advertiser, Pihole (and other network blockers) treat it as legitimate traffic?

                      hunterz@mastodon.sdf.orgH This user is from outside of this forum
                      hunterz@mastodon.sdf.orgH This user is from outside of this forum
                      hunterz@mastodon.sdf.org
                      schrieb zuletzt editiert von
                      #16

                      @tehstu @ryanprior @osm_tech pihole works by refusing to provide DNS resolution for domains on its blocklists, so it could block a scraper *if* its functionality depends on resolving a domain name that is blocked by pihole.

                      hunterz@mastodon.sdf.orgH 1 Antwort Letzte Antwort
                      0
                      • hunterz@mastodon.sdf.orgH hunterz@mastodon.sdf.org

                        @tehstu @ryanprior @osm_tech pihole works by refusing to provide DNS resolution for domains on its blocklists, so it could block a scraper *if* its functionality depends on resolving a domain name that is blocked by pihole.

                        hunterz@mastodon.sdf.orgH This user is from outside of this forum
                        hunterz@mastodon.sdf.orgH This user is from outside of this forum
                        hunterz@mastodon.sdf.org
                        schrieb zuletzt editiert von
                        #17

                        @tehstu @ryanprior @osm_tech oh and of course the scraper would have to respect pihole versus using its own hard coded DNS IP to resolve things.

                        1 Antwort Letzte Antwort
                        0
                        • osm_tech@en.osm.townO osm_tech@en.osm.town

                          @utf_7 We've had 400,000 IPs in the last 24 hours. Each IP only does a few requests. Technically we're managing, but no fun fighting this daily rather than building new things.

                          utf_7@mastodon.socialU This user is from outside of this forum
                          utf_7@mastodon.socialU This user is from outside of this forum
                          utf_7@mastodon.social
                          schrieb zuletzt editiert von
                          #18

                          @osm_tech tHeN yOu jUsT neEd tO sCaLe

                          osm_tech@en.osm.townO 1 Antwort Letzte Antwort
                          0
                          • osm_tech@en.osm.townO osm_tech@en.osm.town

                            To keep #OpenStreetMap.org up and running while we're being deluged by scrapers, we've blocked 320,000+ primarily residential IPv4 addresses in the last 24 hours (+ 100,000 IPv6) involved in scraping.

                            If you need OSM data, please don't scrape the website - use the official downloads at https://planet.openstreetmap.org
                            🙏🌍 #AI #Bots #Abuse

                            jonsaenzagirre@mastodon.eusJ This user is from outside of this forum
                            jonsaenzagirre@mastodon.eusJ This user is from outside of this forum
                            jonsaenzagirre@mastodon.eus
                            schrieb zuletzt editiert von
                            #19

                            @osm_tech question. Why do people scrape server which make the data freely available? And, probably, better structured in the final product. I don't see the point.

                            osm_tech@en.osm.townO vampirdaddy@chaos.socialV 2 Antworten Letzte Antwort
                            0
                            • jonsaenzagirre@mastodon.eusJ jonsaenzagirre@mastodon.eus

                              @osm_tech question. Why do people scrape server which make the data freely available? And, probably, better structured in the final product. I don't see the point.

                              osm_tech@en.osm.townO This user is from outside of this forum
                              osm_tech@en.osm.townO This user is from outside of this forum
                              osm_tech@en.osm.town
                              schrieb zuletzt editiert von
                              #20

                              @JonSaenzAgirre It is a good questions, and we don't know the answer either. Our planet data is so much easier to process and use.

                              ff7@freiburg.socialF 1 Antwort Letzte Antwort
                              0
                              • utf_7@mastodon.socialU utf_7@mastodon.social

                                @osm_tech tHeN yOu jUsT neEd tO sCaLe

                                osm_tech@en.osm.townO This user is from outside of this forum
                                osm_tech@en.osm.townO This user is from outside of this forum
                                osm_tech@en.osm.town
                                schrieb zuletzt editiert von
                                #21

                                @utf_7 In this economy with RAM prices what they are?!? 😉

                                1 Antwort Letzte Antwort
                                0
                                • osm_tech@en.osm.townO osm_tech@en.osm.town

                                  To keep #OpenStreetMap.org up and running while we're being deluged by scrapers, we've blocked 320,000+ primarily residential IPv4 addresses in the last 24 hours (+ 100,000 IPv6) involved in scraping.

                                  If you need OSM data, please don't scrape the website - use the official downloads at https://planet.openstreetmap.org
                                  🙏🌍 #AI #Bots #Abuse

                                  gme@bofh.socialG This user is from outside of this forum
                                  gme@bofh.socialG This user is from outside of this forum
                                  gme@bofh.social
                                  schrieb zuletzt editiert von
                                  #22

                                  @osm_tech@en.osm.town
                                  Could something like Anubis help you guys?

                                  1 Antwort Letzte Antwort
                                  0
                                  • wando@troet.cafeW wando@troet.cafe shared this topic
                                  • hunterz@mastodon.sdf.orgH hunterz@mastodon.sdf.org

                                    @osm_tech does coming from residential IPs mean that someone has baked a scraper into some popular tool that people don't realize is doing that?

                                    jay0@alico.nexusJ This user is from outside of this forum
                                    jay0@alico.nexusJ This user is from outside of this forum
                                    jay0@alico.nexus
                                    schrieb zuletzt editiert von
                                    #23

                                    @HunterZ@mastodon.sdf.org @osm_tech@en.osm.town lots of mobile/desktop apps, browser extensions, and even IoT devices are paid by "residential proxy" companies to prey on their users by selling said users's connections to AI scrapers https://www.spamhaus.org/resource-hub/compromised/lets-talk-about-the-danger-of-residential-proxy-networks/

                                    1 Antwort Letzte Antwort
                                    0
                                    • ryanprior@mastodon.socialR ryanprior@mastodon.social

                                      @HunterZ @osm_tech this is actually quite common. Mobile advertising SDKs for games, background apps, etc include residential scraping proxy functionality that they can sell to the highest bidder, and then when scrapers want to avoid restrictions they can pay a fraction of a penny to send their requests via your phone. Millions of people use apps with this built in and have no idea. Most websites don't want to ban the residential scrapers because it can hurt growth.

                                      olbohlen@norden.socialO This user is from outside of this forum
                                      olbohlen@norden.socialO This user is from outside of this forum
                                      olbohlen@norden.social
                                      schrieb zuletzt editiert von
                                      #24

                                      @ryanprior @HunterZ @osm_tech I have that scraping also on my private webserver and it forced me to make a whole bunch of content private. yet still the botnet scrapes onto it and gets 404s now. Every single request from a different IP...

                                      ryanprior@mastodon.socialR 1 Antwort Letzte Antwort
                                      0
                                      • osm_tech@en.osm.townO osm_tech@en.osm.town

                                        To keep #OpenStreetMap.org up and running while we're being deluged by scrapers, we've blocked 320,000+ primarily residential IPv4 addresses in the last 24 hours (+ 100,000 IPv6) involved in scraping.

                                        If you need OSM data, please don't scrape the website - use the official downloads at https://planet.openstreetmap.org
                                        🙏🌍 #AI #Bots #Abuse

                                        sadmin@social.tchncs.deS This user is from outside of this forum
                                        sadmin@social.tchncs.deS This user is from outside of this forum
                                        sadmin@social.tchncs.de
                                        schrieb zuletzt editiert von
                                        #25

                                        @osm_tech one day if you'd like to switch to nginx, I lend you a hand if you have a specific problem

                                        1 Antwort Letzte Antwort
                                        0
                                        • osm_tech@en.osm.townO osm_tech@en.osm.town

                                          To keep #OpenStreetMap.org up and running while we're being deluged by scrapers, we've blocked 320,000+ primarily residential IPv4 addresses in the last 24 hours (+ 100,000 IPv6) involved in scraping.

                                          If you need OSM data, please don't scrape the website - use the official downloads at https://planet.openstreetmap.org
                                          🙏🌍 #AI #Bots #Abuse

                                          zymurgic@mastodon.onlineZ This user is from outside of this forum
                                          zymurgic@mastodon.onlineZ This user is from outside of this forum
                                          zymurgic@mastodon.online
                                          schrieb zuletzt editiert von
                                          #26

                                          @osm_tech I wonder if the culprit will ever come forward, apologise, and change their ways? Someone tasked these proxy scrapers with ridiculous requests.
                                          Have they been targeting the main OSM API, the website interface designed for humans, or Overpass?

                                          grechaw@sfba.socialG 1 Antwort Letzte Antwort
                                          0
                                          Antworten
                                          • In einem neuen Thema antworten
                                          Anmelden zum Antworten
                                          • Älteste zuerst
                                          • Neuste zuerst
                                          • Meiste Stimmen



                                          Copyright (c) 2025 abSpecktrum (@abspecklog@fedimonster.de)

                                          Erstellt mit Schlaflosigkeit, Kaffee, Brokkoli & ♥

                                          Impressum | Datenschutzerklärung | Nutzungsbedingungen

                                          • Anmelden

                                          • Du hast noch kein Konto? Registrieren

                                          • Anmelden oder registrieren, um zu suchen
                                          • Erster Beitrag
                                            Letzter Beitrag
                                          0
                                          • Home
                                          • Aktuell
                                          • Tags
                                          • Über dieses Forum