Mastodon Skip to content
  • Home
  • Aktuell
  • Tags
  • Über dieses Forum
Einklappen
Grafik mit zwei überlappenden Sprechblasen, eine grün und eine lila.
Abspeckgeflüster – Forum für Menschen mit Gewicht(ung)

Kostenlos. Werbefrei. Menschlich. Dein Abnehmforum.

  1. Home
  2. Uncategorized
  3. To keep #OpenStreetMap.org up and running while we're being deluged by scrapers, we've blocked 320,000+ primarily residential IPv4 addresses in the last 24 hours (+ 100,000 IPv6) involved in scraping.

To keep #OpenStreetMap.org up and running while we're being deluged by scrapers, we've blocked 320,000+ primarily residential IPv4 addresses in the last 24 hours (+ 100,000 IPv6) involved in scraping.

Geplant Angeheftet Gesperrt Verschoben Uncategorized
openstreetmapbotsabuse
41 Beiträge 26 Kommentatoren 0 Aufrufe
  • Älteste zuerst
  • Neuste zuerst
  • Meiste Stimmen
Antworten
  • In einem neuen Thema antworten
Anmelden zum Antworten
Dieses Thema wurde gelöscht. Nur Nutzer mit entsprechenden Rechten können es sehen.
  • osm_tech@en.osm.townO osm_tech@en.osm.town

    @JonSaenzAgirre It is a good questions, and we don't know the answer either. Our planet data is so much easier to process and use.

    ff7@freiburg.socialF This user is from outside of this forum
    ff7@freiburg.socialF This user is from outside of this forum
    ff7@freiburg.social
    schrieb zuletzt editiert von
    #31

    @osm_tech @JonSaenzAgirre thats dumb ai, probably. No "i" at all...

    1 Antwort Letzte Antwort
    0
    • osm_tech@en.osm.townO osm_tech@en.osm.town

      To keep #OpenStreetMap.org up and running while we're being deluged by scrapers, we've blocked 320,000+ primarily residential IPv4 addresses in the last 24 hours (+ 100,000 IPv6) involved in scraping.

      If you need OSM data, please don't scrape the website - use the official downloads at https://planet.openstreetmap.org
      🙏🌍 #AI #Bots #Abuse

      tykayn@mastodon.cipherbliss.comT This user is from outside of this forum
      tykayn@mastodon.cipherbliss.comT This user is from outside of this forum
      tykayn@mastodon.cipherbliss.com
      schrieb zuletzt editiert von
      #32

      @osm_tech
      Have you tried some #iocaine, #anubis, and shared block lists for #fail2ban ?
      I think some defense guides for admin sys would be useful for a few people around here that are hosting things for osm too

      #fuckiascraping

      1 Antwort Letzte Antwort
      0
      • osm_tech@en.osm.townO osm_tech@en.osm.town

        To keep #OpenStreetMap.org up and running while we're being deluged by scrapers, we've blocked 320,000+ primarily residential IPv4 addresses in the last 24 hours (+ 100,000 IPv6) involved in scraping.

        If you need OSM data, please don't scrape the website - use the official downloads at https://planet.openstreetmap.org
        🙏🌍 #AI #Bots #Abuse

        burtyb@widget.ukB This user is from outside of this forum
        burtyb@widget.ukB This user is from outside of this forum
        burtyb@widget.uk
        schrieb zuletzt editiert von
        #33

        @osm_tech sounds familiar, last year I braved turning cloudflares "under attack" mode off for https://dnshistory.org/ and saw an extra 5 million requests/day (500k unique IPs) overloading things. It's still blocking >700k requests/day a month later...

        1 Antwort Letzte Antwort
        0
        • osm_tech@en.osm.townO osm_tech@en.osm.town

          To keep #OpenStreetMap.org up and running while we're being deluged by scrapers, we've blocked 320,000+ primarily residential IPv4 addresses in the last 24 hours (+ 100,000 IPv6) involved in scraping.

          If you need OSM data, please don't scrape the website - use the official downloads at https://planet.openstreetmap.org
          🙏🌍 #AI #Bots #Abuse

          clarinerd@mastodon.socialC This user is from outside of this forum
          clarinerd@mastodon.socialC This user is from outside of this forum
          clarinerd@mastodon.social
          schrieb zuletzt editiert von
          #34

          @osm_tech and we can tell the scrapers are AI built because a cursory glance at the documentation on the "coders" part would've prevented this problem.

          jkb@gotosocial.jkbockstael.beJ 1 Antwort Letzte Antwort
          0
          • osm_tech@en.osm.townO osm_tech@en.osm.town

            To keep #OpenStreetMap.org up and running while we're being deluged by scrapers, we've blocked 320,000+ primarily residential IPv4 addresses in the last 24 hours (+ 100,000 IPv6) involved in scraping.

            If you need OSM data, please don't scrape the website - use the official downloads at https://planet.openstreetmap.org
            🙏🌍 #AI #Bots #Abuse

            chestycougth@mastodon.socialC This user is from outside of this forum
            chestycougth@mastodon.socialC This user is from outside of this forum
            chestycougth@mastodon.social
            schrieb zuletzt editiert von
            #35

            @osm_tech Thank you. I'm a beginner who has just been doing toy projects and has barely any notion of what web scraping is but I'm very happy to learn that your data can be downloaded 🙏

            1 Antwort Letzte Antwort
            0
            • zymurgic@mastodon.onlineZ zymurgic@mastodon.online

              @osm_tech I wonder if the culprit will ever come forward, apologise, and change their ways? Someone tasked these proxy scrapers with ridiculous requests.
              Have they been targeting the main OSM API, the website interface designed for humans, or Overpass?

              grechaw@sfba.socialG This user is from outside of this forum
              grechaw@sfba.socialG This user is from outside of this forum
              grechaw@sfba.social
              schrieb zuletzt editiert von
              #36

              @zymurgic @osm_tech this kind of abuse has become normal and normalized. It's the AI way. Makes it tough for the legit crawlers out there, too.

              1 Antwort Letzte Antwort
              0
              • hunterz@mastodon.sdf.orgH hunterz@mastodon.sdf.org

                @osm_tech does coming from residential IPs mean that someone has baked a scraper into some popular tool that people don't realize is doing that?

                marcel@waldvogel.familyM This user is from outside of this forum
                marcel@waldvogel.familyM This user is from outside of this forum
                marcel@waldvogel.family
                schrieb zuletzt editiert von
                #37

                @HunterZ @osm_tech
                My first guess would be some dual-use browser extension. Aka Trojan.

                1 Antwort Letzte Antwort
                0
                • osm_tech@en.osm.townO osm_tech@en.osm.town

                  To keep #OpenStreetMap.org up and running while we're being deluged by scrapers, we've blocked 320,000+ primarily residential IPv4 addresses in the last 24 hours (+ 100,000 IPv6) involved in scraping.

                  If you need OSM data, please don't scrape the website - use the official downloads at https://planet.openstreetmap.org
                  🙏🌍 #AI #Bots #Abuse

                  apnoe_soeren@mastodon.socialA This user is from outside of this forum
                  apnoe_soeren@mastodon.socialA This user is from outside of this forum
                  apnoe_soeren@mastodon.social
                  schrieb zuletzt editiert von
                  #38

                  @osm_tech Limit the speed to Modem 14400 speed each IP for a month or so. 😅

                  1 Antwort Letzte Antwort
                  0
                  • necrosis@chaos.socialN necrosis@chaos.social shared this topic
                  • jonsaenzagirre@mastodon.eusJ jonsaenzagirre@mastodon.eus

                    @osm_tech question. Why do people scrape server which make the data freely available? And, probably, better structured in the final product. I don't see the point.

                    vampirdaddy@chaos.socialV This user is from outside of this forum
                    vampirdaddy@chaos.socialV This user is from outside of this forum
                    vampirdaddy@chaos.social
                    schrieb zuletzt editiert von
                    #39

                    @JonSaenzAgirre @osm_tech
                    The scrapers are DUMB.
                    They are not curated, have only basic maintenance, are built to gobble up ANYTHING textual they encounter, without respect, mercy or reason.

                    Just collect meaningless data.

                    That’s the nature of the coveted LLMs: just statistics, no understanding, structure or meaning.

                    And greedy crooks in haste to make quick money just grab everything they can.

                    The AI bubble needs to pop really soon.

                    1 Antwort Letzte Antwort
                    0
                    • clarinerd@mastodon.socialC clarinerd@mastodon.social

                      @osm_tech and we can tell the scrapers are AI built because a cursory glance at the documentation on the "coders" part would've prevented this problem.

                      jkb@gotosocial.jkbockstael.beJ This user is from outside of this forum
                      jkb@gotosocial.jkbockstael.beJ This user is from outside of this forum
                      jkb@gotosocial.jkbockstael.be
                      schrieb zuletzt editiert von
                      #40

                      @ClariNerd @osm_tech Because their IP ranges are increasingly being blocked by servers following their harmful scraping habits, AI companies are now releasing "browsers" so they can scrape from residential IPs instead and circumvent blocks. Oh, sorry, I meant "so they can empower users with AI insight in this new era of information".

                      clarinerd@mastodon.socialC 1 Antwort Letzte Antwort
                      0
                      • jkb@gotosocial.jkbockstael.beJ jkb@gotosocial.jkbockstael.be

                        @ClariNerd @osm_tech Because their IP ranges are increasingly being blocked by servers following their harmful scraping habits, AI companies are now releasing "browsers" so they can scrape from residential IPs instead and circumvent blocks. Oh, sorry, I meant "so they can empower users with AI insight in this new era of information".

                        clarinerd@mastodon.socialC This user is from outside of this forum
                        clarinerd@mastodon.socialC This user is from outside of this forum
                        clarinerd@mastodon.social
                        schrieb zuletzt editiert von
                        #41

                        @jkb @osm_tech brb repeatedly slamming my forehead against my desk for the next five minutes. Then I will reread that and hopefully it will seem less dystopian.

                        1 Antwort Letzte Antwort
                        0
                        • angelacarstensen@mastodon.onlineA angelacarstensen@mastodon.online shared this topic
                        Antworten
                        • In einem neuen Thema antworten
                        Anmelden zum Antworten
                        • Älteste zuerst
                        • Neuste zuerst
                        • Meiste Stimmen



                        Copyright (c) 2025 abSpecktrum (@abspecklog@fedimonster.de)

                        Erstellt mit Schlaflosigkeit, Kaffee, Brokkoli & ♥

                        Impressum | Datenschutzerklärung | Nutzungsbedingungen

                        • Anmelden

                        • Du hast noch kein Konto? Registrieren

                        • Anmelden oder registrieren, um zu suchen
                        • Erster Beitrag
                          Letzter Beitrag
                        0
                        • Home
                        • Aktuell
                        • Tags
                        • Über dieses Forum