Mastodon Skip to content
  • Home
  • Aktuell
  • Tags
  • Über dieses Forum
Einklappen
Grafik mit zwei überlappenden Sprechblasen, eine grün und eine lila.
Abspeckgeflüster – Forum für Menschen mit Gewicht(ung)

Kostenlos. Werbefrei. Menschlich. Dein Abnehmforum.

  1. Home
  2. Uncategorized
  3. If you write about the messy reality behind "free" internet services: we're seeing #OpenStreetMap hammered by scrapers hiding behind residential proxy/embedded-SDK networks.

If you write about the messy reality behind "free" internet services: we're seeing #OpenStreetMap hammered by scrapers hiding behind residential proxy/embedded-SDK networks.

Geplant Angeheftet Gesperrt Verschoben Uncategorized
openstreetmapbotsabuse
114 Beiträge 92 Kommentatoren 0 Aufrufe
  • Älteste zuerst
  • Neuste zuerst
  • Meiste Stimmen
Antworten
  • In einem neuen Thema antworten
Anmelden zum Antworten
Dieses Thema wurde gelöscht. Nur Nutzer mit entsprechenden Rechten können es sehen.
  • floris@freiburg.socialF floris@freiburg.social

    Vielleicht ist das ein Thema für die @lagedernation?

    lagedernation@chaos.socialL This user is from outside of this forum
    lagedernation@chaos.socialL This user is from outside of this forum
    lagedernation@chaos.social
    schrieb zuletzt editiert von
    #96

    @floris
    @osm_tech Hi, please get in touch, we've covered OSM many times before and would love to learn more: team a lagedernation org

    1 Antwort Letzte Antwort
    0
    • osm_tech@en.osm.townO osm_tech@en.osm.town

      If you write about the messy reality behind "free" internet services: we're seeing #OpenStreetMap hammered by scrapers hiding behind residential proxy/embedded-SDK networks. We're a volunteer-run service and the costs are real. We'd love to talk to a journalist about what we're seeing + how we're responding. #AI #Bots #Abuse

      fantafanta@mastodon.socialF This user is from outside of this forum
      fantafanta@mastodon.socialF This user is from outside of this forum
      fantafanta@mastodon.social
      schrieb zuletzt editiert von
      #97

      @osm_tech Interesting. Perhaps we could follow-up via e-mail or DM? alexander.fanta@ftm.nl

      1 Antwort Letzte Antwort
      0
      • osm_tech@en.osm.townO osm_tech@en.osm.town

        If you write about the messy reality behind "free" internet services: we're seeing #OpenStreetMap hammered by scrapers hiding behind residential proxy/embedded-SDK networks. We're a volunteer-run service and the costs are real. We'd love to talk to a journalist about what we're seeing + how we're responding. #AI #Bots #Abuse

        ea5iyl@mastodon.radioE This user is from outside of this forum
        ea5iyl@mastodon.radioE This user is from outside of this forum
        ea5iyl@mastodon.radio
        schrieb zuletzt editiert von
        #98

        @osm_tech Wow. TIL that software development kits are more or less silently embedding internet scrapers in (unrelated) end-user applications to distribute AI data scraping across residential addresses and therefore be harder to defend against.
        Hey, Tim, were you expecting stuff like this 35 years down the line?

        1 Antwort Letzte Antwort
        0
        • osm_tech@en.osm.townO osm_tech@en.osm.town

          @BalooUriza We use fail2ban to handle some of this with custom rules, but eventually fail2ban becomes a bottleneck after 100,000 IP addresses.

          mnalis@mastodon.onlineM This user is from outside of this forum
          mnalis@mastodon.onlineM This user is from outside of this forum
          mnalis@mastodon.online
          schrieb zuletzt editiert von
          #99

          @osm_tech @BalooUriza is it using ipset hashsets, or default rule-per-ip rules? raw namespace or? I don't know the details of implementation, but if it is L7 load that is problematic (instead of pure bandwidth DDoS), it might be worth to consider whitelisting instead. I.e. whitelist addresses (or /24s) that have *not* had excessive requests lately, and put them in priority network bucket, and the rest (which is not blacklisted) goes in best-effort bucket (to maybe migrate to whitelist later)

          1 Antwort Letzte Antwort
          0
          • osm_tech@en.osm.townO osm_tech@en.osm.town

            If you write about the messy reality behind "free" internet services: we're seeing #OpenStreetMap hammered by scrapers hiding behind residential proxy/embedded-SDK networks. We're a volunteer-run service and the costs are real. We'd love to talk to a journalist about what we're seeing + how we're responding. #AI #Bots #Abuse

            orfanik@witter.czO This user is from outside of this forum
            orfanik@witter.czO This user is from outside of this forum
            orfanik@witter.cz
            schrieb zuletzt editiert von
            #100

            @osm_tech

            @jakubzelenka

            1 Antwort Letzte Antwort
            0
            • dalias@hachyderm.ioD dalias@hachyderm.io

              @osm_tech @BalooUriza For IPv4, a bitmask of the entire address space is a viable "efficient" implementation of blocking. I wonder if there are tools that can do it that way rather than needing a gigantic list.

              magezwitscher@det.socialM This user is from outside of this forum
              magezwitscher@det.socialM This user is from outside of this forum
              magezwitscher@det.social
              schrieb zuletzt editiert von
              #101

              @dalias @BalooUriza But that is one of the points @osm_tech are making in their post. These crawlers resort to using massive amounts of "scrapers hiding behind residential proxy/embedded-SDK networks" - meaning they are using Adware-infested phones all over the world for their scraping attaks. So banning IP ranges won't help much. Playing cat-and-mouse with these scrapers is resource intensive, which is increasingly hard for FOSS projects and is also driving up cost for commercial offerings.

              dalias@hachyderm.ioD 1 Antwort Letzte Antwort
              0
              • magezwitscher@det.socialM magezwitscher@det.social

                @dalias @BalooUriza But that is one of the points @osm_tech are making in their post. These crawlers resort to using massive amounts of "scrapers hiding behind residential proxy/embedded-SDK networks" - meaning they are using Adware-infested phones all over the world for their scraping attaks. So banning IP ranges won't help much. Playing cat-and-mouse with these scrapers is resource intensive, which is increasingly hard for FOSS projects and is also driving up cost for commercial offerings.

                dalias@hachyderm.ioD This user is from outside of this forum
                dalias@hachyderm.ioD This user is from outside of this forum
                dalias@hachyderm.io
                schrieb zuletzt editiert von
                #102

                @magezwitscher @BalooUriza @osm_tech Not ranges. Just the single IP, and a short-lived ban. All you need to do is get them down from thousands of requests per minute to one request per hour (because they get banned for an hour each time they start again).

                1 Antwort Letzte Antwort
                0
                • mimesatwork@wandering.shopM mimesatwork@wandering.shop

                  @robz @osm_tech And who is it going to reach?

                  robz@toot.robzazueta.comR This user is from outside of this forum
                  robz@toot.robzazueta.comR This user is from outside of this forum
                  robz@toot.robzazueta.com
                  schrieb zuletzt editiert von
                  #103

                  @Mimesatwork @osm_tech The same people this message reached for a start.

                  Journalists no longer have the reach you think they do. They have become extremely unreliable.

                  Write the post, spread it the same way they spread their request for a journo...

                  They got at least you and I and the person who shared it with me initially so... they have some reach, especially into the people who care about this kind of thing.

                  1 Antwort Letzte Antwort
                  0
                  • mrgrumpymonkey@mastodon.socialM mrgrumpymonkey@mastodon.social

                    @osm_tech Pinging @GarretSidzaka as he might have some leads.

                    garretsidzaka@mastodon.socialG This user is from outside of this forum
                    garretsidzaka@mastodon.socialG This user is from outside of this forum
                    garretsidzaka@mastodon.social
                    schrieb zuletzt editiert von
                    #104

                    @mrgrumpymonkey @osm_tech
                    Brian Krebs is on it

                    1 Antwort Letzte Antwort
                    0
                    • osm_tech@en.osm.townO osm_tech@en.osm.town

                      If you write about the messy reality behind "free" internet services: we're seeing #OpenStreetMap hammered by scrapers hiding behind residential proxy/embedded-SDK networks. We're a volunteer-run service and the costs are real. We'd love to talk to a journalist about what we're seeing + how we're responding. #AI #Bots #Abuse

                      papuass@toot.lvP This user is from outside of this forum
                      papuass@toot.lvP This user is from outside of this forum
                      papuass@toot.lv
                      schrieb zuletzt editiert von
                      #105

                      @osm_tech hey, @arstechnica it is not only @wikipedia that suffers

                      1 Antwort Letzte Antwort
                      0
                      • osm_tech@en.osm.townO osm_tech@en.osm.town

                        If you write about the messy reality behind "free" internet services: we're seeing #OpenStreetMap hammered by scrapers hiding behind residential proxy/embedded-SDK networks. We're a volunteer-run service and the costs are real. We'd love to talk to a journalist about what we're seeing + how we're responding. #AI #Bots #Abuse

                        oscherler@tooting.chO This user is from outside of this forum
                        oscherler@tooting.chO This user is from outside of this forum
                        oscherler@tooting.ch
                        schrieb zuletzt editiert von
                        #106

                        @osm_tech Good luck finding a journalist in 2026.

                        1 Antwort Letzte Antwort
                        0
                        • corbet@social.kernel.orgC corbet@social.kernel.org
                          @osm_tech You are definitely not alone: https://lwn.net/Articles/1008897/ The situation is not sustainable but I'm not sure what we do about it beyond waiting for the AI bubble to burst.
                          soaproot@sfba.socialS This user is from outside of this forum
                          soaproot@sfba.socialS This user is from outside of this forum
                          soaproot@sfba.social
                          schrieb zuletzt editiert von
                          #107

                          @corbet @osm_tech I don't have answers either but I hope something emerges because waiting for the bubble to burst still may face the "the market can remain irrational longer than you can remain solvent" problem.

                          1 Antwort Letzte Antwort
                          0
                          • osm_tech@en.osm.townO osm_tech@en.osm.town

                            If you write about the messy reality behind "free" internet services: we're seeing #OpenStreetMap hammered by scrapers hiding behind residential proxy/embedded-SDK networks. We're a volunteer-run service and the costs are real. We'd love to talk to a journalist about what we're seeing + how we're responding. #AI #Bots #Abuse

                            gregoa_@chaos.socialG This user is from outside of this forum
                            gregoa_@chaos.socialG This user is from outside of this forum
                            gregoa_@chaos.social
                            schrieb zuletzt editiert von
                            #108

                            @suka_hiroaki ↑

                            1 Antwort Letzte Antwort
                            0
                            • osm_tech@en.osm.townO osm_tech@en.osm.town

                              If you write about the messy reality behind "free" internet services: we're seeing #OpenStreetMap hammered by scrapers hiding behind residential proxy/embedded-SDK networks. We're a volunteer-run service and the costs are real. We'd love to talk to a journalist about what we're seeing + how we're responding. #AI #Bots #Abuse

                              derjoeffekt@sueden.socialD This user is from outside of this forum
                              derjoeffekt@sueden.socialD This user is from outside of this forum
                              derjoeffekt@sueden.social
                              schrieb zuletzt editiert von
                              #109

                              @osm_tech Heise has written about it in German/English.

                              https://www.heise.de/en/news/OpenStreetMap-is-concerned-thousands-of-AI-bots-are-collecting-data-11157359.html

                              1 Antwort Letzte Antwort
                              0
                              • thatprilla@theatl.socialT thatprilla@theatl.social

                                @osm_tech

                                I feel for yall. These residential proxies and the sdk networks are the bane of my existence and I’m paid to deal with them.

                                eq@mas.toE This user is from outside of this forum
                                eq@mas.toE This user is from outside of this forum
                                eq@mas.to
                                schrieb zuletzt editiert von
                                #110

                                @ThatPrilla @osm_tech It would be very useful with a tool running on ISP hardware that could detect residential proxies. Or is there anything we can stuff into our DNS'es to blackhole the proxies backbone?

                                1 Antwort Letzte Antwort
                                0
                                • osm_tech@en.osm.townO osm_tech@en.osm.town

                                  If you write about the messy reality behind "free" internet services: we're seeing #OpenStreetMap hammered by scrapers hiding behind residential proxy/embedded-SDK networks. We're a volunteer-run service and the costs are real. We'd love to talk to a journalist about what we're seeing + how we're responding. #AI #Bots #Abuse

                                  hypolite@friendica.mrpetovan.comH This user is from outside of this forum
                                  hypolite@friendica.mrpetovan.comH This user is from outside of this forum
                                  hypolite@friendica.mrpetovan.com
                                  schrieb zuletzt editiert von
                                  #111
                                  @osm_tech Ping @404mediaco
                                  1 Antwort Letzte Antwort
                                  0
                                  • eyjala@mastodon.socialE eyjala@mastodon.social shared this topic
                                  • osm_tech@en.osm.townO osm_tech@en.osm.town

                                    If you write about the messy reality behind "free" internet services: we're seeing #OpenStreetMap hammered by scrapers hiding behind residential proxy/embedded-SDK networks. We're a volunteer-run service and the costs are real. We'd love to talk to a journalist about what we're seeing + how we're responding. #AI #Bots #Abuse

                                    F This user is from outside of this forum
                                    F This user is from outside of this forum
                                    froztbyte@mastodon.social
                                    schrieb zuletzt editiert von
                                    #112

                                    @osm_tech might be a thing @davidgerard could do on pivot

                                    davidgerard@circumstances.runD 1 Antwort Letzte Antwort
                                    0
                                    • F froztbyte@mastodon.social

                                      @osm_tech might be a thing @davidgerard could do on pivot

                                      davidgerard@circumstances.runD This user is from outside of this forum
                                      davidgerard@circumstances.runD This user is from outside of this forum
                                      davidgerard@circumstances.run
                                      schrieb zuletzt editiert von
                                      #113

                                      @froztbyte @osm_tech yeah i'm getting the same AI assholes

                                      as is @RationalWiki (i'm the sysadmin trying to keep the site up in the face of the hammering - we can either lose Google search listing, or we can be literally unusable for humans)

                                      as is @corbet at Linux Weekly News - OSM might be relevant to LWN, a free content project getting hammered by the AI bots

                                      they botnet suburban Android boxes

                                      covered it a bit previously on Pivot:

                                      https://pivot-to-ai.com/2025/06/02/fighting-the-ai-scraper-bots-at-pivot-to-ai-and-rationalwiki/
                                      https://pivot-to-ai.com/2025/09/07/the-ai-scraper-bots-are-hammering-pivot-to-ai-again-please-test/

                                      1 Antwort Letzte Antwort
                                      0
                                      • sjvn@mastodon.socialS sjvn@mastodon.social

                                        @osm_tech Tell me more. You can reach me at sjvn01 <at> gmail.com

                                        davidgerard@circumstances.runD This user is from outside of this forum
                                        davidgerard@circumstances.runD This user is from outside of this forum
                                        davidgerard@circumstances.run
                                        schrieb zuletzt editiert von
                                        #114

                                        @sjvn @osm_tech do contact sjvn!

                                        1 Antwort Letzte Antwort
                                        0
                                        • bugspriet@social.tchncs.deB bugspriet@social.tchncs.de shared this topic
                                        Antworten
                                        • In einem neuen Thema antworten
                                        Anmelden zum Antworten
                                        • Älteste zuerst
                                        • Neuste zuerst
                                        • Meiste Stimmen



                                        Copyright (c) 2025 abSpecktrum (@abspecklog@fedimonster.de)

                                        Erstellt mit Schlaflosigkeit, Kaffee, Brokkoli & ♥

                                        Impressum | Datenschutzerklärung | Nutzungsbedingungen

                                        • Anmelden

                                        • Du hast noch kein Konto? Registrieren

                                        • Anmelden oder registrieren, um zu suchen
                                        • Erster Beitrag
                                          Letzter Beitrag
                                        0
                                        • Home
                                        • Aktuell
                                        • Tags
                                        • Über dieses Forum