Mastodon Skip to content
  • Home
  • Aktuell
  • Tags
  • Über dieses Forum
Einklappen
Grafik mit zwei überlappenden Sprechblasen, eine grün und eine lila.
Abspeckgeflüster – Forum für Menschen mit Gewicht(ung)

Kostenlos. Werbefrei. Menschlich. Dein Abnehmforum.

  1. Home
  2. Uncategorized
  3. If you write about the messy reality behind "free" internet services: we're seeing #OpenStreetMap hammered by scrapers hiding behind residential proxy/embedded-SDK networks.

If you write about the messy reality behind "free" internet services: we're seeing #OpenStreetMap hammered by scrapers hiding behind residential proxy/embedded-SDK networks.

Geplant Angeheftet Gesperrt Verschoben Uncategorized
openstreetmapbotsabuse
114 Beiträge 92 Kommentatoren 0 Aufrufe
  • Älteste zuerst
  • Neuste zuerst
  • Meiste Stimmen
Antworten
  • In einem neuen Thema antworten
Anmelden zum Antworten
Dieses Thema wurde gelöscht. Nur Nutzer mit entsprechenden Rechten können es sehen.
  • baloouriza@social.tulsa.ok.usB baloouriza@social.tulsa.ok.us

    @osm_tech I wonder if there's a way to fail2ban requests coming in faster than typically found in human requests.

    nicd@masto.ahlcode.fiN This user is from outside of this forum
    nicd@masto.ahlcode.fiN This user is from outside of this forum
    nicd@masto.ahlcode.fi
    schrieb zuletzt editiert von
    #80

    @BalooUriza The problem is, who do you ban? Since the requests keep changing IPs and user agents.

    1 Antwort Letzte Antwort
    0
    • chillicampari@layer8.spaceC chillicampari@layer8.space

      @osm_tech tagging @mfeilner

      sl007@digitalcourage.socialS This user is from outside of this forum
      sl007@digitalcourage.socialS This user is from outside of this forum
      sl007@digitalcourage.social
      schrieb zuletzt editiert von
      #81

      @chillicampari @osm_tech So if it is too late to tag @mfeilner now, I am tagging @evawolfangel

      Journa et al.
      - #OSM is equally important as #wikipedia and wikibase
      - Please do not only report about critical infrastructure problems if an OSS project has birthday …

      1 Antwort Letzte Antwort
      0
      • osm_tech@en.osm.townO osm_tech@en.osm.town

        If you write about the messy reality behind "free" internet services: we're seeing #OpenStreetMap hammered by scrapers hiding behind residential proxy/embedded-SDK networks. We're a volunteer-run service and the costs are real. We'd love to talk to a journalist about what we're seeing + how we're responding. #AI #Bots #Abuse

        gusseting@mastodon.socialG This user is from outside of this forum
        gusseting@mastodon.socialG This user is from outside of this forum
        gusseting@mastodon.social
        schrieb zuletzt editiert von
        #82

        @osm_tech @camwilson fyi

        1 Antwort Letzte Antwort
        0
        • dalias@hachyderm.ioD dalias@hachyderm.io

          @osm_tech @BalooUriza For IPv4, a bitmask of the entire address space is a viable "efficient" implementation of blocking. I wonder if there are tools that can do it that way rather than needing a gigantic list.

          slink@fosstodon.orgS This user is from outside of this forum
          slink@fosstodon.orgS This user is from outside of this forum
          slink@fosstodon.org
          schrieb zuletzt editiert von
          #83

          @dalias @osm_tech @BalooUriza we have a very efficient implementation in #vinylcache (formerly #varnishcache )

          1 Antwort Letzte Antwort
          0
          • blub@norden.socialB blub@norden.social

            @osm_tech Or @heiseonline ?

            christopherkunz@chaos.socialC This user is from outside of this forum
            christopherkunz@chaos.socialC This user is from outside of this forum
            christopherkunz@chaos.social
            schrieb zuletzt editiert von
            #84

            @blub @osm_tech @heiseonline Yeah I already replied.

            1 Antwort Letzte Antwort
            0
            • osm_tech@en.osm.townO osm_tech@en.osm.town

              If you write about the messy reality behind "free" internet services: we're seeing #OpenStreetMap hammered by scrapers hiding behind residential proxy/embedded-SDK networks. We're a volunteer-run service and the costs are real. We'd love to talk to a journalist about what we're seeing + how we're responding. #AI #Bots #Abuse

              stuartyeates@cloudisland.nzS This user is from outside of this forum
              stuartyeates@cloudisland.nzS This user is from outside of this forum
              stuartyeates@cloudisland.nz
              schrieb zuletzt editiert von
              #85

              @osm_tech

              The real solution here is for app stores to give users proper per-app security settings. If an app isn't doesn't have a good reason to be sending email, it shouldn't be trying.

              1 Antwort Letzte Antwort
              0
              • L linux@bahn.social

                @osm_tech
                Maybe @adfichter for @republik_magazin ?

                adfichter@infosec.exchangeA This user is from outside of this forum
                adfichter@infosec.exchangeA This user is from outside of this forum
                adfichter@infosec.exchange
                schrieb zuletzt editiert von
                #86

                @Linux after vacation;) @osm_tech @republik_magazin

                1 Antwort Letzte Antwort
                0
                • bjoerne@norden.socialB bjoerne@norden.social shared this topic
                • osm_tech@en.osm.townO osm_tech@en.osm.town

                  If you write about the messy reality behind "free" internet services: we're seeing #OpenStreetMap hammered by scrapers hiding behind residential proxy/embedded-SDK networks. We're a volunteer-run service and the costs are real. We'd love to talk to a journalist about what we're seeing + how we're responding. #AI #Bots #Abuse

                  droidboy@social.cologneD This user is from outside of this forum
                  droidboy@social.cologneD This user is from outside of this forum
                  droidboy@social.cologne
                  schrieb zuletzt editiert von
                  #87

                  @osm_tech @publictorsten

                  1 Antwort Letzte Antwort
                  0
                  • jorgesanz@mapstodon.spaceJ jorgesanz@mapstodon.space

                    @osm_tech maybe @civio @dcabo can be interested or help finding someone

                    dcabo@mastodon.socialD This user is from outside of this forum
                    dcabo@mastodon.socialD This user is from outside of this forum
                    dcabo@mastodon.social
                    schrieb zuletzt editiert von
                    #88

                    @jorgesanz @osm_tech @civio hmm, it doesn’t fit in Civio’s scope I’m afraid. But it’s definitely an issue I’m aware of, it’s worse now with all the AI scrapers and I wonder if we should block them all, they flood my apps too 😕 Maybe the 404 Media guys would be interested in this? https://www.404media.co/ai-scraping-bots-are-breaking-open-libraries-archives-and-museums/

                    1 Antwort Letzte Antwort
                    0
                    • osm_tech@en.osm.townO osm_tech@en.osm.town

                      If you write about the messy reality behind "free" internet services: we're seeing #OpenStreetMap hammered by scrapers hiding behind residential proxy/embedded-SDK networks. We're a volunteer-run service and the costs are real. We'd love to talk to a journalist about what we're seeing + how we're responding. #AI #Bots #Abuse

                      nodami@hcommons.socialN This user is from outside of this forum
                      nodami@hcommons.socialN This user is from outside of this forum
                      nodami@hcommons.social
                      schrieb zuletzt editiert von
                      #89

                      @osm_tech
                      Maybe @La_Directa @donestech
                      @tunubesecamirio
                      @albalafarga
                      @mediapart
                      @mainichi
                      @heisec

                      Not Sure If they are already aware 😅

                      I remember @FediTips shared a list of News Media here in the fediverse, I'll try to find it.... Here it is https://fedi.directory/tag/investigative-journalism/

                      1 Antwort Letzte Antwort
                      0
                      • osm_tech@en.osm.townO osm_tech@en.osm.town

                        If you write about the messy reality behind "free" internet services: we're seeing #OpenStreetMap hammered by scrapers hiding behind residential proxy/embedded-SDK networks. We're a volunteer-run service and the costs are real. We'd love to talk to a journalist about what we're seeing + how we're responding. #AI #Bots #Abuse

                        kaasbaas@mastodon.africaK This user is from outside of this forum
                        kaasbaas@mastodon.africaK This user is from outside of this forum
                        kaasbaas@mastodon.africa
                        schrieb zuletzt editiert von
                        #90

                        @osm_tech @theregister ?

                        1 Antwort Letzte Antwort
                        0
                        • osm_tech@en.osm.townO osm_tech@en.osm.town

                          If you write about the messy reality behind "free" internet services: we're seeing #OpenStreetMap hammered by scrapers hiding behind residential proxy/embedded-SDK networks. We're a volunteer-run service and the costs are real. We'd love to talk to a journalist about what we're seeing + how we're responding. #AI #Bots #Abuse

                          wumbo@infosec.exchangeW This user is from outside of this forum
                          wumbo@infosec.exchangeW This user is from outside of this forum
                          wumbo@infosec.exchange
                          schrieb zuletzt editiert von
                          #91

                          @osm_tech hey, look into spur.us, they can help with the residential proxy issue.

                          1 Antwort Letzte Antwort
                          0
                          • osm_tech@en.osm.townO osm_tech@en.osm.town

                            If you write about the messy reality behind "free" internet services: we're seeing #OpenStreetMap hammered by scrapers hiding behind residential proxy/embedded-SDK networks. We're a volunteer-run service and the costs are real. We'd love to talk to a journalist about what we're seeing + how we're responding. #AI #Bots #Abuse

                            wumbo@infosec.exchangeW This user is from outside of this forum
                            wumbo@infosec.exchangeW This user is from outside of this forum
                            wumbo@infosec.exchange
                            schrieb zuletzt editiert von
                            #92

                            @osm_tech @briankrebs

                            1 Antwort Letzte Antwort
                            0
                            • osm_tech@en.osm.townO osm_tech@en.osm.town

                              If you write about the messy reality behind "free" internet services: we're seeing #OpenStreetMap hammered by scrapers hiding behind residential proxy/embedded-SDK networks. We're a volunteer-run service and the costs are real. We'd love to talk to a journalist about what we're seeing + how we're responding. #AI #Bots #Abuse

                              joris@hostux.socialJ This user is from outside of this forum
                              joris@hostux.socialJ This user is from outside of this forum
                              joris@hostux.social
                              schrieb zuletzt editiert von
                              #93

                              @osm_tech in my experience, it helps if you have local representatives so journalists can speak with, and write about, a person in their own region.
                              I could nudge Dutch (/Belgian) press!

                              1 Antwort Letzte Antwort
                              0
                              • osm_tech@en.osm.townO osm_tech@en.osm.town

                                If you write about the messy reality behind "free" internet services: we're seeing #OpenStreetMap hammered by scrapers hiding behind residential proxy/embedded-SDK networks. We're a volunteer-run service and the costs are real. We'd love to talk to a journalist about what we're seeing + how we're responding. #AI #Bots #Abuse

                                vickyjo@mastodon.socialV This user is from outside of this forum
                                vickyjo@mastodon.socialV This user is from outside of this forum
                                vickyjo@mastodon.social
                                schrieb zuletzt editiert von
                                #94

                                @osm_tech I tried to find you on BSky - I'd try over there...

                                https://bsky.app/profile/joetidy.bsky.social
                                https://bsky.app/profile/zsk.bsky.social
                                https://bsky.app/profile/404media.co

                                1 Antwort Letzte Antwort
                                0
                                • ryanvade@mas.toR ryanvade@mas.to

                                  @osm_tech @404mediaco

                                  naturemc@mastodon.onlineN This user is from outside of this forum
                                  naturemc@mastodon.onlineN This user is from outside of this forum
                                  naturemc@mastodon.online
                                  schrieb zuletzt editiert von
                                  #95

                                  @ryanvade Was also my idea - and @heiseonline @heisec

                                  @osm_tech @404mediaco

                                  1 Antwort Letzte Antwort
                                  0
                                  • floris@freiburg.socialF floris@freiburg.social

                                    Vielleicht ist das ein Thema für die @lagedernation?

                                    lagedernation@chaos.socialL This user is from outside of this forum
                                    lagedernation@chaos.socialL This user is from outside of this forum
                                    lagedernation@chaos.social
                                    schrieb zuletzt editiert von
                                    #96

                                    @floris
                                    @osm_tech Hi, please get in touch, we've covered OSM many times before and would love to learn more: team a lagedernation org

                                    1 Antwort Letzte Antwort
                                    0
                                    • osm_tech@en.osm.townO osm_tech@en.osm.town

                                      If you write about the messy reality behind "free" internet services: we're seeing #OpenStreetMap hammered by scrapers hiding behind residential proxy/embedded-SDK networks. We're a volunteer-run service and the costs are real. We'd love to talk to a journalist about what we're seeing + how we're responding. #AI #Bots #Abuse

                                      fantafanta@mastodon.socialF This user is from outside of this forum
                                      fantafanta@mastodon.socialF This user is from outside of this forum
                                      fantafanta@mastodon.social
                                      schrieb zuletzt editiert von
                                      #97

                                      @osm_tech Interesting. Perhaps we could follow-up via e-mail or DM? alexander.fanta@ftm.nl

                                      1 Antwort Letzte Antwort
                                      0
                                      • osm_tech@en.osm.townO osm_tech@en.osm.town

                                        If you write about the messy reality behind "free" internet services: we're seeing #OpenStreetMap hammered by scrapers hiding behind residential proxy/embedded-SDK networks. We're a volunteer-run service and the costs are real. We'd love to talk to a journalist about what we're seeing + how we're responding. #AI #Bots #Abuse

                                        ea5iyl@mastodon.radioE This user is from outside of this forum
                                        ea5iyl@mastodon.radioE This user is from outside of this forum
                                        ea5iyl@mastodon.radio
                                        schrieb zuletzt editiert von
                                        #98

                                        @osm_tech Wow. TIL that software development kits are more or less silently embedding internet scrapers in (unrelated) end-user applications to distribute AI data scraping across residential addresses and therefore be harder to defend against.
                                        Hey, Tim, were you expecting stuff like this 35 years down the line?

                                        1 Antwort Letzte Antwort
                                        0
                                        • osm_tech@en.osm.townO osm_tech@en.osm.town

                                          @BalooUriza We use fail2ban to handle some of this with custom rules, but eventually fail2ban becomes a bottleneck after 100,000 IP addresses.

                                          mnalis@mastodon.onlineM This user is from outside of this forum
                                          mnalis@mastodon.onlineM This user is from outside of this forum
                                          mnalis@mastodon.online
                                          schrieb zuletzt editiert von
                                          #99

                                          @osm_tech @BalooUriza is it using ipset hashsets, or default rule-per-ip rules? raw namespace or? I don't know the details of implementation, but if it is L7 load that is problematic (instead of pure bandwidth DDoS), it might be worth to consider whitelisting instead. I.e. whitelist addresses (or /24s) that have *not* had excessive requests lately, and put them in priority network bucket, and the rest (which is not blacklisted) goes in best-effort bucket (to maybe migrate to whitelist later)

                                          1 Antwort Letzte Antwort
                                          0
                                          Antworten
                                          • In einem neuen Thema antworten
                                          Anmelden zum Antworten
                                          • Älteste zuerst
                                          • Neuste zuerst
                                          • Meiste Stimmen



                                          Copyright (c) 2025 abSpecktrum (@abspecklog@fedimonster.de)

                                          Erstellt mit Schlaflosigkeit, Kaffee, Brokkoli & ♥

                                          Impressum | Datenschutzerklärung | Nutzungsbedingungen

                                          • Anmelden

                                          • Du hast noch kein Konto? Registrieren

                                          • Anmelden oder registrieren, um zu suchen
                                          • Erster Beitrag
                                            Letzter Beitrag
                                          0
                                          • Home
                                          • Aktuell
                                          • Tags
                                          • Über dieses Forum