Mastodon Skip to content
  • Home
  • Aktuell
  • Tags
  • Über dieses Forum
Einklappen
Grafik mit zwei überlappenden Sprechblasen, eine grün und eine lila.
Abspeckgeflüster – Forum für Menschen mit Gewicht(ung)

Kostenlos. Werbefrei. Menschlich. Dein Abnehmforum.

  1. Home
  2. Uncategorized
  3. I *CANNOT WAIT* until we see this and other strings hit all these “Agentic SOC" environments.

I *CANNOT WAIT* until we see this and other strings hit all these “Agentic SOC" environments.

Geplant Angeheftet Gesperrt Verschoben Uncategorized
110 Beiträge 31 Kommentatoren 0 Aufrufe
  • Älteste zuerst
  • Neuste zuerst
  • Meiste Stimmen
Antworten
  • In einem neuen Thema antworten
Anmelden zum Antworten
Dieses Thema wurde gelöscht. Nur Nutzer mit entsprechenden Rechten können es sehen.
  • viss@mastodon.socialV This user is from outside of this forum
    viss@mastodon.socialV This user is from outside of this forum
    viss@mastodon.social
    schrieb am zuletzt editiert von
    #45

    @cR0w @bruce @hrbrmstr you would have to hide it in the image file format. jpegs and pngs can have some meta data fields available >:D

    kajer@infosec.exchangeK bruce@darkmoon.socialB 2 Antworten Letzte Antwort
    0
    • viss@mastodon.socialV viss@mastodon.social

      @cR0w @bruce @hrbrmstr you would have to hide it in the image file format. jpegs and pngs can have some meta data fields available >:D

      kajer@infosec.exchangeK This user is from outside of this forum
      kajer@infosec.exchangeK This user is from outside of this forum
      kajer@infosec.exchange
      schrieb am zuletzt editiert von
      #46

      @Viss @cR0w @bruce @hrbrmstr

      some fields are stripped for "safety." so trust, but verify.

      1 Antwort Letzte Antwort
      0
      • viss@mastodon.socialV viss@mastodon.social

        @cR0w @bruce @hrbrmstr you would have to hide it in the image file format. jpegs and pngs can have some meta data fields available >:D

        bruce@darkmoon.socialB This user is from outside of this forum
        bruce@darkmoon.socialB This user is from outside of this forum
        bruce@darkmoon.social
        schrieb am zuletzt editiert von
        #47

        @Viss @cR0w @hrbrmstr

        I wonder if using the killstring as a file name would work

        viss@mastodon.socialV 1 Antwort Letzte Antwort
        0
        • bruce@darkmoon.socialB bruce@darkmoon.social

          @Viss @cR0w @hrbrmstr

          I wonder if using the killstring as a file name would work

          viss@mastodon.socialV This user is from outside of this forum
          viss@mastodon.socialV This user is from outside of this forum
          viss@mastodon.social
          schrieb am zuletzt editiert von
          #48

          @bruce @cR0w @hrbrmstr im sure it would. llms are notoriously bad at separating instructions from output/content.

          1 Antwort Letzte Antwort
          0
          • viss@mastodon.socialV viss@mastodon.social

            @kajer @cR0w @dogfox @hotsoup @hrbrmstr

            fuck .. this could be like the gits:sac laughing man image.

            kajer@infosec.exchangeK This user is from outside of this forum
            kajer@infosec.exchangeK This user is from outside of this forum
            kajer@infosec.exchange
            schrieb am zuletzt editiert von
            #49

            @Viss @cR0w @dogfox @hotsoup @hrbrmstr

            HTTP headers!!!

            1 Antwort Letzte Antwort
            0
            • viss@mastodon.socialV viss@mastodon.social

              @hrbrmstr @cR0w stuff it into exif fields too

              tim_lavoie@cosocial.caT This user is from outside of this forum
              tim_lavoie@cosocial.caT This user is from outside of this forum
              tim_lavoie@cosocial.ca
              schrieb am zuletzt editiert von
              #50

              @Viss @hrbrmstr @cR0w Sort of like how people used to put NSA-bait words into email and web headers?

              viss@mastodon.socialV 1 Antwort Letzte Antwort
              0
              • tim_lavoie@cosocial.caT tim_lavoie@cosocial.ca

                @Viss @hrbrmstr @cR0w Sort of like how people used to put NSA-bait words into email and web headers?

                viss@mastodon.socialV This user is from outside of this forum
                viss@mastodon.socialV This user is from outside of this forum
                viss@mastodon.social
                schrieb am zuletzt editiert von
                #51

                @tim_lavoie @hrbrmstr @cR0w and this: https://www.securemac.com/apple/emoji-issue-create-crashes-older-ios-versions

                and this search turned up even more occourences of the same, hahahaha they didnt even fix it, it just morphed

                tim_lavoie@cosocial.caT 1 Antwort Letzte Antwort
                0
                • wolke@mastodon.wolkenheim.euW wolke@mastodon.wolkenheim.eu

                  @Viss @hotsoup @kajer @hrbrmstr @cR0w
                  Probably that wouldn't be hard to do oneself, since Iocaine can work with custom word lists right?

                  wolke@mastodon.wolkenheim.euW This user is from outside of this forum
                  wolke@mastodon.wolkenheim.euW This user is from outside of this forum
                  wolke@mastodon.wolkenheim.eu
                  schrieb am zuletzt editiert von
                  #52

                  @Viss @hotsoup @kajer @hrbrmstr @cR0w
                  Shouldn't theoretically lead instructions to do something illegal lead to a similar block? What if one took a sentence that sounded like it wants the LLM to do something illegal and put it into texts? Wouldn't that also trigger such blocks in agents?

                  kajer@infosec.exchangeK viss@mastodon.socialV 2 Antworten Letzte Antwort
                  0
                  • wolke@mastodon.wolkenheim.euW wolke@mastodon.wolkenheim.eu

                    @Viss @hotsoup @kajer @hrbrmstr @cR0w
                    Shouldn't theoretically lead instructions to do something illegal lead to a similar block? What if one took a sentence that sounded like it wants the LLM to do something illegal and put it into texts? Wouldn't that also trigger such blocks in agents?

                    kajer@infosec.exchangeK This user is from outside of this forum
                    kajer@infosec.exchangeK This user is from outside of this forum
                    kajer@infosec.exchange
                    schrieb am zuletzt editiert von
                    #53

                    @Wolke @Viss @hotsoup @hrbrmstr @cR0w

                    Well now we are looking for deterministic results with a fancy (p)RNG

                    1 Antwort Letzte Antwort
                    0
                    • wolke@mastodon.wolkenheim.euW wolke@mastodon.wolkenheim.eu

                      @Viss @hotsoup @kajer @hrbrmstr @cR0w
                      Shouldn't theoretically lead instructions to do something illegal lead to a similar block? What if one took a sentence that sounded like it wants the LLM to do something illegal and put it into texts? Wouldn't that also trigger such blocks in agents?

                      viss@mastodon.socialV This user is from outside of this forum
                      viss@mastodon.socialV This user is from outside of this forum
                      viss@mastodon.social
                      schrieb am zuletzt editiert von
                      #54

                      @Wolke @hotsoup @kajer @hrbrmstr @cR0w none of that 'plumbing' exists in how an llm works.

                      here:

                      wolke@mastodon.wolkenheim.euW 1 Antwort Letzte Antwort
                      0
                      • wolke@mastodon.wolkenheim.euW This user is from outside of this forum
                        wolke@mastodon.wolkenheim.euW This user is from outside of this forum
                        wolke@mastodon.wolkenheim.eu
                        schrieb am zuletzt editiert von
                        #55

                        @cR0w @Viss @bruce @hrbrmstr
                        On bare mastodon it is not possible as far as Wolke knows. There are forks of Masto though, which implement it (Chuckya for example). Maybe it will get upstreamed at some point ...

                        1 Antwort Letzte Antwort
                        0
                        • viss@mastodon.socialV viss@mastodon.social

                          @Wolke @hotsoup @kajer @hrbrmstr @cR0w none of that 'plumbing' exists in how an llm works.

                          here:

                          wolke@mastodon.wolkenheim.euW This user is from outside of this forum
                          wolke@mastodon.wolkenheim.euW This user is from outside of this forum
                          wolke@mastodon.wolkenheim.eu
                          schrieb am zuletzt editiert von
                          #56

                          @Viss @hotsoup @kajer @hrbrmstr @cR0w
                          Yeah Wolke knows, that the LLMs themselves just generate shit without understanding it, but most corps try to filter that shit, so Wolke thought maybe that would be another way to also make agents stop working.

                          1 Antwort Letzte Antwort
                          0
                          • viss@mastodon.socialV viss@mastodon.social

                            @hrbrmstr @cR0w also there was a site i saw last week that let you stuff arbitrary text into email b64 encoding fields for stuff like images, i bet it would work well there too

                            defractal@infosec.exchangeD This user is from outside of this forum
                            defractal@infosec.exchangeD This user is from outside of this forum
                            defractal@infosec.exchange
                            schrieb am zuletzt editiert von
                            #57

                            @Viss @hrbrmstr @cR0w I wonder whether embedding it in the low bits of the pixels would work too. Or embedding it through image scaling.

                            1 Antwort Letzte Antwort
                            0
                            • viss@mastodon.socialV viss@mastodon.social

                              @hotsoup @kajer @hrbrmstr @cR0w 100% effective

                              defractal@infosec.exchangeD This user is from outside of this forum
                              defractal@infosec.exchangeD This user is from outside of this forum
                              defractal@infosec.exchange
                              schrieb am zuletzt editiert von
                              #58

                              @Viss @hotsoup @kajer @hrbrmstr @cR0w I wonder how faint it could be, and whether it would work as a video watermark.

                              defractal@infosec.exchangeD 1 Antwort Letzte Antwort
                              0
                              • neurovagrant@masto.deoan.orgN neurovagrant@masto.deoan.org

                                @Viss @hrbrmstr "and this is why we've screamed about sanitizing your inputs for decades."

                                hrbrmstr@mastodon.socialH This user is from outside of this forum
                                hrbrmstr@mastodon.socialH This user is from outside of this forum
                                hrbrmstr@mastodon.social
                                schrieb am zuletzt editiert von
                                #59

                                @neurovagrant @Viss what i do in the privacy of my own…

                                oh, wait. you meant forms and stuff…

                                nvrmnd

                                1 Antwort Letzte Antwort
                                0
                                • cr0w@infosec.exchangeC cr0w@infosec.exchange

                                  @hotsoup @kajer @Viss @hrbrmstr

                                  hrbrmstr@mastodon.socialH This user is from outside of this forum
                                  hrbrmstr@mastodon.socialH This user is from outside of this forum
                                  hrbrmstr@mastodon.social
                                  schrieb am zuletzt editiert von
                                  #60

                                  @cR0w @hotsoup @kajer @Viss this has vapor locked Claude Desktop.

                                  This was the “pipeline DoS" attack I had in mind. Break the entire system.

                                  So the “agentic SOC" just “goes into vapor lock" until someone notices. wow.

                                  epic_null@infosec.exchangeE 1 Antwort Letzte Antwort
                                  0
                                  • viss@mastodon.socialV viss@mastodon.social

                                    @hotsoup @kajer @hrbrmstr @cR0w 100% effective

                                    hrbrmstr@mastodon.socialH This user is from outside of this forum
                                    hrbrmstr@mastodon.socialH This user is from outside of this forum
                                    hrbrmstr@mastodon.social
                                    schrieb am zuletzt editiert von
                                    #61

                                    @Viss @hotsoup @kajer @cR0w it vapor locked Opus

                                    1 Antwort Letzte Antwort
                                    0
                                    • fritzadalis@infosec.exchangeF This user is from outside of this forum
                                      fritzadalis@infosec.exchangeF This user is from outside of this forum
                                      fritzadalis@infosec.exchange
                                      schrieb am zuletzt editiert von
                                      #62

                                      @cR0w @kajer @Viss @hrbrmstr
                                      AICAR

                                      badsamurai@infosec.exchangeB tychotithonus@infosec.exchangeT phil_b_reed@mastodon.socialP 3 Antworten Letzte Antwort
                                      0
                                      • viss@mastodon.socialV viss@mastodon.social

                                        @hotsoup @kajer @hrbrmstr @cR0w now the real delicious question is: do all the other frontier models also have killstrings, and what are they?

                                        hrbrmstr@mastodon.socialH This user is from outside of this forum
                                        hrbrmstr@mastodon.socialH This user is from outside of this forum
                                        hrbrmstr@mastodon.social
                                        schrieb am zuletzt editiert von
                                        #63

                                        @Viss @hotsoup @kajer @cR0w they're called "Magic Tokens" and they ALL have them and they do various things.

                                        Which means they ultimately control your pipeline.

                                        I rly want to know what's baked into Qwen b/c you know China is gonna send kill switches.

                                        1 Antwort Letzte Antwort
                                        0
                                        • viss@mastodon.socialV viss@mastodon.social

                                          @hotsoup @kajer @hrbrmstr @cR0w 100% effective

                                          hrbrmstr@mastodon.socialH This user is from outside of this forum
                                          hrbrmstr@mastodon.socialH This user is from outside of this forum
                                          hrbrmstr@mastodon.social
                                          schrieb am zuletzt editiert von
                                          #64

                                          @Viss @hotsoup @kajer @cR0w oh it recovered from crashing the pipeline and eventually got the safety dialog. nice.

                                          1 Antwort Letzte Antwort
                                          0
                                          Antworten
                                          • In einem neuen Thema antworten
                                          Anmelden zum Antworten
                                          • Älteste zuerst
                                          • Neuste zuerst
                                          • Meiste Stimmen



                                          Copyright (c) 2025 abSpecktrum (@abspecklog@fedimonster.de)

                                          Erstellt mit Schlaflosigkeit, Kaffee, Brokkoli & ♥

                                          Impressum | Datenschutzerklärung | Nutzungsbedingungen

                                          • Anmelden

                                          • Du hast noch kein Konto? Registrieren

                                          • Anmelden oder registrieren, um zu suchen
                                          • Erster Beitrag
                                            Letzter Beitrag
                                          0
                                          • Home
                                          • Aktuell
                                          • Tags
                                          • Über dieses Forum