Mastodon Skip to content
  • Home
  • Aktuell
  • Tags
  • Über dieses Forum
Einklappen
Grafik mit zwei überlappenden Sprechblasen, eine grün und eine lila.
Abspeckgeflüster – Forum für Menschen mit Gewicht(ung)

Kostenlos. Werbefrei. Menschlich. Dein Abnehmforum.

  1. Home
  2. Uncategorized
  3. A few days ago, a client’s data center "vanished" overnight.

A few days ago, a client’s data center "vanished" overnight.

Geplant Angeheftet Gesperrt Verschoben Uncategorized
sysadminhorrorstoriesithorrorstoriesmonitoring
111 Beiträge 46 Kommentatoren 0 Aufrufe
  • Älteste zuerst
  • Neuste zuerst
  • Meiste Stimmen
Antworten
  • In einem neuen Thema antworten
Anmelden zum Antworten
Dieses Thema wurde gelöscht. Nur Nutzer mit entsprechenden Rechten können es sehen.
  • _elena@mastodon.social_ _elena@mastodon.social

    @EnigmaRotor reading this at lunch in a cafe near my house and I keep chuckling and smiling from ear to ear. @stefano is such a treasure 🙌🏆

    ozoned@btfree.socialO This user is from outside of this forum
    ozoned@btfree.socialO This user is from outside of this forum
    ozoned@btfree.social
    schrieb zuletzt editiert von
    #48

    @_elena@mastodon.social When you direct the movie, can I star as the legendary @stefano@mastodon.bsd.cafe ​?

    _elena@mastodon.social_ stefano@mastodon.bsd.cafeS 2 Antworten Letzte Antwort
    0
    • bojanlandekic@mastodon.socialB bojanlandekic@mastodon.social

      @stefano so refreshing to read a quality tech tale on Mastodon. Thanks for sharing!

      stefano@mastodon.bsd.cafeS This user is from outside of this forum
      stefano@mastodon.bsd.cafeS This user is from outside of this forum
      stefano@mastodon.bsd.cafe
      schrieb zuletzt editiert von
      #49

      @bojanlandekic thank you! I'm just trying to spread some real life experiences

      bojanlandekic@mastodon.socialB 1 Antwort Letzte Antwort
      0
      • jamesoff@mastodon.jamesoff.netJ jamesoff@mastodon.jamesoff.net

        @rhoot @stefano I have my cronjob scripts touch a file as their final action and my monitoring stuff alarms if the file is too old

        randomized@masto.bikeR This user is from outside of this forum
        randomized@masto.bikeR This user is from outside of this forum
        randomized@masto.bike
        schrieb zuletzt editiert von
        #50

        @jamesoff
        I have
        my backup scripts write their return code in a file.

        I monitor file content and mtime, get an alert if content not 0 or file too old

        I also regularly manually test backup restore.

        Then I can sleep

        @rhoot @stefano

        jamesoff@mastodon.jamesoff.netJ 1 Antwort Letzte Antwort
        0
        • neurovagrant@masto.deoan.orgN neurovagrant@masto.deoan.org

          @stefano This is such a good, if niche, example of "paying attention to the fundamentals and the alerts covers all sorts of things you'd never imagine happening."

          Thanks for sharing.

          stefano@mastodon.bsd.cafeS This user is from outside of this forum
          stefano@mastodon.bsd.cafeS This user is from outside of this forum
          stefano@mastodon.bsd.cafe
          schrieb zuletzt editiert von
          #51

          @neurovagrant thank you! My rule is: we need moooarr alerts, as you never know how and when (not if - we know it will happen) your alertil system will break.

          1 Antwort Letzte Antwort
          0
          • stefano@mastodon.bsd.cafeS stefano@mastodon.bsd.cafe

            A few days ago, a client’s data center "vanished" overnight. My monitoring showed that all devices were unreachable. Not even the ISP routers responded, so I assumed a sudden connectivity drop. The strange part? Not even via 4G.

            I then suspected a power failure, but the UPS should have sent an alert.

            The office was closed for the holidays, but I contacted the IT manager anyway. He was home sick with a serious family issue, but he got moving.

            To make a long story short: the company deals in gold and precious metals. They have an underground bunker with two-meter thick walls. They were targeted by a professional gang. They used a tactic seen in similar hits: they identify the main power line, tamper with it at night, and send a massive voltage spike through it.

            The goal is to fry all alarm and surveillance systems. Even if battery-backed, they rarely survive a surge like that. Thieves count on the fact that during holidays, owners are away and fried systems can't send alerts. Monitoring companies often have reduced staff and might not notice the "silence" immediately.

            That is exactly what happened here. But there is a "but": they didn't account for my Uptime Kuma instance monitoring their MikroTik router, installed just weeks ago. Since it is an external check, it flagged the lack of response from all IPs without needing an internal alert to be triggered from the inside.

            The team rushed to the site and found the mess. Luckily, they found an emergency electrical crew to bypass the damage and restore the cameras and alarms. They swapped the fried server UPS with a spare and everything came back up.

            The police warned that the chances of the crew returning the next night to "finish" the job were high, though seeing the systems back online would likely make them move on. They also warned that thieves sometimes break in just to destroy servers to wipe any video evidence.

            Nothing happened in the end. But in the meantime, I had to sync all their data off-site (thankfully they have dual 1Gbps FTTH), set up an emergency cluster, and ensure everything was redundant.

            Never rely only on internal monitoring. Never.

            #IT #SysAdmin #HorrorStories #ITHorrorStories #Monitoring

            stux@mstdn.socialS This user is from outside of this forum
            stux@mstdn.socialS This user is from outside of this forum
            stux@mstdn.social
            schrieb zuletzt editiert von
            #52

            @stefano Great job!

            This is why is always run up time on different servers in other places!

            Perfect!

            stefano@mastodon.bsd.cafeS 1 Antwort Letzte Antwort
            0
            • _elena@mastodon.social_ _elena@mastodon.social

              @EnigmaRotor reading this at lunch in a cafe near my house and I keep chuckling and smiling from ear to ear. @stefano is such a treasure 🙌🏆

              stefano@mastodon.bsd.cafeS This user is from outside of this forum
              stefano@mastodon.bsd.cafeS This user is from outside of this forum
              stefano@mastodon.bsd.cafe
              schrieb zuletzt editiert von
              #53

              @_elena @EnigmaRotor thank you!

              1 Antwort Letzte Antwort
              0
              • ozoned@btfree.socialO ozoned@btfree.social

                @_elena@mastodon.social When you direct the movie, can I star as the legendary @stefano@mastodon.bsd.cafe ​?

                _elena@mastodon.social_ This user is from outside of this forum
                _elena@mastodon.social_ This user is from outside of this forum
                _elena@mastodon.social
                schrieb zuletzt editiert von
                #54

                @ozoned @stefano maybe! Especially if it’s motivation enough for you to keep practicing your Italian! 😂 and definitely at the very least a cameo with a line from Spaceballs

                ozoned@btfree.socialO 1 Antwort Letzte Antwort
                0
                • ozoned@btfree.socialO ozoned@btfree.social

                  @_elena@mastodon.social When you direct the movie, can I star as the legendary @stefano@mastodon.bsd.cafe ​?

                  stefano@mastodon.bsd.cafeS This user is from outside of this forum
                  stefano@mastodon.bsd.cafeS This user is from outside of this forum
                  stefano@mastodon.bsd.cafe
                  schrieb zuletzt editiert von
                  #55

                  @ozoned @_elena 😆 sure, just continue to practice with your Italian

                  1 Antwort Letzte Antwort
                  0
                  • randomized@masto.bikeR randomized@masto.bike

                    @jamesoff
                    I have
                    my backup scripts write their return code in a file.

                    I monitor file content and mtime, get an alert if content not 0 or file too old

                    I also regularly manually test backup restore.

                    Then I can sleep

                    @rhoot @stefano

                    jamesoff@mastodon.jamesoff.netJ This user is from outside of this forum
                    jamesoff@mastodon.jamesoff.netJ This user is from outside of this forum
                    jamesoff@mastodon.jamesoff.net
                    schrieb zuletzt editiert von
                    #56

                    @randomized @rhoot @stefano how do you monitor your sleep 😛

                    randomized@masto.bikeR 1 Antwort Letzte Antwort
                    0
                    • _elena@mastodon.social_ _elena@mastodon.social

                      @stefano you’re a hero Stefano! As your Fedi friend and documentary filmmaker I hope I get preferential treatment when one of your amazing stories gets optioned for a film 🤗

                      stefano@mastodon.bsd.cafeS This user is from outside of this forum
                      stefano@mastodon.bsd.cafeS This user is from outside of this forum
                      stefano@mastodon.bsd.cafe
                      schrieb zuletzt editiert von
                      #57

                      @_elena Thank you! Sure, I will 👍
                      But, to be honest, I don't think any of those stories will ever be a film.

                      The big, most scary one is yet to come, anyway...

                      elaterite@mastoart.socialE 1 Antwort Letzte Antwort
                      0
                      • n_dimension@infosec.exchangeN n_dimension@infosec.exchange

                        @stefano

                        You are the hero I aspire to be!

                        lienrag@mastodon.tedomum.netL This user is from outside of this forum
                        lienrag@mastodon.tedomum.netL This user is from outside of this forum
                        lienrag@mastodon.tedomum.net
                        schrieb zuletzt editiert von
                        #58

                        @n_dimension

                        Why ?
                        It's a cool story for everyone that does network supervision, but why take sides there ?
                        The attackers probably worked more than the company owners for their gold, so why denying it to them would be a good thing ?

                        Even the sysadmin in https://xkcd.com/705/ only did his job, he didn't choose a side.

                        @stefano

                        1 Antwort Letzte Antwort
                        0
                        • stux@mstdn.socialS stux@mstdn.social

                          @stefano Great job!

                          This is why is always run up time on different servers in other places!

                          Perfect!

                          stefano@mastodon.bsd.cafeS This user is from outside of this forum
                          stefano@mastodon.bsd.cafeS This user is from outside of this forum
                          stefano@mastodon.bsd.cafe
                          schrieb zuletzt editiert von
                          #59

                          @stux thank you! Yes, that's a very wise approach. I have some internal and external monitoring tools. And the monitoring tools monitoring the monitoring tools, with different technologies (so a bug won't hit all the tools at the same time). Yet, I always feel I need moooarrr monitoring 🙂

                          1 Antwort Letzte Antwort
                          0
                          • stefano@mastodon.bsd.cafeS stefano@mastodon.bsd.cafe

                            @rhoot @jamesoff I have two different, remote monitors that monitor the monitor

                            mkj@social.mkj.earthM This user is from outside of this forum
                            mkj@social.mkj.earthM This user is from outside of this forum
                            mkj@social.mkj.earth
                            schrieb zuletzt editiert von
                            #60

                            @stefano But what monitors the monitor monitors? We need an audio technician in here, stat! 😉

                            @rhoot @jamesoff

                            jamesoff@mastodon.jamesoff.netJ 1 Antwort Letzte Antwort
                            0
                            • stefano@mastodon.bsd.cafeS stefano@mastodon.bsd.cafe

                              A few days ago, a client’s data center "vanished" overnight. My monitoring showed that all devices were unreachable. Not even the ISP routers responded, so I assumed a sudden connectivity drop. The strange part? Not even via 4G.

                              I then suspected a power failure, but the UPS should have sent an alert.

                              The office was closed for the holidays, but I contacted the IT manager anyway. He was home sick with a serious family issue, but he got moving.

                              To make a long story short: the company deals in gold and precious metals. They have an underground bunker with two-meter thick walls. They were targeted by a professional gang. They used a tactic seen in similar hits: they identify the main power line, tamper with it at night, and send a massive voltage spike through it.

                              The goal is to fry all alarm and surveillance systems. Even if battery-backed, they rarely survive a surge like that. Thieves count on the fact that during holidays, owners are away and fried systems can't send alerts. Monitoring companies often have reduced staff and might not notice the "silence" immediately.

                              That is exactly what happened here. But there is a "but": they didn't account for my Uptime Kuma instance monitoring their MikroTik router, installed just weeks ago. Since it is an external check, it flagged the lack of response from all IPs without needing an internal alert to be triggered from the inside.

                              The team rushed to the site and found the mess. Luckily, they found an emergency electrical crew to bypass the damage and restore the cameras and alarms. They swapped the fried server UPS with a spare and everything came back up.

                              The police warned that the chances of the crew returning the next night to "finish" the job were high, though seeing the systems back online would likely make them move on. They also warned that thieves sometimes break in just to destroy servers to wipe any video evidence.

                              Nothing happened in the end. But in the meantime, I had to sync all their data off-site (thankfully they have dual 1Gbps FTTH), set up an emergency cluster, and ensure everything was redundant.

                              Never rely only on internal monitoring. Never.

                              #IT #SysAdmin #HorrorStories #ITHorrorStories #Monitoring

                              darkling@mstdn.socialD This user is from outside of this forum
                              darkling@mstdn.socialD This user is from outside of this forum
                              darkling@mstdn.social
                              schrieb zuletzt editiert von
                              #61

                              @stefano About 15 years ago, the place I worked had a supercomputer. One night, the aircon in the machine room failed. The machine kept computing, and the temperature rose. It rose *quite a lot*.

                              Sadly, the first thing to fail from the heat was the core switch for the room. You know, the one that handles all of the network for everything in the room. Including the temperature alerts.

                              It was finally spotted about 8am when the security patrol wondered why the door shutters were so hot.

                              stefano@mastodon.bsd.cafeS 1 Antwort Letzte Antwort
                              0
                              • darkling@mstdn.socialD darkling@mstdn.social

                                @stefano About 15 years ago, the place I worked had a supercomputer. One night, the aircon in the machine room failed. The machine kept computing, and the temperature rose. It rose *quite a lot*.

                                Sadly, the first thing to fail from the heat was the core switch for the room. You know, the one that handles all of the network for everything in the room. Including the temperature alerts.

                                It was finally spotted about 8am when the security patrol wondered why the door shutters were so hot.

                                stefano@mastodon.bsd.cafeS This user is from outside of this forum
                                stefano@mastodon.bsd.cafeS This user is from outside of this forum
                                stefano@mastodon.bsd.cafe
                                schrieb zuletzt editiert von
                                #62

                                @darkling nice story! Unfortunately I had to manage something like that, too (A/C broken - switch dead, etc)

                                darkling@mstdn.socialD 1 Antwort Letzte Antwort
                                0
                                • stefano@mastodon.bsd.cafeS stefano@mastodon.bsd.cafe

                                  @darkling nice story! Unfortunately I had to manage something like that, too (A/C broken - switch dead, etc)

                                  darkling@mstdn.socialD This user is from outside of this forum
                                  darkling@mstdn.socialD This user is from outside of this forum
                                  darkling@mstdn.social
                                  schrieb zuletzt editiert von
                                  #63

                                  @stefano Fortunately, the only thing that did fail after the aircon was the switch. (And a pair of ear muffs which had been hanging on a metal rail -- they'd melted).

                                  The fire brigade turned up, checked everything, and ran some big positive pressure fans to get airflow through the room from one door to the other to cool everything down.

                                  robert@irrelevant.me.ukR 1 Antwort Letzte Antwort
                                  0
                                  • _elena@mastodon.social_ _elena@mastodon.social

                                    @ozoned @stefano maybe! Especially if it’s motivation enough for you to keep practicing your Italian! 😂 and definitely at the very least a cameo with a line from Spaceballs

                                    ozoned@btfree.socialO This user is from outside of this forum
                                    ozoned@btfree.socialO This user is from outside of this forum
                                    ozoned@btfree.social
                                    schrieb zuletzt editiert von
                                    #64

                                    @_elena@mastodon.social @stefano@mastodon.bsd.cafe ​How do I say "I knew it! I'm surrounded by assholes!" in Italian?

                                    1 Antwort Letzte Antwort
                                    0
                                    • jamesoff@mastodon.jamesoff.netJ jamesoff@mastodon.jamesoff.net

                                      @randomized @rhoot @stefano how do you monitor your sleep 😛

                                      randomized@masto.bikeR This user is from outside of this forum
                                      randomized@masto.bikeR This user is from outside of this forum
                                      randomized@masto.bike
                                      schrieb zuletzt editiert von
                                      #65

                                      @jamesoff
                                      Sport watch 😁
                                      @rhoot @stefano

                                      1 Antwort Letzte Antwort
                                      0
                                      • mkj@social.mkj.earthM mkj@social.mkj.earth

                                        @stefano But what monitors the monitor monitors? We need an audio technician in here, stat! 😉

                                        @rhoot @jamesoff

                                        jamesoff@mastodon.jamesoff.netJ This user is from outside of this forum
                                        jamesoff@mastodon.jamesoff.netJ This user is from outside of this forum
                                        jamesoff@mastodon.jamesoff.net
                                        schrieb zuletzt editiert von
                                        #66

                                        @mkj @stefano @rhoot oh if audio's getting involved, you can use `ping -a` 😄

                                        mkj@social.mkj.earthM 1 Antwort Letzte Antwort
                                        0
                                        • stefano@mastodon.bsd.cafeS stefano@mastodon.bsd.cafe

                                          A few days ago, a client’s data center "vanished" overnight. My monitoring showed that all devices were unreachable. Not even the ISP routers responded, so I assumed a sudden connectivity drop. The strange part? Not even via 4G.

                                          I then suspected a power failure, but the UPS should have sent an alert.

                                          The office was closed for the holidays, but I contacted the IT manager anyway. He was home sick with a serious family issue, but he got moving.

                                          To make a long story short: the company deals in gold and precious metals. They have an underground bunker with two-meter thick walls. They were targeted by a professional gang. They used a tactic seen in similar hits: they identify the main power line, tamper with it at night, and send a massive voltage spike through it.

                                          The goal is to fry all alarm and surveillance systems. Even if battery-backed, they rarely survive a surge like that. Thieves count on the fact that during holidays, owners are away and fried systems can't send alerts. Monitoring companies often have reduced staff and might not notice the "silence" immediately.

                                          That is exactly what happened here. But there is a "but": they didn't account for my Uptime Kuma instance monitoring their MikroTik router, installed just weeks ago. Since it is an external check, it flagged the lack of response from all IPs without needing an internal alert to be triggered from the inside.

                                          The team rushed to the site and found the mess. Luckily, they found an emergency electrical crew to bypass the damage and restore the cameras and alarms. They swapped the fried server UPS with a spare and everything came back up.

                                          The police warned that the chances of the crew returning the next night to "finish" the job were high, though seeing the systems back online would likely make them move on. They also warned that thieves sometimes break in just to destroy servers to wipe any video evidence.

                                          Nothing happened in the end. But in the meantime, I had to sync all their data off-site (thankfully they have dual 1Gbps FTTH), set up an emergency cluster, and ensure everything was redundant.

                                          Never rely only on internal monitoring. Never.

                                          #IT #SysAdmin #HorrorStories #ITHorrorStories #Monitoring

                                          lasseleegaard@mastodon.socialL This user is from outside of this forum
                                          lasseleegaard@mastodon.socialL This user is from outside of this forum
                                          lasseleegaard@mastodon.social
                                          schrieb zuletzt editiert von
                                          #67

                                          @stefano 10+ years ago i started volunteering at a festival. Everything was new that year including the small outdoor racks for the area field routers (Juniper MX80). They barely fit but we managed. The racks were left in the sun in the summer. It was only when we enabled Observium (LibreNMS predecessor) that graphs almost everything it gets from SNMP that we discovered the inlet temperature was getting close to 80 degrees C. #monitorallthethings

                                          stefano@mastodon.bsd.cafeS lasseleegaard@mastodon.socialL 2 Antworten Letzte Antwort
                                          0
                                          Antworten
                                          • In einem neuen Thema antworten
                                          Anmelden zum Antworten
                                          • Älteste zuerst
                                          • Neuste zuerst
                                          • Meiste Stimmen



                                          Copyright (c) 2025 abSpecktrum (@abspecklog@fedimonster.de)

                                          Erstellt mit Schlaflosigkeit, Kaffee, Brokkoli & ♥

                                          Impressum | Datenschutzerklärung | Nutzungsbedingungen

                                          • Anmelden

                                          • Du hast noch kein Konto? Registrieren

                                          • Anmelden oder registrieren, um zu suchen
                                          • Erster Beitrag
                                            Letzter Beitrag
                                          0
                                          • Home
                                          • Aktuell
                                          • Tags
                                          • Über dieses Forum