Mastodon Skip to content
  • Home
  • Aktuell
  • Tags
  • Über dieses Forum
Einklappen
Grafik mit zwei überlappenden Sprechblasen, eine grün und eine lila.
Abspeckgeflüster – Forum für Menschen mit Gewicht(ung)

Kostenlos. Werbefrei. Menschlich. Dein Abnehmforum.

  1. Home
  2. Uncategorized
  3. We've built our own text-to-speech system with an initial English language model we trained ourselves with fully open source data.

We've built our own text-to-speech system with an initial English language model we trained ourselves with fully open source data.

Geplant Angeheftet Gesperrt Verschoben Uncategorized
42 Beiträge 28 Kommentatoren 0 Aufrufe
  • Älteste zuerst
  • Neuste zuerst
  • Meiste Stimmen
Antworten
  • In einem neuen Thema antworten
Anmelden zum Antworten
Dieses Thema wurde gelöscht. Nur Nutzer mit entsprechenden Rechten können es sehen.
  • grapheneos@grapheneos.socialG grapheneos@grapheneos.social

    Existing implementations of text-to-speech and speech-to-text didn't meet our functionality or usability requirements. We want at least very high quality, low latency and robust implementations of both for English included in the OS. It will help make GrapheneOS more accessible.

    tchambers@indieweb.socialT This user is from outside of this forum
    tchambers@indieweb.socialT This user is from outside of this forum
    tchambers@indieweb.social
    schrieb zuletzt editiert von
    #21

    @GrapheneOS Fascinating is the text to speech and vice versa model and code you’re working on platform specific?

    grapheneos@grapheneos.socialG 1 Antwort Letzte Antwort
    0
    • grapheneos@grapheneos.socialG grapheneos@grapheneos.social

      Our full time developer working on this already built their own Transcribro app for on-device speech-to-text available in the Accrescent app store. For GrapheneOS itself, we want actual open source implementations of these features rather than OpenAI's phony open source though.

      hipsterelectron@circumstances.runH This user is from outside of this forum
      hipsterelectron@circumstances.runH This user is from outside of this forum
      hipsterelectron@circumstances.run
      schrieb zuletzt editiert von
      #22

      @GrapheneOS i was really impressed with the efficacy and UI of transcribro. no surprise to hear that was the mark of a grapheneos app

      1 Antwort Letzte Antwort
      0
      • grapheneos@grapheneos.socialG grapheneos@grapheneos.social

        Whisper is actually closed source. Open weights is another way of saying permissively licensed closed source. Our implementation of both text-to-speech and speech-to-text will be actual open source which means people can actually fork it and add/change/remove training data, etc.

        hipsterelectron@circumstances.runH This user is from outside of this forum
        hipsterelectron@circumstances.runH This user is from outside of this forum
        hipsterelectron@circumstances.run
        schrieb zuletzt editiert von
        #23

        @GrapheneOS the "largeness" of language models is precisely a measure of the difficulty to reproduce them. this methodology has some similarities to something i proposed to huggingface a few years back in a cover letter. no surprise to see they were not interested in reproducibility or the scientific method

        hipsterelectron@circumstances.runH 1 Antwort Letzte Antwort
        0
        • tchambers@indieweb.socialT tchambers@indieweb.social

          @GrapheneOS Fascinating is the text to speech and vice versa model and code you’re working on platform specific?

          grapheneos@grapheneos.socialG This user is from outside of this forum
          grapheneos@grapheneos.socialG This user is from outside of this forum
          grapheneos@grapheneos.social
          schrieb zuletzt editiert von
          #24

          @tchambers It's not really platform specific. It currently runs on the CPU but we plan to add TPU support for Tensor and NPU support for Snapdragon in the future. It's made for GrapheneOS and we're not interested in doing any significant work on use outside of GrapheneOS. It will be possible to install it from our App Store on other Android 16+ operating systems but it's not our focus. We're focused on making GrapheneOS better and haven't gotten much out of making stuff available elsewhere.

          1 Antwort Letzte Antwort
          0
          • grapheneos@grapheneos.socialG grapheneos@grapheneos.social

            We've built our own text-to-speech system with an initial English language model we trained ourselves with fully open source data. It will be added to our App Store soon and then included in GrapheneOS as a default enabled TTS backend once some more improvements are made to it.

            4a4a0ea6f24fe54ca08a20f5ada65e42efdb692f6b8912ff6c1e521c024afa61@mostr.pub4 This user is from outside of this forum
            4a4a0ea6f24fe54ca08a20f5ada65e42efdb692f6b8912ff6c1e521c024afa61@mostr.pub4 This user is from outside of this forum
            4a4a0ea6f24fe54ca08a20f5ada65e42efdb692f6b8912ff6c1e521c024afa61@mostr.pub
            schrieb zuletzt editiert von
            #25
            that is awesome.

            how far if ever until we have a stable terminal app that can be run from any user profile?
            1 Antwort Letzte Antwort
            0
            • hipsterelectron@circumstances.runH hipsterelectron@circumstances.run

              @GrapheneOS the "largeness" of language models is precisely a measure of the difficulty to reproduce them. this methodology has some similarities to something i proposed to huggingface a few years back in a cover letter. no surprise to see they were not interested in reproducibility or the scientific method

              hipsterelectron@circumstances.runH This user is from outside of this forum
              hipsterelectron@circumstances.runH This user is from outside of this forum
              hipsterelectron@circumstances.run
              schrieb zuletzt editiert von
              #26

              @GrapheneOS i have also been trying to find similarly motivated people to collaborate with on a research project to reproduce the fawkes facial recognition poisoner upon a mobile device (ideally as an asynchronous but fully local image postprocessing technique) cc @xyhhx @bunnyhero

              hipsterelectron@circumstances.runH 1 Antwort Letzte Antwort
              0
              • hipsterelectron@circumstances.runH hipsterelectron@circumstances.run

                @GrapheneOS i have also been trying to find similarly motivated people to collaborate with on a research project to reproduce the fawkes facial recognition poisoner upon a mobile device (ideally as an asynchronous but fully local image postprocessing technique) cc @xyhhx @bunnyhero

                hipsterelectron@circumstances.runH This user is from outside of this forum
                hipsterelectron@circumstances.runH This user is from outside of this forum
                hipsterelectron@circumstances.run
                schrieb zuletzt editiert von
                #27

                @GrapheneOS @xyhhx @bunnyhero i have been putting it off repeatedly but the fawkes paper itself is very high quality and imo intended to be reproduced. if there are resources your team has developed or considered regarding modern hardware on mobile phones for statistical training and inference (fawkes especially requires a training step with local user input iirc) it would be tremendously helpful for our goals here.

                hipsterelectron@circumstances.runH 1 Antwort Letzte Antwort
                0
                • hipsterelectron@circumstances.runH hipsterelectron@circumstances.run

                  @GrapheneOS @xyhhx @bunnyhero i have been putting it off repeatedly but the fawkes paper itself is very high quality and imo intended to be reproduced. if there are resources your team has developed or considered regarding modern hardware on mobile phones for statistical training and inference (fawkes especially requires a training step with local user input iirc) it would be tremendously helpful for our goals here.

                  hipsterelectron@circumstances.runH This user is from outside of this forum
                  hipsterelectron@circumstances.runH This user is from outside of this forum
                  hipsterelectron@circumstances.run
                  schrieb zuletzt editiert von
                  #28

                  @GrapheneOS @xyhhx @bunnyhero we obviously expect reduced efficacy vs the SANDlab implementation with GPU acceleration but the math and the code are both very approachable and since its publication we have seen phones add specific "NPU" chips for matmul/etc and this would be a fun way to subvert the utility of "AI" ubiquitization to embed panoptic surveillance

                  1 Antwort Letzte Antwort
                  0
                  • grapheneos@grapheneos.socialG grapheneos@grapheneos.social

                    Whisper is actually closed source. Open weights is another way of saying permissively licensed closed source. Our implementation of both text-to-speech and speech-to-text will be actual open source which means people can actually fork it and add/change/remove training data, etc.

                    king_of_ooo@defcon.socialK This user is from outside of this forum
                    king_of_ooo@defcon.socialK This user is from outside of this forum
                    king_of_ooo@defcon.social
                    schrieb zuletzt editiert von
                    #29

                    @GrapheneOS I replied to one of your posts a couple months ago when yall asked about TTS, suggesting Piper TTS models (https://github.com/OHF-Voice/piper1-gpl). There are def some quality (English) and performant models, though I haven't dug into whether they are truly open source (aka open dataset) or just open weights.

                    Either way, I am very excited to see more projects by gOS and more quality options in the TTS & STT spaces. People with disabilities deserve equal access to technology, and anything that brings us closer to a world were that is possible is a good thing.

                    1 Antwort Letzte Antwort
                    0
                    • grapheneos@grapheneos.socialG grapheneos@grapheneos.social

                      We've built our own text-to-speech system with an initial English language model we trained ourselves with fully open source data. It will be added to our App Store soon and then included in GrapheneOS as a default enabled TTS backend once some more improvements are made to it.

                      wass47@mastodon.socialW This user is from outside of this forum
                      wass47@mastodon.socialW This user is from outside of this forum
                      wass47@mastodon.social
                      schrieb zuletzt editiert von
                      #30

                      @GrapheneOS happy to help with French

                      1 Antwort Letzte Antwort
                      0
                      • grapheneos@grapheneos.socialG grapheneos@grapheneos.social

                        We're going to build our own speech-to-text implementation to go along with this too. We're starting with an English model for both but we can add other languages which have high quality training data available. English and Mandarin have by far the most training data available.

                        malmen@masto.ptM This user is from outside of this forum
                        malmen@masto.ptM This user is from outside of this forum
                        malmen@masto.pt
                        schrieb zuletzt editiert von
                        #31

                        @GrapheneOS will this enable speach commands on android auto?

                        1 Antwort Letzte Antwort
                        0
                        • grapheneos@grapheneos.socialG grapheneos@grapheneos.social

                          We're going to build our own speech-to-text implementation to go along with this too. We're starting with an English model for both but we can add other languages which have high quality training data available. English and Mandarin have by far the most training data available.

                          T This user is from outside of this forum
                          T This user is from outside of this forum
                          topcaser@mastodontech.de
                          schrieb zuletzt editiert von
                          #32

                          @GrapheneOS please, please add German to that list

                          1 Antwort Letzte Antwort
                          0
                          • grapheneos@grapheneos.socialG grapheneos@grapheneos.social

                            We've built our own text-to-speech system with an initial English language model we trained ourselves with fully open source data. It will be added to our App Store soon and then included in GrapheneOS as a default enabled TTS backend once some more improvements are made to it.

                            emberfox@bark.lgbtE This user is from outside of this forum
                            emberfox@bark.lgbtE This user is from outside of this forum
                            emberfox@bark.lgbt
                            schrieb zuletzt editiert von
                            #33

                            @GrapheneOS I am very excited to not have to use an external tool to do this anymore.

                            1 Antwort Letzte Antwort
                            0
                            • grapheneos@grapheneos.socialG grapheneos@grapheneos.social

                              Our full time developer working on this already built their own Transcribro app for on-device speech-to-text available in the Accrescent app store. For GrapheneOS itself, we want actual open source implementations of these features rather than OpenAI's phony open source though.

                              lunareclipse@snug.moeL This user is from outside of this forum
                              lunareclipse@snug.moeL This user is from outside of this forum
                              lunareclipse@snug.moe
                              schrieb zuletzt editiert von
                              #34

                              @GrapheneOS highly interested in seeing high quality open source TTS/STT, great work!

                              1 Antwort Letzte Antwort
                              0
                              • grapheneos@grapheneos.socialG grapheneos@grapheneos.social

                                We've built our own text-to-speech system with an initial English language model we trained ourselves with fully open source data. It will be added to our App Store soon and then included in GrapheneOS as a default enabled TTS backend once some more improvements are made to it.

                                catsalad@infosec.exchangeC This user is from outside of this forum
                                catsalad@infosec.exchangeC This user is from outside of this forum
                                catsalad@infosec.exchange
                                schrieb zuletzt editiert von
                                #35

                                @GrapheneOS How good is this model at meowing?

                                float13@masto.hackers.townF 1 Antwort Letzte Antwort
                                0
                                • catsalad@infosec.exchangeC catsalad@infosec.exchange

                                  @GrapheneOS How good is this model at meowing?

                                  float13@masto.hackers.townF This user is from outside of this forum
                                  float13@masto.hackers.townF This user is from outside of this forum
                                  float13@masto.hackers.town
                                  schrieb zuletzt editiert von
                                  #36

                                  @catsalad @GrapheneOS

                                  catsalad@infosec.exchangeC 1 Antwort Letzte Antwort
                                  0
                                  • float13@masto.hackers.townF float13@masto.hackers.town

                                    @catsalad @GrapheneOS

                                    catsalad@infosec.exchangeC This user is from outside of this forum
                                    catsalad@infosec.exchangeC This user is from outside of this forum
                                    catsalad@infosec.exchange
                                    schrieb zuletzt editiert von
                                    #37

                                    @float13 @GrapheneOS

                                    1 Antwort Letzte Antwort
                                    0
                                    • grapheneos@grapheneos.socialG grapheneos@grapheneos.social

                                      Our full time developer working on this already built their own Transcribro app for on-device speech-to-text available in the Accrescent app store. For GrapheneOS itself, we want actual open source implementations of these features rather than OpenAI's phony open source though.

                                      D This user is from outside of this forum
                                      D This user is from outside of this forum
                                      detietsch@mastodon.social
                                      schrieb zuletzt editiert von
                                      #38

                                      @GrapheneOS wow ! this is great ! Good Work GOS-Team 👍

                                      1 Antwort Letzte Antwort
                                      0
                                      • grapheneos@grapheneos.socialG grapheneos@grapheneos.social

                                        We're going to build our own speech-to-text implementation to go along with this too. We're starting with an English model for both but we can add other languages which have high quality training data available. English and Mandarin have by far the most training data available.

                                        tmakarios@theres.lifeT This user is from outside of this forum
                                        tmakarios@theres.lifeT This user is from outside of this forum
                                        tmakarios@theres.life
                                        schrieb zuletzt editiert von
                                        #39

                                        @GrapheneOS
                                        This is great news! I'll be interested to find out how well your English speech-to-text model copes with non-rhotic accents. (Think of a posh British "received pronunciation" accent, or anyone from southeast England, or Australia or New Zealand.) Currently, if I want something like Dicio to understand me, I have to put on an American accent, which my wife says is "creepy".

                                        1 Antwort Letzte Antwort
                                        0
                                        • grapheneos@grapheneos.socialG grapheneos@grapheneos.social

                                          We've built our own text-to-speech system with an initial English language model we trained ourselves with fully open source data. It will be added to our App Store soon and then included in GrapheneOS as a default enabled TTS backend once some more improvements are made to it.

                                          D This user is from outside of this forum
                                          D This user is from outside of this forum
                                          dei@is.nota.live
                                          schrieb zuletzt editiert von
                                          #40

                                          @GrapheneOS Perfect timing! Since december 2025 google text to speach is not working anymore without Exploit protection compatibility mode.

                                          1 Antwort Letzte Antwort
                                          0
                                          Antworten
                                          • In einem neuen Thema antworten
                                          Anmelden zum Antworten
                                          • Älteste zuerst
                                          • Neuste zuerst
                                          • Meiste Stimmen



                                          Copyright (c) 2025 abSpecktrum (@abspecklog@fedimonster.de)

                                          Erstellt mit Schlaflosigkeit, Kaffee, Brokkoli & ♥

                                          Impressum | Datenschutzerklärung | Nutzungsbedingungen

                                          • Anmelden

                                          • Du hast noch kein Konto? Registrieren

                                          • Anmelden oder registrieren, um zu suchen
                                          • Erster Beitrag
                                            Letzter Beitrag
                                          0
                                          • Home
                                          • Aktuell
                                          • Tags
                                          • Über dieses Forum