We've built our own text-to-speech system with an initial English language model we trained ourselves with fully open source data.

tchambers@indieweb.social

@GrapheneOS Fascinating is the text to speech and vice versa model and code you’re working on platform specific?

hipsterelectron@circumstances.run

@GrapheneOS i was really impressed with the efficacy and UI of transcribro. no surprise to hear that was the mark of a grapheneos app

hipsterelectron@circumstances.run

@GrapheneOS the "largeness" of language models is precisely a measure of the difficulty to reproduce them. this methodology has some similarities to something i proposed to huggingface a few years back in a cover letter. no surprise to see they were not interested in reproducibility or the scientific method

grapheneos@grapheneos.social

@tchambers It's not really platform specific. It currently runs on the CPU but we plan to add TPU support for Tensor and NPU support for Snapdragon in the future. It's made for GrapheneOS and we're not interested in doing any significant work on use outside of GrapheneOS. It will be possible to install it from our App Store on other Android 16+ operating systems but it's not our focus. We're focused on making GrapheneOS better and haven't gotten much out of making stuff available elsewhere.

grapheneos@grapheneos.social

that is awesome.

how far if ever until we have a stable terminal app that can be run from any user profile?

hipsterelectron@circumstances.run

@GrapheneOS i have also been trying to find similarly motivated people to collaborate with on a research project to reproduce the fawkes facial recognition poisoner upon a mobile device (ideally as an asynchronous but fully local image postprocessing technique) cc @xyhhx @bunnyhero

hipsterelectron@circumstances.run

@GrapheneOS @xyhhx @bunnyhero i have been putting it off repeatedly but the fawkes paper itself is very high quality and imo intended to be reproduced. if there are resources your team has developed or considered regarding modern hardware on mobile phones for statistical training and inference (fawkes especially requires a training step with local user input iirc) it would be tremendously helpful for our goals here.

hipsterelectron@circumstances.run

@GrapheneOS @xyhhx @bunnyhero we obviously expect reduced efficacy vs the SANDlab implementation with GPU acceleration but the math and the code are both very approachable and since its publication we have seen phones add specific "NPU" chips for matmul/etc and this would be a fun way to subvert the utility of "AI" ubiquitization to embed panoptic surveillance

king_of_ooo@defcon.social

@GrapheneOS I replied to one of your posts a couple months ago when yall asked about TTS, suggesting Piper TTS models (https://github.com/OHF-Voice/piper1-gpl). There are def some quality (English) and performant models, though I haven't dug into whether they are truly open source (aka open dataset) or just open weights.

Either way, I am very excited to see more projects by gOS and more quality options in the TTS & STT spaces. People with disabilities deserve equal access to technology, and anything that brings us closer to a world were that is possible is a good thing.

wass47@mastodon.social

@GrapheneOS happy to help with French

malmen@masto.pt

@GrapheneOS will this enable speach commands on android auto?

topcaser@mastodontech.de

@GrapheneOS please, please add German to that list

emberfox@bark.lgbt

@GrapheneOS I am very excited to not have to use an external tool to do this anymore.

lunareclipse@snug.moe

@GrapheneOS highly interested in seeing high quality open source TTS/STT, great work!

catsalad@infosec.exchange

@GrapheneOS How good is this model at meowing?

float13@masto.hackers.town

@catsalad @GrapheneOS

catsalad@infosec.exchange

@float13 @GrapheneOS

detietsch@mastodon.social

@GrapheneOS wow ! this is great ! Good Work GOS-Team

tmakarios@theres.life

@GrapheneOS
This is great news! I'll be interested to find out how well your English speech-to-text model copes with non-rhotic accents. (Think of a posh British "received pronunciation" accent, or anyone from southeast England, or Australia or New Zealand.) Currently, if I want something like Dicio to understand me, I have to put on an American accent, which my wife says is "creepy".

dei@is.nota.live

@GrapheneOS Perfect timing! Since december 2025 google text to speach is not working anymore without Exploit protection compatibility mode.

Abspeckgeflüster – Forum für Menschen mit Gewicht(ung)

We've built our own text-to-speech system with an initial English language model we trained ourselves with fully open source data.