Most AI translation tools rely on cloud services.

Audio leaves your device, gets processed elsewhere, and comes back translated.

As open speech recognition, translation, and TTS models continue to improve, it feels increasingly possible to build communication tools that run on infrastructure users actually control.

That’s one of the ideas behind PolyTalk, an open-source translation platform we’re building.

Privacy, ownership, and transparency may soon matter as much as model quality.

Do you think communication tools like translation, transcription, and speech interfaces will eventually move back toward local and self-hosted deployments?

GitHub: https://github.com/PolyTalkIO/polytalk

  • CrypticCoffee@lemmy.ml
    shield
    M
    link
    fedilink
    arrow-up
    0
    ·
    2 days ago

    Got a complaint that this is an ad. We don’t really have a rule against that and I’d assume in an open source community, sharing repos and projects is part of it. There is a repo link.

    I’ll allow it. Discuss as you will.

  • thingsiplay@lemmy.ml
    link
    fedilink
    arrow-up
    0
    ·
    2 days ago

    Post title is clickbait. It should contain the name of the program. Really this is not YouTube… Also why is every sentence its own paragraph?

    Sorry for being this negative, nothing against the project, just against this post that looks like something out of a marketing team to me.

    • PolyTalk_BizzAppDev@lemmy.worldOP
      link
      fedilink
      arrow-up
      0
      ·
      1 day ago

      Fair point. I was trying to focus on the broader topic rather than lead with the project, but I can see why that might come across as marketing-style framing.

  • anamethatisnt@sopuli.xyz
    link
    fedilink
    arrow-up
    0
    ·
    2 days ago

    There are ton of great selfhosted tools for tts and similar interfaces.
    I used https://github.com/resemble-ai/chatterbox to make my own voice read my epubs, albeit with an american accent which I definitely don’t have in real life. It was close enough to put the voice in the uncanny valley according to my wife.

    I think most end users will go for a cloud app or website for their needs though, playing around with self-hosting isn’t for everyone.