• taladar@sh.itjust.works
    link
    fedilink
    English
    arrow-up
    11
    ·
    5 hours ago

    Most of these languages dont even have enough professional voice actors to cover the bandwidth.

    And you think anyone is training AI voice models for those languages? Have you even seen how long it takes even large companies like Google to support the languages with hundreds of millions of speakers?

    • JohnEdwa@sopuli.xyz
      link
      fedilink
      English
      arrow-up
      2
      ·
      edit-2
      1 hour ago

      That’s the benefit of using AI and machine learning - once you have enough source material, you can throw it all in and it’ll eventually spit out a model.
      Which is exactly what Meta did with their Massively Multilingual Speech project which supports text-to-speech and speech-to-text for 1107 different languages.

      Is it actually any good in 99% of them, I don’t have a clue, but it exists.

      • taladar@sh.itjust.works
        link
        fedilink
        English
        arrow-up
        1
        ·
        57 minutes ago

        Seems more like a proof of concept project for that paper than something they are pursuing seriously judging by the GitHub location in some example folder that hasn’t seen any significant updates in over a year. If it is so great I would assume they would pursue it more actively and replace existing models with it two years later.

    • Dr. Moose@lemmy.world
      link
      fedilink
      English
      arrow-up
      3
      arrow-down
      4
      ·
      4 hours ago

      It becomes easier and cheaper every day. Today’s open source LLMs are better than last year’s best model.

      • Jhex@lemmy.world
        link
        fedilink
        English
        arrow-up
        3
        ·
        1 hour ago

        Is it? I just tried again yesterday for a simple script since coding is the one thing apparently AI will replace people like me and it could not put together a working JavaScript script.

        I have yet to see tangible results not announced by the people with sunken cost exploding their balls.

      • ExperiencedWinter@lemmy.world
        link
        fedilink
        English
        arrow-up
        4
        ·
        4 hours ago

        You’re fundamentally misunderstanding the comment you replied to, they are not saying that voice AI are bad, they are saying there is not enough training data to improve the AI for these languages. How will it improve without good training data?