• Dr. Moose@lemmy.world
    link
    fedilink
    English
    arrow-up
    14
    arrow-down
    7
    ·
    edit-2
    6 hours ago

    This is clearly the future despite the outrage here.

    There are at least 389 living languages with over 1M speakers. That alone means it’s impossible to reach some people and they get left out. Most of these languages dont even have enough professional voice actors to cover the bandwidth.

    There are thousands of books released every year. That’s impossible to cover even in English alone.

    Its an objective net good to have more accessible audio books and the privileged people who do care about this stuff can very much afford to vote with their wallets for non-ai voices.

    In fact since AI moat is so minimal this will very quickly be adapted by open source solution providing audio book access to millions if not billions of people to whom this was not an option. Its amazing.

    • taladar@sh.itjust.works
      link
      fedilink
      English
      arrow-up
      7
      ·
      2 hours ago

      Most of these languages dont even have enough professional voice actors to cover the bandwidth.

      And you think anyone is training AI voice models for those languages? Have you even seen how long it takes even large companies like Google to support the languages with hundreds of millions of speakers?

      • Dr. Moose@lemmy.world
        link
        fedilink
        English
        arrow-up
        1
        arrow-down
        1
        ·
        21 minutes ago

        It becomes easier and cheaper every day. Today’s open source LLMs are better than last year’s best model.

        • ExperiencedWinter@lemmy.world
          link
          fedilink
          English
          arrow-up
          1
          ·
          4 minutes ago

          You’re fundamentally misunderstanding the comment you replied to, they are not saying that voice AI are bad, they are saying there is not enough training data to improve the AI for these languages. How will it improve without good training data?