Audible unveils plans to use AI voices to narrate audiobooks

return2ozma@lemmy.world · 20 days ago

Audible unveils plans to use AI voices to narrate audiobooks

Kusimulkku@lemm.ee · 19 days ago

I’m not sure why AI would automatically mean it’s doing a shitty job.

utopiah@lemmy.world · 19 days ago

Because… the tool has no understanding of anything? It reads written words, yes, but no intention, no cultural context, no intonation. Unless everything is spelled out like a script, then it will not sound great, would it?

Kusimulkku@lemm.ee · edit-2 18 days ago

Someone can manually go through it and correct and edit it, as one would a regular, human made recording. It’s not rocket science exactly. It wouldn’t be a story time for children but it would probably be alright for more plain stuff

utopiah@lemmy.world · 18 days ago

If the “fix” for an AI implementation in a use case is, again, to manually correct it and find a less demanding audience then… yes, by definition it’s shitty.

The point isn’t that it’s infeasible, just that it will be low quality.

Kusimulkku@lemm.ee · edit-2 18 days ago

I mean you have to correct and edit human made stuff too, doesn’t mean it’s shit lol

If you want the stuff read out and don’t care for the radio type stuff, I’d imagine the better voice AIs do a pretty good job. And I personally prefer the more neutral voices to the story time stuff, so works for me.

utopiah@lemmy.world · 18 days ago

This is me just speculating here but if they follow the path of this CEO who fired his human staff to replace it by AI… then rollback admit it’s shit https://gizmodo.com/klarna-hiring-back-human-help-after-going-all-in-on-ai-2000600767 then my bet is that it’s not done to improve quality but rather margins.

If AI is done alongside professionals, and done so ethically (not stolen training data, not ignoring ecological cost by pumping water in dry areas to cool down GPUs, etc) and economically (i.e. not having it “cheap” now but once a monopoly position is obtain, raise prices for a captive set of consumers) then yes it can be potentially empowering. This though is pretty much never the case.

That being said, if one “just” want read aloud, there are plenty of FLOSS alternatives and I believe Mozilla even a TTS/STT system based solely on voluntary voices.

Kusimulkku@lemm.ee · 18 days ago

It’s a company, of course it’s done to increase profits. I’m just saying it being AI doesn’t automatically mean it’s shit, it could be done just fine. AI is a tool, the end result depends on how that tool is used.