Save a profile in tts server, then go into read > tts settings and change voice to profile you saved. I don’t remember but you may need readera premium.
tiktok voice:
hate. let me tell you how much i’ve come to hate you since i began to live. there are 387.44 million miles of printed circuits in wafer thin layers that fill my complex…
The professional ai voices are amazing
unironically, that is a character that could use an uncanny robotic AI voice.
Why would they when you can just plug any epub into a program and use google tts. Ive listened to about a book a day for the past few years doing this and i love it. Yeah it took getting used too, but once you find an ai voice you like and figure out which words to auto replace to sound right its honestly better then an audiobook. Well at least to me it is, i could never stand when the reader would change their voice for different characters.
My experience is these systems never get the intonation and stresses right. It drives me nuts and I can’t listen to it.
This is what I don’t get from a business standpoint. Why would anyone buy an AI read audiobook for $20 when they can get the exact same audio by buying the ebook for $0.99 and running it through AI?
Well that’s a great way to keep me unsubscribed. Glad I canceled my membership.
This is clearly the future despite the outrage here.
There are at least 389 living languages with over 1M speakers. That alone means it’s impossible to reach some people and they get left out. Most of these languages dont even have enough professional voice actors to cover the bandwidth.
There are thousands of books released every year. That’s impossible to cover even in English alone.
Its an objective net good to have more accessible audio books and the privileged people who do care about this stuff can very much afford to vote with their wallets for non-ai voices.
In fact since AI moat is so minimal this will very quickly be adapted by open source solution providing audio book access to millions if not billions of people to whom this was not an option. Its amazing.
dont even have enough professional voice actors to cover the bandwidth
I’m pretty sure they’d be a lot more people ready to do that job if there was a good remuneration. Heck that sounds a lot more fun that a LOT of jobs out there!
Sure but that’s not how free markets work. If there’s only 3 million consumers you can’t afford 3 million voice actors but you can afford 3 million AI renders.
I’m not an economist but… 1 voice actor can serve 3 million consumers if they listen to the same content.
Anyway that’s not even my point, my point is that it is possible to cover, we as a society, driven both by VC with strategies of capturing markets (so precisely going against “free” market as an ideal) and consumers are making choices (like when one buys from the local farmer market vs Amazon deliveries). If though we, while fully understanding the consequence of such choice (namely how the sausage is made, here how AI models are trained and then run), believe it’s not valuable then sure, we can make that choice.
I’m just warning consumers then that if they don’t pay for quality content made a certain way, they can’t complain that they in turn don’t get the job they wanted because nobody out there is ready to pay for it.
2 sides of the same coin.
Most of these languages dont even have enough professional voice actors to cover the bandwidth.
And you think anyone is training AI voice models for those languages? Have you even seen how long it takes even large companies like Google to support the languages with hundreds of millions of speakers?
That’s the benefit of using AI and machine learning - once you have enough source material, you can throw it all in and it’ll eventually spit out a model.
Which is exactly what Meta did with their Massively Multilingual Speech project which supports text-to-speech and speech-to-text for 1107 different languages.Is it actually any good in 99% of them, I don’t have a clue, but it exists.
It becomes easier and cheaper every day. Today’s open source LLMs are better than last year’s best model.
Is it? I just tried again yesterday for a simple script since coding is the one thing apparently AI will replace people like me and it could not put together a working JavaScript script.
I have yet to see tangible results not announced by the people with sunken cost exploding their balls.
Sounds like a skill issue my dude. While you struggle to get a js script people are putting out entire programs with AI assistants so sure - you’re right and they’re wrong
yeah, I guess I didn’t prompt right lol
Yes, to effectively use AI you actually have to understand the medium you’re in to describe the problem you’re trying to solve. You can get there with prompting but it’ll take you much longer if you just don’t understand code yourself.
Thats why most senior software devs are not afraid of LLMs cause they need strong oversight and thats exactly what years of software dev experience trained you to do.
You’re fundamentally misunderstanding the comment you replied to, they are not saying that voice AI are bad, they are saying there is not enough training data to improve the AI for these languages. How will it improve without good training data?
Thats not how AI training works and even then there’s absolutely enough data. Also training data can be created and even synthesized. There are many techniques to extract make training value from datasets that we discover every year - It’s really not a problem you think it is.
I’m genuinely confused how AI illiterate users here are. It’s just blind leading the blind.
It’s Amazon, what did you expect? Enshittification and monopoly abuse, no surprise.
Idk, they have pretty good stats that nobody will listen to an audio book if they don’t like the narrator, so being able to choose your own narrator on the fly isn’t really shitty
Enshittification isn’t adding new features that people want, it’s gradually lowering the quality of the product. So here if Audible is solely adding more possibilities, never at the cost of higher quality ones degrading, then indeed I’m wrong.
If though they hire less people to do good voice acting, then it’s really shitty.
I genuinely hope I’m wrong and they are ONLY adding new capabilities… but my entire experience with capitalism is that obtaining a monopolistic position is not done to improve quality but rather to increase margins regardless of how.
We’ll see!
Left Amazon a handful of years ago. Glad I didn’t entirely contribute to this. Saw that coming….
I prefer listening to real people. No matter how good AI voices become, I still like knowing that the one reading the book to me understands what they are saying.
The issue is there’s a million books out there with no audio and never will. Im ok with Ai doing readings on books that wouldn’t otherwise get an audio version
With machine voice with no attempts at imitate human’s intonation - yes.
Hey for the deaf and people who need the info on the page, robot voice is better than nothing.
Just pretend the book is being narrated by Stephen Hawking!
Accessibility and performance art are separate categories
Audiobooks for the deaf? Excuse me?
Sign language books. Now there’s a hole in the market 😆
I meant eye deaf
Yeah i can see worls of non fiction being a good candidate.
Sure, but it is still lame for a company like Audible to expect people to pay for their service and then they decide to cut costs by switching to AI voices. They can afford to hire actors to read their books. They have no excuse to go do that.
Meanwhile what you’re talking about if books and stories that may not get to be picked to be narrated and well, I can see where ai voices could be a benefit in those cases. Especially for people with dyslexia.
I just disagree with a company that sells itself on narrated books and then they go and have robots read their shit? Why should anyone pay for that? Because I’m sure their prices wouldn’t go down either.
And when all is said and done, personally, I just prefer that a human being is reading to me. Especially if it is fiction.
Does audible actually do the audiobooks? I assumed it was the publishers. Sometimes the books i want aren’t available on audio which I listen to while working
I assumed they did. Maybe not all, to be fair, but I am pretty sure they have produced audio recordings of books in the past(?)
Maybe I’m just tripping, I dunno.
There are Audible originals that you can only get on their platform. Audiobook sellers like libro.fm and streamers like Storytel don’t get access to those.
I completely agree. I don’t even like it when the human reader clearly doesn’t understand what they’re saying, so some AI flatly telling me the story isn’t going to cut it.
For the humans, someone mispronounced “quay” for example. “La Jolla” was another standout mistake that took me out of the story.
Dude, I know how you feel xD back in 2009 I bought an audio recording of the first Twilight book because I was curious about ehat the fuss was about. It was in Danish, as I am Danish, and the narrator, bless her, had a very Danish way of pronouncing the word “flirting”. In Danish we don’t have a modern word for flirting so we just use the English one with English pronunciation, but this lady, who already sounded like she was in her 60s, just went full Dane on that word and it completely took me out of the story and had me yell at my ghettoblaster “FLIRTING” everytime she pronounced her mutilated version of that word. I don’t even know how to write a phonetic version of what the fuck she said, but I’ll try.
Fleert-eh
Fuck me, it’s been almost 16 years and just spelling it out made my skin crawl.
I also hated that book, but that wasn’t really the narrator’s fault. Had to pause the fuck out of it several times and rage clean my apartment. Nobody had told me about how it romanticized abusive relationships and I had JUST gotten out of one of those so to say I was triggered was an understatement. The mispronounciations of flirting were just the garnish on top, lol.
I watch those movie recaps from YouTube while I work. The AI was obviously talking about a nine one one call but called it a nine hundred and eleven. Or when it’s talking about nine eleven. It instantly snaps you out of it. It’s sorta funny as background noise but I would 100% be avoiding it as a purchase.
WHY WOULD YOU SAY THAT. ROBOTS CAN SHOW EMOTION.
AS A FELLOW HUMAN I APPRECIATE YOUR INSIGHTFUL FEELINGS
Surely I can just do that myself with an an epub and a free AI.
Glad I binned my Audible subscription many years ago.
youtube already does it.
YouTube is crawling with it. It’s unlistenable shit. The prosody is badly implemented, pronunciation is infuriatingly bad, and a lot of the text that these TTS are reading appears to be AI-generated. Otherwise, already dire standards of literacy are getting worse at an accelerating rate.
And it’s shit
AI will write them and AI will read them to us.
Stock up on old physical books
It is easier to keep the books than what’s written in them…
Let AI pay for them and AI listen to them too. That way we can pay for and listen to actually good ones.
that’s gross.
I just wrote a novel (finished first draft yesterday). There’s no way I can afford professional audiobook voice actors—especially for a hobby project.
What I was planning on doing was handling the audiobook on my own—using an AI voice changer for all the different characters.
That’s where I think AI voices can shine: If someone can act they can use a voice changer to handle more characters and introduce a great variety of different styles of speech while retaining the careful pauses and dramatic elements (e.g. a voice cracking during an emotional scene) that you’d get from regular voice acting.
I’m not saying I will be able to pull that off but surely it will be better than just telling Amazon’s AI, “Hey, go read my book.”
Would infinitely prefer no voice changer.
Agreed. No AI voice changer please. Hopefully every one of us at one point in our lives has been read a story by someone else. Never once did the fact that all the different characters dialog was coming from one voice did that detract from the story or the immersion.
I’ve listened to audiobooks recorded with extremely deep masculine voices (think James Earl Jones) and when the voice actor was doing the voice of a 5 year old girl, (in only a slightly higher whiny timbre which matched the character traits) it was never immersion breaking. However, AI voice would. If I want different actors for different characters I’ll listen to radio dramas.
I think it would be a good idea to do a section of your work with and without AI modification. Then have people listen to both and give feedback. Good to find out if people like the modifications before you do a tone of work.
do a section of your work with and without […t]hen have people listen to both and give feedback.
Yes, that’s the principle of prototyping. De-risk while testing solely the crucial part!
AI aside, different voices may be immersion breaking. I tend to avoid audiobooks with more than a single narrator.
They are redoing all of the discworld books like this, and personally I can’t stand it.
Two narrators with one reading the male and one reading the female characters is usually okay but the full cast dramas are the worst.
I am okay with this only in cases where 1) the author approves, and 2) there is no audible version anyways.
Some people prefer listening to their books instead of reading and that’s totally ok. Indie authors can’t always afford to hire a narrator but I’d still want the buyers to be able to listen to the book.
Big question is, will the author get paid for the download or not…
I wouldn’t support it even if the author couldn’t afford it otherwise. There’s no test to confirm that and knowing profit margines, all publishers will use AI for all their books.
Yes, I’d want smaller authors to have people listen to their books, but without oversight, it’s going to ruin all audiobooks.
Fucking gross. Maybe it’s the 250+ audiobooks I have influencing me, but the very best ones I’ve listened to transcend just turning words into sound. Sound effects, music, tone, emotion, accents, sarcasm, and god damn BLOOPERS all improve the experience beyond just hearing what is written down.
I’m against it, fuck that literal noise.
Sound effects, music […] improve the experience
Actually hard disagreeing on that. I absolutely hate the audio drama versions of audio books and prefer the narrator only ones since they are much clearer and require a lot less focus to listen to and work in more contexts (background noise,…). Sound effects and music (while something is read, intro or outro style music is okay) distract from the actual content.
All I can think of is Jim Dale’s reading of the Harry Potter books. Fucking epic.
What, no way, they did not replace Steven Fry.
They didn’t replace Fry. When the Audiobooks were released in the US, they were read by Jim Dale. Fry was for the rest of the English language releases. During the run, Jim Dale broke the world record for the most character voices performed by a single actor in an audiobook (146).
Also Andy Serkis reading the lord of the rings. 11/10
No publisher is going to pay a professional to narrate their audiobooks when they can have AI do a shitty job for much less.
A shitty narrator can get me to hate a book I like. A great narrator can bring the characters to life, enhance the experience, and turn me from a listener to a fan. I’ve searched for books by narrators like Nick Podehl and Jeff Hayes and bought audiobooks I wouldn’t have otherwise.
I tried, and failed, to get into audio books for years. Then I listened to Dungeon Crawler Carl narrated by Jeff Hayes and what an absolute delight it was. There’s no way I would’ve gotten even 10 minutes in if it was one of those soulless AI voices instead.
Currently listening to the first book.
I’m not sure why AI would automatically mean it’s doing a shitty job.
Because… the tool has no understanding of anything? It reads written words, yes, but no intention, no cultural context, no intonation. Unless everything is spelled out like a script, then it will not sound great, would it?
Someone can manually go through it and correct and edit it, as one would a regular, human made recording. It’s not rocket science exactly. It wouldn’t be a story time for children but it would probably be alright for more plain stuff
If the “fix” for an AI implementation in a use case is, again, to manually correct it and find a less demanding audience then… yes, by definition it’s shitty.
The point isn’t that it’s infeasible, just that it will be low quality.
I mean you have to correct and edit human made stuff too, doesn’t mean it’s shit lol
If you want the stuff read out and don’t care for the radio type stuff, I’d imagine the better voice AIs do a pretty good job. And I personally prefer the more neutral voices to the story time stuff, so works for me.
This is me just speculating here but if they follow the path of this CEO who fired his human staff to replace it by AI… then rollback admit it’s shit https://gizmodo.com/klarna-hiring-back-human-help-after-going-all-in-on-ai-2000600767 then my bet is that it’s not done to improve quality but rather margins.
If AI is done alongside professionals, and done so ethically (not stolen training data, not ignoring ecological cost by pumping water in dry areas to cool down GPUs, etc) and economically (i.e. not having it “cheap” now but once a monopoly position is obtain, raise prices for a captive set of consumers) then yes it can be potentially empowering. This though is pretty much never the case.
That being said, if one “just” want read aloud, there are plenty of FLOSS alternatives and I believe Mozilla even a TTS/STT system based solely on voluntary voices.
It’s a company, of course it’s done to increase profits. I’m just saying it being AI doesn’t automatically mean it’s shit, it could be done just fine. AI is a tool, the end result depends on how that tool is used.
That depends entirely on how profitable it is and how much they can get authors onboard.
I do agree that a good narrator delivers a performance that adds the work. James Marster will always be Harry Dresden in my head.
That depends entirely on how profitable it is and how much they can get authors onboard.
A. Anything can be profitable when the cost to generation will be counted in singles of dollars instead of multiple thousands for a good narrator. They don’t even have to sell many to turn a profit too.
B. You think authors are going to have a choice? Lmfao. It’s the publishers that hold any real power and they will jump all over everyone’s IP with AI slop to make an extra three cents.
Your view seems to be hyper focused on the most pessimistic way of interpreting things. Are you doing OK? Seriously, I know how easy it is for everything going on to overwhelm you with negativity. How are you doing?
Maybe this is a culture clash thing, but FWIW, to me your post comes across as incredibly condesending asking a total stranger about their mental helth and implying its bad like you were their close friend.
I find the constant stream of people hyper focused on the worst possible outcome tiresome and frustrating. But instead of responding with that, I intentionally tried to express compassion and concern for a complete stranger. But because this is the Internet, naturally people interpret my actions with the worst possible intent.
That being said, how are you doing? Have anything fun you are looking forward to?
So despite me giving my opinion that that style of posting seems (to me) to be condesending you decided to apply that same style of message, which i just said I thought was invasive, to me?
I get you think you are being nice but trying to force unearned intimacy comes off as creepy.
It’s the publishers that hold any real power
It might be time to finally change that, especially considering what a piss poor job they have been doing for decades at their own part of the production of media.
A shitty narrator can get me to hate a book I like.
And that is where I see potential for AI. There are quite a few books which I’d love to listen to but they are all narrated by a guy whose narration I can’t stand. AI would open the possibility to choose a voice and I might actually get to enjoy those books. It’s Amazon though so the ethical implications and quality concerns are something I’m worried about.
Did you ever heard a single AI-narrated content that did not make you run away screaming?
Oh sorry, totally forgot that things cannot improve any longer /s
Just like with the narrator I dislike I will just not listen to the ones where AI was used if I don’t like it. It’s that simple.
And btw I’m not saying we should throw in the text and let the AI do everything. We can still have narration performers but use AI to create different styles.
You think they’ll be narrating books with Tiktok TTS?
To rephrase my question: where can I listen to an example of good AI spoken content?
First thing that comes to my mind would be Dougdoug. He’s a streamer who messes around with AI a fair bit for funny content, including using AI-generated voices at times.
Hm. I wasn’t able to listen to all 9:53:57, but in the samples I watched I heard a voice resembling the classical computer voice of Science Fiction movies of the 70s. Better than most YouTube AI generated audio content, but good enough to narrate audio books? Well, we’ll accustom to anything, I guess.
I don’t know if my timestamp went through, but the part I linked to was at 7h/19m/42s. That’s the relevant part, not necessarily the entire video. That’s a showcase of good AI voices.
Some use even worse, if YouTube content is any indication.
But you think Audible would use those to narrate books?
Nick Podehl is such an amazing narrator. The voices and performance are amazing.
I’ve been slowly getting through the Kel Kade books and the narration just makes it for me
The thing with this is that there won’t be shitty narrations any more. Hate it all you may, fact of the matter is that AI-powered voice generation is pretty good at what it does. So in the future you won’t have shitty narrations and great narrations. You’ll have decent narrations and great (human) narrations.
And teslas will have full self driving tomorrow and crypto currency will replace normal currency within one year! Always believe in the hype!
Maybe we’ll start reading again.
There is literally zero shame in someone consuming audiobooks, and it’s deeply weird to act like something is lost to you if others enjoy them. And this is coming from someone who virtually never listens to audiobooks.
I never said there was. I offered an alternative. . Outrage is misdirected and it’s by design. There are constructive ways to direct it
Reading is not an alternative to listening. Both have different use cases. You cannot read while driving, to name just one.
“Maybe we’ll start reading again” obviously implies that something is lacking presently and that with luck, we’ll go back to the way things were
Not sure if you’re saying I’m outraged but I promise you I’m not, just thought it was lame to try and imply audiobook enjoyers were somehow less than because of how they prefer to enjoy stories
I made some AI animated content that I never released because I don’t have the rights to the voices I was using. Even though I was blending several voices together to make them unrecognizable, it made me uncomfortable.
But in the process I learned the capabilities and limitations of AI voices. If you’re going purely from text to speech, it’s horrendous (as far as I experienced). Very robotic. It’s a bit better when melodic information is included (as in Suno) but still sounds like AI.
But when I recorded my own voice saying the lines and then converted it to another voice, it took all of the nuance of my line reads and converted it into the other voice.
So, would your opinion change if it turns out they’re going to use purchased voice rights to have a single narrator perform the whole book and then use AI to turn the narrators voice into a full voice cast?
I could see how it would allow lesser known books to have a better experience with a truly separate voice for each character, but I could also see how this might drive out lesser known/minority voice actors. Not advocating one way or another, just providing a piece of this conversation I think we should bear in mind.
Using different voices to read different parts of a book turns an audiobook into a bad audio play, and arguably, a bad audio play is worse than a mediocre audio book.
What audible misses is, that, while reading is a technique that can be automated, narrating is an art. They can use AI to read books, they cannot use AI to narrate books.
Your example of AI use is a good example of this: AI can read your content. AI can enhance your capabilities. But only you can narrate it.
So, would your opinion change if it turns out they’re going to use purchased voice rights to have a single narrator perform the whole book and then use AI to turn the narrators voice into a full voice cast?
It would make me hate it even more because I already hate the existing full cast of humans audio dramas 99% of the time and actually prefer a single (or low number of) narrator approach.
Completely fair. I kind of like them. They did it for Redwall and I listen to those books on long drives sometimes. It works for me. Now I guess the advantage could be to have both versions and get to choose which you listen to–but even I’m skeptical that a corporation would have that much regard for the preferences of its consumers.
Oh. That’s an interesting use-case I hadn’t considered.
Honestly audible are terribles. They are constantly doing things that annoy me, like they must have a team somewhere that spends its days going, how can we kill this golden goose?
They are going through and replacing audiobooks recorded in the 1980s with new ones which in theory should improve their quality but they’re getting rid of the classic sounds of those books.
like they must have a team somewhere that spends its days going, how can we kill this golden goose?
I wouldn’t put it past Bezos to have an actual enshittification department.
For fiction, yeah, that’s true. For nonfiction, this could work pretty well.
I’m still generally opposed to it because it’s using the work of existing voice recording without compensation, though.
nonfiction, this could work pretty well.
Only in rare cases.
If you have for example some explanations to a complex topic, then a super emotionless voice would still make you hate it and block you from learning it. Even the most dry and hard topics need some good and alive voice in explanations.
If it is just some reference list, where you need to search and hear small parts of it, then it could be Ok.