return2ozma@lemmy.world to Technology@lemmy.worldEnglish · 1 year agoAudible unveils plans to use AI voices to narrate audiobookswww.theguardian.comexternal-linkmessage-square189linkfedilinkarrow-up1392arrow-down119
arrow-up1373arrow-down1external-linkAudible unveils plans to use AI voices to narrate audiobookswww.theguardian.comreturn2ozma@lemmy.world to Technology@lemmy.worldEnglish · 1 year agomessage-square189linkfedilink
minus-squarevenusaur@lemmy.worldlinkfedilinkEnglisharrow-up18·1 year agoSure there are. ElevenLabs is one. You can probably tell they’re not human but they’re really decent.
minus-squareEcho Dot@feddit.uklinkfedilinkEnglisharrow-up9arrow-down1·1 year agoThey still don’t understand the context of what they’re reading though so they can’t apply tone correctly.
minus-squaressillyssadass@lemmy.worldlinkfedilinkEnglisharrow-up3arrow-down1·1 year agoFrom what I’ve been able to hear it’s not that bad. They’re pretty good at having a general tone. But they may fail when it comes to emotional tones, like anger or sadness. But for just reading a book aloud there shouldn’t be any issue.
minus-squarevenusaur@lemmy.worldlinkfedilinkEnglisharrow-up1·1 year agoFair. Definitely some awkward phrasing, but it’ll get better.
minus-squareLandless2029@lemmy.worldlinkfedilinkEnglisharrow-up4·1 year agoJust tried it. Still a machine buy much better than default TTS.
minus-squarevenusaur@lemmy.worldlinkfedilinkEnglisharrow-up2·1 year agoIn 10 years it’s probably gonna be really impressive.
Sure there are. ElevenLabs is one. You can probably tell they’re not human but they’re really decent.
They still don’t understand the context of what they’re reading though so they can’t apply tone correctly.
From what I’ve been able to hear it’s not that bad. They’re pretty good at having a general tone. But they may fail when it comes to emotional tones, like anger or sadness. But for just reading a book aloud there shouldn’t be any issue.
Fair. Definitely some awkward phrasing, but it’ll get better.
Just tried it. Still a machine buy much better than default TTS.
In 10 years it’s probably gonna be really impressive.