this post was submitted on 05 Mar 2025
34 points (100.0% liked)

Technology

38603 readers
416 users here now

A nice place to discuss rumors, happenings, innovations, and challenges in the technology sphere. We also welcome discussions on the intersections of technology and society. If it’s technological news or discussion of technology, it probably belongs here.

Remember the overriding ethos on Beehaw: Be(e) Nice. Each user you encounter here is a person, and should be treated with kindness (even if they’re wrong, or use a Linux distro you don’t like). Personal attacks will not be tolerated.

Subcommunities on Beehaw:


This community's icon was made by Aaron Schneider, under the CC-BY-NC-SA 4.0 license.

founded 3 years ago
MODERATORS
 

In late 2013, the Spike Jonze film Her imagined a future where people would form emotional connections with AI voice assistants. Nearly 12 years later, that fictional premise has veered closer to reality with the release of a new conversational voice model from AI startup Sesame that has left many users both fascinated and unnerved.

you are viewing a single comment's thread
view the rest of the comments
[–] [email protected] 10 points 2 months ago* (last edited 2 months ago) (4 children)

Here's the video from the article:

Video

I would not be able to tell this is AI.

[–] [email protected] 4 points 2 months ago (2 children)

It feels weird, like maybe over-practiced, but I agree that it sounds human enough to fool me.

[–] [email protected] 3 points 2 months ago (1 children)

It feels reminiscent of the way narrators used to do books on tape. Modern ones are better imho, but all the pausing and intonation definitely seems "professional" more than conversational. Still extremely good.

[–] [email protected] 3 points 2 months ago

I listened to an audiobook by Levar Burton a few days ago, and this sounds similar enough to his pattern of speech during the intro that I wouldn't have known there was anything unusual about the AI voice. If I'd heard it read a book, I would have just assumed that the pauses were a style choice.

[–] [email protected] 2 points 2 months ago (1 children)

Yeah, it sounds like it was recorded in a recording studio, but not like it's a robot. Very creepy.

[–] [email protected] 2 points 2 months ago (1 children)

Oh! Interesting! So for you this is in the uncanny valley?

I'm on the fence about that.

[–] [email protected] 2 points 2 months ago

No, I think it sounds very normal, I just think the idea that I can no longer distinguish between fake and real voices creepy.

[–] [email protected] 2 points 2 months ago

At first, I thought it said it likes peanut butter & people sandwiches, and was like, WTF???? 🫥

[–] [email protected] 2 points 2 months ago

It sounds a bit off. But if you're not looking for it, you won't find it. That, I believe, is enough to fool most everyone, which is arguably a bad thing.

[–] [email protected] 1 points 2 months ago

For reference, this is what Maya reminds me of, Merle Dandridge, VA for Half Life 2's Alyx Vance.

I've skipped to some slower commentary, just so that you can kinda see what a human and AI can sound like with similar pacing while reflecting on a question:

https://youtu.be/GCgkK-Y_uwg?t=8m