this post was submitted on 22 Jun 2026
901 points (99.0% liked)

Technology

85670 readers
4454 users here now

This is a most excellent place for technology news and articles.


Our Rules


  1. Follow the lemmy.world rules.
  2. Only tech related news or articles.
  3. Be excellent to each other!
  4. Mod approved content bots can post up to 10 articles per day.
  5. Threads asking for personal tech support may be deleted.
  6. Politics threads may be removed.
  7. No memes allowed as posts, OK to post as comments.
  8. Only approved bots from the list below, this includes using AI responses and summaries. To ask if your bot can be added please contact a mod.
  9. Check for duplicates before posting, duplicates may be removed
  10. Accounts 7 days and younger will have their posts automatically removed.

Approved Bots


founded 3 years ago
MODERATORS
 

fuck offffff

you are viewing a single comment's thread
view the rest of the comments
[–] bitwolf@sh.itjust.works 13 points 23 hours ago (1 children)

How did these people expect "Hey Siri" / "Hey Google" to work?

[–] NotMyOldRedditName@lemmy.world 16 points 23 hours ago* (last edited 23 hours ago) (3 children)

Im a perfect world, as they claim, its a secondary system listening that isn't recording or transmitting anything, and is meant to be low power. If it hears the wake up word, it wakes up the other mic and starts recording.

Thats how they claim the smart speakers work anyway.

This would be different.

[–] huey_m@reddthat.com 4 points 8 hours ago (2 children)

This was my understanding, but I just don't believe it anymore. There have been way, way too many time my wife and I were talking about an incredibly niche thing that didn't come up through the internet in any way, and lo and behold the algorithm presented those key words. Nobody will ever convince me it isn't being done to some extent.

[–] NotMyOldRedditName@lemmy.world 2 points 1 hour ago* (last edited 1 hour ago)

I had this happen many years ago, to the point there was no chance something wasn't listening. We suspected it was my partners iPhone with Facebook installed before they got better about preventing abuse like that, as it was a Facebook ad that showed up.

Were talking about something where we never use the product, would never use the product, but it came up in a conversation just between the 2 of us, and there were ads the next day.

It happened a few times.

[–] GalacticRobot@lemmy.world 4 points 4 hours ago (1 children)

It doesn't need to, that's the issue, there is so much other data you are generating that can be harvested. Nothing you talk about is completely random, so it's incredibly easy to build profiles about you, without listening to a single word.

[–] huey_m@reddthat.com 1 points 4 hours ago

I understand that's the theory, but these situations were specifically not something that could be easily gleaned. We're talking like reminiscing about things that happened in our pre internet youth that there's no record of anywhere and that came up randomly in conversation. I'm definitely aware of the dynamic, even before ubiquity of the internet, it's true that sometimes companies would know people were pregnant before the person did based on their purchasing profile. This wasn't that though, there's just no possible connection.

That happened a few times now, so pretty much nothing is going to convince me it's not the case.

[–] nevyn@slrpnk.net 1 points 14 hours ago

It can't hear if it isn't already listening.

[–] orioler25@lemmy.world 2 points 21 hours ago (4 children)

"How they claim?" Is there no way to confirm that?

[–] emergencyfood@sh.itjust.works 3 points 4 hours ago (1 children)

You look for network traffic. You might not be able to see inside the packets, but you can know when they're sending packets, and how many. As far as I know, voice assistant systems that claim to use a secondary local circuit to detect calls are telling the truth.

[–] orioler25@lemmy.world 1 points 3 hours ago (1 children)

That's kind of what I was wondering, I figured this as well as a way to track that it is at least sending data at unusual times. Someone else in this thread explained that actually determining what that data is would be difficult yeah: https://lemmy.world/post/48510943/24408747

I don't know enough about system security or forensics to evaluate this, but it does make sense based on what I know.

[–] emergencyfood@sh.itjust.works 1 points 53 minutes ago

The consensus so far seems to be that they don't collect as much data as people think, partly because they can't process all of it, and partly because educated guesses are good enough to target ads often enough.

[–] neclimdul@lemmy.world 4 points 19 hours ago

I have a memory of people black boxing it and seeing power usage and network traffic that supported the claims but that was a snapshot in time and as others note its all proprietary.

It takes a lifetime to build a good reputation, but you can lose it in a minute.

[–] HK65@sopuli.xyz 4 points 21 hours ago (2 children)

They ship with proprietary code, this would be the point of open source.

In practice in my experience, every company is at least skirting the law regarding privacy, and I never worked for one big enough that could lobby itself out of a fine.

[–] orioler25@lemmy.world 1 points 20 hours ago (2 children)

would this not be detectable by tracking the data sent through your network?

[–] blargh513@sh.itjust.works 3 points 18 hours ago (1 children)

I used to run forensic network capture and analysis tools.

First thing, traffic is encrypted. All you will see is a blob of traffic passing through. You used to see hostnames with TLS, but now with quic, you see nothing. This makes it hard.

You could root the phone and install a root ca certificate for a decrypting proxy, you might see more, but the data itself (not just the transport protocol) could be encoded or even encrypted within the network encapsulation.

Next, you'd have to reverse engineer the protocol if they're using something nonstandard. Also, malware can often be set up to "behave" when it can detect analysis. I'm all but certain Google would do this.

Maybe you could do statistical analysis of the traffic and attempt to baseline normal vs when it's transmitting audio. It would be a bit of a blind guess at best.

If I had more time, I'd love to try it. I have an old pixel7 pro. Maybe I can sort something out.

[–] GalacticRobot@lemmy.world 2 points 4 hours ago

People have already done that and shown that no the device isn't listening to you 24/7 and sending all your data out. There are plenty of papers on the subject, and it makes sense. Why record, decode and analyze all audio when your digital footprint is so much easier to compile and analyze. People aren't random, so it's easy to put them into statistical buckets of how to target them. Here is one reference paper (of many): https://recon.meddle.mobi/papers/panoptispy18pets.pdf

[–] NotMyOldRedditName@lemmy.world 2 points 20 hours ago* (last edited 20 hours ago)

If its real time monitoring you, but not if its logging data to send later when it would be expected to be doing so.

Audio doesnt take up much space.

[–] NotMyOldRedditName@lemmy.world 1 points 20 hours ago

Even if it was open source, you'd need to be able to verify what they ship matches the specs. Allowing you to flash whatever you want onto it helps, but you still need to validate the hardware.

[–] NotMyOldRedditName@lemmy.world 0 points 21 hours ago* (last edited 21 hours ago)

I dont know. You'd need to reverse engineer the hardware and software to be confident, and could a OTA update then sneak a bypass in anyway?

Edit: i think Amazon might have abandoned this as well and always records on echos now too.