Privacy @lemmy.ml minnix @lemux.minnix.dev 4 mo. ago

Google's Gemini AI caught scanning Google Drive hosted PDF files without permission — user complains feature can't be disabled

www.tomshardware.com Google's Gemini AI caught scanning Google Drive hosted PDF files without permission — user complains feature can't be disabled

Kevin Bankston, a Senior Advisor on AI Governance, discusses this concerning Google Gemini behavior.

Google's Gemini AI caught scanning Google Drive hosted PDF files without permission — user complains feature can't be disabled

25 comments

For the people who didn't read the article. Read this TLDR: When you open a Google Doc. A Gemini sidebar appears, so you can ask questions about the document. Here, it summarized a document without the user asking.

The article title makes it seem like they are using your files to train AI which no proof exists for that(yet)
- At least the data is sent to Gemini servers. This alone can be illegal but I'm not sure. What I'm more sure about is that they do use the data to train the models.
  
  Since it is Google Docs, the data is already on Google servers. But yeah, it doesn't exactly instill confidence into the confidentiality of documents on Google Docs.
- Thank you for the service!
  
  I see your point re training, but aint the entire point why they want peasants using their models is to train them more?
  
  Generative AI doesn't get any training in use. The explosion in public AI offerings falls into three categories:
  
  Saves the company labor by replacing support staff
  
  Used to entice users by offering features competitors lack (or as catch-up after competitors have added it for this reason)
  
  Because AI is the current hot thing that gets investors excited
  
  To make a good model you need two things:
  
  Clean data that is tagged in a way that allows you to grade model performance
  
  Lots of it
  
  User data might meet need 2, but it fails at need 1. Running random data through neural networks to make it more exploitable (more accurate interest extraction, etc) makes sense, but training on that data doesn't.
  
  This is clearly demonstrated by Google's search AI, which learned lots of useful info from Reddit but also learned absurd lies with the same weight. Not just overtuned-for-confidence lies, straight up glue-the-cheese-on lies.
Permanently Deleted
- But it's still probably illegal
  
  Permanently Deleted
- Yes. Now its documented that Google is violating their terms of service. I'm sure their lawyers will point to the clause that says they can change the terms of service at any time without warning
Google told me they really care about my privacy, tho.
Surprising nobody.
Oh silly mistake.

Laughing all the way to the bank.
*shocked pikachu*
this reminded me of the Google takeout I requested last week so I could switch to self hosting 👍
...Why would you post unencrypted personal information onto the cloud in the first place?
- !RemindMe in two hours to give my doctor my new SSN after my last one got stolen: 644-11-9217
  
  There's a certain level of due-diligence that you can use when you're moving personal information around on the cloud. Hospitals have a legal obligation to keep your medical records secure; Google does not.
This is why anything you upload to the cloud should be encrypted. Or just, yenno, don't use the cloud
"Caught"

You've viewed 25 comments.