I recently posted a blog message to friends and family about genAi...and then I realized I mentioned generating character images using Ai...and then I realized the community probably has a growing use of Ai:
- People prolly using Ai images
- People prolly testing code updates against GPT or other engines
- People prolly using Ai to generate text/writing now
So I figured...what the heck. May as well post it, here, too. Might protect some of the lesser savvy folks.
For people who aren't aware how Ai (Generative Ai) works, and why regular people should be aware of it.
(Please read if you want to protect your data around Ai engines, let me put on my "IT guy" hat, and please ignore if you already knew this stuff)
Ai isn't some "intelligence" inside of a machine. It's just a "learning engine". Here's an example:
You ask a brand-new Ai engine about rocks:
You: Please explain rocks to me
Ai: What is a rock?
<you give it access to a bunch of information on rocks>
Now the Ai engine has access to that definition of a rock and can/will apply it to future references of "rock"
You: What is a rock?
Ai: (now with data) "A rock is a..."
Now just picture thousands of repeated/refined saved data on rocks over a year, questions others asked about rocks, and the engine will attempt to use all of that data to get the best response.
This is how "ChatGPT" became "racist" in 48 hours. It isn't a childlike mind that needs to learn, and it surely has no bias against race. It's just that racist USERS of the software flooded the Ai engine with racist sentences, questions, and racially-charged responses to the Ai's queries. Due to the % of racist data on the server (which the Ai has no true understanding of racism), ChatGPT thought it was giving accurate data.
So...
You: What is the best weapon to use against <race>
Ai: No data
You spend 20 minutes flooding the engine with false data related to how rocks are used to attack <race>
You: Tell me about rocks
Ai: <bunch of data about rocks> + "often used to kill <racial slur>"
++HOW IS THIS A RISK TO YOU???++
IF you are messing around with a generative Ai engine, it requires the INPUT of data to GENERATE data. It TAKES data from the user, searches for other data within the engine with similar tags, and generates a response based on what it finds.
You must FEED it prompts (you type into the bar what you're looking for), it saves that data, then returns data based on what you asked for.
-
if you accidentally paste private info into a GenAi engine, that data will REMAIN INSIDE OF THE ENGINE.
-
so if you accidentally paste your email address and password into an Ai engine, it'll remain in there. There's no guarantee it'll show up on a search about rocks, but it's in there SOMEWHERE
If you do not own the Ai engine, you cannot confirm the data is in there or can be removed. Your Uid/pass may show up in images or other GenAi searches because the data remains.
This means that, in theory, all generative Ai engines that are not properly maintained and audited may actually be hot targets for data mining. This means that if you search an Ai engine with higher classified company data into an Ai engine, that data may have just been leaked. This means that all art you make (digital or scan) and upload to an Ai engine remains in the engine for others to generate off of
So, please, consider this before entering anything personal, private, or important into a GenerativeAi engine. I suspect in less than 5 years, we may hear a lot about the GDPR, data retention on Ai engines, and Europe's "right to be forgotten", and a need for Ai engines to purge personally identifiable information stored in their databases.