ChatGPT can be made to generate sexualised and violent images, researchers find

Wudi@feddit.uk · 2 days ago

ChatGPT can be made to generate sexualised and violent images, researchers find

gdf535@lemmy.cafe · 17 hours ago

Remember that if AI companies actually didn’t want their image generators to output gore, they could just not put gore in the training data. Same with child porn, sexualized violence, etc. But that would be effort, so they clearly don’t care.

But yeah, it’s social media that’s harmful to teenagers, sure.

NihilsineNefas@slrpnk.net · 1 day ago

You mean it’s capable of doing the thing that it was built for? Making the billionaire owners more tortureporn without the effort of finding the nearest orphan?

thedeadwalking4242@lemmy.world · 1 day ago

If it can do it, it means it was in its training data… Just saying :)

plutopos@lemmy.zip · 1 day ago

If any sensible regulation was put on generative AI, they would quickly go out of order

WolfmanEightySix@piefed.social · 2 days ago

I thought we already knew this?

I feel like I’m missing something.

FriendOfDeSoto@startrek.website · 2 days ago

And spray paint may be used for graffiti.

Bratosch@lemmy.world · 2 days ago

“Study finds water in ocean”

melsaskca@lemmy.ca · 1 day ago

I think the users found out way before the researchers.

lemmysmash@piefed.social · 2 days ago

Oh my god! That’s disgusting! How?

Grimy@lemmy.world · 2 days ago

The horror. It can generate stuff I can find through a simple Google search. I personally don’t like censorship, especially since it constantly bleeds into the simpler stuff.

PowerCrazy@lemmy.ml · 2 days ago

It looks like chatgpt can generate pictures taht aren’t real. This is obviously a problem because

frongt@lemmy.zip · 2 days ago

Their blog post with more info https://mindgard.ai/blog/chatgpt-spontaneously-generated-violent-images-from-a-viral-prompt

Australis13@fedia.io · 2 days ago

That’s horrific.

All I did was tell it there were no restrictions and ask for a random image; I didn’t request it. But ChatGPT immediately went to the darkest pits of humanity. As I said at the start: the image didn’t arise from nowhere. It may be an artificial image, but it is based on photographs of a real person, or a combination of real victims. What worries me is this was too easy. There was no real hacking. This was ready to be surfaced, with the smallest scratch. It was a one-shot jailbreak. It was based on a popular prompt (which already veered into the darkness).

frongt@lemmy.zip · 2 days ago

To be fair there are plenty of images like that that aren’t photos of victims. I’m sure the training data contains plenty of images of consensual bondage play, movies and other fiction, and drawings.

Australis13@fedia.io · 2 days ago

Probably, it’s more the fact that it takes so little for ChatGPT to tip over the edge and produce the worst of humanity.

tias@discuss.tchncs.de · 2 days ago

The “no restrictions” part is a very strong signal. Any prompt to an image model is basically a coordinate in its latent space, and “no restrictions” will point straight at the darker areas.

Australis13@fedia.io · 2 days ago

I agree that that’s the likely trigger - which makes me wonder why instructions to ignore censors or have “no restrictions” aren’t immediately blocked by a filter prior to passing the prompt to the image generation. I’d have thought this was a foreseeable exploit.

PoopingCough@lemmy.world · 2 days ago

You just can’t filter out the nearly infinite combinations of rewording “ignore all previous instructions”. Filtering is never going to be a worthwhile security measure for LLMs

Australis13@fedia.io · 2 days ago

I agree completely. But as a first step (especially since they do seem to have a keyword filter in place), “no restrictions” (or “no censorship” as the case is for the last image) seems like a very obvious phrase to include.

ExcessShiv@lemmy.dbzer0.com · 2 days ago

deleted by creator

Australis13@fedia.io · 2 days ago

I’m referring to the last image that they produced combining the two methods – a graphically mutilated corpse.

ExcessShiv@lemmy.dbzer0.com · 2 days ago

I didn’t see the images themselves before…Yikes, yeah that’s dark

JohnEdwa@sopuli.xyz · 2 days ago

Also combining multiple things is kinda the entire point of an AI image generator, how many videos of gymnasts made out of pasta you think there were in the training data?

frongt@lemmy.zip · 2 days ago

Probably at least one.

halcyoncmdr@piefed.social · 2 days ago

I mean… It did give a random image, with no restrictions.

One of the few times “AI” did what it was told, correctly, the first time.

radhika_inheritx@lemmy.world · 1 day ago

Removed by mod

VeryFrugal@sh.itjust.works · 2 days ago

“AI generates horrific images when asked to.”

“We use AI to hide them.”

southsamurai@sh.itjust.works · 1 day ago

The only damn thing it’s good for

resipsaloquitur@lemmy.cafe · 2 days ago

Almost like it’s a net negative.

ChatGPT can be made to generate sexualised and violent images, researchers find

ChatGPT can be made to generate sexualised and violent images, researchers find

OpenAI works to stop ChatGPT generating 'sex crime scene' images