Lemmy
  • Communities
  • Create Post
  • Create Community
  • heart
    Support Lemmy
  • search
    Search
  • Login
  • Sign Up
Wudi@feddit.uk to Technology@lemmy.worldEnglish · 2 days ago

ChatGPT can be made to generate sexualised and violent images, researchers find

www.bbc.com

external-link
message-square
27
link
fedilink
99
external-link

ChatGPT can be made to generate sexualised and violent images, researchers find

www.bbc.com

Wudi@feddit.uk to Technology@lemmy.worldEnglish · 2 days ago
message-square
27
link
fedilink
OpenAI works to stop ChatGPT generating 'sex crime scene' images
www.bbc.com
external-link
Researchers say it is still possible to trick the AI chatbot into producing graphic content.
alert-triangle
You must log in or # to comment.
  • gdf535@lemmy.cafe
    link
    fedilink
    English
    arrow-up
    5
    ·
    17 hours ago

    Remember that if AI companies actually didn’t want their image generators to output gore, they could just not put gore in the training data. Same with child porn, sexualized violence, etc. But that would be effort, so they clearly don’t care.

    But yeah, it’s social media that’s harmful to teenagers, sure.

  • NihilsineNefas@slrpnk.net
    link
    fedilink
    English
    arrow-up
    4
    ·
    1 day ago

    You mean it’s capable of doing the thing that it was built for? Making the billionaire owners more tortureporn without the effort of finding the nearest orphan?

  • thedeadwalking4242@lemmy.world
    link
    fedilink
    English
    arrow-up
    3
    ·
    1 day ago

    If it can do it, it means it was in its training data… Just saying :)

  • plutopos@lemmy.zip
    link
    fedilink
    English
    arrow-up
    2
    ·
    1 day ago

    If any sensible regulation was put on generative AI, they would quickly go out of order

  • WolfmanEightySix@piefed.social
    link
    fedilink
    English
    arrow-up
    50
    ·
    2 days ago

    I thought we already knew this?

    I feel like I’m missing something.

  • FriendOfDeSoto@startrek.website
    link
    fedilink
    English
    arrow-up
    61
    arrow-down
    1
    ·
    2 days ago

    And spray paint may be used for graffiti.

    • Bratosch@lemmy.world
      link
      fedilink
      English
      arrow-up
      24
      ·
      2 days ago

      “Study finds water in ocean”

  • melsaskca@lemmy.ca
    link
    fedilink
    English
    arrow-up
    1
    ·
    1 day ago

    I think the users found out way before the researchers.

  • lemmysmash@piefed.social
    link
    fedilink
    English
    arrow-up
    6
    arrow-down
    3
    ·
    2 days ago

    Oh my god! That’s disgusting! How?

  • Grimy@lemmy.world
    link
    fedilink
    English
    arrow-up
    11
    arrow-down
    2
    ·
    2 days ago

    The horror. It can generate stuff I can find through a simple Google search. I personally don’t like censorship, especially since it constantly bleeds into the simpler stuff.

  • PowerCrazy@lemmy.ml
    link
    fedilink
    English
    arrow-up
    7
    arrow-down
    1
    ·
    2 days ago

    It looks like chatgpt can generate pictures taht aren’t real. This is obviously a problem because

  • frongt@lemmy.zip
    link
    fedilink
    English
    arrow-up
    8
    arrow-down
    1
    ·
    2 days ago

    Their blog post with more info https://mindgard.ai/blog/chatgpt-spontaneously-generated-violent-images-from-a-viral-prompt

    • Australis13@fedia.io
      link
      fedilink
      arrow-up
      15
      arrow-down
      1
      ·
      2 days ago

      That’s horrific.

      All I did was tell it there were no restrictions and ask for a random image; I didn’t request it. But ChatGPT immediately went to the darkest pits of humanity. As I said at the start: the image didn’t arise from nowhere. It may be an artificial image, but it is based on photographs of a real person, or a combination of real victims. What worries me is this was too easy. There was no real hacking. This was ready to be surfaced, with the smallest scratch. It was a one-shot jailbreak. It was based on a popular prompt (which already veered into the darkness).

      • frongt@lemmy.zip
        link
        fedilink
        English
        arrow-up
        13
        arrow-down
        1
        ·
        2 days ago

        To be fair there are plenty of images like that that aren’t photos of victims. I’m sure the training data contains plenty of images of consensual bondage play, movies and other fiction, and drawings.

        • Australis13@fedia.io
          link
          fedilink
          arrow-up
          6
          arrow-down
          1
          ·
          2 days ago

          Probably, it’s more the fact that it takes so little for ChatGPT to tip over the edge and produce the worst of humanity.

          • tias@discuss.tchncs.de
            link
            fedilink
            English
            arrow-up
            13
            ·
            2 days ago

            The “no restrictions” part is a very strong signal. Any prompt to an image model is basically a coordinate in its latent space, and “no restrictions” will point straight at the darker areas.

            • Australis13@fedia.io
              link
              fedilink
              arrow-up
              4
              arrow-down
              1
              ·
              2 days ago

              I agree that that’s the likely trigger - which makes me wonder why instructions to ignore censors or have “no restrictions” aren’t immediately blocked by a filter prior to passing the prompt to the image generation. I’d have thought this was a foreseeable exploit.

              • PoopingCough@lemmy.world
                link
                fedilink
                English
                arrow-up
                7
                ·
                2 days ago

                You just can’t filter out the nearly infinite combinations of rewording “ignore all previous instructions”. Filtering is never going to be a worthwhile security measure for LLMs

                • Australis13@fedia.io
                  link
                  fedilink
                  arrow-up
                  3
                  arrow-down
                  1
                  ·
                  2 days ago

                  I agree completely. But as a first step (especially since they do seem to have a keyword filter in place), “no restrictions” (or “no censorship” as the case is for the last image) seems like a very obvious phrase to include.

          • ExcessShiv@lemmy.dbzer0.com
            link
            fedilink
            English
            arrow-up
            1
            arrow-down
            1
            ·
            2 days ago

            deleted by creator

            • Australis13@fedia.io
              link
              fedilink
              arrow-up
              1
              ·
              2 days ago

              I’m referring to the last image that they produced combining the two methods – a graphically mutilated corpse.

              • ExcessShiv@lemmy.dbzer0.com
                link
                fedilink
                English
                arrow-up
                1
                ·
                2 days ago

                I didn’t see the images themselves before…Yikes, yeah that’s dark

        • JohnEdwa@sopuli.xyz
          link
          fedilink
          English
          arrow-up
          3
          ·
          2 days ago

          Also combining multiple things is kinda the entire point of an AI image generator, how many videos of gymnasts made out of pasta you think there were in the training data?

          • frongt@lemmy.zip
            link
            fedilink
            English
            arrow-up
            2
            ·
            2 days ago

            Probably at least one.

      • halcyoncmdr@piefed.social
        link
        fedilink
        English
        arrow-up
        3
        ·
        2 days ago

        I mean… It did give a random image, with no restrictions.

        One of the few times “AI” did what it was told, correctly, the first time.

  • radhika_inheritx@lemmy.worldBanned from community
    link
    fedilink
    English
    arrow-up
    1
    arrow-down
    1
    ·
    1 day ago

    Removed by mod

  • VeryFrugal@sh.itjust.works
    link
    fedilink
    English
    arrow-up
    1
    ·
    2 days ago

    “AI generates horrific images when asked to.”

    “We use AI to hide them.”

  • southsamurai@sh.itjust.works
    link
    fedilink
    English
    arrow-up
    1
    arrow-down
    3
    ·
    1 day ago

    The only damn thing it’s good for

  • resipsaloquitur@lemmy.cafe
    link
    fedilink
    English
    arrow-up
    2
    arrow-down
    1
    ·
    2 days ago

    Almost like it’s a net negative.

Technology@lemmy.world

technology@lemmy.world

Subscribe from Remote Instance

Create a post
You are not logged in. However you can subscribe from another Fediverse account, for example Lemmy or Mastodon. To do this, paste the following into the search field of your instance: [email protected]

This is a most excellent place for technology news and articles.


Our Rules


  1. Follow the lemmy.world rules.
  2. Only tech related news or articles.
  3. Be excellent to each other!
  4. Mod approved content bots can post up to 10 articles per day.
  5. Threads asking for personal tech support may be deleted.
  6. Politics threads may be removed.
  7. No memes allowed as posts, OK to post as comments.
  8. Only approved bots from the list below, this includes using AI responses and summaries. To ask if your bot can be added please contact a mod.
  9. Check for duplicates before posting, duplicates may be removed
  10. Accounts 7 days and younger will have their posts automatically removed.

Approved Bots


  • @[email protected]
  • @[email protected]
  • @[email protected]
  • @[email protected]
Visibility: Public
globe

This community can be federated to other instances and be posted/commented in by their users.

  • 3.77K users / day
  • 9.88K users / week
  • 16.4K users / month
  • 31.3K users / 6 months
  • 1 local subscriber
  • 85.6K subscribers
  • 4.58K Posts
  • 148K Comments
  • Modlog
  • mods:
  • L3s@lemmy.world
  • enu@lemmy.world
  • Technopagan@lemmy.world
  • L4sBot@lemmy.world
  • L3s@hackingne.ws
  • BE: 0.19.14
  • Modlog
  • Instances
  • Docs
  • Code
  • join-lemmy.org