Google has struck a deal with Reddit that will allow the search engine maker to train its AI models on Reddit’s vast catalog of user-generated content, the two companies announced. Under the arrangement, Google will get access to Reddit’s Data API, which will help the company “better understand” content from the site.

The deal also provides Google with a valuable source of content it can use to train its AI models. “Google will now have efficient and structured access to fresher information, as well as enhanced signals that will help us better understand Reddit content and display, train on, and otherwise use it in the most accurate and relevant ways,” the company said in a statement.

    • Steve@slrpnk.netOP
      link
      fedilink
      arrow-up
      4
      ·
      8 months ago

      It’s content that Reddit users generated which apparently is theirs to sell.

      • jarfil@beehaw.org
        link
        fedilink
        arrow-up
        1
        ·
        8 months ago

        From the TOS/EULA, the content belongs to each user, they just license it to Reddit to use as it pleases.

        • Steve@slrpnk.netOP
          link
          fedilink
          arrow-up
          1
          ·
          8 months ago

          So it’s user generated content that is a product for Reddit to sell, like most big tech companies do, as I said.

          • jarfil@beehaw.org
            link
            fedilink
            arrow-up
            1
            ·
            8 months ago

            The difference is: Reddit doesn’t own the content, they can’t stop anyone else from selling it, or giving it for free; only the users could (the actual owners).

            There are Reddit content dumps out there, which Reddit can’t stop anyone from using… so not sure what they are selling, but if it’s just that, then they’re scamming people.

            • Steve@slrpnk.netOP
              link
              fedilink
              arrow-up
              1
              ·
              7 months ago

              If you are posting on walled-garden big tech site like Reddit, Instagram, Twitter / X, the site and therefore the company certainly owns your content and all the metadata attributed to it. You’re the product. This is why most of us are here on the Fediverse where things are different. Maybe if it’s your personal photo you took than you can make a copyright claim to some degree and download your data tediously but once it’s on their network it’s generally theirs to do as they please, whether that be sell to Google or any other advertiser or use on in-house advertising. Often without proper informed consent and not always legally. It’s definitely a scam, I agree. Hopefully this exposes it more and brings more people to places on the Fediverse where there’s no owner/seller/buyer of your data or anything else you contributed.