• dotslashme@infosec.pub
    link
    fedilink
    English
    arrow-up
    1
    ·
    6 months ago

    Ah, more glue on pizza incoming. Personally I don’t understand taking reddit posts as a source for LLM training. It’s like they never visited reddit and think that all posts/comments are true, or even useful. Depending on the sub, sarcasm can account for anywhere from 5% to 100%.

  • mrcleanup@lemmy.world
    link
    fedilink
    arrow-up
    0
    ·
    10 months ago

    Time to delete my old accounts, I guess. Is there a bit that will go through and delete all posts and comments too? That would be helpful.

  • Eager Eagle@lemmy.world
    link
    fedilink
    English
    arrow-up
    0
    ·
    10 months ago

    Well, they already made it very clear to everyone back in May that the content created by the community does not belong to the community. Anyone still using that dump deserves to be explored.

  • AutoTL;DR@lemmings.worldB
    link
    fedilink
    English
    arrow-up
    0
    ·
    10 months ago

    This is the best summary I could come up with:


    Reddit will let “an unnamed large AI company” have access to its user-generated content platform in a new licensing deal, according to Bloomberg yesterday.

    The deal, “worth about $60 million on an annualized basis,” the outlet writes, could still change as the company’s plans to go public are still in the works.

    The news also follows an October story that Reddit had threatened to cut off Google and Bing’s search crawlers if it couldn’t make a training data deal with AI companies.

    Last year, it successfully stonewalled its way out of the biggest protest in its history after changes to its third-party API access pricing caused developers of the most popular Reddit apps to shut down.

    As Bloomberg writes, Reddit’s year-over-year revenue was up by 20 percent by the end of 2023, but it was still $200 million shy of a $1 billion target it had set two years prior.

    The company was reportedly advised to seek a $5 billion valuation when it opens up for public investment, which is expected to happen in March.


    The original article contains 346 words, the summary contains 175 words. Saved 49%. I’m a bot and I’m open source!

  • Substance_P@lemmy.world
    link
    fedilink
    arrow-up
    0
    ·
    10 months ago

    Brilliant, A.I does the heavy lifting takes data for free then resells access to it while us who contributed for the last decade don’t get a dime.

  • zeluko@kbin.social
    link
    fedilink
    arrow-up
    0
    ·
    10 months ago

    I dont see why someone would need this deal anyways… most is already available, and most the new stuff probably too, even without API access.
    I also expect the fediverse to be crawled and used for training, thats just the thing about publicly available stuff, it gets used, if we like it or not…

    • LWD@lemm.ee
      link
      fedilink
      arrow-up
      0
      ·
      10 months ago

      The opposite; the API to simply take comments and posts in bulk is free and open.

        • LWD@lemm.ee
          link
          fedilink
          arrow-up
          0
          ·
          10 months ago

          In theory, yes, but instances don’t ship with the ability to do that. There would need to be a change to the Lemmy code base if such a thing was to be seriously implemented.

          I’m no federation expert, so I can’t really comment on whether doing something like requiring API keys would be feasible, unfortunately.

  • 9point6@lemmy.world
    link
    fedilink
    arrow-up
    0
    ·
    10 months ago

    Here comes a new wave of users, I guess

    Kinda thought they’d manage to go a bit longer than the few months they did