large language model

From IndieWeb


A large language model is (AKA LLM) usually a reference to a service, like OpenAI’s ChatGPT, that synthesizes text based on a massive set of prose typically crawled and indexed from the open web and other sources and should not be used to contribute content to the IndieWeb wiki; several IndieWeb sites disclaim any use thereof for their content.

Why

Why disclaim

With more and more LLM-generated content being published to the web, e.g. by news outlets like Hoodline, people will increasingly look for actual human-written content instead, and disclaiming your use of LLMs will appeal to a larger and larger audience.

How to

How to block

Tarpit & fighting back

Promote your work as human created

You can add a Not By AI badge on your website. See some IndieWeb Examples below.

IndieWeb Examples

In rough date order of adding to personal sites:

Hidde

Hidde de Vries added a ‘no LLMs involved’ note to the site-wide footer of hidde.blog from 17 March 2023

No language models were involved in writing the blog posts on here.

capjamesg

capjamesg since at least 2023-07-14(?) added a "Not By AI" image button on all his blog posts going back to at least 2020

Aaron Parecki

Aaron Parecki put a "Not by AI" badge in his global website footer (e.g. bottom of https://aaronparecki.com/) next to the IndieWeb/Microformats/Webmention buttons on 2023-10-15.

Tantek

Tantek Çelik put a general disclaimer about no use of LLMs for his site on his homepage on 2023-12-31:

Paul Watson

Paul Watson put a general disclaimer about no use of LLMs for his site on his blog homepage [1] on 2024-01-04:

No large language models (LLM) or similar AI technologies were involved in writing the blog posts on here.

and also a similar statement and button (by https://notbyai.fyi/) on the footer of all pages on 2024-01-05.

Todd Presta

to2ds Updated the existing blurb above the fold on the homepage to use the term "LLMs" instead of the generic "AI." [2] on 2024-01-05. Also added (Not BY AI) written and painted badges to the common footer along with the IndieWeb badge and link.

fluffy

fluffy Added a blurb to their about page on 2025-07-23.

Matt Lee

Dr. Matt Lee blocks as much of this as possible. His content is not permitted to be used to train such things.

His Libre.fm and 1800www.com projects are changing their mission/focus as result.

... add yourself ...

Add yourself here… (see this for more details)

IndieWeb Wiki Examples

Do not use LLMs for content for the wiki:

Don't even think about using ChatGPT to contribute material to the wiki, because you don't have the ability to know you can contribute it to the public domain / CC0.

Other Examples

IndieWeb opinions

Other Examples

Criticism

Hammering bandwidth while disrespecting robots.txt

The worst of the internet is continuously attacking the best of the internet. This is a distributed denial of service attack on the good parts of the World Wide Web.

Over the past few months, instead of working on our priorities at SourceHut, I have spent anywhere from 20-100% of my time in any given week mitigating hyper-aggressive LLM crawlers at scale. [...] If you think these crawlers respect robots.txt then you are several assumptions of good faith removed from reality. These bots crawl everything they can find, robots.txt be damned.

Encourages disregarding copyright

Encourages training on private data

Discourages open creativity and sharing

  • 2024-04-22 The Atlantic: It’s the End of the Web as We Know It

    SEO will morph into LLMO: large-language-model optimization, the incipient industry of manipulating AI-generated material to serve clients’ interests.
    [...]
    LLMs aren’t people we connect with. Eventually, people may stop writing, stop filming, stop composing—at least for the open, public web.

Abused to waste developer time

Criticism: LLMs used to waste developer time with fake security bounty bug reports:

  • (from the maintainer of curl) 2024-01-02 The I in LLM stands for intelligence

    Like for the email spammers, the cost of this ends up in the receiving end. The ease of use and wide access to powerful LLMs is just too tempting. I strongly suspect we will get more LLM generated rubbish in our Hackerone inboxes going forward.

Fails basic context dependence

Can fail basic context-dependence, like examples vs commands:

Scraping for training widely rejected

85% of Cloudflare's customers prefer to block GenAI training scrapers:

Allows the creation of abusive content for deliberate harm

Risk of misleading or hazardous output

Other Criticism

“Let me get this straight,” John Mulaney said. “You’re hosting a ‘future of AI’ event in a city that has failed humanity so miserably?”

He added, “Can AI sit there in a fleece vest? Can AI not go to events and spend all day at a bar?”

See Also

  • https://en.wikipedia.org/wiki/Large_language_model
  • ai;dr
  • https://github.com/ai-robots-txt/ai.robots.txt
  • How to turn off training Grok on your tweets: uncheck this setting: https://twitter.com/settings/grok_settings
  • autocorrect
  • 2022-11-13 comic: By Bots, For Bots 221114.n.bots_.jpg
  • 2024-09-20 Tantek Çelik: Can we have CC-NT licenses for no-training (ML/LLM, GenAI in general), just like we have CC-NC for non-commercial?
  • https://www.aitrainingstatement.org/
  • "I’m intrigued and bored at the same time: I find it quickly becomes quite tedious. I have a sort of inner dissatisfaction when I play with it, a little like the feeling I get from eating a lot of confectionery when I’m hungry. I suspect this is because the joy of art isn’t only the pleasure of an end result but also the experience of going through the process of having made it." AI’s Walking Dog by Brian Eno
  • ai.txt
  • blog about GenAI / LLM related failures and/or organizations seemingly doing silly things with such technologies: https://pivot-to-ai.com/
  • List of LLM bot user agents
  • Criticism of use of (and assumptions around use of) LLMs in work tasks, and especially to "replace" humans: https://www.rawsignal.ca/newsletter-archive/none-of-this-is-real-and-it-doesnt-matter/
    • "Photo by max laurell.


      We have officially entered the tree-falling-in-forest-era of white-collar work. Our friend who is a big dude in fundraising just declared the end of the pitch deck. He pointed to decks that were auto-generated and then auto-summarized. And asserted that if no one is writing it and no one is reading it, there’s just no point at all in having that piece of collateral.



      Over here in management-training land, our bosses are so busy reading the summaries of all the meetings they didn’t attend, there’s no time for anything else on their calendars. They ask to send notetakers to management training in their stead. Or get a summary of what they missed. We get it. Genuinely, there’s more to do than there are hours in the day.



      The problem is that the summary approach falls way down on implementation. Leadership transformation nerds will tell you there is no growth or development without reflection. So if we send you a summary of management training that says: Reflect on how you’re showing up as a leader and where the tools you are deploying are in service of you, your team, your org. And where they are not. You can read the summary meant to prompt reflection but that is not the same as doing any actual reflecting.



      Faster, sure. But comically ineffective for the stated task.



      The comedy era of automation. Where an entire meeting is attended by notetakers but no participants. And if you’re quick to point out that surely, of all meetings, this meeting could have been an email. Take a beat and pull it all the way through. Too busy to write that email? Just generate it instead. And if you’re too busy to read any of your email? Well, there’s a tool for that, too. Here at the bottom, we find ourselves back at the top. What is the point of auto-generating only to auto-summarize?



      Recursive malfunction



      It’s ok to want to be productive at work. It’s even cool to work on a thing you care about and want to see happen in the world. And it’s not a problem to use tools in the course of your work. But there’s a point where we tip from productivity gains into losses. More notes from more meetings that no one attends does not make us a more productive workforce. More emails that no one reads doesn’t mean more alignment or shared context or a smarter workforce.



      This is silliness. This is the output of work without any of the inputs. This is sound. And fury.



      And there’s no fucking point to any of it.



      We’ve all worked with someone, usually a junior person. Often an intern. First to sign on in the morning. Last to clock out at the end of the day. And, when pressed about how they spent their time, they attended a lot of meetings. Read all of the emails in their inbox. Caught up on slack. And that was about it.



      No one, not a single manager who has been at it for more than 10 minutes, points to that person as the most productive member of the team. An invaluable contributor. If you’re being generous, you say that person is coming up to speed. That they’re learning. And at some point, you pull them aside and talk about the difference between the theater of work and actual work.



      Because the wonderful part about goals and objectives and even mission statements is that we know what we’re trying to get done. No organization has a quarterly goal around sending more email (maybe Mailchimp does). Or sitting in the most meetings (maybe Zoom does). But unless it’s your core business, those are tools you use to achieve the thing, not the thing itself.



      This is the dream



      The people pitching all this will totally acknowledge that the current version is ridiculous. Well. Not all of them. Some of them are so heartbreakingly earnest about their blog-slop, or design-slop, or pitchdeck-slop, and can’t understand why you’re not impressed. Those ones seem to have a lot of time to post on LinkedIn.



      But like, many of them, the more polished ones, can admit that this is all still sort of silly. For those folks, the nonsense is an intermediate stage. Sure, having bots attend each other’s meetings is silly, but it doesn’t really matter, and it’s on the way to eliminating the need for meetings altogether. And yes, the bots aren’t doing a very good job at creative output yet, but, you know…for now we’ll have the humans come in and clean up the mess. And in 3-5 years we probably won’t need the humans at all. At least, this is the dream.



      That dream is rooted in a worldview. It’s a worldview we should talk about because it’s not only the root of the slop showing up in your feeds, but it’s also the root of the pressure to make your work into slop. The people who hold this worldview have been working hard to distribute and accelerate it. When Eric Schmidt talks about the “San Francisco Consensus,” it sounds like he’s talking about a set of AI predictions, but it’s clear he’s also talking about this.



      We’ve never seen them crisply articulate it, we don’t get invited to their group chats, but empirically it’s roughly:



      Companies exist to make money, and the most money goes to people who can find leverage — maximum impact per unit cost.



      Humans are a major cost.



      Most employees are over-entitled and under-accountable and have an inflated sense of their own worth. This is particularly true for people in creative fields — writing, design, art direction — or those with a background in humanities.



      Most work is also bullshit. Especially communication and coordination work — meetings, presentations, management conversations. Those things keep people from doing their real work, by which we generally mean lone genius individual contribution.



      Automating away the bullshit work is righteous because that work has no value.



      Automating away the humans is righteous because it will reduce the dependence on entitled employees, and produce an appropriate level of fear/obedience in the ones who remain.



      The coolest thing in the world would be to build a billion-dollar company as a single person running an army of AI bots.
      The major LLM vendors don’t say these things, but it’s implicit in a lot of how they talk about their offerings. No sensible leader would pay earth dollars for a bot that transcribes silent meetings with other bots, or tells you how much house paint to add to your chocolate chip cookies. But when those companies sell the promise of saving on payroll costs, people are buying the dream. When new benchmarks drop showing which human certification tests the chatbots can now pass, what human work they can now sort of do, people are buying.



      If you’re bought into the worldview above, this is the dream.



      This is a nightmare



      If this is the dream, we don’t want it.



      Like so much LLM output it’s not even very creative, just derivative and beige and vaguely unsettling. There’s nothing here that isn’t age-old grievances of business owners about their employees, dressed up in some scraps of poorly understood organizational psychology and modern finance. They aren’t re-inventing work, they’re re-inventing shit work. Because this? None of this is how great work happens.



      Remember great work? Genuinely great, thriving team, trusting each other, flow-state, catching the pass, you make me better and I make you better, holy shit I can’t believe we pulled that off, years later the people from that team are out in the world doing amazing things and making each other proud? Remember great fucking work?



      You don’t get that by removing the humanity, or the humans. It’s hard work to run a team or an organization well, we know, but the temptation to reach for some quick fix at the expense of your people never works out. If your teams are underperforming, there are concrete steps you can take. If you have an accountability problem, fix your management. The solution to your shit meetings is to run better ones. These are all skills you can build, but you won’t get them by skipping the hard parts and skimming a summary.



      Building a billion-dollar company alone with your bots is not the coolest thing in the world. It’s honestly a pretty sad and lonely image, and relies on a massive amount of damage along the way to make it possible. But building something enduring, something with real impact in the world, alongside a team of folks you trust and believe in, and who trust and believe in you, and bringing out the best in each other? Yeah, that’s the dream.



      — Melissa & Johnathan

      Upcoming Programs



      Actually good, actually useful training





      Online
      Oct 9-Nov 13


      BPX Fall
      Register now"