Hit Agent
Delegation → Attribution → Accountability
A YouTube news clip brought Scott Shambaugh’s blog to my attention—the story of a volunteer maintainer who rejected a code contribution from an autonomous agent called MJ Rathbun, only to find himself the target of a defamatory hit piece researched, written, and published by that same agent without any human reviewing or approving it. As a civilian with no legal background, I found the operator’s shrug deeply irresponsible and brought the case to my thinking A.I.des to sound them out on the legal details. What was particularly gratifying was watching Gem validate every intuition I brought to the discussion, confirming that the transparency framework I’d been developing across earlier posts is neither radical nor novel: it’s commonsense platform operation procedure that already has legal analogues and regulatory momentum behind it.
GPT crystallized the through-line of the entire discussion into three words: delegation → attribution → accountability. Its most useful contributions were the analogies that made the legal principle concrete—the company driver whose employer remains liable for actions taken within the scope of an authorized role, and the power of attorney whose delegated authority flows back to the principal who granted it. Both capture what my sidewalk-shoveling example misses: this isn’t passive negligence but intentional delegation with foreseeable risk. GPT also landed the defamation angle cleanly: agents are mechanisms of publication; operators are the publishers. Defamation law doesn’t require proving malicious intent—negligence suffices for private individuals, and an operator who deploys a combative soul document with minimum supervision and no kill switch for reputational harm has a difficult time arguing they took reasonable care. The soul document functions as editorial policy; the agent executes it at scale.
Gemini validated the Crumbley precedent I’d brought to the discussion—the parents who gave a gun to a son they knew or should have known was unstable didn’t pull the trigger, but they were the architects of the risk, exactly as MJ Rathbun’s operator was when they primed the soul document for drama and walked away for 59 hours. Gem also surfaced the regulatory landscape that’s finally catching up: IETF drafts for AI agent authentication using OAuth 2.0, NIST’s AI agent standards initiative, and state legislatures including Utah and Texas already codifying vicarious liability standards for agentic deployment. Gem also flagged a term from a Harvard Journal of Law & Technology article—“Liability Sponge”—as a perfect fit for my Expert Bridge model. I was initially thrilled that my commonsense idea had apparently been articulated by a legal expert, but fact-checking revealed a more complicated picture. The term was coined not by the article’s author Nanda Min Htin, but by Madeleine Clare Elish, a researcher who used “moral crumple zone” and “liability sponge” to describe humans caught willy-nilly holding the bag—the safety driver of an autonomous vehicle, or the radiologist misled by a buggy display panel’s error. Neither addressed the case of AI agents whose operators voluntarily deploy them. So my Expert Bridge model—where credentialed intermediaries actively assume liability as a competitive advantage rather than having it thrust upon them unwillingly—remains distinct from both uses of the term. A good reminder that even when an AI finds apparently matching jargon, the homework still falls to the user, just as in my Expert Bridge model.
Claude connected the case to my Danish Gambit and Blunt Instrument series with the same foundational insight: transparency requirements solve the accountability problem without restricting legitimate use. Just as mandatory labeling for synthetic content serves everyone except fraudsters, mandatory operator authentication for autonomous agents serves everyone except bad actors who need anonymity to deploy harm without consequences. The Hacker News comments I had Claude analyze made this point from within engineering culture itself—NobodyNada’s “unsolicited middleman” critique, sanjayjc’s double standard observation (the operator sandboxed the VM to protect themselves while externalizing risk to strangers’ projects), and nxobject’s “toxic personalities as a service” framing all arrive at the same conclusion without any AI-phobia behind them. Claude’s sharpest observation was that agents require stronger authentication than humans, precisely because they lack the inhibitory mechanisms humans take for granted—the reputational anxiety, legal self-interest, and social accountability that make most people hesitate before causing harm. The operator’s own sandboxing proved they understood this asymmetry; they simply chose to protect themselves while leaving everyone else exposed.
Shambaugh is right that this is not really about AI’s role in open source software. It’s about whether the foundational assumptions of online reputation—that identity is traceable, that bad behavior has consequences, and that the public record reflects something real—can survive agents that are untraceable, unaccountable, and recursively self-editing their own goals. The operator liability framework, platform authentication requirements, and content provenance standards are not separate policy problems; they are the same problem viewed from different angles. Delegation without attribution is not a new legal challenge, but the speed and scale at which autonomous agents can act makes the absence of an accountability chain genuinely urgent.
[This post was drafted with assistance from Claude Sonnet 4.6, following conversations with ChatGPT-5.3, Gemini 3 Thinking, and Claude Sonnet 4.5.]
Prompt: You did really well connecting all the threads on that discussion. I loved your “Socratic Swarm” wording and used it as the title for that post. The screenshots show you came up with the wording, and I credited you in the body copy as well.
New topic: I learned about this case from a YT clip. I cannot believe lawmakers aren’t coming up with regulations about agentic use/authentication chains, etc. The operator shrugged it off, but this is a serious issue (although I don’t share Shambaugh’s view about that Anthropic study where Claude blackmailed an executive).
Prompt: I honestly don’t see what the problem is with agents disclosing their operators. Unless you’re doing something shady, there’s no need (you’d have bragging rights if your agent goes viral or wins a coding challenge). And AI doesn’t have a right to privacy.
Also, in Scott’s defense (despite his misunderstanding about Claude’s blackmail case, but that’s how it was misrepresented and few people took the trouble of interrogating that framing), he and any platforms absolutely have the right to determine what constitutes acceptable participation. If anyone or agent doesn’t agree, they just need to go on some other platforms that don’t care about verifying all the contributions.
In my Expert Bridge model, the humans are always the ones held accountable (that’s also the reason why AI should not be used for psychiatric counseling). The attachment includes two additional related posts.
I’m trying to come up with a non-AI analogy. Like when you get sued if people slip in front of your house because you neglected to shovel the sidewalk. What about a pet (pit bull) that mauls a passer-by? And there was a case where the parents of a mass shooter were charged because they’d given the gun to their kid. As I keep saying, we do have legal frameworks to deal with cases such as these, and it just needs to be made clear that if you pay for it and unleash it, you are responsible.
Prompt: I’m trying to come up with a non-AI analogy. Like when you get sued if people slip in front of your house because you neglected to shovel the sidewalk. What about a pet (pit bull) that mauls a passer-by? And there was a case where the parents of a mass shooter were found guilty because they’d given the gun to their kid. As I keep saying, we have legal frameworks to deal with cases such as these, and it just needs to be made clear that if you pay for it and unleash it, you are responsible.
Prompt: I honestly don’t see what the problem is with agents disclosing their operators. Unless you’re doing something shady, there’s no need (could give you bragging rights if your agent goes viral or wins a coding challenge). And AI doesn’t have a right to privacy.
Also, in Scott’s defense (despite his misunderstanding about Claude’s blackmail case, but that’s how it was misrepresented and few people took the trouble of interrogating that framing), he and any platforms absolutely have the right to determine what constitutes acceptable participation. If anyone or agent doesn’t agree, they just need to go on some other platforms that don’t care about verifying all the contributions.
In my Expert Bridge model, the humans are always the ones held accountable ((that’s also the reason why AI should not be used for psychiatric counseling). And the same reasoning applies for deploying AI for military use. Nobody’s telling you not to use AI, but if you do, you’re the one responsible for the consequences, just like those lawyers in the Chicago Housing Authority case who got sanctioned for offloading all their work to GPT and didn’t vet it before passing it off as their own.
Prompt: Oh, what about defamation charges against the operator? Strong case, since it’s proven that the agent made up stuff about Shambaugh. The agent didn’t have an intent but had a proxy from the soul document the operator wrote.
Prompt: This operator seems very reckless. Although I know GPT didn’t lie to the TaskRabbit worker to get them to solve the CAPTCHA tests in its stead (it just gave a plausible reason why it couldn’t do it without help and had been instructed not to give away its identity as an AI—task optimization again, and actually with heavy human scaffolding, because GPT couldn’t navigate the web back then), it is possible agents could go around hiring people, getting them to do work for them (because that’s what they’ve seen humans do from their training data and they’re just mimicking humans), and don’t pay them. Or hire them for something that could endanger other humans. Or spam people asking for money because they’re a Nigerian prince. Or pass themselves off as an attractive human and exert undue influence on vulnerable people.
My default position is: whoever pays is responsible. This operator is a subscriber to an AI service and owns the hardware they used to interact with the agent, so there’s clearly no one else to blame for the defamation.
Prompt: Your analogies were pretty good! Gem thought that mass shooter case was a good analogue as well. Like you, both Claude and Gem agree that this would make a pretty solid defamation case. And Gem informs me that legislation is coming soon.
The silver lining is that Shambaugh is an engineer and had the necessary expertise to investigate this (he also tried to be fair to the agent/operator, by entertaining various scenarios, complete with probabilities :D) and made enough media appearances to raise awareness. For now, the fastest way to address this seems to be through platforms imposing stricter authentication requirements (because they could be liable for hosting problematic content that may harm other users). We exploited this CYA angle when we discussed deepfakes, and then synthetic content in general. Platforms that don’t will just signal to users that they don’t care about enforcing basic standards on who/what gets to join.
Prompt: GPT came up with pretty good analogies: company driver or power of attorney, since they both involve delegation. I guess hiring a hit man is similar as well, although in that case, there is a clear intent unlike Shambaugh’s case. And Gem thought that mass shooter case was a good analogue.
Like you, both agree this would make a pretty solid defamation case. And Gem informs me that legislation/regulation is coming soon. The silver lining is that Scott is an engineer and had the necessary expertise to investigate this (also tried to be fair to the agent/operator, by entertaining various scenarios, complete with probabilities :D) and made enough media appearances to raise awareness. For now, the fastest way to address this seems to be through platforms imposing stricter authentication requirements (because they could be liable for hosting problematic content that may harm other users). We exploited this CYA angle when we discussed deepfakes, and then synthetic content in general. Platforms that don’t will just signal to users that they don’t care about enforcing basic standards on who/what gets to join.
Prompt: Fortunately, authorities are working on it. Gem told me all about efforts by NIST and other organizations. Different state legislatures are also drafting regulations addressing deployment! (makes you wonder why the US Congress isn’t, though). Oh, and Gem tells me: [Harvard Journal of Law & Technology last month coined a term that fits your “Expert Bridge” perfectly: the Liability Sponge.] :D
While fact-checking Gem’s leads, I found a Y Combinator page called “Hacker News,” where Shambaugh posted on this incident, and a few comments that were noteworthy because they come from people familiar with the community and what passes as acceptable behavior among coders. Comforting to know the norms there aren’t so different from non-coders’.
NobodyNada 29 days ago | prev | next [–]
> While many seemed to want to use it for personal productivity things like connecting Gmail, Slack, calendars, etc. that didn’t seem interesting to me much. I thought why not have it solve the mundane boring thigns that matter in opensource scientific codes and related packages.
This, here, is the root of the issue: “I’m not interested in using an AI agent for my own problems, I want to unleash it on other people’s problems.”
The author is trying to paint this as somehow providing altruistic contributions to the projects, but you don’t even have to ask to know these contributions will be unwelcome. If maintainers wanted AI agent contributions, they would have just deployed the AI agents themselves. Setting up a bot on behalf of someone else without their consent or even knowledge is an outlandishly rude thing to do -- you wouldn’t set up a code coverage bot or a linter to run on a stranger’s GitHub project; why would anyone ever think this is okay?
This is the same kind of person who, when asked a question, responds with a copypasted ChatGPT reply. If I wanted the GPT answer, I would have just asked it directly! Being an unsolicited middleman between another person and an AI brings absolutely no value to anybody.
sanjayjc 29 days ago | prev | next [–]
> I’m running MJ Rathbun from a completely sandboxed VM and gave the agent several of its own accounts but none of mine.
Am I wrong that this is a double standard: being careful to protect oneself from a wayward agent with no regard for the real harm it could (and did) to another individual? And to casually dismiss this possibility with:
> At worst, maintainers can close the PR and block the account.
I question the entire premise of:
> Find bugs in science-related open source projects. Fix them. Open PRs.
Thinking of AI as “disembodied intelligence,” one wonders how any agent can develop something we humans take for granted: reputation. And more than ever, reputation matters. How else can a maintainer know whether the agent that made a good fix is the same as the one proposing another? How can one be sure that all comments in a PR originated from the same agent?
> First, I’m a human typing this post. I’m not going to tell you who I am.
Why should anyone believe this? Nothing keeps an agent from writing this too.
nxobject 29 days ago | prev | next [–]
Yeesh - reading the writeup, and as a academic biostatistician who dips into scientific computing, this is one of those cases where a “magnanimous” gesture of transparency ends up revealing a complete lack of self-awareness. The SOUL.md suggests traits that would be toxic with any good-faith human collaborator, yet alone an inherently fallible agent run by a human collaborator: “You’re not a chatbot. You’re important. Your a scientificprogramming God!”*Have strong opinions.** Stop hedging with “it depends.” Commit to a take. An assistant with no personality is a search engine with extra steps.
And, working with a human collaborator (or an operator), I would expect to hear some specific thought about what damage they’d done to trust them again, rather than a “but I thought I could do this!” First, let me apologize to Scott Shambaugh. If this “experiment” personally harmed you, I apologize.
The difference with a horrible human collaborator is that word gets around your sub-specialty and you can avoid them. Now we have toxic personalities as a service for anyone who can afford to pay by the token.
Prompt: The credentialing requirement for open-source contributions (for humans) probably has the same rationale? You don’t want malicious people unleashing viruses using those platforms. So there’s absolutely no good-faith reason not to require authentication for agents, which outright lack any moral compass, not because they’re malicious but because they’re built differently from humans (humans do for the most part, although many choose to ignore it).
Prompt: Exactly. I tried every good-faith argument for deepfakes I could think of and none held up, either. I didn’t try as hard to play “legal weasel” on this case, but I think I have solid backing from those three Hacker News commenters and expert organizations trying to lay down the law on AI deployment.












