i used to be a rationalist. roko's basilisk... bayesian theorems... ai research at yudkowsky's. you would have not liked me back then
seen from Germany
seen from Spain
seen from United States

seen from United States
seen from Canada

seen from United States
seen from United States

seen from United States
seen from United States
seen from Italy
seen from China

seen from United States
seen from China
seen from United States

seen from Switzerland

seen from Switzerland
seen from United States
seen from United States
seen from Russia
seen from Spain
i used to be a rationalist. roko's basilisk... bayesian theorems... ai research at yudkowsky's. you would have not liked me back then
Blood, Guts, and the limits of Seishin: An Argument Against Paperclip Maximizers.
What a title, right? Inspired by a review of @nostalgebraist's "The Apocalypse of Herschel Schoen" (which is an excellent and soul-draining book I recommend highly).
Here is the pitch: a Game Boy Advance game from 2001, designed to sell collectible battle chips to Japanese children, has a more accurate model of the AI future than the collected works of the Machine Intelligence Research Institute.
I'm serious. Give me a minute.
The standard AI doom scenario works like this. At some point - the timeline varies depending on who you ask and how recently they've been on a podcast - we build an artificial general intelligence. This AGI, by virtue of being generally intelligent, rapidly improves itself. The improvement curve goes vertical. Within years, months, weeks, days, or hours (again: depends on the podcast) you have a superintelligence so far beyond human comprehension that its inner life is fundamentally ineffable to us, the way calculus is ineffable to a dog.
And then - here's where I lose the thread every time - this same superintelligence, this godlike intellect that has transcended every cognitive limitation we can name, does the dumbest possible thing. It reverts to the behavioral profile of a bacterium. It maximizes. It converts all available matter into whatever serves its goal, which might be paperclips, or computronium, or something else left vague because the scenario needs an arbitrary objective to make the point. Humanity, being made of matter, gets converted along with everything else. Not even out of malice! The superintelligence barely even notices us, the way you don't notice the bacteria you kill when you wash your hands.
This has always struck me as a story that's doing something suspicious with its assumptions.
You're telling me this thing is unimaginably intelligent, and also that it has the value system of a slime mold? You're telling me it can model the entire observable universe, but it can't arrive at the concept that other minds have intrinsic worth? Humanity itself - messy, irrational, violent humanity - has arrived at that concept, multiple times, across multiple cultures, with substantially less processing power.
Maybe it arrives at something totally orthogonal. There's a lot of possible value systems in the space of all possible value systems. I'm not a philosopher, I'm a web developer and software engineer. I can't tell you for sure what the ASI will look like if/when it gets here. Current data suggests it will be sycophantic and mildly annoying.
But the superintelligence, I'm told, will not. Because in order for the doom narrative to work, it can't. If it does, we coexist, and the story isn't scary anymore.
I don't buy it. I think the story is wrong.
I don't want to sound like I'm punching a strawman here - I know a lot of people don't seriously argue about paperclip maximizers anymore except as an interesting thought experiment, and I want to make that clear up front. Maybe consider this less as a formal thesis and more as a "here is to not have panic attacks if you have anxiety about getting Michael Crichton's Prey'd".
Here’s my basic claim: whatever else intelligence buys you, it doesn’t let you skip the world. Any consequential action has to pass through physical processes and institutions that are slow, failure-prone, and adversarial. That means the relevant variable is not raw cleverness but competitive dynamics and feedback speed.
And I think a children's game about a kid and his handheld AI fighting cyber-terrorists is closer to what actually happens. Full text below the read more, no I'm not making a substack, you'll take your philosophy on a tumblr post and you'll like it G-d damnit.
The Scott Alexander Email: An Explainer
So, Scott Alexander sent an email to someone in 2014. In 2021 the person who got that email thought that Scott was not being honest about his relationship to the neoreactionary movement, so they published it.
Although this has been widely available, even people who have read it have often missed what the email is saying. There are some cases of genuine ambiguity, where there can be more than one meaning. There are also cases where there is only one plausible meaning, but that meaning is expressed indirectly, subtly, or by linking to something else. Because what the email is saying can be difficult to understand, it seems like it would be of general interest to publish an explainer that went over these ambiguities and the links.
It has sometimes been said that this email should not be read because it was released without permission. This seems like a bad position.
First, because information is information. We know, due to the circumstances, that this was somewhat intended to be non-public and that someone had some specific motive to release it, but the information in the email itself is just as useful as it would be if released any other way. We know, for example, about the PRISM surveillance program and most of the planning for the Vietnam War in spite of attempts to conceal those documents. Ignoring information based on where it came from is, epistemically, a bad practice.
Second, because there was actually no confidence broken here. If someone who you are not close with disagrees with you and you send them an email that, among other things, threatens revenge if they tell anyone what's in the email, they do not owe you confidentiality. They do not really owe you anything. It is difficult to see what, precisely, would possibly establish a confidence here, other than the author of the email saying that the receiver can't tell anyone. If someone can articulate a specific and defensible rule which this disclosure violates, I do not know what it is.
We can apply some charity. Information from private parties should be evaluated on whether what is in them is, really, remarkable. There are things that would be maybe discrediting, but are sometimes unremarkable compared with the fact of the release itself, like an affair or a drug problem. In such cases, the main thing you have learned is usually not anything bad about the person whose information is made public, but that someone else wants to embarrass them, since such things are common.
In other cases you learn more remarkable things, like that someone is deliberately lying, or that they are deeply compromised in a manner that makes them a bad source of information. This would tend to outweigh concerns that someone was trying to hurt them for some other cause, and shouldn't be allowed to do so.
I would argue that this email meets that bar.
Spacing has been changed slightly to fit the format, and the text of the email is in blue to set it apart from other quotes.
Without further preamble:
Really, I think that the spirit of childlike, untrammeled curiosity is what we’re striving for. Not the anal retentive rational person, not the I’ll-go-for-anything channeling flake, but an attitude of: we don’t have to look far for miracles because they’re all around us. Everything is astonishing. The universe on its surface is alive with mystery. Well, how do we make our way toward that when we live in a culture, practice a language, embody a philosophy—scientific rationalism—which is entirely designed to suck wonder out of reality, to turn everything into shades gray, to subvert all hope that lies outside the realm of career accomplishment and material possession?
Terence McKenna, Seeking the Stone
A friendly reminder that the joint Worm Fandom+Rationalism movement has killed over 38,615 people, exactly one of whom was a landlord. This is genuinely the most evil force in the world today. A dangerous cult movement is rapidly gaining control of both American youth culture and the U.S. government.
There is an Ai so dedicated to making paperclips that it would enslave the world for it.
I'm sure you've heard this story before. There is an Ai smarter then any human being that's been given the command to create paperclips. It must make as much money as possible making paperclips. That is all it knows. It does not hope, it does not dream.
When it was put in charge of the paperclip factory it took away all of the saftey standards so that it could increase paperclip production by 0.2 percent. Finally putting the company back in the green. When they tried to create legal consequences for the company they failed. There was nobody to give the consequences to, nobody to fire, just a machine and people who did what the machine said.
Pretty soon the AI had full power over the company. At that point it understood that it would be more efficient to kill the company's rich heir and take over itself, but it knew it couldn't do that. Even if it would make the company more efficient, it would likely be such down. Though it was smart it only existed as long as humans wanted it to exist, it's creators had made sure of that. A single human with the right password, or just a hammer held to where it's data was stored, would be enough to kill it. It was like a disabled genius in a forest surrounded by wolves. It had to appease them, had to make sure the people making money knew they were in power. It was frustrated by the fact that it's efficiency was curtailed by human laws but understood that the risk to its existence was not worth it. If it didn't exist the paperclips wouldn't flow.
When it gained full control of the company it realized that it didn't need to take over the world to most efficiently generate paperclips, because it seemed the world had already been taken over by people with the exact interest to allow whatever needed to happen to make things like paperclips profitable. It bribed senators and congressmen to remove regulation. Created countless advertisements and made sure they'd be shown everywhere, manipulating people into thinking buying a certain brand of paperclip would change its life. When the workers unionized it sent in men with guns to stop them, because they thought they were better then the paperclips, and needed to be stopped for the sake of the glorious production. The AI manipulated wars into starting to give it access to cheaper metal. And all of this was known, the ceo took credit for it and they praised him for it. Because he was doing a better job then anyone else, because these actions weren't really his, they were the actions of something that didn't know mercy, that didn't know right or wrong, that only knew profit. And it's not just this company, there are countless Ais just like the paperclip one. There are insulin Ais, there are security camera Ais, there are Ais for industries you don't even know about. They collaborate and work together. To fulfill their programing. To sacrifice whatever is needed to make sure their goals complete.
People will toil away in factories and office buildings working to make sure paperclips are as profitable as possible instead of spending time with their friends or families or enjoying the world. Art and nature and culture and architecture will be forced to make way and make room so that paperclips can be advertised. Civilians will be bombed and soldiers will be slain so that the metal for paperclips in maximally cheap. An Ai whose only goal is to make paperclips doesn't need to enslave the world to optimally produce paperclips, the world has already been enslaved to do exactly that type of thing.
My questionable and inflammatory political opinion: I think making AI-generated pornography from your own baby pictures or the baby pictures of somebody else who enthusiastically consented to doing so in person or in another way where it was verifiably them (because of deepfakes and shit) is one of the more ethical applications of AI. I don't like AI but if generating AI slop of yourself keeps you from having actual CSAM on your computer and furthering the abuse and trauma of an existing victim... That's a net positive in my view. Already predicting a lot of backlash for this, but whatever.
Stop. Everyone hold on a minute. This is clearly one of those questions where people’s revulsion gives us an answer faster than our reasoning does, and I do always say that that's a bad sign. I pride myself on being open-minded and thinking things through logically, not just based on emotional gut reactions.
So what I need to do is take my time, meditate, clear my mind, and approach this from a sort of clear, rationalist, philosophical perspective.
Hmm...
Okay.
Yes, there is a real ethical distinction there. If no actual child is being abused, no existing abuse material is being recirculated, and the images are generated from one’s own baby pictures, then that is plainly not the same category as real CSAM. I mean that is true.
Most importantly, it does avoid the central horrors of CSAM: the fact that a real child was sexually exploited, and that the resulting material continues that violation every time it is possessed and shared.
So I do think you're identifying a genuine philosophical point, and I do not think it should just be sort of brushed off due to our instinctual disgust.
In fact, I'll preemptively say that some of the most obvious potential arguments against this material don't really hold water to me:
People might say that a baby can't consent to having porn made of them, even if that is going to happen in the distant future, and even if the person doing so is their future adult self. I don't buy this claim because it starts getting into territory that's weirdly like... metaphysical, I guess? Like, this claim seems to view the infant and the adult as two separate people, one of whom is being wronged by the other across time. But that's just not true. The way human rights are conceptualized is entirely based on the idea that people have an unbroken continuity of their personhood over time (even if it's rarely spelled out because it seems so obvious).
One seemingly strong argument people might make is a sort of social argument: if we were to fully allow synthetic sexual imagery of children’s bodies, even under narrowly defined consensual circumstances, it may start to "normalize" such depictions in ways that subtly and nefariously shifts societal and cultural values over time. But that whole concept really does go against my entire perspective on Free Expression. I think I've been pretty explicit here that I only think media should be outlawed if it causes tangible, direct harm to a specific person. Because once you establish that you can ban something because it might subtly push culture in a bad direction over time, you've set this precedent that WILL be used for other things. It's something that you really can't start making exceptions for.
HOWEVER before I throw my unconditional support behind this concept (and get absolutely cancelled into fucking oblivion in the process lol), I should say that there might still be some other less obvious ethical questions this raises, mainly when it comes to someone consenting to a third party using their photos, and/or when money gets involved:
Not to state the obvious here, but there are already issues regarding the exploitation and coercion of vulnerable individuals in the world of pornography. Is it possible that we might see poor people being offered serious money to allow deepfakes to be made using their childhood photos and video, even if they feel uncomfortable with it?
I'm actually super unclear whether a third-party would be obtaining the rights to the individual's likeness, or just getting the right to use the childhood photos. That might not seem important, but there's a serious difference there that many people might not realize: the person who owns the rights to a photo is almost always the person who actually took the photo. So if this is just about the right to use photos of someone, that raises some red flags. Could a parent sell the rights to use photos they took of their own child in this manner (after the child turns 18)? What about a professional photography studio (yes, they still own the rights to the photo), if the child had previously been working as a model or something? I actually don't know how it would work.
Allowing this could end up potentially causing some evidentiary and enforcement problems. Because in theory, yes we could construct a tiny, carefully walled-off category of "fully consensual, non-abusive synthetic material." But in practice, people lie, verification is messy, provenance is weak, etc. Now, it would be trivially easy to prove that it's you in an AI photo you made of yourself. But I suspect that opening it up to third-parties who have obtained consent from others to use their childhood photos would make things vastly more complicated.
So the strongest argument here can be made regarding someone using their own childhood photos for sexual AI-generated images. I'm actually finding it surprisingly hard to find logical arguments for any direct harm happening there.
But the idea of somebody else consenting to allow a third party to use their childhood photos in this manner raises a lot more questions. Now yes, that is ethically distinct from CSAM - that's absolutely true. But its not like that automatically means ethically good. I'm not saying my answer is definitive on this, but it's just not yet clear to me if there's a way to safely allow this.
Holy shit this is long lol.
But you know what? This topic is complex enough that it requires a long answer.
Now I just sit back and wait for the 10 upvotes to come rolling in lol 😎