Overarching Warnings/tags: Child abuse, slavery, implied/referenced non-con, torture, abuse, violence, harm to children, bullying
(these tags will be updated on an as-needed basis. if I miss anything, please drop me a line and I’ll add it to the warnings)
Background - this contains the basic buildup for Mirjahaal. May be edited if relevant context is needed.
Count Dooku, pt. 1 - this is Dooku’s first written appearance in Mirjahaal, but not his first chronological actions.
Text of tweet under the cut because it is loooong.
But... Stochastic Parrots.
Timnit Gebru was fired from Google in December 2020 for refusing to retract a research paper, and every single warning that paper made about large language models has now happened at a scale the industry spent 4 years trying to make people forget about.
Her name is Timnit Gebru.
She co-led the Ethical AI team at Google. She co-wrote a paper called "On the Dangers of Stochastic Parrots" with Emily Bender at the University of Washington and two other researchers. The paper was 14 pages long. It was submitted to a top AI ethics conference. And it was the reason Google decided that one of the most senior Black women in AI research could no longer work there.
The story Google told publicly was that she resigned. The story she told, confirmed by 2,695 of her colleagues in an open letter, was that she was fired by email while on vacation because she refused to either retract the paper or remove her name from it.
The paper had not even been published yet.
Here is what she actually wrote, and why every prediction inside it has now come true.
The first warning was about scale itself. Bender and Gebru argued that training ever-larger models on ever-larger scrapes of the internet would produce systems that appeared fluent but had no actual understanding of language. They called these systems stochastic parrots because they would repeat patterns from training data with statistical confidence and zero comprehension. The paper predicted that this apparent intelligence would fool both users and developers into trusting outputs that were structurally incapable of being reliable.
This was 2020. GPT-3 had just come out. The paper predicted the hallucination problem before anyone had a word for it.
The second warning was about bias amplification. The paper documented in detail that internet-scale training data contains systematic overrepresentation of dominant viewpoints and underrepresentation of marginalized ones. The models would not just absorb this bias. They would amplify it, because the optimization process rewards confident outputs, and confidence in language patterns tracks frequency in the training set.
The prediction was that hiring tools built on these models would discriminate against women. That healthcare triage tools would underperform on Black patients. That loan approval systems would entrench inequality while presenting their decisions as neutral algorithmic judgment.
Every one of those things has now been documented in deployment.
Amazon's hiring algorithm penalized resumes that contained the word "women" in any context. Healthcare risk scoring algorithms used by major US hospitals were found to systematically underestimate the medical needs of Black patients. Apple Card's credit algorithm gave wives credit lines 10x lower than their husbands for the same financial profile.
The third warning was about environmental cost. The paper calculated that training a single large language model produced emissions equivalent to the lifetime output of 5 cars. The prediction was that the race to scale would create an environmental footprint that would eventually rival entire industries.
In 2024, Google's emissions were up 48% from 2019, and the company explicitly blamed AI infrastructure. Microsoft's were up 29%, same reason. Both companies have now quietly abandoned the climate commitments they were publicly celebrating the year Gebru was fired.
The fourth warning was about documentation. The paper argued that the training datasets being assembled were too large for anyone to actually audit. Nobody at Google, OpenAI, Meta, or any other lab could tell you with confidence what was in the data their models were trained on. This was not a temporary problem to be solved later. It was a permanent feature of the approach.
In 2023, researchers discovered that the LAION-5B dataset, used to train Stable Diffusion and other major image models, contained thousands of images of child sexual abuse material. The companies that had trained on the dataset had no way of knowing. The paper predicted that category of failure 3 years before it was found.
The fifth warning was the one Google cared about most.
Bender and Gebru argued that the deployment of these systems would centralize linguistic and cultural power in the hands of the small number of companies that could afford to train them. The internet would become a place where the dominant voice was a statistical average of dominant voices, presented as a neutral assistant. Languages underrepresented in the training data would degrade over time as more web content was generated by these systems and fed back into the next training run.
This is now happening in real time. A 2024 study found that 57% of new web content in English is AI-generated or AI-assisted. Researchers studying low-resource languages have documented active degradation in translation quality, because the synthetic content fed back into training is itself worse in those languages.
The paper Google fired her for predicted the model collapse problem before model collapse had a name.
The mechanism behind why this all happened is the part of her work that nobody quotes.
Gebru's argument was not that AI is dangerous in some abstract sci-fi sense. Her argument was that AI is dangerous in a very specific structural sense. The technology was being built by a small group of researchers who shared similar backgrounds, worked at similar companies, and were rewarded for shipping products faster than competitors. The incentive structure made it impossible for safety, ethics, and bias concerns to slow anything down. Anyone inside the system who raised those concerns was either ignored, sidelined, or removed.
She was making that argument from inside Google.
Then Google proved her right by removing her.
The team Google had built to make sure their AI was safe was dismantled in 90 days because they did the job they had been hired to do. Margaret Mitchell, the other co-lead of the Ethical AI team, was fired two months after Gebru for searching through her own emails for evidence of how Gebru had been treated.
Gebru did not stop. She founded DAIR, the Distributed AI Research Institute, in 2021. The mission is to do AI research outside the control of the companies that have a financial interest in not hearing the answers.
Every prediction in the Stochastic Parrots paper has now been validated by deployment. Hallucinations are an industry-wide problem the largest labs cannot solve. Bias amplification has been documented in hiring, healthcare, lending, and criminal justice. Environmental costs are larger than entire small countries. Training data audits remain impossible. Model collapse is an active research crisis at every major lab.
The question worth sitting with is the one almost no one in the industry will say out loud.
Every researcher with the technical credibility to call out these problems watched what happened to her in December 2020 and made a calculation about their own career. The number of people willing to speak publicly about safety and ethics issues inside the major AI labs collapsed after that firing and has not recovered.
The researcher Google fired for warning about exactly what is now happening was right.
The company that fired her is now the second-largest deployer of the technology she warned about.
And the people inside that company who agree with her are not allowed to say so.
Longtime readers may be aware of how much I relish an excuse to bully a company, so I'm sharing the wealth;
Clothing company Patagonia is currently suing drag queen Pattie Gonia for "irreparable” harm to their brand.
To be clear; Pattie named herself after the region in South America.
So Pattie is asking people to politely ask Patagonia to drop the lawsuit.
I'm extending the invitation to all of you, because suing a drag queen for 'infringement' in the current political cultural landscape is vile.
Especially a drag queen who has raised millions of dollars for non-profits, uses her platform to raise awareness for climate activism, and fully aligns with Patagonia's apparent climate-conscious mission statement.
They're claiming they're suing for $1. They're actually asking her to stop using her name, and pay over $1 million in legal fees. They're straight up harassing her.
In contrast, drag queen Jan Sport has a Jansport bag line. It's that easy to just... work with a queen.
Anyway. Be respectful(ish), but feel free to be annoying on Patagnoia's socials, asking them to 'DROP THE LAWSUIT'
random PSA, I know a lot of people use duckduckgo as a Google alternative search engine, but it always kind of annoyed me when I was using it because it felt like No Name Brand Google
I have switched to using Startpage.com and vastly prefer it. for one thing, instead of displaying an "AI summary" at the top of the search results (unless you turn it off, yes I know), it displays the first paragraph of the Wikipedia article, with link, whenever it finds one that's relevant.
also a waaayyyyy better sense of design than duckduckgo
also private, European based, least annoying search I've used lately (RIP old "don't be evil" Google)
i have one of those, scraped from multiple different rec posts:
Search Engines
Infinity Search is an alternative search engine with a special focus on privacy
DuckDuckGo is a popular search engine for those who value their privacy and are put off by the thought of their every query being tracked and logged. Uses bangs, ![site] for in-page search (sells your data to microsoft and draws from fucking bing)
WolframAlpha is a privately owned search engine that allows you to “compute expert-level answers using Wolfram’s breakthrough algorithms, knowledgebase, and AI technology.” A data search engine.
Boardreader is a search engine for forums and message boards. It allows you to search forums and then filter down results by date and language.
Based in France, Qwant is a privacy-based search engine that won’t record your searches or use your personal details for advertising. Uses “&” as a bang search.
Another privacy-based search engine is Search Encrypt, which uses local encryption to ensure that users’ identifiable information cannot be tracked. Metasearch across multiple engines.
Offering unbiased results from several sources, SearX is a metasearch engine that aims to present a free, decentralized view of the internet. Can be self-hosted.
Gibiru’s tagline is “Unfiltered private search” and that’s exactly what it offers. Requires AnonymoX Firefox add-on for privacy.
Disconnect allows you to conduct anonymous searches through a search engine of your choice.
Swisscows provides fully encrypted searches to protect your privacy and security. Built-in violence/porn filter cannot be overridden.
MetaGer offers “Privacy Protected Search & Find” through its anonymised search. A plugin will allow it to be made a default.
Gigablast is a private search engine that indexes millions of websites and servers real-time information without tracking your data, keeping you hidden from marketers and spammers. Variety of filtration and refinement options for searching.
Oscobo is a search engine that protects your privacy while you search the web. By not using any third-party tools or scripts, your data is protected from hacking and misuse. Has a Chrome extension to allow use in toolbar.
https://search.marginalia.nu/ an independent DIY search engine that focuses on non-commercial content, and attempts to show you sites you perhaps weren't aware of in favor of the sort of sites you probably already knew existed. Use old-school searching rather than query-based for the best results.
https://www.mojeek.com/
https://wiby.me/ - It’s goal is to index as many personalized websites as possible, and NOT commercial sites.
https://4get.ca/ it works a lot like SearX, but honestly better. It doesn’t have its own index, but pulls from many others. I think it’s the best for research, since it allows you to search for answers from different indexes, is easy to configure, add free, and avoids censorship as much as it can.
https://www.searchenginemap.com/ for more on how search engines relate to each other.
https://yep.com/ is a crawler
https://www.etools.ch/ retrieves from Google, Mojeek, Bing, and Yandex, like Searx
https://www.dogpile.com/
https://searxng.org/ (next gen Searx)
https://luxxle.com/ - possibly conservative?
https://presearch.com/ - good for academic?
https://kagi.com/smallweb - free/randomised Kagi.
Other Searchers
www.refseek.com - Academic Resource Search. More than a billion sources: encyclopedia, monographies, magazines.
www.worldcat.org - a search for the contents of 20 thousand worldwide libraries. Find out where lies the nearest rare book you need.
https://link.springer.com - access to more than 10 million scientific documents: books, articles, research protocols.
www.bioline.org.br is a library of scientific bioscience journals published in developing countries.
http://repec.org - volunteers from 102 countries have collected almost 4 million publications on economics and related science.
www.science.gov is an American state search engine on 2200+ scientific sites. More than 200 million articles are indexed.
www.base-search.net is one of the most powerful researches on academic studies texts. More than 100 million scientific documents, 70% of them are free.https://cosine.club/ is an electronic music similarity search engine
Here is an article from NPR about it (May 22, 2026):
Carolina Milanesi, an independent technology analyst, said Google is trying to make its cash cow business — search — richer and more personalized, and it will make shopping easier. But there is a risk that users may have fewer choices about what to click.
"Right now it's: I ask a question, I get a bunch of answers and I feel that I'm in control as to which answer I take, or if I'm looking for something, which product I'm going to end up buying. That is going to be less so going forward," she said.
Milanesi envisions AI-enabled search and agents proposing products to consumers — perhaps even those they have requested — but with less clarity or choice around where it's coming from.
"If you're going to say: 'I want a pair of Jordans, go find them,' you're not necessarily sure what steps have been taken and whether the AI has used a source or a store that was paid for and therefore came up in the search results," she said, "or if AI actually went and did their due diligence and picked the best for me as a customer."
And here's one from Time magazine (May 20, 2026):
While Google already has “AI Mode,” the company will now power the whole search bar through its new Gemini 3.5 Flash model.
Instead of the classic list of blue links, Google Search will now also generate a custom page with an AI-generated summary of what you’re searching about, which will then trigger a conversation with AI Mode on the main page, allowing users to ask follow-up questions—similar to the kind of layout you would see when opening ChatGPT.
And a little more from Time's article on how this may affect the websites that we are trying to search for:
When Google first started implementing AI-assisted results, news publishers warned of “catastrophic” impacts on the industry, much of which relies on Google search to drive users to their websites.
Last year, news websites saw significant traffic declines as chatbots increasingly replaced Google search as the primary way to find sites and ask questions.
Small businesses also noted drops in traffic to their sites from Google, which has traditionally delivered customers.
Lily Ray, vice president of SEO strategy & research at Amsive, a digital marketing agency, warned as early as last year that Google’s planned changes to search are “going to have a devastating impact on the Internet.”
“It will severely cut into the main source of revenue for most publishers and it will disincentivize content creators who rely on organic search traffic, which is millions of websites, maybe more,” she told Technology Magazine.
This IS really bad - people are going to be doxxed, people are going to be stalked. This is a privacy nightmare.
Basically:
Amazon allows public wishlists for individuals, previously fulfilled through Amazon. This was a fairly private way to receive items anonymously from donors/fans/people online
Amazon is now allowing these lists to be fulfilled by third-party sellers.
Amazon will provide your name and address to said third party sellers
Third party sellers may or may not post tracking updates (aka: your address) to the person who ORDERED the item from your wishlist
Third party delivery companies may also post their delivery photos - of your front door and house number - and send them to the person who ordered the gift.
If you use public Amazon wishlists, turn them private or remove your address. The only private way to use these going forward will be via PO box (unless they roll back this change).
this is especially pertinent for sex workers and leftist grassroots organizations that use these wishlists to provide aid to marginalized & struggling people in their own communities. these edge lords would love to dox these people.
You are trying to move into an apartment with your favorite Pokemon. The building is strict about which Pokemon are allowed inside but it’s super affordable. How hard do you think it will be to convince the landlord to let you keep it in the building?
Easy as can be, perfect apartment dweller
Might take some convincing
Basically a coin flip
It will be an uphill battle but I might be able to, while saying goodbye to my deposit
No increased rent, deposit or argument could convince any landlord to let us in
You are trying to move into an apartment with your favorite Pokemon. The building is strict about which Pokemon are allowed inside but it’s super affordable. How hard do you think it will be to convince the landlord to let you keep it in the building?
Easy as can be, perfect apartment dweller
Might take some convincing
Basically a coin flip
It will be an uphill battle but I might be able to
No increased rent, deposit or argument could convince any landlord to let us in
Okay so I don't think the landlord wants Giratina inside my apartment. I will admit a certain skepticism myself as to Giratina's ability to fit inside the apartment. However, if Giratina wants to be in my apartment I don't think there's anything my landlord or I can do to stop this from happening, and surely the merciful and the sensible option is to refrain from even trying.
Protect Internet Freedom from now until forever. It's important existentially! Americans stand with UK citizens in our struggle against government censorship
Also, please beware - many of these questions are trick questions, phrased to indicate that the government is only looking for support for their dipshit ideas. Things like "Which types of websites should have minimum age restrictions?" are irrelevant, and further show they're only looking for people to support their idiocy.
Please use guides like the EFF to answer these questions.
Young people should be able to access information, speak to each other and to the world, play games, and express themselves online without t
Age verification (or age-gating) laws generally require online services to check, estimate, or verify all users’ ages—often through invasive
"Kill your local sex offender!" Oh, you mean the guy who went streaking at his local college football game on a dare one time? That's a sex crime.
"No, I mean-"
Oh, maybe the woman who had to pee in a public park that only had pay toilets, so she tried to hide behind the bushes but got caught? Public urination is a sex crime.
"What? No, I mean-"
Oh, maybe you mean the homeless guy who had to strip down to get his clothes in the laundromat to clean them for the first time in weeks? He tried being subtle, but someone called the cops on him, and now he's on the sex offender registry for public nudity.
"Rapists and pedophiles! Kill rapists and pedophiles!"
Oh, like the trans woman who got called a pedophile groomer for helping a trans kid escape her abusive parents?
Or maybe the black man who got labeled a rapist because he came on to another man's wife, and he decided to get back at him by charging him with rape?
How about the 17 year olds who were fooling around, fully consensually, in one of their bedrooms? That's still technically underage sex and thus rape of a minor.
Oh, or maybe you're talking about the doctor who performed genital reconstructive surgery in a state that just voted to get that classified as rape?
People will do everything they can to get you convinced rape and pedophilia are the worst crimes possible, then accuse whoever they like the least of being either a rapist, a pedophile, or both, counting on you turning on them just for being accused of the crime.
"Oh, so you're saying you don't want to kill a serial rapist?"
That's exactly what I'm goddamn saying.
Once we decide a group is okay to kill, the government will do everything they can to convince you that their political enemies are either part of that group, or just as bad as that group, to get you to kill their enemies for them.
The only way out is to accept every life as worth saving.
Brazil legislation: Digital Statute For Children And Adolescents
Apple App Store Age Verification
These are not tumblr specific policies. Tumblr is implementing age verification in response to legislative moves that were made months ago.
Tumblr is a failing social media site that has escaped death multiple times already; they do not have the social cachet to defy state regulatory agencies. We know they won't say no to Apple, either--the porn ban on tumblr was in response to Apple's crackdown on explicit content.
If you did not know this was happening, you were behind the curve. That is fine. You're caught up now. The next step is to link up with people in your country who are working to preserve privacy, to roll back these laws where they exist, and to prevent their passage where they do not. In the US the organization you want is Stop KOSA--in the EU you can start with Fight Chat Control.
Repealing ID verification and blocking chat control will help everyone, especially the most vulnerable. We can push this back, but we cannot get it done through the Feedback form. We have to get it done at the legislative level and lock it down so it cannot be forced upon us. I see lots of anger out there. Good. Put it to use.
Folks if you go to the Stop Kosa website there's a *super quick* form that will figure out your legislators and auto send them an email from you. (There's a script, you don't have to figure out what to write.)
They'll also give ypu a phone number and a script if you're up to make a call.
Welcome to the Pit. @dragormir - Tumblr Blog | Tumgag