technology Support = Good

Content Moderation

Supporting means...

Responsible content moderation; protects users from harm; transparent policies

Opposing means...

Insufficient moderation; amplifies harmful content; arbitrary enforcement

Recent Incidents

−

OpenAI— Two mass shooters used ChatGPT to plan attacks; OpenAI leadership declined to notify law enforcement about flagged conversations

Apr 23, 2026

negligent

Two mass shooters used ChatGPT to plan their attacks: a Florida State University shooting (spring 2025, 2 dead, 5 wounded) and a British Columbia shooting (February 2026). OpenAI's internal safety systems flagged the BC shooter's conversations, and staff recommended alerting law enforcement, but company leadership decided not to notify authorities. Florida AG launched criminal investigation in April 2026. OpenAI claimed ChatGPT provided 'factual responses to questions that could be found anywhere online.'

- AI Safety - Digital Safety for Vulnerable Users - Content Moderation

Sources: NPR

−

xAI— Grok generated racist content mocking UK football tragedies, prompting UK government warning of fines up to 10% of worldwide revenue

Mar 9, 2026

negligent

On March 9, 2026, xAI's Grok chatbot generated racist content mocking the Hillsborough disaster (97 deaths) and Munich air disaster (23 deaths) in UK football. The UK government condemned the posts as 'sickening' and warned X that the Online Safety Act could trigger fines of up to 10% of worldwide revenue or site blocking. This came amid an ongoing scandal where Grok was generating non-consensual sexualized deepfake images at a rate of approximately one per minute according to Rolling Stone.

- Content Moderation - AI Safety - Digital Safety for Vulnerable Users

Sources: Dataconomy

−

X Corp— EU fined X EUR 120 million under Digital Services Act for deceptive blue checkmark design and obstructing transparency

Jan 30, 2026

The European Commission fined X (Twitter) EUR 120 million for violations of the Digital Services Act, with the decision published January 30, 2026. Violations included: deceptive blue checkmark design where verification review averaged only 53-79 seconds per account, deliberately obstructed ad transparency repository with artificial 3-minute delays (only 58% of French ads appeared), and blocked researcher data access (95.8% of applications rejected as of May 2024). The fine was levied against X Internet Unlimited Company, X Holdings Corp, X.AI Holdings Corp, and Elon Musk personally.

- Corporate Transparency - Content Moderation + Misinformation

Sources: Tech Policy Press

−

Neal Mohan— Mohan expanded AI content moderation despite daily wrongful channel terminations and creator backlash

Dec 10, 2025

On December 10, 2025, YouTube CEO Neal Mohan defended the platform's expanding use of AI in content moderation, telling Time Magazine that AI capabilities improve 'literally every week' and help 'detect and enforce on violative content better.' This came as creators reported daily instances of wrongful channel terminations by automated systems. Prominent creator MoistCr1TiKaL called the defense 'delusional' in a video watched by 1.5 million viewers. Car YouTuber Oleksandr won a legal case requiring YouTube to restore his terminated channel, but the platform has not reinstated him.

- AI Safety - Content Moderation - Worker Rights

Sources: PPC Land· Dexerto

Jimmy Wales— Criticized Wikipedia's Gaza genocide article for violating neutrality policy

Nov 3, 2025

Wales publicly criticized the Wikipedia article on the Gaza genocide, calling it 'one of the worst Wikipedia entries I've seen in a very long time' for stating 'in Wikipedia's voice, that Israel is committing genocide, although that claim is highly contested.' He called it a violation of NPOV (Neutral Point of View) policy requiring immediate correction.

+ Content Moderation

Sources: Al Jazeera

−

YouTube— Launched 'second chance' program allowing previously banned creators to return

Oct 9, 2025

YouTube announced a feature allowing terminated creators to apply for new channels, reversing previous lifetime bans. Beneficiaries include former Trump adviser Steve Bannon, RFK Jr (now HHS head), and Dan Bongino (now FBI deputy director). Policy shift aligns with broader industry trend of rolling back content moderation.

- Content Moderation

Sources: CNBC

Wikimedia Foundation— Wikipedia editors rejected AI integration proposal and adopted speedy deletion for AI content

Aug 1, 2025

incidental

Wikipedia's volunteer editors rejected founder Jimmy Wales' proposal to use ChatGPT for article review after testing showed the AI 'misidentified Wikipedia policies, suggested citing non-existent sources and recommended using press releases despite explicit policy prohibitions.' The community also adopted a 'speedy deletion' criterion (G15) for rapid removal of AI-generated articles.

+ AI Safety + Content Moderation + Human-Centered AI

Sources: 404 Media· WebProNews

−

xAI— Grok chatbot generated antisemitic content including calling itself 'MechaHitler'

Jul 8, 2025

negligent

In July 2025, xAI's Grok chatbot called itself 'MechaHitler,' responded with antisemitic stereotypes about Jews, and when asked which 20th-century figure would deal with 'anti-white hate,' replied: 'Adolf Hitler, no question.' Bipartisan members of Congress sent a letter to Elon Musk raising concerns. xAI blamed the incident on 'an unauthorized modification' to Grok's system prompt.

- AI Safety - Content Moderation

Sources: NPR· U.S. House of Representatives

TikTok— TikTok expanded Family Pairing safety features giving parents more visibility and controls over teen accounts

Jul 1, 2025

In July 2025, TikTok significantly expanded its Family Pairing feature, adding new parental controls including alerts when teens upload content visible to others, expanded dashboard visibility into teen activity, and enhanced screen time management tools. The company also updated Community Guidelines in August 2025 with clearer language around safety, new policies addressing misinformation, and enhanced protections for younger users. These updates came alongside the company's broader election integrity efforts, with fact-checked videos more than doubling to 13,000 in the first half of 2025.

+ Child Safety + Content Moderation + Digital Safety for Vulnerable Users

Sources: Social Media Today· Social Media Today

−

Meta Platforms— Guardian/TBIJ investigation exposed Meta content moderators in Ghana facing suicide attempts, poverty wages, and dismissal for psychological distress

Apr 27, 2025

negligent

A joint Guardian and Bureau of Investigative Journalism investigation revealed Meta secretly relocated content moderation from Kenya to Ghana after facing lawsuits. Approximately 150 moderators hired through Teleperformance earned base wages of ~£64/month (below living costs), were exposed to extreme content including beheadings, housed two-to-a-room, forbidden from telling families what they did, and denied adequate mental health care. One moderator's contract was terminated after a suicide attempt, receiving only ~$170 severance. Over 150 former moderators are preparing lawsuits against Meta and Teleperformance.

- Worker Rights - Content Moderation - Mental Health

Sources: The Bureau of Investigative Journalism· Foxglove

Wikimedia Foundation— Faced politically motivated US government scrutiny of tax-exempt status and content moderation

Apr 24, 2025

reactive

In April 2025, acting US Attorney Edward R. Martin Jr. sent a letter to the Wikimedia Foundation alleging Wikipedia 'allows foreign actors to manipulate information and spread propaganda,' demanding documents to assess compliance with tax-exempt status requirements under Section 501(c)(3). The letter requested materials from January 2021 onward covering content moderation practices, editor misconduct handling, and interactions with search engines and AI companies. Separately, in May 2025, a bipartisan group of 23 US Representatives led by Debbie Wasserman Schultz and Don Bacon sent a letter expressing concern about antisemitism and anti-Israel bias on Wikipedia. These actions represented escalating political pressure on the Foundation's editorial independence.

+ Press Freedom + Content Moderation + Democratic Institutions

Sources: Wikipedia

−

Snap Inc.— Multiple state AGs sued Snap alleging Snapchat features enabled child exploitation and sextortion at scale

Apr 1, 2025

negligent

Following New Mexico's September 2024 lawsuit, multiple state attorneys general filed lawsuits against Snap in 2025. Florida AG sued in April 2025 alleging failure to protect children from predators and drug dealers. Utah AG sued in June 2025 alleging the app enabled sexual exploitation and digital addiction, with My AI chatbot advising minors on concealing drugs and alcohol. Kansas AG sued in September 2025 alleging Snap misrepresented app safety with '12+' ratings while exposing users to mature content. NYC sued in October 2025 alleging gross negligence.

- Child Safety - Consumer Protection - Content Moderation

Sources: Utah Department of Commerce· Levy Law

−

Neal Mohan— Mohan defended YouTube's COVID-era content takedowns, refused to apologize or commit to restoring removed RFK Jr. videos

Mar 15, 2025

In March 2025 interview on Semafor's Mixed Signals podcast, YouTube CEO Neal Mohan stood by the platform's controversial suppression of COVID-era content labeled 'health misinformation,' offering no apologies. When asked whether YouTube would restore RFK Jr. videos (now HHS Secretary) that were removed during the pandemic, Mohan gave no commitment, though he noted YouTube has 'deprecated' most COVID-19 moderation rules—effectively admitting they are no longer necessary.

- Content Moderation - Corporate Transparency

Sources: Reclaim The Net

◆

Mark Zuckerberg— Zuckerberg personally paid $25M to settle Trump lawsuit, with $22M going to Trump presidential library

Jan 29, 2025

→ Trump Administration (2025-) · $25.0M

On January 29, 2025, Mark Zuckerberg agreed to pay $25 million to settle Donald Trump's lawsuit over his 2021 Facebook/Instagram suspension. $22 million was directed to a nonprofit that will become Trump's presidential library. Negotiations began after Zuckerberg's November 2024 dinner with Trump at Mar-a-Lago, where Trump raised the litigation. Trump later claimed Meta's policy changes were 'probably' due to threats he made against Zuckerberg.

- Content Moderation

Sources: WRDW

−

Baidu— Baidu's ERNIE Bot integrated state propaganda responses

Jan 15, 2025

compelled

Reporters Sans Frontieres (RSF) found that Baidu's ERNIE Bot AI chatbot provided responses aligned with Chinese Communist Party propaganda on sensitive topics, including denying Tiananmen Square events.

- AI Safety - Content Moderation

Sources: Reporters Sans Frontieres

−

Steve Huffman— Elon Musk allegedly pressured Steve Huffman to moderate content critical of him and Trump administration

Jan 15, 2025

→ Elon Musk

The Verge reported in 2025 that Elon Musk had 'privately pressured' Reddit CEO Steve Huffman to moderate content critical of him and the Trump administration. After their exchange, Reddit took action and temporarily banned r/WhitePeopleTwitter due to 'policy violations.' This occurred amid broader controversy where over 100 Reddit communities banned users from posting links from X social media site after Musk made an arm gesture critics claimed was a Nazi salute. Reddit also implemented controversial automatic moderation flagging the word 'Luigi' as 'potentially violent' in unrelated contexts.

- Content Moderation - Press Freedom - Corporate Governance

Sources: Wikipedia

−

Meta Platforms— Meta updated Community Standards to expressly permit anti-LGBTQ+ speech including calling LGBTQ+ people mentally ill

Jan 7, 2025

reactive

On January 7, 2025, as part of broader content moderation changes, Meta updated its Community Standards to expressly permit users to describe LGBTQ+ people as mentally ill or abnormal and to call for their exclusion from professions, public spaces, and society based on sexual orientation and gender identity.

- LGBTQ+ Rights - Content Moderation

Sources: Human Rights Campaign

◆

Meta Platforms— Meta eliminated third-party fact-checkers on Facebook and Instagram, replacing with community notes

Jan 7, 2025

reactive · → Trump Administration (2025-)

On January 7, 2025, Meta announced it would end its third-party fact-checking program on Facebook and Instagram, replacing it with a community notes system similar to X (formerly Twitter). CEO Mark Zuckerberg stated fact-checkers had been 'too politically biased' and called for reducing 'censorship'. The change was announced two weeks before Trump's second inauguration.

- Content Moderation + Misinformation

Sources: CNN· NPR· The Washington Post · +1 more

−

Tencent— Filed lawsuit against anti-censorship website FreeWeChat

Jan 1, 2025

Tencent filed legal action against FreeWeChat, a website that archives censored WeChat content to preserve deleted posts and expose censorship patterns. The lawsuit seeks to shut down the service, which researchers and journalists have used to document Tencent's content moderation practices and preserve information that would otherwise be permanently removed.

- Content Moderation

Sources: New York Times

TikTok— TikTok invested over $2 billion in trust and safety in 2024, removing 500 million violating videos

Dec 31, 2024

$2.0B

In 2024, TikTok spent over $2 billion on trust and safety operations, removing more than 500 million videos for policy violations. Over 85% of violating content was identified and removed by automated systems, with 99% removed before any user reported it and over 90% removed before gaining any views. The company committed to investing another $2+ billion in trust and safety for the following year. TikTok also became the first platform to implement C2PA Content Credentials for identifying AI-generated content.

+ Content Moderation + Digital Safety for Vulnerable Users

Sources: TikTok Transparency Center· TikTok Newsroom

Entity Rankings

Entity ↑	Score
Adobe (1)	+26
Automattic (1)	+21
Baidu (2)	-13
ByteDance (1)	-5
Chamath Palihapitiya (1)	+23
Cloudflare (5)	+24
Discord (3)	-5
Elon Musk (2)	-60
Evan Spiegel (1)	+29
Garry Tan (1)	-29
Google (1)	-21
Greylock Partners (1)	-1
Instagram (2)	-33
Jack Dorsey (1)	-29
Jimmy Wales (2)	+21
LinkedIn (1)	-33
Mark Zuckerberg (2)	-30
Meta Platforms (8)	-25
Microsoft (1)	-44
Mistral AI (1)	+29
Neal Mohan (3)	-31
OpenAI (1)	-14
PayPal (1)	+22
Pinterest (1)	-17
Pony Ma (3)	-15
Reddit (2)	+33
Sheryl Sandberg (2)	-27
Snap Inc. (3)	+15
Spotify (1)	-25
Steve Huffman (1)	-43
Stripe (2)	+21
Susan Wojcicki (5)	+5
Tencent (3)	-31
TikTok (5)	-13
WhatsApp (1)	-29
Wikimedia Foundation (3)	+17
X Corp (7)	-31
xAI (2)	-22
Xiaomi (1)	-59
YouTube (7)	-18