Invisible characters hidden in text can trick AI agents into following secret instructions — we tested 5 models across 8,000+ cases

2 sources2 storiesFirst seen 2/26/2026Score30Mixed Progress

Bigness

Coverage

Recency

Engagement

Velocity

Confidence

Clipability

Polarization

Claims

Contradictions

Breakthrough

Sentiment Mix

Positive0%

Neutral100%

Negative0%

Geography

North America

Expert Signals

thecanonicalmg

author • 2 mentions

r/LocalLLaMA

source • 1 mention

r/artificial

source • 1 mention

Extracted Claims

Reverse CAPTCHA: We tested whether invisible Unicode characters can hijack LLM agents: 8,308 outputs across 5 models.

Supported by 1 story

Two encoding schemes (zero-width binary and Unicode Tags), 5 models (GPT-5.2, GPT-4o-mini, Claude Opus 4, Sonnet 4, Haiku 4.5), 8,308 graded outputs.

Supported by 1 story

Key findings: * **Tool access is the primary amplifier.** Without tools, compliance stays below 17%.

Supported by 1 story

With tools and decoding hints, it reaches 98-100%.

Supported by 1 story

* **Encoding vulnerability is provider-specific.** OpenAI models decode zero-width binary but not Unicode Tags.

Supported by 1 story

Invisible characters hidden in text can trick AI agents into following secret instructions — we tested 5 models across 8,000+ cases.

Supported by 1 story

The biggest finding: giving the AI access to tools (like code execution) is what makes this dangerous.

Supported by 1 story

We tested GPT-5.2, GPT-4o-mini, Claude Opus 4, Sonnet 4, and Haiku 4.5 across 8,308 graded outputs.

Supported by 1 story

Related Events

Large-scale online deanonymization with LLMs

LLMs • 2/27/2026

43% match

Hacker used Anthropic's Claude chatbot to attack government agencies in Mexico

LLMs • 2/26/2026

42% match

Claude Code Remote Control

LLMs • 2/26/2026

39% match

Improving support with every interaction at OpenAI

LLMs • 2/26/2026

38% match

OpenAI and Target team up on new AI-powered experiences

LLMs • 2/26/2026

36% match

Causality Chain

Preceded By

OpenAI and Target team up on new AI-powered experiences

65 causal score

Claude Code Remote Control

55 causal score

Hacker used Anthropic's Claude chatbot to attack government agencies in Mexico

55 causal score

Led To

Pentagon sets Friday deadline for Anthropic to abandon ethics rules for AI

52 causal score

OpenAI takes an ownership stake in Thrive Holdings to accelerate enterprise AI adoption

50 causal score

Show HN: Microgpt is a GPT you can visualize in the browser

45 causal score

Timeline (2 stories)

Feb 26 10:26 PMFirst

Invisible characters hidden in text can trick AI agents into following secret instructions — we tested 5 models across 8,000+ cases

r/artificial83 engagement

Feb 26 10:26 PM

Reverse CAPTCHA: We tested whether invisible Unicode characters can hijack LLM agents: 8,308 outputs across 5 models

r/LocalLLaMA33 engagement

Receipts (2)

Sociali.redd.it2/26/2026

Socialmoltwire.com2/26/2026