Whitespace Characters

Unicode includes many whitespace characters that look similar but behave differently across tools.

Quick checks

Inspect pasted text for invisible or risky Unicode characters, visualize findings, and generate cleaned output entirely in your browser.

Input text

Live analysis on

Drop .txt/.csv/.json here

or click to choose a file (max 2 MB)

Symbol

Detect mixed-script and confusable lookalike tokens before trusting identifiers or domains.

Token	Scripts	Risk	Confusables	Positions
No homoglyph risks detected.

Inspect bidirectional control sequences that can hide true token order in code, configs, and logs.

Type	Risk	Range	Controls count	Details
No dangerous bidi sequences detected.

Overlay on

Collapse repeated spaces during whitespace normalization

Advanced JSON report

Include positionsInclude rangesInclude risk and reasonsInclude redacted context snippets

Positions and ranges can increase payload size.

Share link

Raw text is never included in the report or share link.

Different runtimes and libraries treat Unicode whitespace classes differently.

Not always. Some contexts require preserved line breaks or tabs for readability and semantics.

Normalize at ingestion boundaries, then keep a predictable internal format across your pipeline.