Question 1

Does the tool produce true UTF-8 binary or Unicode code-point binary?

Accepted Answer

Unicode code-point binary, not UTF-8 wire format. The letter "A" (U+0041) produces 01000001 in both representations by coincidence since ASCII is a subset of Unicode. For characters above U+007F they diverge: U+00E9 ("e with acute") is code-point 11101001 (8 bits) in this tool, but the UTF-8 byte sequence is 11000011 10101001 (16 bits). For byte-accurate network protocol work, use a hex or UTF-8-specific tool; this tool targets the educational case of showing how characters map to numeric code points in binary.

Question 2

Is my text stored or sent anywhere?

Accepted Answer

No. The tool runs in the browser, using the JavaScript String methods and a simple conversion routine. There is no fetch call, no WebSocket, no analytics event containing your text, and no localStorage persistence. You can type a password or private document into the input box and watch in DevTools Network tab - zero requests fire. The tab closing erases the state.

Question 3

Why does my decoded output differ from what I encoded?

Accepted Answer

Mismatched settings between encode and decode. The most common cause is the 8-bit padding toggle: if you encode without padding, ASCII characters become 7-bit chunks (1000001 for A), but decoding with 8-bit padding on splits the stream into 8-bit groups and produces garbage. Set padding and space-separation the same way on both sides. Second cause is pasting binary that was produced by a different tool that uses UTF-8 wire format instead of code points - the chunk boundaries will not align.

Question 4

Can I decode binary that has extra spaces or line breaks?

Accepted Answer

Yes. The binary-to-text function strips all non-digit whitespace characters with a regex, then splits on whitespace if space-separation is enabled. Newlines, tabs, and multiple spaces between groups are handled. Non-binary characters (letters, punctuation) are silently dropped, which is convenient but can hide typos - if you paste "10O01" meaning one-zero-zero-zero-one, the tool cleans it to "1001" which is a different value.

Question 5

How does this handle emoji and CJK characters?

Accepted Answer

Emoji in the Basic Multilingual Plane have code points up to U+FFFF; emoji above that (most modern faces, flags) go up to U+10FFFF and produce up to 21 bits. CJK characters in the BMP are 15-16 bits each. The text is iterated with [...text] which respects surrogate pairs, so round-trip encoding preserves all these characters. For UTF-8 byte-level analysis, use a hex-dump tool instead.

Question 6

Why does the null byte get dropped during decoding?

Accepted Answer

A chunk of all zeros represents code point 0 (U+0000 NULL), which renders invisibly and often breaks terminals and HTML. The decoder checks for zero and emits nothing to avoid confusing output. If you decode a binary stream that legitimately contains null bytes (embedded-systems datasheets, serialized C strings), this tool silently loses them; use a hex-oriented tool for that workload.

Question 7

Does this tool handle Base64 or Base32?

Accepted Answer

No - those are different encodings covered by dedicated tools on this site. Base64 (RFC 4648 section 4) maps 3 bytes to 4 characters using A-Z, a-z, 0-9, +, /; Base32 (RFC 4648 section 6) maps 5 bytes to 8 characters using A-Z and 2-7. Both are denser than raw binary. Use the Base64 Encoder/Decoder and Base32 tools on this site for those; this page is for converting between text and literal binary digits.

Question 8

Can I paste binary from a CTF challenge and get the flag?

Accepted Answer

Often yes. CTF flag encoding with space-separated 8-bit binary is one of the most common beginner puzzles, and this tool decodes them directly. Set the mode to "Binary to Text", enable 8-bit padding and space-separation as appropriate, paste the binary, and read the decoded text. If the flag is UTF-8 encoded (non-ASCII characters), the decoded output may look garbled because the tool decodes code points, not UTF-8 bytes; try splitting the binary into 8-bit groups and running through a UTF-8-aware decoder in that case.

Binary to Text

Converting Between Raw Bits and Readable Text

Unicode Code Points, UTF-8 Bytes, and Why They Differ

Why You Would Actually Need This

Pitfalls When Round-Tripping Through Binary

The Broader Binary-to-Text Encoding Landscape

Binary vs Base64, xxd, and System Tools

Frequently Asked Questions

Related tools

More Text Tools

Case Converter

Character Counter

Emoji Picker & Search

Fancy Text Generator

Find & Replace

HTML to Markdown