Question 1

Which Markdown flavor does the output target?

Accepted Answer

CommonMark, which is the modern standardised dialect documented at commonmark.org. Headings use ATX form (#), fenced code blocks use triple backticks, and link syntax matches the [text](url) form. GFM-specific features like task lists and tables are handled opportunistically but may render differently on strict CommonMark parsers. For maximum portability, keep your HTML structure simple and avoid relying on GFM extensions.

Question 2

Does it use a real HTML parser or regex?

Accepted Answer

Real parser. The input is fed to the browser's DOMParser with the text/html MIME type, producing the same DOM tree that a browser would render. A depth-first walk then emits Markdown tokens for each element. That approach handles nested structures, implicit tag closure, and unusual attribute quoting correctly - situations where a regex-based converter would silently drop content or emit malformed output.

Question 3

Are <code><script></code> and <code><style></code> blocks included in the Markdown?

Accepted Answer

No. The walker explicitly skips these elements because emitting their text would inject JavaScript or CSS into your Markdown, which every sane renderer treats as plain text - producing noisy paragraphs that clutter the content. This is especially useful when pasting an entire HTML page where the contains analytics snippets you do not want in your README.

Question 4

Is my content uploaded anywhere?

Accepted Answer

No. DOMParser is a synchronous in-process API, and the Markdown walk is a local function call. No fetch request is made during conversion, no websocket is opened, and nothing is persisted to localStorage or IndexedDB. The content you paste and the Markdown you copy both live in JavaScript memory only and are released when you close the tab.

Question 5

How are tables handled?

Accepted Answer

Simple tables with , , , and convert to the GitHub Flavored Markdown table syntax with pipe separators and a dashed separator row. Tables that use row-spans or column-spans cannot be expressed in GFM syntax, so those lose the span and emit a flat grid. If your source has complex tables, consider keeping them as HTML inside the Markdown output (both CommonMark and GFM permit raw HTML blocks).

Question 6

Does it preserve image alt text?

Accepted Answer

Yes. The alt attribute is read from each and placed inside the ![alt](src) brackets. Images without alt text (accessibility anti-pattern, unfortunately common) emit ![](src), which is valid Markdown but renders with no caption. Add meaningful alt text after conversion if it was missing in the source.

Question 7

What happens with inline links containing query strings or parentheses?

Accepted Answer

CommonMark requires URL-special characters inside parentheses to be either URL-encoded or wrapped in angle brackets. A link like [click](https://a.com/(path)) is ambiguous to parsers; the converter URL-encodes the inner parentheses to %28 and %29. Query strings with ?, &, and = pass through unmodified because they are safe in the link syntax.

Question 8

Can I convert an entire HTML page including head and nav?

Accepted Answer

Yes, but the output will contain everything including menus, footers, and metadata. If you want just the article body, paste only the relevant fragment - for example the contents of

or

. The Readability algorithm (used by Firefox Reader View) is the canonical way to extract article content; browser extensions like Markdownload combine that extraction with Markdown conversion in one step.

Question 9

How are fenced code blocks labeled with a language?

Accepted Answer

The converter reads the class attribute on the

 element inside a . Classes matching language-xxx, lang-xxx, or just xxx (common from highlighters like Prism, Shiki, and Rouge) are recognised. The language hint appears immediately after the opening triple backtick and lets syntax highlighters in your target renderer apply correct colouring.

Question 10

Will nested lists indent correctly?

Accepted Answer

Yes. Each list level adds four spaces of indentation, which is the standard CommonMark rule for nested lists. Unordered lists use - markers and ordered lists use 1. through n. . A nested ordered list inside an unordered list produces a structure like - Item\n 1. Sub item, which CommonMark parsers render correctly.

HTML to Markdown

How to Use the HTML to Markdown Converter

How the Conversion Works

When to Convert

Edge Cases

CommonMark in Context

Alternatives Worth Knowing

Frequently Asked Questions

Related tools

More Text Tools

Binary to Text

Case Converter

Character Counter

Emoji Picker & Search

Fancy Text Generator

Find & Replace