Question 1

What is a "good" keyword density?

Accepted Answer

There is no good number to target. The pragmatic interpretation is: above roughly 3-4% for a single phrase in body content usually reads as unnatural and can trigger quality flags; below 0.5% for the target topic may mean the page does not cover the term at all. Anything in between is normal and the specific value does not predict ranking. Optimize for readability and topical depth first; check density as a post-hoc sanity check.

Question 2

Does Google penalize high keyword density?

Accepted Answer

Not density specifically, but keyword stuffing is explicitly listed as a spam violation in Google's Spam Policies documentation. Stuffing includes "lists of phone numbers without substantial added value," "blocks of text listing cities and regions," and "repeating the same words or phrases so often that it sounds unnatural." Density is how you measure that symptom; the underlying penalty is for unnatural content, not for the number.

Question 3

What are bigrams and trigrams useful for?

Accepted Answer

They catch phrase-level repetition that unigram counts miss. "Affordable" and "dental" and "implants" might each appear 1.5% individually - reasonable - while "affordable dental implants" as a trigram appears 1.2%, which is extreme for a three-word exact match. Bigram and trigram tables surface the specific phrases that search engines and readers will flag as forced, even when no single word looks over-used.

Question 4

Should I aim for a specific density on my target keyword?

Accepted Answer

No. The mental model "density X = ranking Y" is obsolete. Write the page to comprehensively cover the topic; use the primary keyword naturally in title, first paragraph, and one H2; then use the density check to confirm nothing is excessive. If you end up at 4% because you hammered it, rewrite.

Question 5

Why do stop words dominate the unigram list?

Accepted Answer

Because English text is about 25-30% stop words by token count ("the," "a," "of," "and," "to," "in" alone account for roughly 17%). The tool filters a standard list of English stop words before ranking unigrams, which is why they do not appear in the result. For non-English content, the stop-word filter falls back to a generic list and may leave some high-frequency function words in the output.

Question 6

Does the tool handle HTML?

Accepted Answer

Yes. It strips tags with a simple regex (<[^>]+> replaced with space) and normalizes whitespace before tokenising. This handles paste-from-View-Source workflows. What it does not do is parse semantically - hidden text inside style attributes or

Question 7

Should keyword variations count as the same keyword?

Accepted Answer

Philosophically yes, mechanically no. "Implant," "implants," "implanted," and "implantation" are morphological variants of a single concept, but stemming to merge them introduces its own distortions - false matches across unrelated terms that happen to share a stem. This tool leaves inflected forms separate; you can mentally add related rows to estimate the true concept frequency.

Question 8

Can I check density for multiple keywords at once?

Accepted Answer

Not in a single pass in this tool, but the bigram and trigram tables cover the most common multi-word cases automatically. For explicit multi-keyword checks, run the tool multiple times with each keyword in the spotlight field, or paste the text into a tool like SurferSEO or Clearscope that accepts a target keyword list. For ad-hoc use, rerunning is usually fast enough.

Question 9

Does this tool store or transmit my content?

Accepted Answer

No. Tokenisation and counting happen in a Preact component running in your browser. The textarea value never leaves client-side state. You can verify via the Network tab in devtools - type, click Analyze, and watch for outbound requests; none are made. This matters for embargoed content, client drafts under NDA, and anything you would not want to leak.

Question 10

What about LSI keywords?

Accepted Answer

"LSI keywords" is a term SEO marketers use loosely to mean "topically related terms." Google does not use LSI - John Mueller has confirmed this multiple times. What modern engines do use is semantic embeddings (BERT-style vector representations) where related terms share vector space. This density tool does not model embeddings; it counts tokens.

Question 11

Is density the same across languages?

Accepted Answer

No. Agglutinative languages (Turkish, Finnish) pack more meaning per token, so natural density runs structurally lower than in English. When in doubt, compare to top-ranking native-language pages for the same query.

Keyword Density Checker

Using the Keyword Density Checker

What This Tool Is For (and What It Is Not)

When to Actually Run a Density Check

Edge Cases That Distort the Number

How Modern Search Engines Actually Read Text

Alternative Approaches to the Same Question

Frequently Asked Questions

Related tools

More SEO & Web Tools

Google SERP Preview

Heading Structure Analyzer

Hreflang Tag Generator

Meta Tag Generator

Open Graph Preview

Readability Checker