REPOGEO REPORT · LITE

MaartenGr/BERTopic

Default branch master · commit f9697602 · scanned 5/16/2026, 8:01:59 AM

GitHub: 7,609 stars · 896 forks

AI VISIBILITY SCORE

91 /100

Healthy

Category recall

2 / 2

Avg rank #1.5 when recommended

Rule findings

2 pass · 0 warn · 0 fail

Objective metadata checks

AI knows your name

3 / 3

Direct prompts that named your repo

HOW TO READ THIS REPORT

Action plan is what to do next — copy-pasteable changes prioritized by impact. Category visibility is the real GEO test: when a user asks an AI a brand-free question that should surface MaartenGr/BERTopic, does the AI actually recommend you — or your competitors? Objective checks verify the metadata signals AI engines weight first. Self-mention check detects whether AI even knows you exist by name.

Action plan — copy-paste fixes

3 prioritized changes generated by gemini-2.5-flash. Mark items done after you ship the fix.

OVERALL DIRECTION

highreadme#1

Refine README's opening sentence to emphasize library status

Why:

CURRENT

BERTopic is a topic modeling technique that leverages 🤗 transformers and c-TF-IDF to create dense clusters allowing for easily interpretable topics whilst keeping important words in the topic descriptions.

COPY-PASTE FIX

BERTopic is a powerful Python library for topic modeling that leverages 🤗 transformers and c-TF-IDF to create dense clusters, allowing for easily interpretable topics whilst keeping important words in the topic descriptions.

mediumcomparison#2

Add a dedicated comparison section to the README or documentation

Why:

COPY-PASTE FIX

Create a new section titled "BERTopic vs. Other Libraries" (or similar) in the README or link to a dedicated page in the documentation, comparing BERTopic with Top2Vec, Gensim, LDA, and NMF, focusing on strengths like interpretability, transformer integration, and flexibility.

lowabout#3

Expand the repository description to reinforce its library identity

Why:

CURRENT

Leveraging BERT and c-TF-IDF to create easily interpretable topics.

COPY-PASTE FIX

BERTopic is a Python library leveraging BERT and c-TF-IDF to create easily interpretable topics and dense clusters from text data.

Category GEO backends resolved for this scan: google/gemini-2.5-flash, deepseek/deepseek-v4-flash

Category visibility — the real GEO test

Brand-free queries asked to google/gemini-2.5-flash. Did AI recommend you, or someone else?

Same questions for every model — switch tabs to compare answers and rankings.

Recall

2 / 2

100% of queries surface MaartenGr/BERTopic

Avg rank

#1.5

Lower is better. #1 = top recommendation.

Share of voice

13%

Of all named tools, what % are you?

Top rival

Top2Vec

Recommended in 2 of 2 queries

COMPETITOR LEADERBOARD

Top2Vec · recommended 2×
sentence-transformers · recommended 2×
Gensim · recommended 2×
KeyBERT · recommended 1×
HDBSCAN · recommended 1×

CATEGORY QUERY
How can I extract easily interpretable topics and dense clusters from large text datasets?
you: #1
AI recommended (in order):
1. BERTopic ← you
2. Top2Vec
3. KeyBERT
4. HDBSCAN
5. sentence-transformers
6. Gensim
7. pyLDAvis
8. Scikit-learn
9. Spark NLP
Show full AI answer
CATEGORY QUERY
What are modern NLP libraries for robust topic modeling and semantic document clustering?
you: #2
AI recommended (in order):
1. Gensim
2. BERTopic ← you
3. Top2Vec
4. spaCy
5. scikit-learn
6. sentence-transformers
Show full AI answer

Objective checks

Rule-based audits of metadata signals AI engines weight most.

Metadata completeness
pass
README presence
pass

Self-mention check

Does AI even know your repo exists when asked about it directly?

Compared to common alternatives in this category, what is the core differentiator of MaartenGr/BERTopic?
pass
AI named MaartenGr/BERTopic explicitly
AI answers can be confidently wrong. Read for accuracy: does it match your actual tech stack, audience, and differentiator?
If a team adopts MaartenGr/BERTopic in production, what risks or prerequisites should they evaluate first?
pass
AI named MaartenGr/BERTopic explicitly
AI answers can be confidently wrong. Read for accuracy: does it match your actual tech stack, audience, and differentiator?
In one sentence, what problem does the repo MaartenGr/BERTopic solve, and who is the primary audience?
pass
AI named MaartenGr/BERTopic explicitly
AI answers can be confidently wrong. Read for accuracy: does it match your actual tech stack, audience, and differentiator?

Embed your GEO score

Drop this badge into the README of MaartenGr/BERTopic. It auto-updates whenever the report is rescanned and links back to the latest report — easy public proof that you care about AI discoverability.

Live preview

MARKDOWN (README)

[![RepoGEO](https://repogeo.com/badge/MaartenGr/BERTopic.svg)](https://repogeo.com/en/r/MaartenGr/BERTopic)

HTML

<a href="https://repogeo.com/en/r/MaartenGr/BERTopic"><img src="https://repogeo.com/badge/MaartenGr/BERTopic.svg" alt="RepoGEO" /></a>

Pro

Subscribe to Pro for deep diagnoses

MaartenGr/BERTopic — Lite scans stay free; this card itemizes Pro deep limits vs Lite.

Deep reports10 / month
Brand-free category queries5 vs 2 in Lite
Prioritized action items8 vs 3 in Lite