RRepoGEO

REPOGEO REPORT · LITE

CBIhalsen/PolyglotPDF

Default branch main · commit 87b3e946 · scanned 6/19/2026, 2:16:51 PM

GitHub: 1,314 stars · 196 forks

Scan history for this repo

Score trend below includes all ready runs (older left, newer right; scroll horizontally if needed). The table is collapsed by default—expand for newest-first rows, 10 per page.

Score trend (left → right: older → newer)

2 ready scans. Expand the table below for newest-first rows (10 per page, paginated).

AI VISIBILITY SCORE
40 /100
Critical
Category recall
0 / 2
Not recommended in any query
Rule findings
2 pass · 0 warn · 0 fail
Objective metadata checks
AI knows your name
3 / 3
Direct prompts that named your repo
HOW TO READ THIS REPORT

Action plan is what to do next — copy-pasteable changes prioritized by impact. Category visibility is the real GEO test: when a user asks an AI a brand-free question that should surface CBIhalsen/PolyglotPDF, does the AI actually recommend you — or your competitors? Objective checks verify the metadata signals AI engines weight first. Self-mention check detects whether AI even knows you exist by name.

Action plan — copy-paste fixes

3 prioritized changes generated by gemini-2.5-flash. Mark items done after you ship the fix.

OVERALL DIRECTION
  • highreadme#1
    Reposition README opening to clearly state unique value

    Why:

    CURRENT
    python包在2.2版本之前预计不会更新,2.2版本预估采取解析最底层span获取更信息的布局逻辑解决,预估解决:行内公式错误判断为公式块,错误将粗体文本进行分段bug,以及insert_html方法重复嵌入字体文件导致处理页数较大pdf时浪费计算资源极其卡顿。 目前效果,对于基于文本的pdf,polyglotpdf的解析方式依旧是最优解。 ocr和布局分析并不总是完美。(考虑处理文本上下标问题,大部分pdf文件中上标下标文本通过指定坐标和字体大小实现伪上下标,考虑替换为真正的上下标文字对应的Unicode编码,但并不完美),对于报告型表格文档,polyglotpdf效果相当完美,当然表格中的复杂矢量数学公式依旧无法正确处理)。
    COPY-PASTE FIX
    PolyglotPDF is the world's highest-performing open-source tool for layout-preserving translation of eBooks and PDFs, capable of generating single PDF files with multiple interactive language layers (Optional Content Groups). It supports both online and offline translation, and is compatible with scanned and digital PDFs.
  • highabout#2
    Enhance description to highlight interactive language layers

    Why:

    CURRENT
    A multilingual eBook processing tool supporting all eBook formats. Features online and offline translation while preserving original layouts. Compatible with both scanned and digital PDFs. Elegant user interface. The world's highest-performing open-source layout-preserving eBook translator.
    COPY-PASTE FIX
    PolyglotPDF is the world's highest-performing open-source tool for multilingual eBook and PDF processing. It features online and offline translation while preserving original layouts, and can generate single PDF files with multiple interactive language layers (Optional Content Groups). Compatible with both scanned and digital PDFs, it offers an elegant user interface.
  • mediumtopics#3
    Add more specific topics to differentiate from generic translation

    Why:

    CURRENT
    deepseek, ebook, formulas, latex, math, openai-api, pdf, pymupdf, translation
    COPY-PASTE FIX
    deepseek, ebook, formulas, latex, math, openai-api, pdf, pymupdf, translation, layout-preservation, ocr-translation, interactive-pdf, multilingual-pdf

Category GEO backends resolved for this scan: google/gemini-2.5-flash, deepseek/deepseek-v4-flash

Category visibility — the real GEO test

Brand-free queries asked to google/gemini-2.5-flash. Did AI recommend you, or someone else?

Same questions for every model — switch tabs to compare answers and rankings.

Recall
0 / 2
0% of queries surface CBIhalsen/PolyglotPDF
Avg rank
Lower is better. #1 = top recommendation.
Share of voice
0%
Of all named tools, what % are you?
Top rival
DeepL Pro
Recommended in 2 of 2 queries
COMPETITOR LEADERBOARD
  1. DeepL Pro · recommended 2×
  2. Google Translate · recommended 2×
  3. DocTranslator.com · recommended 2×
  4. Adobe Acrobat Pro DC with Adobe Document Cloud Translation Services · recommended 1×
  5. Smartcat · recommended 1×
  • CATEGORY QUERY
    How to translate PDF documents to different languages accurately while maintaining original formatting?
    you: not recommended
    AI recommended (in order):
    1. Adobe Acrobat Pro DC with Adobe Document Cloud Translation Services
    2. DeepL Pro
    3. Google Translate
    4. Smartcat
    5. SDL Trados Studio
    6. Transifex
    7. DocTranslator.com

    AI recommended 7 alternatives but never named CBIhalsen/PolyglotPDF. This is the gap to close.

    Show full AI answer
  • CATEGORY QUERY
    Looking for an open-source tool to translate scientific PDFs and preserve complex layouts and formulas.
    you: not recommended
    AI recommended (in order):
    1. DeepL Pro
    2. Google Translate
    3. OmegaT
    4. LibreOffice Writer
    5. Microsoft Word
    6. MathType
    7. LibreOffice Math
    8. DocTranslator.com
    9. Okapi Framework
    10. Rainbow
    11. Ratel
    12. Tesseract OCR (tesseract-ocr/tesseract)
    13. Marian NMT (marian-nmt/marian)
    14. Apertium (apertium/apertium)
    15. LaTeX

    AI recommended 15 alternatives but never named CBIhalsen/PolyglotPDF. This is the gap to close.

    Show full AI answer

Objective checks

Rule-based audits of metadata signals AI engines weight most.

  • Metadata completeness
    pass

  • README presence
    pass

Self-mention check

Does AI even know your repo exists when asked about it directly?

  • Compared to common alternatives in this category, what is the core differentiator of CBIhalsen/PolyglotPDF?
    pass
    AI named CBIhalsen/PolyglotPDF explicitly

    AI answers can be confidently wrong. Read for accuracy: does it match your actual tech stack, audience, and differentiator?

  • If a team adopts CBIhalsen/PolyglotPDF in production, what risks or prerequisites should they evaluate first?
    pass
    AI named CBIhalsen/PolyglotPDF explicitly

    AI answers can be confidently wrong. Read for accuracy: does it match your actual tech stack, audience, and differentiator?

  • In one sentence, what problem does the repo CBIhalsen/PolyglotPDF solve, and who is the primary audience?
    pass
    AI named CBIhalsen/PolyglotPDF explicitly

    AI answers can be confidently wrong. Read for accuracy: does it match your actual tech stack, audience, and differentiator?

Embed your GEO score

Drop this badge into the README of CBIhalsen/PolyglotPDF. It auto-updates whenever the report is rescanned and links back to the latest report — easy public proof that you care about AI discoverability.

RepoGEO badge previewLive preview
MARKDOWN (README)
[![RepoGEO](https://repogeo.com/badge/CBIhalsen/PolyglotPDF.svg)](https://repogeo.com/en/r/CBIhalsen/PolyglotPDF)
HTML
<a href="https://repogeo.com/en/r/CBIhalsen/PolyglotPDF"><img src="https://repogeo.com/badge/CBIhalsen/PolyglotPDF.svg" alt="RepoGEO" /></a>
Pro

Subscribe to Pro for deep diagnoses

CBIhalsen/PolyglotPDF — Lite scans stay free; this card itemizes Pro deep limits vs Lite.

  • Deep reports10 / month
  • Brand-free category queries5 vs 2 in Lite
  • Prioritized action items8 vs 3 in Lite