REPOGEO REPORT · LITE
AI4Bharat/indicnlp_catalog
Default branch master · commit 7c608d6f · scanned 6/15/2026, 11:12:59 AM
GitHub: 636 stars · 96 forks
Action plan is what to do next — copy-pasteable changes prioritized by impact. Category visibility is the real GEO test: when a user asks an AI a brand-free question that should surface AI4Bharat/indicnlp_catalog, does the AI actually recommend you — or your competitors? Objective checks verify the metadata signals AI engines weight first. Self-mention check detects whether AI even knows you exist by name.
Action plan — copy-paste fixes
3 prioritized changes generated by gemini-2.5-flash. Mark items done after you ship the fix.
- highlicense#1Add a LICENSE file to the repository
Why:
COPY-PASTE FIXAdd a LICENSE file (e.g., MIT or Apache-2.0) to the repository root to clarify usage terms for contributors and users.
- highreadme#2Strengthen README's opening to emphasize 'catalog' and 'resource discovery'
Why:
CURRENT# :bookmark: The Indic NLP Catalog _A Collaborative Catalog of Resources for Indic Language NLP_ The **Indic NLP Catalog** repository is an attempt to **collaboratively** build the **most comprehensive** catalog of NLP datasets, models and other resources for all languages of the Indian subcontinent.
COPY-PASTE FIX# :bookmark: The Indic NLP Catalog _The Definitive Collaborative Catalog for Indic Language NLP Resources_ The **Indic NLP Catalog** is the **most comprehensive and centralized platform** for discovering and accessing NLP datasets, models, and other resources across all languages of the Indian subcontinent. It serves as a vital hub for researchers and practitioners to find, contribute, and utilize essential Indic NLP assets.
- mediumtopics#3Add more specific topics to highlight resource catalog function
Why:
CURRENTawesome-list, corpora, indian-languages, libraries, models
COPY-PASTE FIXawesome-list, corpora, indian-languages, libraries, models, resource-catalog, nlp-resources, data-discovery, indic-nlp
Category GEO backends resolved for this scan: google/gemini-2.5-flash, deepseek/deepseek-v4-flash
Category visibility — the real GEO test
Brand-free queries asked to google/gemini-2.5-flash. Did AI recommend you, or someone else?
Same questions for every model — switch tabs to compare answers and rankings.
- AI4Bharat · recommended 1×
- Hugging Face · recommended 1×
- IndicNLP Library · recommended 1×
- Google Research India · recommended 1×
- Google AI Blog · recommended 1×
- CATEGORY QUERYWhere can I find a comprehensive list of NLP datasets and models for Indian languages?you: not recommendedAI recommended (in order):
- AI4Bharat
- Hugging Face
- IndicNLP Library
- Google Research India
- Google AI Blog
- Microsoft Research India
- WMT (Workshop on Machine Translation)
- LREC (Language Resources and Evaluation Conference)
AI recommended 8 alternatives but never named AI4Bharat/indicnlp_catalog. This is the gap to close.
Show full AI answer
- CATEGORY QUERYWhat are the best available NLP libraries and corpora for various Indic language processing tasks?you: not recommendedAI recommended (in order):
- Indic NLP Library
- iNLTK
- Stanza
- spaCy
- spacy-udpipe
- NLTK
- ILCI (Indian Language Corpora Initiative) Corpus
- CIIL (Central Institute of Indian Languages) Corpora
- Universal Dependencies (UD) Corpora
- WikiSource
- Wikipedia Dumps
- IIT Bombay English-Hindi Parallel Corpus
- Samantar Corpus
AI recommended 13 alternatives but never named AI4Bharat/indicnlp_catalog. This is the gap to close.
Show full AI answer
Objective checks
Rule-based audits of metadata signals AI engines weight most.
- Metadata completenesswarn
Suggestion:
- README presencepass
Self-mention check
Does AI even know your repo exists when asked about it directly?
- Compared to common alternatives in this category, what is the core differentiator of AI4Bharat/indicnlp_catalog?passAI did not name AI4Bharat/indicnlp_catalog — likely talking about a different project
AI answers can be confidently wrong. Read for accuracy: does it match your actual tech stack, audience, and differentiator?
- If a team adopts AI4Bharat/indicnlp_catalog in production, what risks or prerequisites should they evaluate first?passAI named AI4Bharat/indicnlp_catalog explicitly
AI answers can be confidently wrong. Read for accuracy: does it match your actual tech stack, audience, and differentiator?
- In one sentence, what problem does the repo AI4Bharat/indicnlp_catalog solve, and who is the primary audience?passAI named AI4Bharat/indicnlp_catalog explicitly
AI answers can be confidently wrong. Read for accuracy: does it match your actual tech stack, audience, and differentiator?
Embed your GEO score
Drop this badge into the README of AI4Bharat/indicnlp_catalog. It auto-updates whenever the report is rescanned and links back to the latest report — easy public proof that you care about AI discoverability.
[](https://repogeo.com/en/r/AI4Bharat/indicnlp_catalog)<a href="https://repogeo.com/en/r/AI4Bharat/indicnlp_catalog"><img src="https://repogeo.com/badge/AI4Bharat/indicnlp_catalog.svg" alt="RepoGEO" /></a>Subscribe to Pro for deep diagnoses
AI4Bharat/indicnlp_catalog — Lite scans stay free; this card itemizes Pro deep limits vs Lite.
- Deep reports10 / month
- Brand-free category queries5 vs 2 in Lite
- Prioritized action items8 vs 3 in Lite