REPOGEO REPORT · LITE
chatopera/efaqa-corpus-zh
Default branch master · commit eade3282 · scanned 5/31/2026, 8:17:46 PM
GitHub: 755 stars · 88 forks
Action plan is what to do next — copy-pasteable changes prioritized by impact. Category visibility is the real GEO test: when a user asks an AI a brand-free question that should surface chatopera/efaqa-corpus-zh, does the AI actually recommend you — or your competitors? Objective checks verify the metadata signals AI engines weight first. Self-mention check detects whether AI even knows you exist by name.
Action plan — copy-paste fixes
3 prioritized changes generated by gemini-2.5-flash. Mark items done after you ship the fix.
- highreadme#1Reposition the core value proposition to the top of the README
Why:
CURRENTThe detailed description of the dataset's unique value (largest, multi-turn, mental health specific) appears several paragraphs into the README, after a table and a note.
COPY-PASTE FIX# 心理咨询相关语料库 chatopera/efaqa-corpus-zh 是迄今公开的最大的中文心理咨询对话语料库,专为心理咨询和情感支持领域的AI应用设计。它包含人工标注的多轮对话数据,是训练和微调大型语言模型(LLMs)以构建心理健康聊天机器人的理想选择。
- mediumtopics#2Add more specific topics to improve category visibility
Why:
CURRENTcorpus, natural-language-processing, natural-language-understanding, psychology
COPY-PASTE FIXcorpus, natural-language-processing, natural-language-understanding, psychology, mental-health-chatbot, emotional-support-ai, dialogue-corpus, chinese-nlp, llm-training-data
- mediumreadme#3Clarify the licensing model for code and data in the README
Why:
CURRENTThe README states '心理咨询问答语料库的源代码是基于开源许可证分发,但是安装使用过程中,下载的语料文件,需要从证书商店购买证书,才能下载和使用'.
COPY-PASTE FIX## 许可与使用 本项目源代码遵循开源许可协议分发。然而,下载和使用语料库文件需要从证书商店购买相应的证书。请参阅 https://www.cskefu.com/licenses/v1.html 获取详细信息。
Category GEO backends resolved for this scan: google/gemini-2.5-flash, deepseek/deepseek-v4-flash
Category visibility — the real GEO test
Brand-free queries asked to google/gemini-2.5-flash. Did AI recommend you, or someone else?
Same questions for every model — switch tabs to compare answers and rankings.
- CLUE (Chinese Language Understanding Evaluation) Benchmark · recommended 1×
- LCCC (Large-scale Chinese Conversational Corpus) · recommended 1×
- CODI (Chinese Open-domain Dialogue Dataset) · recommended 1×
- Weibo Sentiment Analysis Datasets · recommended 1×
- Chinese Medical Question Answering (QA) Datasets · recommended 1×
- CATEGORY QUERYWhere can I find a large Chinese dataset for training a mental health chatbot?you: not recommendedAI recommended (in order):
- CLUE (Chinese Language Understanding Evaluation) Benchmark
- LCCC (Large-scale Chinese Conversational Corpus)
- CODI (Chinese Open-domain Dialogue Dataset)
- Weibo Sentiment Analysis Datasets
- Chinese Medical Question Answering (QA) Datasets
- Zhihu
- Douban
AI recommended 7 alternatives but never named chatopera/efaqa-corpus-zh. This is the gap to close.
Show full AI answer
- CATEGORY QUERYNeed a multi-turn dialogue corpus for emotional support NLU model training.you: not recommendedAI recommended (in order):
- EmpatheticDialogues
- DailyDialog
- Persona-Chat
- Therapeutic Conversations Corpus
- Ubuntu Dialogue Corpus
- ConvAI2
- MELD
AI recommended 7 alternatives but never named chatopera/efaqa-corpus-zh. This is the gap to close.
Show full AI answer
Objective checks
Rule-based audits of metadata signals AI engines weight most.
- Metadata completenesspass
- README presencepass
Self-mention check
Does AI even know your repo exists when asked about it directly?
- Compared to common alternatives in this category, what is the core differentiator of chatopera/efaqa-corpus-zh?passAI did not name chatopera/efaqa-corpus-zh — likely talking about a different project
AI answers can be confidently wrong. Read for accuracy: does it match your actual tech stack, audience, and differentiator?
- If a team adopts chatopera/efaqa-corpus-zh in production, what risks or prerequisites should they evaluate first?passAI named chatopera/efaqa-corpus-zh explicitly
AI answers can be confidently wrong. Read for accuracy: does it match your actual tech stack, audience, and differentiator?
- In one sentence, what problem does the repo chatopera/efaqa-corpus-zh solve, and who is the primary audience?passAI named chatopera/efaqa-corpus-zh explicitly
AI answers can be confidently wrong. Read for accuracy: does it match your actual tech stack, audience, and differentiator?
Embed your GEO score
Drop this badge into the README of chatopera/efaqa-corpus-zh. It auto-updates whenever the report is rescanned and links back to the latest report — easy public proof that you care about AI discoverability.
[](https://repogeo.com/en/r/chatopera/efaqa-corpus-zh)<a href="https://repogeo.com/en/r/chatopera/efaqa-corpus-zh"><img src="https://repogeo.com/badge/chatopera/efaqa-corpus-zh.svg" alt="RepoGEO" /></a>Subscribe to Pro for deep diagnoses
chatopera/efaqa-corpus-zh — Lite scans stay free; this card itemizes Pro deep limits vs Lite.
- Deep reports10 / month
- Brand-free category queries5 vs 2 in Lite
- Prioritized action items8 vs 3 in Lite