REPOGEO REPORT · LITE
metabase/dataset-generator
Default branch main · commit 182a8f4e · scanned 6/12/2026, 1:28:05 PM
GitHub: 763 stars · 46 forks
Action plan is what to do next — copy-pasteable changes prioritized by impact. Category visibility is the real GEO test: when a user asks an AI a brand-free question that should surface metabase/dataset-generator, does the AI actually recommend you — or your competitors? Objective checks verify the metadata signals AI engines weight first. Self-mention check detects whether AI even knows you exist by name.
Action plan — copy-paste fixes
3 prioritized changes generated by gemini-2.5-flash. Mark items done after you ship the fix.
- highreadme#1Reposition the README's opening to emphasize AI-driven, BI-focused data generation
Why:
CURRENT# AI Dataset Generator **Generate realistic datasets for demos, learning, and dashboards. Instantly preview data, export as CSV or SQL, and explore with Metabase.**
COPY-PASTE FIX# AI Dataset Generator **Generate realistic, AI-powered datasets for demos, learning, and dashboards. Use conversational prompts to create multi-table schemas, instantly preview data, export as CSV or SQL, and explore directly with Metabase.** It's specifically designed to integrate seamlessly with Metabase for immediate data exploration and dashboard building.
- hightopics#2Add relevant topics to improve categorization
Why:
COPY-PASTE FIXai-data-generation, synthetic-data, metabase, business-intelligence, data-visualization, dataset-generator, llm, conversational-ai, data-modeling, sql-export, csv-export
- mediumcomparison#3Add a 'How is this different?' section to the README
Why:
COPY-PASTE FIX## How is this different from Faker or Mockaroo? While tools like Faker and Mockaroo are excellent for generating simple, randomized data, the AI Dataset Generator offers a more sophisticated, AI-driven approach. It allows you to: - **Use natural language prompts** to define complex business scenarios and multi-table schemas. - **Generate contextually realistic data** that reflects specific business types, not just random values. - **Export data directly compatible with BI tools** like Metabase, including one-click launch for immediate exploration. - **Focus on relational data** suitable for dashboards and analytical use cases, rather than just individual fields.
Category GEO backends resolved for this scan: google/gemini-2.5-flash, deepseek/deepseek-v4-flash
Category visibility — the real GEO test
Brand-free queries asked to google/gemini-2.5-flash. Did AI recommend you, or someone else?
Same questions for every model — switch tabs to compare answers and rankings.
- Faker · recommended 2×
- Mockaroo · recommended 1×
- chancejs/chance · recommended 1×
- faker-ruby/faker · recommended 1×
- GenerateData.com · recommended 1×
- CATEGORY QUERYHow to quickly generate realistic sample datasets for application development and testing?you: not recommendedAI recommended (in order):
- Faker
- Mockaroo
- Chance.js (chancejs/chance)
- Data Faker (faker-ruby/faker)
- GenerateData.com
- Synthea (synthetichealth/synthea)
- Postman Mock Servers
AI recommended 7 alternatives but never named metabase/dataset-generator. This is the gap to close.
Show full AI answer
- CATEGORY QUERYWhat AI tools can create synthetic data and export as SQL or CSV for analysis?you: not recommendedAI recommended (in order):
- Gretel.ai
- Synthetic Data Vault (SDV)
- Mostly AI
- Synthesized
- Faker
AI recommended 5 alternatives but never named metabase/dataset-generator. This is the gap to close.
Show full AI answer
Objective checks
Rule-based audits of metadata signals AI engines weight most.
- Metadata completenesswarn
Suggestion:
- README presencepass
Self-mention check
Does AI even know your repo exists when asked about it directly?
- Compared to common alternatives in this category, what is the core differentiator of metabase/dataset-generator?passAI named metabase/dataset-generator explicitly
AI answers can be confidently wrong. Read for accuracy: does it match your actual tech stack, audience, and differentiator?
- If a team adopts metabase/dataset-generator in production, what risks or prerequisites should they evaluate first?passAI named metabase/dataset-generator explicitly
AI answers can be confidently wrong. Read for accuracy: does it match your actual tech stack, audience, and differentiator?
- In one sentence, what problem does the repo metabase/dataset-generator solve, and who is the primary audience?passAI did not name metabase/dataset-generator — likely talking about a different project
AI answers can be confidently wrong. Read for accuracy: does it match your actual tech stack, audience, and differentiator?
Embed your GEO score
Drop this badge into the README of metabase/dataset-generator. It auto-updates whenever the report is rescanned and links back to the latest report — easy public proof that you care about AI discoverability.
[](https://repogeo.com/en/r/metabase/dataset-generator)<a href="https://repogeo.com/en/r/metabase/dataset-generator"><img src="https://repogeo.com/badge/metabase/dataset-generator.svg" alt="RepoGEO" /></a>Subscribe to Pro for deep diagnoses
metabase/dataset-generator — Lite scans stay free; this card itemizes Pro deep limits vs Lite.
- Deep reports10 / month
- Brand-free category queries5 vs 2 in Lite
- Prioritized action items8 vs 3 in Lite