Field-Trusted GenAI Documentation
Tableau Workbook Knowledge Platform
Built an end-to-end documentation platform: parse TWB/TWBX into condensed metadata, two-pass Amazon Bedrock generation (draft + metadata-grounded refine), rule-based validator for calc/param/datasource coverage, versioned doc store (S3 pattern), and MCP-style retrieval tools. Local demo uses template mock; real Bedrock via shared portfolio_aws library + CDK-deployed Lambda and Secrets Manager.
Challenge
Tableau workbooks accumulate calculated fields, parameters, and dashboards that only original authors understand—blocking self-service analytics, safe GenAI answers, and consistent metric definitions across teams.
Solution
Built an end-to-end documentation platform: parse TWB/TWBX into condensed metadata, two-pass Amazon Bedrock generation (draft + metadata-grounded refine), rule-based validator for calc/param/datasource coverage, versioned doc store (S3 pattern), and MCP-style retrieval tools. Local demo uses template mock; real Bedrock via shared portfolio_aws library + CDK-deployed Lambda and Secrets Manager.
Impact Metrics
Results
- • 4/4 validation checks passed on demo workbook
- • Two-pass doc generation with grounding against metadata.json
- • MCP-style tools: list_workbooks, get_workbook_doc, search_docs
- • CDK stack: API Gateway → Lambda → Bedrock Converse → S3
- • No AWS credentials required for default local mock path
Business Impact
Turns tribal Tableau knowledge into field-trusted documentation that humans and agents can query—foundation for governed GenAI over BI assets.
Architecture
Sibling to Tableau → QuickSight Migration Assistant; shares metadata ingest layer.
Metadata → Trusted Docs
A proven approach combining statistical rigor, automation, and AWS best practices.
Parse & Condense
Extract data sources, calculated fields, parameters, sheets, and dashboards into minimal metadata.json.
Two-Pass Generation
Bedrock Converse draft, then refine pass constrained to metadata—reduces hallucinated field names.
Rule Validator
Non-LLM checks: calc coverage, param names, datasource references, required markdown sections.
Publish & Retrieve
Versioned S3-style layout; keyword search and MCP read tools for agents and analysts.
Scale & Scope
Technology Stack
- Python TWB XML parser
- Amazon Bedrock Converse API
- AWS CDK (Lambda, API Gateway, S3, Secrets Manager)
- portfolio_aws shared library
- MCP-style read server
Need a similar solution?
Let's replicate this success within your organization with a tailored engagement plan.