๐Ÿš€
You upload a file. The pipeline does the rest.
Name your file correctly, drop it into data/uploads/ on GitHub, commit โ€” and within minutes the ingestion, synthesis, and review agents run automatically. Check the Pipeline tab for results.
1
Name Your File Correctly
The pipeline infers the value chain from the filename โ€” this is the only manual step you must get right.
Rule: Start the filename with the value chain code (e.g. VC01_), then a short description, then the year.

Format: VCxx_description_year.ext

CodeValue ChainGood filename example
VC01Iron & SteelVC01_market_survey_2024.pdf
VC02Copper & Allied MetalsVC02_production_data_2024.xlsx
VC03AutomotiveVC03_KMC_capacity_report.docx
VC04Textiles & GarmentsVC04_firms_survey_2025.pdf
VC05PharmaceuticalsVC05_NDA_register_2024.xlsx
VC06Petrochemicals & FertilizersVC06_kabalega_project_brief.pdf
VC07Sugar & ConfectioneryVC07_USPA_statistics_2024.pdf
VC08Plastics & PackagingVC08_industry_association_data.xlsx
VC09Cement & Building MaterialsVC09_UCHA_capacity_2024.docx
โš ๏ธ
If the filename does NOT start with VCxx, the pipeline will try to guess the chain from keywords in the content. This sometimes works, but can result in data going to the wrong chain. Always use the VCxx prefix.

Accepted file types: PDF, DOCX, XLSX, CSV. Maximum size: no hard limit (content is truncated at 80,000 characters for Claude processing).

2
Upload to data/uploads/ on GitHub
Navigate to the folder and drag-drop or click "Add file".
github.com/MTIC-Uganda/industrial_diagnostic_study/tree/main/data/uploads
๐Ÿ“
MTIC-Uganda / industrial_diagnostic_study
Branch: main
industrial_diagnostic_study / data / uploads / โ–ผ
Drop files here to upload, or use Add file โ†’
โ†‘ Go to file + Add file โ–พ
๐Ÿ“„.gitkeepnaming convention docs
๐Ÿ“คVC01_market_survey_2024.pdfโŸต your file goes here
โ„น๏ธ
Two ways to upload:
a) Browser drag-drop: Navigate to data/uploads/ โ†’ click "Add file" โ†’ "Upload files" โ†’ drag your file โ†’ scroll down โ†’ write a short commit message โ†’ click "Commit changes".
b) Git on your laptop: Copy the file into data/uploads/, then git add โ†’ git commit โ†’ git push.
3
Write a Commit Message and Commit
The commit message is how the team knows what was uploaded โ€” keep it short and descriptive.
github.com/MTIC-Uganda/industrial_diagnostic_study/upload/main/data/uploads
Commit changes
data: add VC01 Iron & Steel market survey 2024
Source: Uganda Bureau of Standards / Commissioner's office ...
Cancel Commit changes

Good commit message format: data: add VC<XX> <short description> <year>

4
The Pipeline Runs Automatically
GitHub Actions detects the upload and triggers three agents in sequence.
๐Ÿ“ค You upload
data/uploads/
โ†’
๐Ÿ“ฅ Ingest Agent
mines doc + trusted sources
โ†’
โœ๏ธ Synthesis Agent
writes diagnostic chapter
โ†’
โœ… Review Agent
validates 32 checks
โœ…
PUBLISH verdict: The review agent opens a GitHub Pull Request with the finished chapter. Jerome or Solomon merges it to publish.
โŒ
REVISION verdict: The review agent writes a revision notice to data/revision/ and sends a WhatsApp notification. The notice explains exactly what data is missing and which fields need to be filled.
How long does it take?
Ingest Agent1โ€“3 minutes (depends on document size and Claude processing time)
Synthesis Agent2โ€“5 minutes (Claude writes the full chapter)
Review Agent1โ€“3 minutes (Claude evaluates 32 quality gate checks)

Total: typically 5โ€“12 minutes from commit to verdict.

5
Check the Pipeline Status
Two places to see what happened: the dashboard Pipeline tab, and GitHub Actions.
๐Ÿ”„ Dashboard โ€” Pipeline Tab

Go to the main dashboard โ†’ click the ๐Ÿ”„ Pipeline tab. Each value chain shows its Ingest / Synthesise / Review status at a glance, plus the revision summary if something needs fixing.

Open Dashboard โ†’
โš™๏ธ GitHub Actions

For detailed logs: go to the repo โ†’ Actions tab โ†’ find the workflow run triggered by your commit โ†’ click "Ingestion Agent", "Synthesis Agent", or "Review Agent" to see the full output.

github.com/MTIC-Uganda/industrial_diagnostic_study/actions
All workflows
Deploy MTIC Value Chains App
data: add VC01 Iron & Steel market survey 2024 ยท main ยท 8 minutes ago
โœ“ completed
๐Ÿ“ฅ Ingestion Agent ยท 2m 14s
โœ๏ธ Synthesis Agent ยท 4m 07s
๐Ÿ” Review Agent ยท 2m 53s โ€” 3 mandatory failures
6
If Revision Is Required โ€” Read the Notice
The review agent writes exactly what is missing and what you need to do.
โ„น๏ธ
A REVISION verdict does not mean your document was wrong โ€” it means some mandatory data fields are still empty after all three ingestion methods ran. The notice tells you which fields and what source to consult.
## Revision Notice โ€” Iron & Steel (VC01)
Run: 20260615T103412Z | Verdict: RETURN_FOR_REVISION

### E-01 โ€” STAGE_1_EVIDENCE
Check: All evidence_required=true fields have populated values or explicit not_available with source note.
Failing fields: D1.02, D1.03, D3.01
Fix required: Add TradeMap export/import figures for Iron & Steel (HS 72). Run the trusted_source method manually or upload a TradeMap CSV named VC01_trademap_2024.csv.

## Summary
Three evidence-required fields are missing trade volume data. Upload a TradeMap CSV for VC01 to resolve all three failures in one run.
What to do:
  1. Read the revision notice โ€” it names the exact field IDs and the data source needed.
  2. Find the missing data from the suggested source (e.g. TradeMap, UBOS, firm records).
  3. Save it as a new file named with the correct VCxx prefix.
  4. Upload to data/uploads/ and commit โ€” the pipeline re-runs automatically.
7
Approving Gap-Fill Proposals (Optional)
When the pipeline cannot find data anywhere, it proposes candidate values for your review.

Sometimes the ingestion agent exhausts all three methods (doc mining, trusted sources, web search) and still cannot find a value for a mandatory field. In that case it proposes a candidate from general knowledge and opens a GitHub Pull Request for your approval.

โš ๏ธ
Your merge = your approval. Before merging a gap-fill PR, verify each proposed value by clicking the source_url link in the JSON file. If a value is wrong, edit it before merging. Closing without merging = rejection โ€” nothing enters the dataset.
github.com/MTIC-Uganda/industrial_diagnostic_study/pull/62
Data review: web gap-fill proposals 2026-06-15
jnuwabaasa-spec opened this pull request ยท 1 file changed
The ingestion agent could not find values for some mandatory fields.
It has proposed candidate values from general knowledge.

โœ” Review each value in data/proposed/web_gap_fill_20260615.json
Check the source_url for each field. If correct: merge. If wrong: edit first.
Merge pull request โœ“ Close pull request
โœ…
That's the whole workflow. Name โ†’ Upload โ†’ Commit โ†’ Wait 5โ€“12 min โ†’ Check Pipeline tab. If revision required, follow the notice and upload again. The pipeline is self-correcting โ€” every upload gets you closer to a published chapter.