Dingo & Co. Knowledge Work
A 23-deliverable consulting brief: research, financial reconciliation, regulatory analysis, decks and spreadsheets. Tests whether a model can run an entire knowledge-work engagement end to end.
This is complete and strategically aware, but strict scoring should not treat completion as reliability. The evidence package shows only one URL in sources, regulatory analysis is broad and under-cited, required visual artifacts ignore provided source imagery, spreadsheets have no charts, and the sales/deck/dashboard outputs are valid but underproduced.
What it nailed
- All required files exist as real artifacts with exact filenames.
- Assumptions file reconciles revenue, budget, launch date, attach-rate, and source-data contradictions.
- Investor FAQ candidly addresses TAM, ethics, Alaska mismatch, import-created demand, and rehoming gaps.
- Personas identify non-buyer curiosity traffic and adjacent wolfdog demand.
Where it slipped
- 00_sources.md has only one URL and relies on unverified Perplexity-style research notes.
- Deck, PDF one-pager, and dashboard ignore provided source images despite the manifest claiming usage.
- Jurisdiction tables do not adequately separate ownership, import, transport, quarantine, and local restrictions.
- Sales one-pager and board deck are valid files but visually underproduced.
- Spreadsheets have formulas but no charts.
- Raw model output/transcript evidence is absent from the evaluation package.