Anvik AI
Case study

Ministry of Statistics (India)

The goal wasn’t a flashy chatbot — it was an evidence-first system for dense policy and statistical documentation, where every answer must be provable.

Impact snapshot
Typical analyst workflow
manual hunting → evidence packs with citations
Answer quality
grounded responses + trace paths
Production readiness
governance-first deployment blueprint
Challenges

Why this problem is hard.

Dense, heterogeneous corpora

Long PDFs, circulars, annexures, tables, scanned pages, and frequent cross-references.

Relationship questions

Answers often require multi-hop reasoning: definitions → exceptions → eligibility → reporting obligations.

Public-sector rigor

Outputs must be defensible: citations, traceability, and repeatability across versions.

Solution

What we built.

The architecture focuses on four guarantees: preserve structure, extract deterministically, resolve entities consistently, and return answers that are auditable.

Structure-preserving ingestion
  • Layout-aware parsing (sections, tables, references)
  • Page-level citations and stable evidence IDs
  • Extraction reporting for coverage and gaps
Knowledge graph layer
  • Entity + relation extraction into controlled schemas
  • Entity normalization and deduplication
  • Cross-document linking for multi-hop traversal
Hybrid retrieval + evaluation
  • Graph-constrained retrieval for relationship questions
  • Table-aware retrieval to avoid losing critical numbers
  • Evaluation suite to measure groundedness and evidence quality
Governance-first outputs
  • Citation-first answers with trace paths
  • Audit-ready logs of retrieval + generation
  • Guardrails: confidence thresholds and gap reporting
Outcomes

What changed.

Faster evidence discovery

Analysts move from manual document hunting to targeted evidence packs and traceable answers.

Better decision support

Relationship queries become practical: dependencies, exceptions, and cross-references are captured in the graph.

Reusable blueprint

The pipeline design can be replicated across other ministries, regulators, or compliance-heavy organizations.

Next
Want this blueprint for your organization?

We run evaluation-first engagements around the questions your teams need to answer, the evidence they rely on, and the controls required for production.