The 7 Pillars of Truth
The architecture implements a seven-stage pipeline to convert unstructured RFP documents into structured data. It establishes clear lineage from raw requirements to validated capabilities, ensuring traceability and evidence-based verification throughout the process.
Atomization Logic
Before semantic understanding, we must achieve Structural Integrity. The system employs a "Docling" layout parser to capture metadata context (page numbers, section headers) before breaking content into atoms.
Why this matters: A standard LLM sees a "soup of text." By parsing the layout first, we know that "10 days" isn't just a numberβit's a constraint belonging to Section 4.1.
Each atom keeps a verbatim anchor to the legal source while being stored in an append-only ledger. Amendments create new atoms that supersede prior versions, Q&A items clarify active requirements, and any unmatched references trigger an orphan check for human review.
"The Contractor shall provide a dedicated Project Manager who possesses a current PMP certification... and who must be available to report on-site within ten (10) days of contract award."
The Garbage-In Problem
Not all data is equal. The Hygiene Protocol calculates a composite Quality Vector based on three factors: Source Verification, Time Decay (Freshness), and Usage Frequency.
Signed contracts and audited financials.
*Subject to time decay.
Submitted proposals and resumes < 6 months old.
Marketing slicks, wiki drafts, and outdated assets.
Two-Pass Reconciliation
Finding the right evidence requires balancing speed and accuracy. We separate retrieval from adjudication using a Vector Triangulation phase followed by a Critic Agent review.
The Filtering Funnel
Phase A: Vector (The Net)
Rapidly scans thousands of assets using Cosine Similarity.
"Find anything mentioning Oracle and Postgres."
Phase B: Critic (The Filter)
Applies deep reasoning to check the Verbatim Anchor.
"Discard results that don't explicitly show migration FROM Oracle TO Postgres."
The Evidence Interlock
Complex requirements rarely map to a single document. The system creates a Composite Proof (or "Zipper") that clusters multiple validated assets.
Interactive Demo: Select a scenario to see how the system clusters evidence or flags failure when an interlock is missing.
Asset Selection Logic
X: Freshness (Days) | Y: Relevance Score | Color: Trust Tier
Operational Gateways
Automation is not abdication. The system implements hard-coded Human-in-the-Loop checkpoints. This is not just a "review"βit's a control plane where human rejection forces the AI to loop back and retry.
Gateways also resolve orphaned amendments and approve supersedes/clarifies lineage links before any requirement is treated as binding.
Interactive Demo: See what the human operator actually controls at each stage of the lifecycle.
"Contractor must be US Citizen and possess TS/SCI."