This archive is designed as a discovery layer — a starting point for research, not a final answer. Think of it as a map that helps you find documents faster, but you still need to read the documents themselves.
What Makes This Different
Traditional Government Sites
Linear search
Bureaucratic navigation
Static PDFs
Limited connections
This Archive
Graph-based exploration
Entity relationships
Timeline visualization
Multi-modal search (text, video, audio)
Interactive discovery
The Right Way to Use This Tool
DO
Use it to discover documents and connections
Verify with primary sources (original court filings, DOJ releases)
Cross-reference findings across multiple documents
Follow the citation links to original sources
Think critically about co-occurrence vs. causation
DON'T
Assume co-occurrence means causation
Take entity connections as proof of wrongdoing
Skip verification with original documents
Treat frequency as evidence
Draw conclusions without primary source confirmation
“No inference without primary source confirmation. That's how journalists and prosecutors think.”
Who This Is For
Researchers investigating specific connections
Journalists following document trails
Investigators mapping relationships
Technical minds who think in graphs and patterns
Anyone who wants to explore the documents efficiently
Understanding the Data
What's Included
1,321,030 documents from official releases
2,291 videos with AI-generated transcripts
152 audio files with speech-to-text transcription
230,000 entities extracted from documents
Network relationships mapped from co-occurrence
Timeline data extracted from document dates
What This Means
Documents are real (from DOJ, House Oversight, court records)
Entities are extracted via AI (may have errors)
Connections show co-occurrence (not proof of relationship)
Transcripts are AI-generated (may have inaccuracies)
Dates are extracted (may be filing date, not event date)
Always Ask
Is this co-occurrence or documented interaction?
Is this primary evidence or just mention?
What is the context of the document?
Was this released via court order or media disclosure?
Is the date the event date or the filing date?
Verification Sources
Always cross-reference findings with official sources: