Autonomous AI agents that batch analyze legal contracts for missing provisions, unusual terms, and risk flags - powered by AWS Strands Agents SDK and Amazon Bedrock AgentCore.
🚧 Status: Core agents stable · Dashboard live · AgentCore deployment ready
M&A and financing due diligence requires reviewing 10–100+ contracts to identify missing clauses, non-standard terms, and cross-document inconsistencies. Manual review is slow, expensive, and error-prone.
Factor AI deploys a system of autonomous AI agents that collaboratively analyze batches of legal documents:
- Ingest PDFs and DOCX files, extracting and chunking provisions
- Detect provision types using pattern matching and AI classification
- Score risk levels against configurable rubrics
- Identify missing critical clauses via gap analysis
- Compare provisions across documents for inconsistencies
- Generate structured risk reports with Excel and HTML export
- Guard every reasoning step with a financial circuit breaker and Arize Phoenix telemetry
┌─────────────────────────────────────────────────────────┐
│ FACTOR AGENT SYSTEM │
│ │
│ ┌───────────────────────────────────────────────────┐ │
│ │ Coordinator Agent │ │
│ │ Receives document batch, plans analysis strategy, │ │
│ │ delegates to specialist agents, assembles report │ │
│ └──────────┬──────────┬──────────┬─────────────────┘ │
│ │ │ │ │
│ ┌────────▼──┐ ┌─────▼─────┐ ┌─▼──────────┐ │
│ │ Ingestion │ │ Analysis │ │ Knowledge │ │
│ │ Agent │ │ Agent │ │ Agent │ │
│ │ • Parse │ │ • Detect │ │ • RAG │ │
│ │ • Chunk │ │ • Score │ │ • Classify │ │
│ │ • Extract │ │ • Gaps │ │ • Citations│ │
│ └───────────┘ │ • Compare │ └────────────┘ │
│ └───────────┘ │
│ ┌─────────────┐ │
│ │ Reporting │ │
│ │ Agent │ │
│ │ • Reports │ │
│ │ • Excel │ │
│ │ • HTML │ │
│ └─────────────┘ │
│ │
│ ┌───────────────────────────────────────────────────┐ │
│ │ FINANCIAL-GUARDRAIL & TELEMETRY HARNESS │ │
│ │ Circuit Breaker │ Budget Tracker │ Loop Detector │ │
│ │ GuardedBedrockModel → audits token I/O per step │ │
│ │ OpenTelemetry/OTLP → Arize Phoenix (traces UI) │ │
│ └───────────────────────────────────────────────────┘ │
│ │
│ ┌───────────────────────────────────────────────────┐ │
│ │ Bedrock AgentCore Runtime │ Memory │ Gateway │ │
│ │ Policy │ Observability │ Identity │ │
│ │ Amazon Bedrock (Foundation Models) │ │
│ │ S3 │ DynamoDB │ CloudWatch │ Cognito │ │
│ └───────────────────────────────────────────────────┘ │
└─────────────────────────────────────────────────────────┘
| Agent | Role | Tools | Status |
|---|---|---|---|
| 🎯 Coordinator | Orchestrates pipeline, delegates tasks, assembles results | ingest_documents, analyze_provisions, search_knowledge, generate_report |
✅ Stable |
| 📄 Ingestion | Parses PDF/DOCX, chunks into provisions | parse_pdf, parse_docx, chunk_provisions |
✅ Stable |
| 🔍 Analysis | Detects types, scores risk, finds gaps, compares | detect_provision_type, score_risk, find_gaps, compare_across_documents |
✅ Stable |
| 📚 Knowledge | Searches synthetic KB, classifies domains, extracts citations | search_synthetic_knowledge, classify_domain, extract_citations |
✅ Stable |
| 📊 Reporting | Builds reports, exports Excel/HTML | build_risk_report, export_excel, export_html |
✅ Stable |
- 🔍 Provision Detection - 14 provision types identified via anchor patterns
- 📊 Risk Scoring - Configurable rubrics with weighted signals (0–10 scale)
⚠️ Gap Analysis - Standard checklists for NDAs, leases, loans, mergers, employment, license, and supply agreements- 🔄 Cross-Document Comparison - Inconsistency detection across governing law, liability caps, termination terms
- 📚 RAG Knowledge Search - Synthetic legal knowledge base (Taylor658/synthetic-legal)
- 📋 Structured Reports - Executive summary, risk assessment, gap analysis, comparison results
- 📥 Export - Excel (with disclaimer tab) and HTML (with disclaimers on every page)
- ⚡ SSE Streaming - Real-time analysis progress via Server-Sent Events
- 💰 Financial Circuit Breaker - Per-session token budget enforced at every reasoning step; agents are hard-halted before runaway cost
- 🔁 Reasoning Loop Detection - Sliding-window detector halts agents stuck in repetitive, high-cost cycles
- 📡 Phoenix Telemetry - OpenTelemetry traces exported to a self-hosted Arize Phoenix instance for per-step token auditing
- 🛡️ Session Isolation - Cedar policies enforce per-user data access
- 🔒 Upload Validation - File type enforcement (PDF, DOCX, DOC, TXT) with size limits
- 🌐 Production CORS - Configurable origin restrictions for production deployments
- 🧹 Automatic Cleanup - Uploaded files are removed after analysis completes
factor/
├── src/factor/ # Python backend
│ ├── agents/ # Strands Agent definitions
│ ├── harness/ # Financial-guardrail & Phoenix telemetry harness
│ ├── tools/ # @tool decorated functions
│ ├── knowledge/ # ChromaDB vector store + dataset loader
│ ├── models/ # Pydantic data models
│ ├── aws/ # Bedrock, AgentCore, S3, Cognito
│ ├── reporting/ # HTML report templates
│ ├── db/ # Session store (thread-safe)
│ ├── app.py # FastAPI + SSE streaming
│ └── config.py # pydantic-settings
├── src/frontend/ # React 18 + TypeScript + Vite
│ └── src/
│ ├── components/ # Upload, Analysis, Report, shared
│ ├── hooks/ # useUpload, useAnalysis, useAgentStream
│ ├── api/ # API client
│ └── types/ # TypeScript types
├── tests/ # pytest test suite
├── scripts/ # Seed KB, generate samples, deploy, benchmark
├── policies/ # Cedar policy files
├── data/ # Provision definitions, risk rubric, samples
├── infra/ # AWS CDK stacks
├── docker-compose.yml # Self-hosted Arize Phoenix (telemetry)
└── docker/ # API + Frontend Dockerfiles, docker-compose
When agents autonomously batch-analyze 100+ legal documents, the loop of reading, extracting, and comparing can lead to runaway token consumption. The harness wraps Bedrock AgentCore execution to audit token input/output at every discrete reasoning step and acts as a financial circuit breaker — ensuring the operating cost of the AI never outpaces the value of the analysis.
Agent step → GuardedBedrockModel → CircuitBreaker.check()
├── SessionBudget (token cost accounting)
└── LoopDetector (repetitive-cycle detection)
↓ trip → BudgetExceededError / ReasoningLoopError
OpenTelemetry span (gen_ai.usage.*) → GuardrailSpanProcessor → Arize Phoenix (OTLP)
| Component | Responsibility |
|---|---|
SessionBudget |
Accumulates input/output token cost per session (Sonnet pricing: $3 / $15 per 1M) |
LoopDetector |
Sliding-window detection of repeated reasoning actions |
CircuitBreaker |
Combines budget + step-limit + loop checks; raises to hard-halt the agent |
GuardedBedrockModel |
Proxy around BedrockModel that checks the breaker before every invocation |
FinancialGuardrail |
Singleton registry of per-session circuit breakers |
GuardrailSpanProcessor |
Extracts token counts from OTel spans and feeds the breaker; exports to Phoenix |
When a breaker trips, the /api/v1/analyze SSE stream emits a guardrail_halt event with the session's cost, step count, and trip reason.
# Launch the Phoenix telemetry UI + OTLP collector
docker compose up -d phoenix
# Phoenix UI: http://localhost:6006
# OTLP gRPC: localhost:4317| Env Var | Default | Description |
|---|---|---|
PHOENIX_ENABLED |
true |
Enable OTLP export to Phoenix |
PHOENIX_OTLP_ENDPOINT |
http://localhost:6006/v1/traces |
Phoenix trace collector endpoint |
GUARDRAIL_ENABLED |
true |
Enable the financial circuit breaker |
GUARDRAIL_SESSION_BUDGET_USD |
5.0 |
Hard cost ceiling per analysis session |
GUARDRAIL_MAX_STEPS |
200 |
Maximum reasoning steps per session |
GUARDRAIL_LOOP_WINDOW |
10 |
Sliding window size for loop detection |
GUARDRAIL_LOOP_THRESHOLD |
5 |
Repeat count within window that trips a loop |
GUARDRAIL_INPUT_COST_PER_1M |
3.0 |
Input token price (USD per 1M) |
GUARDRAIL_OUTPUT_COST_PER_1M |
15.0 |
Output token price (USD per 1M) |
- ✅ Python 3.11+
- ✅ Node.js 20+
- ✅ AWS account with Bedrock access
- ✅ AWS CLI configured (
aws configure)
# Clone the repository
git clone https://github.com/ATaylorAerospace/Factor-AI.git
cd Factor-AI
# Create virtual environment
python -m venv .venv && source .venv/bin/activate
# Install Python dependencies
pip install -r requirements.txt
# Seed the knowledge base
python scripts/seed_knowledge_base.py
# Start the API server
uvicorn src.factor.app:app --reload --port 8000cd src/frontend
npm install
npm run devpytest tests/ -v --cov=src/factor| Layer | Technology | Purpose |
|---|---|---|
| 🤖 Agent Framework | Strands Agents SDK | Model-driven agents with @tool |
| 🧠 Foundation Model | Amazon Bedrock (Anthropic Sonnet) | Reasoning + tool-use |
| ⚡ Agent Runtime | Bedrock AgentCore Runtime | Serverless execution |
| 💾 Agent Memory | Bedrock AgentCore Memory | Persistent context |
| 🔧 Agent Gateway | Bedrock AgentCore Gateway | MCP tool access |
| 🛡️ Agent Policy | Bedrock AgentCore Policy (Cedar) | Action boundaries |
| 📊 Observability | Bedrock AgentCore + OTEL | Tracing + dashboards |
| 📡 AI Telemetry | Arize Phoenix (OTLP, self-hosted) | Per-step token auditing + trace UI |
| 💰 Cost Guardrail | Custom circuit breaker harness | Budget + loop-detection hard halt |
| 🔐 Identity | Bedrock AgentCore Identity / Cognito | Authentication |
| 🔢 Embeddings | sentence-transformers | Vector embeddings |
| 📚 Vector Store | ChromaDB (local) / Bedrock KB (prod) | Dataset indexing |
| ☁️ Storage | Amazon S3 | Document storage |
| 🗄️ Metadata | Amazon DynamoDB | Session + results |
| 📄 Doc Parsing | PyMuPDF + python-docx + pdfplumber | Text extraction |
| 🖥️ Frontend | React 18 + TypeScript + Vite + Tailwind | Dashboard |
| 📋 Export | openpyxl + Jinja2 | Reports |
| 🏗️ IaC | AWS CDK (Python) | Infrastructure |
| 🔄 CI/CD | GitHub Actions | Quality + deploy |
⚠️ CRITICAL: THIS IS A SYNTHETIC DATASET - ALL CONTENT IS ARTIFICIALLY GENERATEDFactor's knowledge base is powered by the Taylor658/synthetic-legal dataset on HuggingFace (140,000 rows, MIT License).
ALL text in this dataset is synthetically generated and IS NOT legally accurate. All citations, statutes, case references, legal problems, verified solutions, and pairings are synthetic constructs created through template-based randomization. No citations, statutes, or case references in this dataset are real.
This dataset exists for research, experimentation, and model training only.
| Method | Endpoint | Description |
|---|---|---|
POST |
/api/v1/analyze |
Upload documents (PDF, DOCX, DOC, TXT) + stream agentic analysis |
GET |
/api/v1/sessions/{id} |
Session status + results |
GET |
/api/v1/sessions/{id}/trace |
Agent reasoning trace |
GET |
/api/v1/reports/{session_id} |
Structured report |
GET |
/api/v1/reports/{session_id}/export |
Download Excel/HTML |
GET |
/api/v1/sessions/{id}/budget |
Real-time guardrail budget + token status |
GET |
/api/v1/guardrail/status |
Guardrail config + all active sessions |
GET |
/api/v1/knowledge/search |
Search synthetic KB |
GET |
/api/v1/knowledge/domains |
List legal domains |
GET |
/api/v1/health |
Health check |
# Run all tests with coverage
pytest tests/ -v --cov=src/factor
# Run specific test modules
pytest tests/test_tools/ -v # Tool tests
pytest tests/test_agents/ -v # Agent tests
pytest tests/test_knowledge/ -v # Knowledge base testsTests cover:
- ✅ Each
@toolfunction independently with assertions - ✅ Agent creation with mocked Bedrock responses
- ✅ Synthetic dataset loading and metadata
- ✅ ChromaDB vector store operations
- ✅ Provision detection, scoring, and gap analysis
- ✅ Cross-document comparison and inconsistency detection
- ✅ Domain classification across 13 legal domains
- ✅ Citation extraction (cases, statutes, regulations)
- ✅ Report building, Excel export, and HTML export
- ✅ Financial guardrail: budget accounting, loop detection, circuit breaker trips
- ✅ All outputs label synthetic content
# Start both API and frontend services
docker compose -f docker/docker-compose.yml up --build# Deploy infrastructure
cd infra && cdk deploy --all
# Deploy agent configuration
python scripts/deploy_agentcore.py --env productionContributions are welcome! Please see the issue templates for bug reports and feature requests.
- Fork the repository
- Create a feature branch (
git checkout -b feature/amazing-feature) - Commit your changes (
git commit -m 'feat: add amazing feature') - Push to the branch (
git push origin feature/amazing-feature) - Open a Pull Request
MIT © 2026 A Taylor See LICENSE for details.
