⚖️ Factor AI - Agentic AI Legal Due Diligence Platform 🔍

Autonomous AI agents that batch analyze legal contracts for missing provisions, unusual terms, and risk flags - powered by AWS Strands Agents SDK and Amazon Bedrock AgentCore.

🚧 Status: Core agents stable · Dashboard live · AgentCore deployment ready

🤔 The Problem

M&A and financing due diligence requires reviewing 10–100+ contracts to identify missing clauses, non-standard terms, and cross-document inconsistencies. Manual review is slow, expensive, and error-prone.

💡 The Solution

Factor AI deploys a system of autonomous AI agents that collaboratively analyze batches of legal documents:

Ingest PDFs and DOCX files, extracting and chunking provisions
Detect provision types using pattern matching and AI classification
Score risk levels against configurable rubrics
Identify missing critical clauses via gap analysis
Compare provisions across documents for inconsistencies
Generate structured risk reports with Excel and HTML export
Guard every reasoning step with a financial circuit breaker and Arize Phoenix telemetry

🏛️ Architecture

┌─────────────────────────────────────────────────────────┐
│                    FACTOR AGENT SYSTEM                   │
│                                                         │
│  ┌───────────────────────────────────────────────────┐  │
│  │            Coordinator Agent                       │  │
│  │  Receives document batch, plans analysis strategy, │  │
│  │  delegates to specialist agents, assembles report  │  │
│  └──────────┬──────────┬──────────┬─────────────────┘  │
│             │          │          │                      │
│    ┌────────▼──┐ ┌─────▼─────┐ ┌─▼──────────┐         │
│    │ Ingestion │ │ Analysis  │ │ Knowledge  │         │
│    │   Agent   │ │   Agent   │ │   Agent    │         │
│    │ • Parse   │ │ • Detect  │ │ • RAG      │         │
│    │ • Chunk   │ │ • Score   │ │ • Classify │         │
│    │ • Extract │ │ • Gaps    │ │ • Citations│         │
│    └───────────┘ │ • Compare │ └────────────┘         │
│                  └───────────┘                          │
│    ┌─────────────┐                                      │
│    │  Reporting   │                                     │
│    │    Agent     │                                     │
│    │ • Reports   │                                     │
│    │ • Excel     │                                     │
│    │ • HTML      │                                     │
│    └─────────────┘                                      │
│                                                         │
│  ┌───────────────────────────────────────────────────┐  │
│  │  FINANCIAL-GUARDRAIL & TELEMETRY HARNESS           │  │
│  │  Circuit Breaker │ Budget Tracker │ Loop Detector  │  │
│  │  GuardedBedrockModel → audits token I/O per step   │  │
│  │  OpenTelemetry/OTLP → Arize Phoenix (traces UI)    │  │
│  └───────────────────────────────────────────────────┘  │
│                                                         │
│  ┌───────────────────────────────────────────────────┐  │
│  │  Bedrock AgentCore Runtime │ Memory │ Gateway      │  │
│  │  Policy │ Observability │ Identity                 │  │
│  │  Amazon Bedrock (Foundation Models)                │  │
│  │  S3 │ DynamoDB │ CloudWatch │ Cognito              │  │
│  └───────────────────────────────────────────────────┘  │
└─────────────────────────────────────────────────────────┘

🤖 Agent System

Agent	Role	Tools	Status
🎯 Coordinator	Orchestrates pipeline, delegates tasks, assembles results	`ingest_documents`, `analyze_provisions`, `search_knowledge`, `generate_report`	✅ Stable
📄 Ingestion	Parses PDF/DOCX, chunks into provisions	`parse_pdf`, `parse_docx`, `chunk_provisions`	✅ Stable
🔍 Analysis	Detects types, scores risk, finds gaps, compares	`detect_provision_type`, `score_risk`, `find_gaps`, `compare_across_documents`	✅ Stable
📚 Knowledge	Searches synthetic KB, classifies domains, extracts citations	`search_synthetic_knowledge`, `classify_domain`, `extract_citations`	✅ Stable
📊 Reporting	Builds reports, exports Excel/HTML	`build_risk_report`, `export_excel`, `export_html`	✅ Stable

✨ Core Capabilities

🔍 Provision Detection - 14 provision types identified via anchor patterns
📊 Risk Scoring - Configurable rubrics with weighted signals (0–10 scale)
⚠️ Gap Analysis - Standard checklists for NDAs, leases, loans, mergers, employment, license, and supply agreements
🔄 Cross-Document Comparison - Inconsistency detection across governing law, liability caps, termination terms
📚 RAG Knowledge Search - Synthetic legal knowledge base (Taylor658/synthetic-legal)
📋 Structured Reports - Executive summary, risk assessment, gap analysis, comparison results
📥 Export - Excel (with disclaimer tab) and HTML (with disclaimers on every page)
⚡ SSE Streaming - Real-time analysis progress via Server-Sent Events
💰 Financial Circuit Breaker - Per-session token budget enforced at every reasoning step; agents are hard-halted before runaway cost
🔁 Reasoning Loop Detection - Sliding-window detector halts agents stuck in repetitive, high-cost cycles
📡 Phoenix Telemetry - OpenTelemetry traces exported to a self-hosted Arize Phoenix instance for per-step token auditing
🛡️ Session Isolation - Cedar policies enforce per-user data access
🔒 Upload Validation - File type enforcement (PDF, DOCX, DOC, TXT) with size limits
🌐 Production CORS - Configurable origin restrictions for production deployments
🧹 Automatic Cleanup - Uploaded files are removed after analysis completes

📁 Repository Layout

factor/
├── src/factor/              # Python backend
│   ├── agents/              # Strands Agent definitions
│   ├── harness/             # Financial-guardrail & Phoenix telemetry harness
│   ├── tools/               # @tool decorated functions
│   ├── knowledge/           # ChromaDB vector store + dataset loader
│   ├── models/              # Pydantic data models
│   ├── aws/                 # Bedrock, AgentCore, S3, Cognito
│   ├── reporting/           # HTML report templates
│   ├── db/                  # Session store (thread-safe)
│   ├── app.py               # FastAPI + SSE streaming
│   └── config.py            # pydantic-settings
├── src/frontend/            # React 18 + TypeScript + Vite
│   └── src/
│       ├── components/      # Upload, Analysis, Report, shared
│       ├── hooks/           # useUpload, useAnalysis, useAgentStream
│       ├── api/             # API client
│       └── types/           # TypeScript types
├── tests/                   # pytest test suite
├── scripts/                 # Seed KB, generate samples, deploy, benchmark
├── policies/                # Cedar policy files
├── data/                    # Provision definitions, risk rubric, samples
├── infra/                   # AWS CDK stacks
├── docker-compose.yml       # Self-hosted Arize Phoenix (telemetry)
└── docker/                  # API + Frontend Dockerfiles, docker-compose

💰 Financial-Guardrail & Telemetry Harness

When agents autonomously batch-analyze 100+ legal documents, the loop of reading, extracting, and comparing can lead to runaway token consumption. The harness wraps Bedrock AgentCore execution to audit token input/output at every discrete reasoning step and acts as a financial circuit breaker — ensuring the operating cost of the AI never outpaces the value of the analysis.

How it works

Agent step → GuardedBedrockModel → CircuitBreaker.check()
                                       ├── SessionBudget   (token cost accounting)
                                       └── LoopDetector    (repetitive-cycle detection)
                                       ↓ trip → BudgetExceededError / ReasoningLoopError
OpenTelemetry span (gen_ai.usage.*) → GuardrailSpanProcessor → Arize Phoenix (OTLP)

Component	Responsibility
`SessionBudget`	Accumulates input/output token cost per session (Sonnet pricing: $3 / $15 per 1M)
`LoopDetector`	Sliding-window detection of repeated reasoning actions
`CircuitBreaker`	Combines budget + step-limit + loop checks; raises to hard-halt the agent
`GuardedBedrockModel`	Proxy around `BedrockModel` that checks the breaker before every invocation
`FinancialGuardrail`	Singleton registry of per-session circuit breakers
`GuardrailSpanProcessor`	Extracts token counts from OTel spans and feeds the breaker; exports to Phoenix

When a breaker trips, the /api/v1/analyze SSE stream emits a guardrail_halt event with the session's cost, step count, and trip reason.

Start Phoenix (self-hosted)

# Launch the Phoenix telemetry UI + OTLP collector
docker compose up -d phoenix

# Phoenix UI:        http://localhost:6006
# OTLP gRPC:         localhost:4317

Configuration

Env Var	Default	Description
`PHOENIX_ENABLED`	`true`	Enable OTLP export to Phoenix
`PHOENIX_OTLP_ENDPOINT`	`http://localhost:6006/v1/traces`	Phoenix trace collector endpoint
`GUARDRAIL_ENABLED`	`true`	Enable the financial circuit breaker
`GUARDRAIL_SESSION_BUDGET_USD`	`5.0`	Hard cost ceiling per analysis session
`GUARDRAIL_MAX_STEPS`	`200`	Maximum reasoning steps per session
`GUARDRAIL_LOOP_WINDOW`	`10`	Sliding window size for loop detection
`GUARDRAIL_LOOP_THRESHOLD`	`5`	Repeat count within window that trips a loop
`GUARDRAIL_INPUT_COST_PER_1M`	`3.0`	Input token price (USD per 1M)
`GUARDRAIL_OUTPUT_COST_PER_1M`	`15.0`	Output token price (USD per 1M)

🏁 Getting Started

Prerequisites

✅ Python 3.11+
✅ Node.js 20+
✅ AWS account with Bedrock access
✅ AWS CLI configured (aws configure)

Installation

# Clone the repository
git clone https://github.com/ATaylorAerospace/Factor-AI.git
cd Factor-AI

# Create virtual environment
python -m venv .venv && source .venv/bin/activate

# Install Python dependencies
pip install -r requirements.txt

# Seed the knowledge base
python scripts/seed_knowledge_base.py

# Start the API server
uvicorn src.factor.app:app --reload --port 8000

Frontend

cd src/frontend
npm install
npm run dev

Run Tests

pytest tests/ -v --cov=src/factor

🛠️ Technology Stack

Layer	Technology	Purpose
🤖 Agent Framework	Strands Agents SDK	Model-driven agents with @tool
🧠 Foundation Model	Amazon Bedrock (Anthropic Sonnet)	Reasoning + tool-use
⚡ Agent Runtime	Bedrock AgentCore Runtime	Serverless execution
💾 Agent Memory	Bedrock AgentCore Memory	Persistent context
🔧 Agent Gateway	Bedrock AgentCore Gateway	MCP tool access
🛡️ Agent Policy	Bedrock AgentCore Policy (Cedar)	Action boundaries
📊 Observability	Bedrock AgentCore + OTEL	Tracing + dashboards
📡 AI Telemetry	Arize Phoenix (OTLP, self-hosted)	Per-step token auditing + trace UI
💰 Cost Guardrail	Custom circuit breaker harness	Budget + loop-detection hard halt
🔐 Identity	Bedrock AgentCore Identity / Cognito	Authentication
🔢 Embeddings	sentence-transformers	Vector embeddings
📚 Vector Store	ChromaDB (local) / Bedrock KB (prod)	Dataset indexing
☁️ Storage	Amazon S3	Document storage
🗄️ Metadata	Amazon DynamoDB	Session + results
📄 Doc Parsing	PyMuPDF + python-docx + pdfplumber	Text extraction
🖥️ Frontend	React 18 + TypeScript + Vite + Tailwind	Dashboard
📋 Export	openpyxl + Jinja2	Reports
🏗️ IaC	AWS CDK (Python)	Infrastructure
🔄 CI/CD	GitHub Actions	Quality + deploy

⚠️ Synthetic Dataset Disclaimer

⚠️ CRITICAL: THIS IS A SYNTHETIC DATASET - ALL CONTENT IS ARTIFICIALLY GENERATED

Factor's knowledge base is powered by the Taylor658/synthetic-legal dataset on HuggingFace (140,000 rows, MIT License).

ALL text in this dataset is synthetically generated and IS NOT legally accurate. All citations, statutes, case references, legal problems, verified solutions, and pairings are synthetic constructs created through template-based randomization. No citations, statutes, or case references in this dataset are real.

This dataset exists for research, experimentation, and model training only.

📊 API Endpoints

Method	Endpoint	Description
`POST`	`/api/v1/analyze`	Upload documents (PDF, DOCX, DOC, TXT) + stream agentic analysis
`GET`	`/api/v1/sessions/{id}`	Session status + results
`GET`	`/api/v1/sessions/{id}/trace`	Agent reasoning trace
`GET`	`/api/v1/reports/{session_id}`	Structured report
`GET`	`/api/v1/reports/{session_id}/export`	Download Excel/HTML
`GET`	`/api/v1/sessions/{id}/budget`	Real-time guardrail budget + token status
`GET`	`/api/v1/guardrail/status`	Guardrail config + all active sessions
`GET`	`/api/v1/knowledge/search`	Search synthetic KB
`GET`	`/api/v1/knowledge/domains`	List legal domains
`GET`	`/api/v1/health`	Health check

🧪 Testing

# Run all tests with coverage
pytest tests/ -v --cov=src/factor

# Run specific test modules
pytest tests/test_tools/ -v          # Tool tests
pytest tests/test_agents/ -v         # Agent tests
pytest tests/test_knowledge/ -v      # Knowledge base tests

Tests cover:

✅ Each @tool function independently with assertions
✅ Agent creation with mocked Bedrock responses
✅ Synthetic dataset loading and metadata
✅ ChromaDB vector store operations
✅ Provision detection, scoring, and gap analysis
✅ Cross-document comparison and inconsistency detection
✅ Domain classification across 13 legal domains
✅ Citation extraction (cases, statutes, regulations)
✅ Report building, Excel export, and HTML export
✅ Financial guardrail: budget accounting, loop detection, circuit breaker trips
✅ All outputs label synthetic content

🚀 Deployment

Docker

# Start both API and frontend services
docker compose -f docker/docker-compose.yml up --build

AWS CDK (AgentCore)

# Deploy infrastructure
cd infra && cdk deploy --all

# Deploy agent configuration
python scripts/deploy_agentcore.py --env production

🙏 Contributing

Contributions are welcome! Please see the issue templates for bug reports and feature requests.

Fork the repository
Create a feature branch (git checkout -b feature/amazing-feature)
Commit your changes (git commit -m 'feat: add amazing feature')
Push to the branch (git push origin feature/amazing-feature)
Open a Pull Request

👤 Author

A Taylor · 2026

Name		Name	Last commit message	Last commit date
Latest commit History 90 Commits
.github		.github
data		data
docker		docker
docs		docs
infra		infra
policies		policies
scripts		scripts
src		src
tests		tests
.env.example		.env.example
.gitignore		.gitignore
CONTRIBUTING.MD		CONTRIBUTING.MD
LICENSE		LICENSE
README.md		README.md
SECURITY.MD		SECURITY.MD
docker-compose.yml		docker-compose.yml
pyproject.toml		pyproject.toml
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

⚖️ Factor AI - Agentic AI Legal Due Diligence Platform 🔍

🤔 The Problem

💡 The Solution

🏛️ Architecture

🤖 Agent System

✨ Core Capabilities

📁 Repository Layout

💰 Financial-Guardrail & Telemetry Harness

How it works

Start Phoenix (self-hosted)

Configuration

🏁 Getting Started

Prerequisites

Installation

Frontend

Run Tests

🛠️ Technology Stack

⚠️ Synthetic Dataset Disclaimer

📊 API Endpoints

🧪 Testing

🚀 Deployment

Docker

AWS CDK (AgentCore)

🙏 Contributing

👤 Author

📄 License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

⚖️ Factor AI - Agentic AI Legal Due Diligence Platform 🔍

🤔 The Problem

💡 The Solution

🏛️ Architecture

🤖 Agent System

✨ Core Capabilities

📁 Repository Layout

💰 Financial-Guardrail & Telemetry Harness

How it works

Start Phoenix (self-hosted)

Configuration

🏁 Getting Started

Prerequisites

Installation

Frontend

Run Tests

🛠️ Technology Stack

⚠️ Synthetic Dataset Disclaimer

📊 API Endpoints

🧪 Testing

🚀 Deployment

Docker

AWS CDK (AgentCore)

🙏 Contributing

👤 Author

📄 License

About

Topics

Resources

License

Contributing

Security policy

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages