FairProp Inspector

The Compliance Layer for Real Estate AI Agent.

FairProp Inspector is a high-performance, latency-critical inference engine designed to detect Fair Housing Act (FHA) violations in real-time. Unlike legacy regex-based solutions, FairProp leverages Small Language Models (SLMs) fine-tuned on compliance datasets to understand context, nuance, and intent.

Built for the On-Device AI era, it runs efficiently on edge hardware while maintaining privacy-first architecture.

graph TD
    A[FHA Rules & Heuristics] --> B[Synthetic Generator <i>(GPT-4o Distillation)</i>]
    B --> C[(Synthetic Dataset)]
    C --> D[ModernBERT Fine-tuning <i>(BF16 / FlashAttention)</i>]
    D --> E{Model Serialization}
    E --> F[PyTorch Checkpoint]
    E --> G[ONNX Export <i>(Quantized)</i>]
    G --> H[Edge Inference <i>(Browser/Embedded)</i>]
    F --> I[Compliance API/Platform]

Part of the FairProp AI Platform ecosystem.

🚀 Key Features

SOTA Architecture: Powered by ModernBERT, delivering 8192 context length and Flash Attention backend.
Edge-Native: Optimized for ONNX Runtime export, enabling sub-20ms latency on CPU.
Data Engine: Includes a synthetic data generation pipeline (scripts/generate_synthetic.py) utilizing LLM distillation (GPT-4o) to bootstrap compliance supervision.
Privacy-First: No data leaves your infrastructure. Full compliance checks happen locally.

🛠️ Installation

From Source (Recommended)

git clone https://github.com/ZheWang-stack/FairProp-Inspector.git
cd FairProp-Inspector
pip install -e .

Direct from GitHub

pip install git+https://github.com/ZheWang-stack/FairProp-Inspector.git

Note

PyPI package coming soon! For now, please install from source.

📊 Performance Comparison

FairProp Inspector bridges the gap between simple regex rules and expensive cloud APIs:

Method	Latency	Accuracy	Privacy	Cost
Regex Rules	<1ms	~65%	✅ Local	Free
Cloud API (GPT-4)	800ms	~95%	❌ Cloud	$$$$
FairProp Inspector	~18ms	~94%	✅ Local	Free

Benchmarks run on Intel i7-12700K CPU with ONNX Runtime optimization.

⚡ Quick Start

Get started in 30 seconds:

from src.inference.predict import predict

# Detect FHA violations instantly
text = "No kids under 12 allowed"
label, confidence = predict(text, "artifacts/model")

print(f"{label}: {confidence:.1%}")
# Output: NON_COMPLIANT: 99.8%

Try it now:

python examples/quickstart.py

📚 Examples

Explore our ready-to-run examples:

Quick Start - 5 lines of code to get started
Edge Inference - Production-ready with error handling and batch processing
Batch Processing - Efficiently process multiple property listings
Jupyter Tutorial - Interactive notebook with visualizations and performance analysis

See examples/README.md for detailed usage instructions.

🏗️ Architecture

The Inspector Pipeline

Our pipeline moves away from "black box" APIs to measurable, controllable local inference.

Synthetic Distillation: We use gpt-4o to generate "Edge Case" violations (e.g., subtle steering like "Perfect for active adults").
Training: We fine-tune ModernBERT-base using bf16 precision and gradient checkpointing.
Inference: The model classifies text segments as COMPLIANT vs NON_COMPLIANT with probability calibration.

💻 Usage

1. Training (Fine-tuning)

Train the inspector on your proprietary or synthetic data.

# Uses Flash Attention & BF16 automatically if specific hardware is detected
python src/trainer/train.py --data data/processed/synthetic.json --epochs 5 --batch_size 16

2. Synthetic Data Generation

Bootstrap your dataset using our chain-of-thought distillation script.

export OPENAI_API_KEY="sk-..."
python src/generator/generate_data.py --count 1000 --output data/processed/synthetic_train.json

3. Inference

python src/inference/predict.py "No kids under 12 allowed in the specialized quiet zone."
# Output: [NON_COMPLIANT] 98.4% Confidence

📖 Documentation

Training Guide - Complete guide to training custom models with GPT-4 prompt templates
Benchmarks - Performance comparison and accuracy testing
Examples - Ready-to-run code samples
ROADMAP - Project development plan and quarterly goals
CHANGELOG - Version history and release notes

🤝 Contributing

We welcome contributions from the community! Please see:

CONTRIBUTING.md - Contribution guidelines and code standards
CODE_OF_CONDUCT.md - Community guidelines
Issue Templates - Bug reports and feature requests

📄 License

This project is licensed under the MIT License.

Built with ❤️ for Fair Housing Compliance
⭐ Star us on GitHub | 🐛 Report Bug | 💡 Request Feature

Name		Name	Last commit message	Last commit date
Latest commit History 13 Commits
.github		.github
benchmarks		benchmarks
docs		docs
examples		examples
scripts		scripts
src		src
tests		tests
.gitignore		.gitignore
CHANGELOG.md		CHANGELOG.md
CODE_OF_CONDUCT.md		CODE_OF_CONDUCT.md
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE		LICENSE
README.md		README.md
ROADMAP.md		ROADMAP.md
index.html		index.html
inspector.py		inspector.py
pyproject.toml		pyproject.toml
requirements.txt		requirements.txt
robots.txt		robots.txt
sitemap.xml		sitemap.xml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

FairProp Inspector

🚀 Key Features

🛠️ Installation

From Source (Recommended)

Direct from GitHub

📊 Performance Comparison

⚡ Quick Start

📚 Examples

🏗️ Architecture

The Inspector Pipeline

💻 Usage

1. Training (Fine-tuning)

2. Synthetic Data Generation

3. Inference

📖 Documentation

🤝 Contributing

📄 License

About

Uh oh!

Releases

Packages

Uh oh!

Languages

License

ZheWang-stack/FairProp-Inspector

Folders and files

Latest commit

History

Repository files navigation

FairProp Inspector

🚀 Key Features

🛠️ Installation

From Source (Recommended)

Direct from GitHub

📊 Performance Comparison

⚡ Quick Start

📚 Examples

🏗️ Architecture

The Inspector Pipeline

💻 Usage

1. Training (Fine-tuning)

2. Synthetic Data Generation

3. Inference

📖 Documentation

🤝 Contributing

📄 License

About

Topics

Resources

License

Code of conduct

Contributing

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages