Labs

We build before we advise

Active projects where we test architecture decisions, benchmark real hardware, and publish what we learn. These inform our solutions and our product thinking.

Active Started Feb 2026

Governed AI Routing Pipeline

How do you route enterprise prompts to the right model without sending PII to the cloud? We built a rules-first router with statistical PII detection and SLM fallback. Three domains (healthcare, finance, telecom), 60 synthetic prompts, full audit trail.

98.2%
PII recall
95%
Routing accuracy (hybrid)
5%
Prompts needing SLM
0%
PII recall in SLM-only mode
Presidio spaCy NER Qwen 2.5 1.5B Ollama CPU-only GDPR

This work is part of our exploration while building neurelay.ai, an AI governance platform for enterprise tool access control.

Active Started Mar 2026

Edge AI on Constrained Hardware

Real-time voice AI running fully offline on commodity hardware. We built the streaming speech-to-text infrastructure missing from the local AI ecosystem — sub-second transcription over WebSocket, with voice activity detection, interruption handling, and multi-turn conversation. No cloud APIs, no GPU, no network required.

951ms
Best end-to-end latency
40-50%
Faster than batch
0
Cloud dependencies
3
Hardware platforms

Tested on Apple Silicon (M1 Max), Intel x86 (i7 20-core, CPU-only), and NVIDIA Jetson (8GB edge). The pipeline delivers natural conversation with interruption handling across all platforms.

Edge inference Air-gapped Voice + SLM Privacy-first Streaming STT WebRTC
In progress

Private Document Intelligence

On-premise document retrieval combining OCR, vector search, and structured retrieval pipelines. Local SLM handles intent detection and classification — no data leaves the perimeter. Testing how far you can push retrieval quality without cloud AI.

Hybrid retrieval Local SLM OCR + vector search Privacy-first

Write-up coming soon.

Working on a similar problem?

We build these so we understand the problems deeply. Let's discuss yours.

Start a conversation