01 — Proprietary, security-focused enterprise client
Enterprise Multimodal AI Analysis Platform
A production system for ingesting, indexing, and querying large-scale unstructured data — audio, text, transcripts — via natural-language LLM queries over the corpus. Architected and built from the ground up, deployed on-premise for a security-focused enterprise client with strict data-sovereignty requirements.
Designed and implemented the full pipeline end to end: ingestion, transcription, embedding, vector indexing, query orchestration, and role-based access. Inference runs entirely on local hardware via Ollama; analytics and event aggregations are split across Apache Druid and ClickHouse, with a vector store for similarity search over the embedded corpus. The system is proprietary and not publicly demonstrable.
Stack Python Apache Druid ClickHouse Ollama Vector search