Hey everyone,
I wanted to share NexusOS v2.0, a self-hosted data factory project I've been working on to capture raw system telemetry and turn it into open-source training data. It runs entirely on local consumer hardware with a 100% zero-cost, zero-dependency stack.
The Architecture
The ingestion engine uses pure Python standard libraries to ingest complex server telemetry, maps out system crashes, and passes them through a local Ollama inference loop to fully scrub private data identifiers (PII). From there, it structures the raw logs into optimized binary Parquet formats.
Zero Dependencies: Orchestrated entirely via Python standard libraries—zero external third-party SDK or cloud API overhead.







