Data architecture and platforms
Valery Frolov
I am a data architect. I write about building and running data platforms, mostly from years of working on data infrastructure at Wix.
Featured writing
Notes from platform work.
A few essays on data architecture, platform operations, cost, ownership, and the parts of the system that are easy to ignore.
When Iceberg Says 321 GB and S3 Says 46 TB
Table metadata can tell one story while object storage tells another. Orphan files are a budget, reliability, and compliance risk.
Who Owns This Table?
Most data lake cost problems are ownership problems wearing a storage bill costume.
The Full Rewrite Anti-Pattern in Data Lakes
Full rewrites are often the most expensive way to make a small logical change.
Operating lens
How I look at data platforms.
Tables, storage, pipelines, catalogs, and cost reports all tell part of the story. Good platform work is connecting those views early enough to make better decisions.