Parquet has been the lakehouse file format of choice for nearly half a decade. But we are starting to see other contenders that are optimized more for lower latency like lance https://github.com/lancedb/lance
Yes, I've been considering that since the beginning. Website (backend in C++, frontend in Svelte) takes priority, because this solution is good enough so far, and I'd really like to have access to my ZK on my phone. Probably not a website meant for the open internet: I have a server at home + use Wireguard VPN, so my phone can connect to local services/sites at all times.
I'm one of the contributors. We created a DSL in the language to describe the data and create tests. You can then use that data description to validate against json, csv, avro... One of the neat things we came up with was the concept of a data trace which is like a stack trace but is a path through the data to a particular error.