Hacker News new | past | comments | ask | show | jobs | submit login

    -> Kafka-connect -> Snowflake -> SQL/sf-tasks -> Snowflake -> Looker
    -> Alooma        ->
    -> custom        -> 
Using Kafka-connect, we're able to serve up near real-time (2-5 mins) insights on device generated events.

We probably need to use some kind of ETL tool to replace custom SQL and sf-tasks. Unfortunately, we haven't been able to find a tool that handles this in a non-batch (even if it's micro-batching) form. Snowflake change-streams and tasks allows us to ETL in a streaming-like fashion.

We're ingesting everything from raw/transformed/aggregated events, micro-service DBs (as fast as they sprout up), netsuite/salesforce, mixpanel, MySQL, MongoDB... Billions of rows of data across multiple data-source accessible to internal and external customer in a matter of seconds. It's been an incredible challenge, especially with only a team of 2-5 people.




Consider applying for YC's Spring batch! Applications are open till Feb 11.

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: