Show HN: An open-source AI Gateway with integrated guardrails

hrishi · 2024-08-14T20:43:07 1723668187

Love this!

brianjking · 2024-08-14T15:13:20 1723648400

Coming over from Twitter/X (@iamrobotbear) -- congrats on the launch! Will dive into the docs, thanks for this!

roh26it · 2024-08-14T16:01:00 1723651260

thanks for the support!

namanyayg · 2024-08-14T14:57:23 1723647443

saw your tweet on X, nice work and congrats on launching!

i'm curious about the caching mechanisms you've implemented to reduce repeated evaluations - are you using a traditional cache store like redis or something more bespoke?

roh26it · 2024-08-14T15:04:26 1723647866

We use a bunch of caching mechanisms on the LLM requests themselves and extend the same to guardrails now.

So there's 2 levels of cache - the LLM request itself might be cached (simple and semantic) and the guardrail response can be cached as well.

We use a mix of a distributed kv store and a vector DB to actually store the data