More

raccoonone · on Feb 10, 2024

I wrote redb (https://github.com/cberner/redb) using mmap, initially. However, I later removed it and switched to read()/write() with my own user space cache. I'm sure it's not as good as the OS page cache, but the difference was only 1.2-1.5x performance on the benchmarks I cared about, and the cache is less than 500 lines of code. Also, by removing mmap() I was able to remove all the unsafe code associated with it, so now redb is memory-safe.

hyc_symas · on Feb 10, 2024

500 lines of code is still 500 lines of added complexity. For LMDB that'd be a 7% increase in LOCs, which would also introduce the need for manual cache configuration and tuning (further increasing complexity for the end user) and a 50% performance loss? Doesn't seem like a good tradeoff to me.

But at least you thought about the decision. Nice benchmark, by the way. https://mastodon.social/@hyc/111887577620902329

raccoonone · on March 31, 2019

I ran the benchmarks with SIMD disabled. Encoding goes from ~950Mbit/s down to ~350Mbit/s

raccoonone · on Jan 18, 2018

Yep, we've been using GPUs for quite a while (even before the alpha support in Kube), both the K80s in Azure and some Pascals in our own clusters. With the support in Kube now it's quite seamless.

eggie5 · on Jan 19, 2018

That's some groundbreaking work: GPUs in K8s. By Kube you mean the KubeFlow project?

Voloskaya · on Jan 22, 2018

Late reply, but Kube meant Kubernetes not Kubeflow. Alpha GPU support landed in 1.6 if my memory serves me right. Before that you had to do a bunch of stuff manually to make GPU work, mostly around scheduling etc. Since 1.6, Kubernetes will automatically detect the GPUs on your node and thus correctly assign the workloads where they fit. Kubeflow is an abstraction layer on top of that, that helps a lot when you want to do things such as distributed TensorFlow training. It also helps a bit for simpler jobs by (almost) removing the need to manually mount the NVIDIA drivers from the host into the container for example.

raccoonone · on March 8, 2014

Still early in development, but I've been working on an autopilot for quadcopters that uses Python, and the ROS framework.

https://github.com/rospilot/rospilot

raccoonone · on May 21, 2012

Ya, I'll add that to our list of feature requests. Would definitely like to put some more features in, to help people with blog posts or other research in the car space.

Ah yes, I would seem that green cars are going out of fashion, dunno who would want a purple car =P

WiseWeasel · on May 21, 2012

If you're buying a Benz, it's a good way to save a few bucks...

raccoonone · on March 23, 2012

What was the reason for having hundreds of shards on each server? Were you still seeing performance benefits to sharding it that aggresively?

falcolas · on March 27, 2012

Multi-tennent data separation required by contract.

raccoonone · on March 23, 2012

Awesome, lemme know when you guys have sharding available! Would love to not have to worry about running our own index, again.

fizx · on March 23, 2012

It's available now, just not as part of a standard plan. I'll send you an email, just to make sure we're getting you the best setup for your needs.

raccoonone · on March 23, 2012

Yep, totally agree with this. Last month we spent an hour or two going through the schema and removing any fields that didn't need to be stored, and making sure that only fields which we actually do queries on had index=true. I didn't test before and after results, but qualitatively it seemed to be faster afterwards.

raccoonone · on March 23, 2012

I sure am the author. email sent :)

raccoonone · on March 12, 2012

Looks awesome, can't wait for a Linux version, so that I can try it out!