Hacker News
new
|
past
|
comments
|
ask
|
show
|
jobs
|
submit
login
the_svd_doctor
on Aug 2, 2023
|
parent
|
context
|
favorite
| on:
VkFFT: Vulkan/CUDA/Hip/OpenCL/Level Zero/Metal Fas...
FFT is memory bound (it’s N log N flops for N bytes, so little arithmetic). GPU HBM is much faster than DRAM, so generally it’s much faster on GPU.
Consider applying for YC's W25 batch! Applications are open till Nov 12.
Guidelines
|
FAQ
|
Lists
|
API
|
Security
|
Legal
|
Apply to YC
|
Contact
Search: