If you want to poke around and have fun, Bela [1] is pretty good way to get started. It's got a prebuilt framework where you can just push a piece of c(++) that gets to run in a xenomai real time audio thread with predigested signals from all the peripherals.
[1] https://bela.io/