Repls are container-based. We control how much resources each repl can use and so we can control how it much it costs. We do swap out idle repls and wake them up on the first request that comes in (heroku style).
How do you start up these containers so quickly anyways? Do you keep some running hot, or delegate the startup to something like amazon's ECS? I've long wanted to run integration tests en masse in containers, but never figured out how to solve issues around long startup times.