I have a startup idea which is heavily dependent on sophisticated machine learning models. I’ve come across API vendors that offer the ML models I need as a service and perform the function I need perfectly. However, they are incredibly expensive, so expensive that my product is no longer feasible from a cost perspective, a user would never buy it at this price.
This can be circumvented by building out the ML models myself, however, the money required to train the models is huge (multiple millions). What should I be doing in this situation?
Here are my options as I see them, if you feel there’s another route please don’t hesitate to add:
- Leverage the expensive APIs until I raise enough money to train my own models.
- Start with a subpar model that is trained on a limited dataset (this runs the risk of damaging the perception of how good the product can be).
- Create a landing page, gather interest, raise money and then train my own model.
From this experience, I’m starting to think once a competitor has an ML model trained and developed to catch up would be insanely difficult. How do other startups that are dependent on ML get started given the costs are so high to train models to be effective?
If you do potentially have a viable idea and it turns out there is >0 people out there willing to pay for it, then you can experiment with cheaper models and optimizations as necessary. ('People want our product we sell at a loss so much we may go bankrupt' is a good problem to have.) Also, consider that as time passes, your problem may be solved for you: lots of people moan and whine about the OA API and wrung their hands about how no one would ever be able to afford to train their own GPT-3 - but here we are, just over 2 years later, and you have a wealth of alternatives in API or FLOSS model, like Jurassic or GPT-J/Neo-20b or YALM or OPT or BLOOM or... Even if none of those work for you or can be finetuned or something, it is also now easier than ever to train your own: countless bugs have been worked out, better training recipes documented, newer better GPUs come out (A100s are no longer rare, and H100s are coming soon), and older GPUs themselves are enjoying a pricing correction.