Hacker News new | past | comments | ask | show | jobs | submit login

Is it possible for example to send you monthly revenue numbers for my startup for the last two years (24 data points) and have yoghurt predict the next two years of monthly revenue?



If the model is autoregressive you can only forecast N steps ahead. Any further forecasting will be based on these generated near-future forecasts. In English, no. See https://www.youtube.com/watch?v=tJ-O3hk1vRw#t=01h16m


Thanks for posting this talk by Jeffrey Yau. I am 9 minutes into it and can't stop watching. He explains things very easily and clearly.


Its very simple to use Yoghurt, just upload the data and rest it does automatically. 24 data points is less to make any accurate prediction. You need more data points. However, Yoghurt currently supports 1 week prediction only and very soon we will be adding prediction upto 1 Month and plus.


Excuse my ignorance, but how does 1 week fit into the equation? Why does the time scale (x-axis) matter? I.E. if I pass 180 points of revenue (y-axis) does it matter if they were sampled each day or each hour in terms of forecasting?


They probably take into account day-specific trends , such as if the data shows sales are usually lower on a Monday than a Tuesday, they would take that into account in the forecast. This is as far as I understand.

So, assuming they are doing this, the time scale does matter. What I am trying to say is that these solutions (like prophet) are opinionated and that is why they can get accurate, as they are taking into account these time-scale specific trends.

But being opinionated means that they are assuming stuff about your data. For example saying that the number of sales you make in a day is a function of or correlated to the day of the week is probably a reasonable statement. However if you move away from sales and marketing, and try to forecast say the number of seismic events in a day, nature doesn't care if it's a Monday or Tuesday or holiday. So any such correlation the program is able to find out and use in forecasting would be incorrect. Like maybe there are more earthquakes on Monday than any other day in a particular dataset, but that would just be incidental and doesn't mean earthquakes are more likely to occur in future on Mondays. It's not a good example but there could be other such cases where such assumptions could be wrong.


Yes it would matter. Our algorithm(SandDune)is built around measuring data on a daily basis at this stage. It takes daily input data and predicts the next week's data on a daily basis. If you give it 180 daily data points, it will predict next 7 data points.


I see. Any plans to support monthly data? That is more useful for financial models.


We are working on it and will be live with it soon.




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: