Hmm one thing that people forget about when it comes to Pandas vs SQL is that for the kind of data you would use in pandas, 1 day and sometimes even 1 week or longer doesn't make a big difference.
So say you are building some complicated data processing pipeline and you base it on a bunch of SQL queries. You are hitting the Database every single time.
While if you just get the tables and to most (or at least some) of the merging and processing in pandas you can do the loading of the data in the middle of the night and that is the only time you are hitting the db.
I have personally experienced big analytical SQL queries hitting the db and busy times...
While if you just get the tables and to most (or at least some) of the merging and processing in pandas you can do the loading of the data in the middle of the night and that is the only time you are hitting the db.
I have personally experienced big analytical SQL queries hitting the db and busy times...