Hacker News new | past | comments | ask | show | jobs | submit | pklee's comments login

https://www.joelonsoftware.com/2001/07/21/good-software-take... - Reference. I read this for the first time. Thank you for sharing this.


This is pure gold !! Thank you so much eugene and gang for doing this. For those of them which I have encountered, I can 100 % agree with them. This is fantastic !! So many good insights.


Have you considered marker. Does a very good job of turning PDF into markdown. - https://github.com/VikParuchuri/marker


This is pretty cool. I tried with dolly and then I tried with T5-base, both of it did not give me result. It broke for me. Has anyone tried it ?


This is fantastic. This is what we do on our platform. + Copy and keep what worked and what did not so you can fine tune :)


If this is using OpenAI which it seems is what it is using, It is only sending column headers / column names. Not the data. If you are concerned about column names, you could also mask it on the way out and back in. If you are looking for an end to end database connect and query, please reach out to me.


GPT3 model generates a SQL. You can sqldf on top of your data.table. We will be demo'ing at one of the events shortly. BTW, you could do somewhat similar with other LLMs such as GPTJ and GPT NEOX if you have worked with them


is GPTJ/NEOX good enough to generate code? tried it with SQL and it was really disappointing


They are decently good, I could not find major differences for the cases I was trying. The key is to control the temperature. Make sure it is low, otherwise the randomness increases tremendously. Infact you can feed the same input from openAI into NEOX and it generates results. There are many NEOX open playgrounds that allow you the control the temperature etc.


Cool! Will check it out thanks.


Very well said !! data is code


2 chapters in. Very good and makes you think. Truly impressive way of teaching how data is pivotal. The introduction reminds me of Peter Norvig's talk. I also like the point below of data is code, it is very profound.



Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: