> a lot of it is not worth memorizing because it's not important.
Until it is.
If you're not familiar with how NULLs are compared, at least in mysql, it will bite you right in the rectum when you start setting up things like unique constraints on nullable columns.
> I hear this complaint often but in reality I have encountered fairly little content in my day to day that has felt fully AI generated?
How confident are you in this assessment?
> straight drivel
We're past the point where what AI generates is "straight drivel"; every minute, it's harder to distinguish AI output from actual output unless you're approaching expertise in the subject being written about.
> a team of copywriters who pumped out mountains the most inane keyword packed text designed for literally no one but Google to read.
And now a machine can generate the same amount of output in 30 seconds. Scale matters.
> every minute, it's harder to distinguish AI output from actual output unless you're approaching expertise in the subject being written about.
So, then what really is the problem with just including LLM-generated text in wordfreq?
If quirky word distributions will remain a "problem", then I'd bet that human distributions for those words will follow shortly after (people are very quick to change their speech based on their environment, it's why language can change so quickly).
Why not just own the fact that LLMs are going to be affecting our speech?
> So, then what really is the problem with just including LLM-generated text in wordfreq?
> Why not just own the fact that LLMs are going to be affecting our speech?
The problem is that we cannot tell what's a result of LLMs affecting our speech, and what's just the output of LLMs.
If LLMs result in a 10% increase of the word "gimple" online, which then results in a 1% increase of humans using the word "gimple" online, how do we measure that? Simply continuing to use the web to update wordfreq would show a 10% increase, which is incorrect.
When my employer tried to make us come back to the office, I flat out told them - school dropoff is at 8:30, the next train is at 8:50, so the earliest I can be in the office is 10, if every train arrival is precisely synchronized, which it never is.
And since pick-up is at 4:30, it means I have to leave the office at 3:00 at the absolute latest, lest I incur significant monetary penalties - as well as the ire of the people who care for my child - for late pick-up.
So sure, if you want me in the office for five hours a day - one of which is going to be taken up by going to lunch with my coworkers, since face-time is so important - I'll be there.
Unless there are significant delays, which there usually are, in which case I'll be there for maybe four hours a day, because I'll also have to leave early since delays in the morning frequently mean delays in the evening.
The people who care for your child should themselves tell their employer, in ire, that the earliest they can be at the office is 10, and that they have to leave by 3 at the absolute latest, since they have train commutes themselves.
reply