Hacker News new | past | comments | ask | show | jobs | submit login

This is an interesting piece.

Its easy to see how for negative numbers the softmax operator could simply refrain from making a decision

e.g. ``` sum(softmax1[-100, -100, -100]) ~= 1e-43 ```

But is there any basis to assume commas and whitespaces will be negatively correlated with other tokens?




Consider applying for YC's W25 batch! Applications are open till Nov 12.

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: