More

huevosabio · 2024-09-26T20:44:37.000000Z

I don't think the narrative makes sense. It was clear from way back in 2016 that training would take a ton of resources. Researchers were already been sucked into FAANG labs because they had the data, the compute, and the money. There was never a viable way for a true non-profit to make world-changing, deep learning-based AI models.

When seen through the rearview mirror, the whole narrative screams of self-importance and duplicity. GPT-2 was too dangerous, and only they were trust-worthy enough to possess. They were trust-worthy because this was a non-profit, so "interest aligned with humanity". This charade has continued even to barely some months ago.

huevosabio · 2024-09-18T03:49:25.000000Z

Orgs that are existential competition (government, hospitals, unis, Google) tend to fatten their bureaucracies. It becomes a goal in itself.

Many can't show super high profit by design (government, unis, non profits) or due to anti trust fear (Google) so it makes sense to just hire more.

huevosabio · 2024-09-16T04:12:25.000000Z

I really like the editor, but the AI stuff clearly lags behind Cursor in quality and in ergonomics.

I want them to catch up though because the editor is very, very snappy. I now use it intermittently with Cursor.

huevosabio · 2024-09-13T20:31:40.000000Z

My bet: they use formal methods (like an interpreter running code to validate, or a proof checker) in a loop.

This would explain: a) their improvement being mostly on the "reasoning, math, code" categories and b) why they wouldn't want to show this (its not really a model, but an "agent").

andix · 2024-09-13T20:36:39.000000Z

My understanding was from the beginning that it’s an agent approach (a self prompting feedback loop).

They might’ve tuned the model to perform better with an agent workload than their regular chat model.

JasonSage · 2024-09-13T20:44:13.000000Z

I think it could be some of both. By giving access to the chain of thought one would able to see what the agent is correcting/adjusting for, allowing you to compile a library of vectors the agent is aware of and gaps which could be exploitable. Why expose the fact that you’re working to correct for a certain political bias and not another?

huevosabio · 2024-09-04T23:20:05.000000Z

They probably already went to investors with letters of sale from GPU/datacenter providers.

And 80% of those $1B will go from Founder Mode VC to Nvidia and Datacenter Management Co in the span of 6 months.

huevosabio · 2024-09-01T20:27:41.000000Z

This is a great explanation. It should be paired with the article.

Thanks!

huevosabio · 2024-08-19T05:08:35.000000Z

I feel seen.

Jokes aside, carpentry is an amazing complementary hobby for software related work. There's something about drawing on a piece of paper and using your hands to make it a reality that scratches a lot of itches.

Sadly, entrepreneurship doesn't leave much free time for carpentry.

ijustwanttovote · 2024-08-19T17:24:17.000000Z

I got into software because I am lazy. I don't think physical work is for me.

Doxin · 2024-08-20T09:36:43.000000Z

Getting into manual machining is another good one. If you go into it with even a hint of a "move fast and break stuff" mentality you'll find yourself mostly just breaking a lot of stuff. It's good practice for being deliberate in your actions.

huevosabio · 2024-08-09T20:45:19.000000Z

Yes, I originally thouoof building AI workflows with an existing DAG package or creating my own.

But I think DAGs don't scale, or rather, as you point out, it quickly becomes easier to reason about code than about a graph.

I thought, maybe a setup like pytorch that let's you code it normally and once you run it or compile it creates a graph for you to see. But I remain unconvinced.

dbmikus · 2024-08-09T21:29:30.000000Z

I've thought about the same things as we built our own open source project for LLM workflows (https://github.com/gofixpoint/fixpoint/).[1]

I like the Pytorch comparison, and I've seen DSPy position themselves as Pytorch for prompting.

I also think the actor model is a natural fit for AI agents, which has some similarities with Pregel (message passing), and some differences (there are no super-steps of graph execution, each actor has its own thread).

I definitely dislike state machines for most use cases.[2] I think we can learn a lot about good AI agent paradigms from game programming, and I enjoyed this article on game state: https://gameprogrammingpatterns.com/state.html

At the end, they mention that game AI doesn’t often use state machines anymore, because the structure they impose is limiting.

Also, the folks behind Temporal are anti-state-machine:

https://pages.temporal.io/download-state-machines-simplified...

https://community.temporal.io/t/workflow-maintainability-abs...

    [1]: our goal is to be an open-source alternative to the OpenAI Assistants API, not to compete with LangGraph, but there is overlap
    [2]: I understand that LangGraph is not a state machine

huevosabio · 2024-08-10T00:04:43.000000Z

Yea, I think it clicked with me when I saw the DSPy documentation that maybe that's a good balance between code-like and graph like, though I still find DSPy to be overly obfuscated.

I'll take a look at your repo!

huevosabio · 2024-08-07T17:38:40.000000Z

I love this!

I have an AirBnB with multiple rooms in Mexico (https://laotraaldeita.com/) and I get relatively often questions about floor plans.

I feel like showing floorplans directly isn't as intuitive for showing off the space, and that using 3D scanners is a huge overkill.

I do have the floorplans so I may give it a try!

kccqzy · 2024-08-08T04:18:37.000000Z

I just use magicplan (a freemium app) to create floor plans just by scanning with my phone. Of course it won't be as accurate as a proper 3D scanner but prospective renters aren't going to notice. You can of course edit the floor plan afterwards.

huevosabio · 2024-08-06T14:25:04.000000Z

Clementine (https://clementine.games/)| Founding Technical Designer | REMOTE

We bring human-like decision making into game mechanics.

We do this via behavior systems that rely on a combination of generative AI and planning algorithms. The most obvious use case is AI NPCs, but game masters, storytellers, opponents, and a myriad of other game mechanics can be greatly enhanced by the same core technology.

We are looking for a game developer who has deep experience with game design and is a proficient programmer. Hence the title.

Your role would primarily be to build experimental games that use the tech.

You can find a more detailed description here: https://docsend.com/view/y2bzetks4jr6mq4a

To apply, just send me an email at ramon@clementine.games with the title "HN Hiring - Technical Designer".

About us: Well, me, for now. I am Ramon, founder, and I am working from San Francisco. You can find more about me in my personal website (http://ramondario.com/). I will be joined by an AI engineer in September. I got some funding from A16Z Speedrun and am assembling a team of AI/ML engs and game devs.

p.s. If you find this interesting, but are not a game developer, feel free to contact me regardless!