LLM Agents Are Simply Graph – Tutorial for Dummies

DebtDeflation · 2025-03-20T11:00:50 1742468450

There are two competing definitions of agents being used in industry.

https://www.anthropic.com/engineering/building-effective-age...

"- Workflows are systems where LLMs and tools are orchestrated through predefined code paths.

- Agents, on the other hand, are systems where LLMs dynamically direct their own processes and tool usage, maintaining control over how they accomplish tasks."

What Anthropic calls a "workflow" in the above definition is what most of the big enterprise software companies (Salesforce, ServiceNow, Workday, SAP, etc.) are building and calling AI Agents.

What Anthropic calls an "agent" in the above definition is what AI Researchers mean by the term. It's also something that mainly exists in their labs. Real world examples are fairly primitive right now, mainly stuff like Deep Research. That will change over time, but right now the hype far exceeds the reality.

kodablah · 2025-03-20T11:57:01 1742471821

I think Anthropic's definition of workflows is inaccurate for modern definitions of the term. Temporal for instance (disclaimer, my employer) allows completely dynamic logic in agentic workflows to let the LLM choose what to do next. It can even be very dynamic (e.g. eval some code) though you may want it to operate on a limited set of "tools" you make available.

The problem with all of these AI specific workflow engines is they are not durable, so they are process local, suffer crashes, cannot resume, don't have good visibility or distribution, etc. They often only allow limited orchestration instead of code freedom, only one language, etc

MacsHeadroom · 2025-03-20T14:54:33 1742482473

>The problem with all of these AI specific workflow engines is they are not durable, so they are process local, suffer crashes, cannot resume, don't have good visibility or distribution, etc. They often only allow limited orchestration instead of code freedom, only one language, etc

So what's the solution?

kodablah · 2025-03-20T15:23:44 1742484224

My biased answer, because I work at Temporal[0], is to use an existing workflow solution that solves all of these problems instead of reaching for a solution that doesn't help with any of these but happens to be AI specific. Most agentic AI workflows are really just microservice orchestrations, the only "AI" involved is prompting an HTTP API that uses AI on its end. So use a good solution for "agentic X" whether that X is AI or any other orchestration needs.

0 - https://temporal.io/solutions/ai

J_Shelby_J · 2025-03-20T18:28:32 1742495312

It’s only a true agent if it chooses what to eat for breakfast everyday.

zh2408 · 2025-03-20T13:54:44 1742478884

"Workflow can be very dynamic" is a great summary!

infecto · 2025-03-20T15:42:06 1742485326

What about workflows are not agents?

stronglikedan · 2025-03-20T16:52:35 1742489555

To me, a workflow is a predetermined set of steps that are followed based on fixed logic. Agents should have some agency to determine which steps in the workflow to perform next, without them being predetermined by fixed logic.

trash_cat · 2025-03-21T11:46:13 1742557573

PocketFlow calls itself "agentic" due to its "agentic coding" paradigm (AI agents like Cursor building apps), but this is about development, not runtime behavior. At runtime, it’s a workflow system. This stretches Anthropic’s definition, where "agentic" implies dynamic LLM control during execution. I think this si where the misunderstanding stems from.

trash_cat · 2025-03-21T20:38:45 1742589525

This was a LLM generated reponse which was pretty stupid involving Agentic coding. But it's still correct that PocketFlow does not align with Anthropic's definition of what an "Agent" is.

campbel · 2025-03-19T22:55:36 1742424936

I follow Mr. Huang, read/watch his content and also plan to use PocketFlow in some cases. A preamble, because I don't agree with this assessment. I think agents as nodes in a DAG workflow is _an_ implementation of an agentic system, but is not the systems I most often interact with (e.g. Cursor, Claude + MCP).

Agentic systems can be simply the LLM + prompting + tools[1]. LLMs are more than capable (especially chain-of thought models) to breakdown problems into steps, analyze necessary tools to use and then executing the steps in sequence. All of this is done with the model in the driver seat.

I think the system described in the post need a different name. It's a traditional workflow system with an agent operating on individual tasks. Its more rigid in that the workflow is setup ahead of time. Typical agentic systems are largely undefined or defined via prompting. For some use cases this rigidity is a feature.

[1 https://docs.anthropic.com/en/docs/build-with-claude/tool-us...

TeMPOraL · 2025-03-20T10:28:34 1742466514

> Agentic systems can be simply the LLM + prompting + tools[1]. LLMs are more than capable (especially chain-of thought models) to breakdown problems into steps, analyze necessary tools to use and then executing the steps in sequence. All of this is done with the model in the driver seat.

Sort of, kind of. It's still a directed graph. Dynamically generated graph, but still a graph. Your prompted LLM is the decision/dispatch block. When the model decides to call a tool, that's going from the decision node to another node. The tool usually isn't another LLM call, but nothing stops it from being one.

The "traditional workflow" exists because even with best prompting, LLMs don't always stick to the expected plan. It's gotten better than it used to, so people are more willing to put the model in the driving seat. A fixed "ahead of time" workflow is still important for businesses powering products with LLMs, as they put up a facade of simplicity in front of the LLM agentic graph, and strongly prefer for it to have bounded runtime and costs.

(The other thing is that, in general, it's trickier to reason about code flow generated at runtime.)

infecto · 2025-03-20T12:36:37 1742474197

Kind of. This explanation feels pedantic—like calling my morning routine a dynamically generated graph (which it technically is). Others have pointed this out, but the industry seems split. Workflows like those described in the article resemble Airflow jobs, making them, well, workflows.

Corporate buzzwords have co-opted "Agent" to describe workflows with an LLM in the loop. While these can be represented as graphs, I'm not convinced "Agent" is the right term, even if they exhibit agentic behavior. The key distinction is that workflows define specific rules and processes, whereas a true agent wouldn’t rely on a predetermined graph—it would simply be given a task in natural language.

You're right that reasoning about runtime is difficult for true agents due to their non-deterministic nature, but different groups are chipping away at the problem.

therealpygon · 2025-03-21T12:17:54 1742559474

In my opinion, the split is between the people who want their tools to be called Agents so they can make more on AI hype, and the people who know better than to call a simple pre-defined software workflow an “agent”. It is harder to get large investments for “my program just calls an LLM” these days.

jwpapi · 2025-03-20T02:17:13 1742437033

I have to agree this is a bit too simple for being anything of substance. That is not what really agentic means. This is basically implementing ChatGPT into Zapier.

When you work with agentic LLMs you should worry about prompt chaining, parallel execution, deciding points, loops and more of these complex decisions.

People who didn’t know what’s in first article shouldn’t use Pocketflow and go with N8N or even Zapier.

zh2408 · 2025-03-20T02:36:27 1742438187

I do agree what you said except the first sentence. The design of the Graph is super important. Pocketflow is for those with technical background.

zh2408 · 2025-03-19T23:00:25 1742425225

Let me clarify: this tutorial focuses on the technical internal implementation of the agent (e.g., OpenAI agent, Pydantic AI, etc.), rather than the UI/UX of the agent-based products that end users interact with.

yed · 2025-03-19T23:35:12 1742427312

The newest generation of agents[0] aren't implemented this way; the model itself is trained to make decisions and a plan of action rather than an explicitly programmed workflow tree.

[0] https://openai.com/index/computer-using-agent/

zh2408 · 2025-03-19T23:43:11 1742427791

I think you’re referring to function calling: https://platform.openai.com/docs/guides/function-calling

This still returns a string. You need to explicitly program the branch to the right function. For example, check out how OpenAI Agents, released a week ago, rely on a workflow: https://github.com/openai/openai-agents-python/blob/48ff99bb...

yed · 2025-03-19T23:45:49 1742427949

No I'm referring to the newest generation of agentic models one of which I linked to. These are not fully released but it is where the newest generation of research is headed.

zh2408 · 2025-03-20T00:04:03 1742429043

It's hard for me to comment on something not open sourced

nileshtrivedi · 2025-03-20T01:05:11 1742432711

That remains to be seen. Manus, a standard agent built with Claude 3.7, outperforms o3 agentic model on the GAIA benchmark.

throwaway7783 · 2025-03-20T02:24:52 1742437492

Operator/Computer Use is a bridge until we no longer need any tools at all

achierius · 2025-03-20T02:39:58 1742438398

What gives you any confidence in that transition happening in the best future?

campbel · 2025-03-19T23:26:52 1742426812

That's what I am talking about as well. The low-level implementation of an agent isn't necessarily a rigid graph, and I'd actually argue its explicitly not this.

zh2408 · 2025-03-19T23:32:59 1742427179

The current implementations of Agents, e.g., OpenAI agents released last week, are based on graph (workflow): https://github.com/openai/openai-agents-python/blob/48ff99bb...

Not sure about Cursor you mentioned as its agent is not open sourced.

campbel · 2025-03-20T02:37:47 1742438267

This link is also referring to the nodes as agents. So its a system of agents interacting to product an outcome. I'm not saying this system is bad, just that I think it deserves another name rather than calling the whole system an "Agent". It's many agents working in a coordinated fashion.

zh2408 · 2025-03-20T02:44:29 1742438669

No. It's not many agents in the workflow. It's not an agent per node.

The whole workflow and the Runner class is for one agent.

Check out this line: https://github.com/openai/openai-agents-python/blob/48ff99bb...

A single `run_agent` is implemented based on the Runner class and workflow. So usually the workflow is for one agent (unless there is handoff).

campbel · 2025-03-20T04:25:46 1742444746

They define it in the same file https://github.com/openai/openai-agents-python/blob/48ff99bb...

> An agent is an AI model configured with instructions, tools, guardrails, handoffs and more.

Agents can hand off to other agents, but even the hand-off is decided by the agent itself, not a pre-defined orchestration.

zh2408 · 2025-03-19T21:29:13 1742419753

Hey folks! I just posted a quick tutorial explaining how LLM agents (like OpenAI Agents, Pydantic AI, Manus AI, AutoGPT or PerplexityAI) are basically small graphs with loops and branches. For example:

OpenAI Agents: for the workflow logic: https://github.com/openai/openai-agents-python/blob/48ff99bb...

Pydantic Agents: organizes steps in a graph: https://github.com/pydantic/pydantic-ai/blob/4c0f384a0626299...

Langchain: demonstrates the loop structure: https://github.com/langchain-ai/langchain/blob/4d1d726e61ed5...

If all the hype has been confusing, this guide shows how they actually work under the hood, with simple examples. Check it out!

https://zacharyhuang.substack.com/p/llm-agent-internal-as-a-...

godelski · 2025-03-20T01:03:48 1742432628

Minor comment: do you mean "LLM Agents Are Simply Graphs". Personally, I'd drop the adjective to "LLM Agents are Graphs" as I think it sounds better, but the plural is needed.

zh2408 · 2025-03-20T01:09:54 1742432994

Oh, that’s embarrassing ... pardon my poor English, and thanks so much for pointing that out!

godelski · 2025-03-20T01:26:31 1742433991

Simple mistake and easy to fix :)

pseudopersonal · 2025-03-20T00:26:16 1742430376

Thanks for this write up. It'll be inspiring my ruby framework.

zh2408 · 2025-03-20T00:34:11 1742430851

Thank you!

bambax · 2025-03-20T12:48:28 1742474908

This explanation and demo is super clear.

It would be interesting to dig deeper into the "thinking" part: how does an LLM know what it doesn't know / how to fight hallucinations in this context?

erichi · 2025-03-20T15:14:57 1742483697

I like the minimalistic approach! How to test such agents?

czbond · 2025-03-19T22:28:05 1742423285

Thank you - really interesting looking read, thanks for crafting the deep explanation, with links to actual internal code examples. Also, thanks for not putting it behind the Medium paywall

zh2408 · 2025-03-19T22:32:28 1742423548

Thank you!!

_pdp_ · 2025-03-19T23:07:21 1742425641

It is hard to put a pin on this one because there are so many thing wrong with this definition. There are agent frameworks that are not rebranded workflow tools too. I don't think this article helps explain anything except putting the intended audience in the same box of mind we were stuck since the invention of programming - i.e. it does not help.

Forget about boxes and deterministic control and start thinking of error tolerance and recovery. That is what agents are all about.

ForTheKidz · 2025-03-20T09:38:31 1742463511

> There are agent frameworks that are not rebranded workflow tools too.

To me "workflow" is just what agent means: the rules under which an automated action occurs. Without some central concept "agent" just a magic wand that does stuff that may or may not be what you want it to do. If we can't use state machines at all I'm just going to go out and say LLMs are a dead end. State machines are the bread and butter of reliable software.

> Forget about boxes and deterministic control and start thinking of error tolerance and recovery.

First you'd have to define what an error even is. Then you're just writing deterministic software again (a workflow), just with less confidence. Nice for stuff with low risk and confidence to begin with (eg semantic analysis etc whose error tends to wash out in aggregate), but not for stuff acting on my behalf.

LLMs are cool bits of software, but I can't say I see much use for "agents" whose behavior is not well-defined and whose non-determinism is formally bounded.

infecto · 2025-03-20T12:46:36 1742474796

It’s getting pedantic, but the key idea is that Agents can solve problems traditional state machine-based workflows couldn't.

Your point is moot since many of these modern workflows already use LLMs as gating functions to determine the next steps.

It’s a different way of approaching problems, and while the future is uncertain, LLMs have moved beyond being just "cool software" to becoming genuinely useful in specific domains.

ForTheKidz · 2025-03-20T12:49:14 1742474954

Hmm, maybe you are referring to something specific with "workflow". I'm envision a visual graph with a ui for each node and connection, or maybe a makefile on the other end of the spectrum. What are you envisioning?

Anyway, LLMs will remain at "cool software" like other niche-specific patterns until I see something general emerge. You'd have to pitch LLMs pretty savvily to show it as a clear value-add. Engineers are extremely expensive, so LLMs need to have a very low error rate to be integrated into the revenue-path of a product to not incur higher costs or a lower-quality service. I still see text- and code-generation for immediate consumption by a human (or possible classification to be reviewed by a human) as the only viable uses cases today. It's just way too easy to manipulate them with standard english.

infecto · 2025-03-20T13:00:53 1742475653

> Hmm, maybe you are referring to something specific with "workflow". I'm envisioning a visual graph with a UI for each node and connection, or maybe a makefile on the other end of the spectrum. What are you envisioning?

In job orchestration systems, workflows are structured sequences of tasks that define how data moves and transforms over time. Workflows are typically defined as Directed Acyclic Graphs (DAGs) but they don't have to be. I don't believe I am referring to anything more specific than how orchestration systems generally use them. LLM-based agents shift the focus from rigidly defined transitions to adaptable problem-solving mechanisms. They don’t replace state machines entirely but introduce a layer where strict determinism isn’t always necessary or even desirable.

> Anyway, LLMs will remain at "cool software" like other niche-specific patterns until I see something general emerge. You'd have to pitch LLMs pretty savvily to show it as a clear value-add. Engineers are extremely expensive, so LLMs need to have a very low error rate to be integrated into the revenue-path of a product to not incur higher costs or a lower-quality service. I still see text- and code-generation for immediate consumption by a human (or possible classification to be reviewed by a human) as the only viable uses cases today. It's just way too easy to manipulate them with standard English.

I get the skepticism, especially about error rates and reliability. But the “cool software” label underestimates where this is heading. There’s already evidence of LLMs being useful beyond text/code-gen (e.g., structured reasoning in research, RAG-enhanced search, or dynamically adapting workflows based on complex input). The real shift isn’t just about automation but about adaptive automation, where LLMs reduce the need for brittle, predefined paths.

Of course, the general-use case is still evolving, and I agree that direct, high-stakes automation remains a challenge. But dismissing LLM-driven agents as just niche tools ignores their growing role in augmenting traditional software paradigms.

vincston · 2025-03-20T09:22:24 1742462544

Why forget about boxes and deterministic control and start thinking of error tolerance and recovery? I know, that LLMs are statistical models, but can you not use patterns to enforce a deterministic outcome? (Single responsibility for each agent, retrying llm calls, rephrasing prompts, etc?)

zh2408 · 2025-03-19T23:18:40 1742426320

Hey, sorry for the confusion. This tutorial is focusing on the low-level internals of how agents are implemented—much like how intelligent large language models still boil down to matrix multiplications at their core.

godelski · 2025-03-20T01:25:58 1742433958

  > This tutorial is focusing on the low-level internals of how agents are implemented

We have very different definitions of what "low-level" means. Exact opposites in fact. "Low-level" means in the inner workings. Like a low-level language is assembly (some consider C low-level but this is debatable), whereas Python would be high-level.

I don't think this tutorial is "near the metal" of LLMs nor do I think it should be considering it is aimed at "Dummies". Low-level would really need to get into the inner workings of the processing, probing agents, and getting into the weeds.

ForTheKidz · 2025-03-20T10:01:32 1742464892

> We have very different definitions of what "low-level" means.

Does it really matter if you can understand them? waiting for strongly-opinionated engineers to finish their pedantic spiels (...even when they're wrong or there is no obvious standard of correctness) when everyone already understands each other is one of the most miserable part of being in this industry.

I—and I emphatically don't include the above poster in this view as it takes continual & repeated behavior to accrue such judgement—see this as a small tantrum, essentially, for people who never learned to regulate their emotions in professional spaces. I don't understand why this sort of bickering is considered acceptable behavior in the workplace or adjacent spaces. It's rude, arrogant, trivially avoidable with slight change in tone and rhetoric, and it makes you look like an asshole if you're not 100% right and approach it in good humor.

godelski · 2025-03-20T16:25:27 1742487927

Yes even if I can understand them it matters. We should correct ourselves and enable better communication moving forward. I would also say that it too is rude, arrogant, and makes you look like an asshole if you are using words incorrectly and then defending that usage. One must conclude that either you have too much ego to correct yourself or you are intentionally misleading people.

Why do you see this among engineers frequently? Well because it's the job of an expert to be concerned with nuance and details. The low-level in fact. This requires a high precision in communication too. The back and forth you see as bickering also ends up getting those details communicated. The reason being is that much of what's being intended is implicit. So the other approach is to use a lot of words. Unfortunately when you do that you are often ignored.

windsignaling · 2025-03-20T02:41:51 1742438511

I think "low-level" is relative to what's being discussed. Low-level for LLMs would have to do with how transformer layers are implemented (self-attention layer, layer norms, etc.) whereas low-level for agents would be the graph structure.

Although I personally don't think the graph implementation for agents is necessarily as established or widely standardized, it's helpful to know about why such an implementation was chosen and how it works.

> the inner workings of the processing, probing agents, and getting into the weeds

These feel to me like empty words... "inner workings of the processing"? You can say that about anything.

godelski · 2025-03-20T03:55:44 1742442944

I'm not quite sure I agree, but I do get your point. Why I don't quite agree is that the agents are communicating and thus the "in the weeds" part is getting into how that communication is being processed. Which is what makes or breaks agents. How they interpret one another and respond. There needs to be some mech interp for me to really think of something as low-level. I'll put emphasis on the in the weeds part. Nuance and details are critical parts to a low-level conversation.

  > You can say that about anything.

That is true. But it is also true that you can approach any topic from low-level or high-level. So I'm not sure I get your point here.

windsignaling · 2025-03-20T07:40:47 1742456447

What I meant was, the phrase "inner workings of the processing" doesn't really mean anything at all. i.e. it doesn't convey any useful information about what you're trying to say.

> How they interpret one another and respond.

That sounds like it just falls back to "how LLMs work". It's the wrong level of abstraction in this case, because it's one level down from the topic being discussed here.

godelski · 2025-03-20T16:34:00 1742488440

Certainly it means something. Alone it says little but in both previous comments there are other words to provide context and even explicitly communicate that I mean you need to be looking at the tokens and token passing. How the LLMs communicate. The low-level details in how that communication operates.

  > because it's one level down

So we're in agreement?

Aren't we after the "low-level"? That's this whole conversation... yes, it is a level down, that's my whole point. Just as my original analogy with assembly being a level down from C. Working at the metal, as they say. In the weeds.

I honestly don't know how to respond because I'm saying "this is too high-level" and you're arguing "you're too low-level". I'm sorry, but when you do stuff at the low-level you in fact have to crouch down and put your face to the ground. The lower the better. You're trying to see something very small, we're not trying to observe mountains here

zh2408 · 2025-03-20T01:48:03 1742435283

By low-level, it is with respect to the agent interface.

The original purpose is to help people understand how the inner agent framework is internally implemented, like those:

OpenAI Agents: https://github.com/openai/openai-agents-python/blob/48ff99bb... Pydantic Agents: https://github.com/pydantic/pydantic-ai/blob/4c0f384a0626299... Langchain: https://github.com/langchain-ai/langchain/blob/4d1d726e61ed5... LangGraph: https://github.com/langchain-ai/langgraph/blob/24f7d7c4399e2...

adamnemecek · 2025-03-19T23:27:30 1742426850

Despite the memes, this reductivism is not exactly insightful. Like why stop there? Matrix multiplication is just a bunch of dot product. Which in turn is just cos and magnitude. What insights were generated from this?

ethanwillis · 2025-03-19T23:35:16 1742427316

The reductionism is insightful when it comes to providing an implementation with those specific details in mind.

In the case of LLMs knowing it does boil down to matrix multiplication is insightful and useful because now you know what kind of hardware is best suited to executing a model.

What is actually not insightful or useful is believing LLMs are AGI or conscious.

ForTheKidz · 2025-03-20T10:55:42 1742468142

Belief is generally not insightful or useful by definition.

Then again, I don't think anyone who can follow this article believed that LLMs were conscious to begin with, so I'm not sure what your point is. You're preaching on behalf of a demographic that won't read this article to begin with, and presumably the people who are can see how useless, distracting, and unproductive this reductionism is.

ethanwillis · 2025-03-20T11:10:00 1742469000

The person I'm replying to literally has a startup targeted at making AGI coming from the LLM hype cycle.

ForTheKidz · 2025-03-20T11:35:08 1742470508

I believe this was precluded by the hedging of people who could follow the article. I have a difficult time imagining a person who can both understand how current LLMs work and still buy into Kurzweil.

Pursue the hypothesis? Sure. But belief is a different beast entirely. It's not even clear AGI is a meaningful concept yet, and I'd bet my life savings everyone reading this comment in 2025 will die before it's answered. Skepticism is the barometer.

adamnemecek · 2025-03-20T17:21:58 1742491318

> coming from the LLM hype cycle

Our approach is so unrelated to any of the other hyped up stuff. We have not written a single line of ML, it has been all math & physics until now.

godelski · 2025-03-20T01:18:25 1742433505

  > this reductivism is not exactly insightful.

I really agree with this. I think it has been bad for a lot of people's understanding when they have trivialized ML to "just matrix multiplications" (or GMMs). This does not help differentiate AI/ML from... well.. really any data processing algorithm. Matrices are fairly general structures in mathematics and you can formulate almost anything as one. In fact, this is a very common way to parallelize or speed up programs (e.g. numpy vectorization).

We wouldn't call least squares, even a bunch of them, ML nor would we call rasterization or ray tracing. Fundamentally all these things are "just GMMs". It also does not make apparent any differentiation from important distinctions like Linear Networks, CNNs, or Transformers. It brushes off a key element, the activation function, which is necessary for neural nets to do non-linear transformations! And what about the residual units? These are one of the most important factors in enabling Deep Learning. They're "just" addition. So we say it's all just matrix addition since we can convert multiplication to addition?

There is such a thing as oversimplification and I worry that we have hyper-optimized (over-optimized) for this. So I agree, saying they just "boil down to matrix multiplications" is fundamentally misleading. It provides no insight and only serves to mislead people.

zh2408 · 2025-03-19T23:38:23 1742427503

It’s kind of like the different levels of abstraction.

For example, for software projects, the algorithmic level is where most people focus because that’s typically where the biggest optimizations happen. But in some critical scenarios, you have to peel back those layers—down to how the hardware or compiler works—to make the best choices (like picking the right CPU/GPU).

Likewise, with agents, you can work with high-level abstractions for most applications. But if you need to optimize or compare different approaches (tool use vs. MCP vs. prompt-based, for instance), you have to dig deeper into how they’re actually implemented.

_factor · 2025-03-20T00:13:37 1742429617

If you can reduce complex matrix multiplications into simpler terms, then you may be able to focus the training based on those constraints to increase performance/efficiency.

xg15 · 2025-03-20T00:32:25 1742430745

ok, then how would you do it?

heyitsguay · 2025-03-20T05:51:08 1742449868

Could you give some examples of the agent frameworks you're referring to? I'd love to see some examples that go beyond the graph pattern! Thank you

_pdp_ · 2025-03-20T09:01:44 1742461304

The agentic ai capabilities of chatbotkit.com has nothing to do with workflows.

The graph rendering is simply for illustrative purposes and most to cater for people who think in terms of graphs but the underlaying mechanics are not nodes and edges and a flow that goes from one to the next.

xg15 · 2025-03-20T00:32:47 1742430767

what exactly do you mean with "error tolerance and recovery"?

mentalgear · 2025-03-19T22:34:41 1742423681

Everything that was previously just called automation or pipeline processing on-top of LLM is now the buzzword "agents". The hype bubble needs constant feeding to keep from imploding.

zh2408 · 2025-03-20T00:08:51 1742429331

Thank you! I'm not against such hype TBH :)

jumploops · 2025-03-20T01:44:13 1742435053

Anthropic[0] and Google[1] are both pushing for a clear definition of an “agent” vs. an “agentic workflow”

tl;dr from Anthropic:

> Workflows are systems where LLMs and tools are orchestrated through predefined code paths.

> Agents, on the other hand, are systems where LLMs dynamically direct their own processes and tool usage, maintaining control over how they accomplish tasks.

Most “agents” today fall into the workflow category.

The foundation model makers are pushing their new models to be better at the second, “pure” agent, approach.

In practice, I’m not sure how effective the “pure” approach will work for most LLM-assisted tasks.

I liken it to a fresh intern who shows up with amnesia every day.

Even if you tell them what they did yesterday, they’re still liable to take a different path for today’s work.

My hunch is that we’ll see an evolution of this terminology, and agents of the future will still have some “guiderails” (note: not necessarily _guard_rails), that makes their behavior more predictable over long horizons.

[0]https://www.anthropic.com/engineering/building-effective-age...

[1]https://www.youtube.com/watch?v=Qd6anWv0mv0

zh2408 · 2025-03-20T02:01:39 1742436099

Let me clarify: we are discussing how the Agent is internally implemented, given LLM calls and tools. It can be built using a graph, where one node makes decisions that branch out to tools and can loop back.

The workflow can vary. For example, it can involve multiple LLM calls chained together without branching or looping. It can also be built using a graph.

I know the terms "graph" and "workflow" can be a bit confusing. It’s like we have a low-level 'cache' at the CPU level and then a high-level 'cache' in software.

jumploops · 2025-03-20T02:09:07 1742436547

Yes, the difference is that in the “pure” agent approach, the model is the only thing directing what to do.

In a sense there’s still a graph of execution, but the graph isn’t known until the “agent” runs and decides what tools to use, in what order, and for how long.

There is no scaffold, just LLM + MCP (or w/e) in a loop.

zh2408 · 2025-03-20T02:21:15 1742437275

Yes!!

miguelinho · 2025-03-19T23:48:53 1742428133

Great write up! In my opinion, your description likely accurately models what AI agents are doing. Perhaps the graph could be static or dynamic. Either way - it makes sense! Also, thank you for removing the hype!

zh2408 · 2025-03-20T00:08:53 1742429333

Thank you!

nxpnsv · 2025-03-20T06:43:59 1742453039

I found it understandable and clear. Pocket flow looks cool, although that magic with - >> operators seems a bit obtuse... Also, I think "simply" is a trap - an agent might be modeled by a graph, but that graph can be arbitrarily complex.

admiralrohan · 2025-03-20T09:36:01 1742463361

Strangely the original HN post on the framework got no comments but this one is getting viral! Good luck.

bckr · 2025-03-20T02:40:32 1742438432

Anyone succeeding with agents in production? Other than cursor :)

v3ss0n · 2025-03-20T07:25:58 1742455558

My experience is Mistral Small, QwQ and QwenCoder can build much better diagrams in Mermaid compared to those attempt by Mr haung

DrFalkyn · 2025-03-20T04:04:09 1742443449

I think the model he is looking for is a deterministic finite automata (DFA)