More

drbojingle · 2026-01-07T15:06:51 1767798411

Prototyping.

drbojingle · 2026-01-05T06:02:57 1767592977

Location: St John's, NL, Canada Remote: yes Willing to relocate: No Tech: JS/TS, Node.js, React, Linux, Laravel/PHP, Django/Python, NVIM (yes, I be liking dem hotkeys), and plenty of AI tooling (Claude Code, Gemini, Cursor) Résumé/CV: https://drive.google.com/file/d/1ImCaLmoaCpjtPmKECjD3uR7j3J0... email: c.ryan@mun.ca

I have over a decade of experience in development and several years in leadership as well. I'm particularly fond of start-ups, and I like to build things in my spare time.

Recently, in my free time, I've been delving into vibe coding with Claude Code. A couple of my results thus far are here:

https://data-atlas.net - A simple JSON Schema-based form builder https://afteractions.net - A collaborative retrospective tool for agile teams

Among other AI projects I undertake.

drbojingle · 2025-12-30T22:54:51 1767135291

What's the difference between using llms now vs the first half of 2025 among the best users?

simonw · 2025-12-30T22:59:44 1767135584

Coding agents and much better models. Claude Code or Codex CLI plus Claude Opus 4.5 or GPT 5.2 Codex.

The latest models and harnesses can crunch on difficult problems for hours at a time and get to working solutions. Nothing could do that back in ~March.

I shared some examples in this comment: https://news.ycombinator.com/item?id=46436885

William_BB · 2025-12-30T23:27:56 1767137276

Ok I will bite.

Every single example you gave is in a hobby project territory. Relatively self-contained, maintainable by 3-4 devs max, within 1k-10k lines of code. I've been successfully using coding agents to create such projects for the past year and it's great, I love it.

However, lots of us here work on codebases that are 100x, 1000x the size of these projects you and Karpathy are talking about. Years of domain specific code. From personal experience, coding agents simply don't work at that scale the same way they do for hobby projects. Over the past year or two, I did not see any significant improvement from any of the newest models.

Building a slightly bigger hobby project is not even close to making these agents work at industrial scale.

rjzzleep · 2025-12-31T05:10:28 1767157828

I think that in general there is a big difference between javascript/typescript projects big or small and other projects that actually address a specific project domain. These two are not the same. The same claude code agent can create a lot of parts of a function web project, but will struggle providing anything functional but a base frame for you to build on if you were to create a new SoC support in some drone firmware.

The problem is that everyone working on those more serious projects knows that and treats LLMs accordingly, but the people that come from the web space come in with the expectation that they can replicate the success they have in their domain just as easily, when oftentimes you need to have some domain knowledge.

I think the difference simply comes down to the sheer volume of training material, i.e. web projects on github. Most "engineers" are actually just framework consumers and within those frameworks llms work great.

simonw · 2025-12-31T00:00:23 1767139223

Most of the stuff I'm talking about here came out in November. There hasn't been much time for professional teams to build new things with it yet, especially given the holidays!

qweiopqweiop · 2025-12-31T09:03:41 1767171821

For what it's worth, I'm working with it on a huge professional monorepo, and the difference was also stark.

reactordev · 2025-12-31T05:14:10 1767158050

For what it’s worth, I have Claude coding away at Unreal Engine codebase. That’s a pretty large c++ codebase and it’s having no trouble at all. Just a cool several million lines of C++ lovely.

drbojingle · 2025-12-31T03:30:27 1767151827

Everything is made of smaller parts. I'd like to think we can sub divide a code base into isolated modules at least.

tracker1 · 2025-12-31T18:52:35 1767207155

Depends on what kinds of problems you're solving...

I'd put it in line with monolith vs microservices... You're shifting complexity somewhere, if it's on orchestration or the codebase. In the end, the piper gets paid.

Also, not all problems can be broken down cleanly into smaller parts.

devin · 2025-12-31T04:32:09 1767155529

In the real world, not all problems decompose nicely. In fact, I think it may be the case that the problems we actually get paid to solve with code are often of this type.

drbojingle · 2026-01-01T15:38:10 1767281890

Problems like?

baq · 2025-12-31T00:30:10 1767141010

That’s right, but it also hints at a solution: split big code bases into parts that are roughly the size of a big hobby project. You’ll need to write some docs to be effective at it, which also helps agents. CICD means continuous integration continuous documentation now.

bccdee · 2025-12-31T01:49:50 1767145790

Splitting one big codebase into 100 microservices always seems tempting, except that big codebases already exist in modules and that doesn't stop one module's concerns from polluting the other modules' code. What you've got now is 100 different repositories that all depend on each other, get deployed separately, and can only be tested with some awful docker-compose setup. Frankly, given the impedance of hopping back and forth between repos separated by APIs, I'd expect an LLM to do far worse in a microservice ecosystem than in an equivalent monolith.

majormajor · 2025-12-31T01:05:35 1767143135

I wonder if anyone has tried this thing before, like... micro-projects or such... ;)

rjzzleep · 2025-12-31T05:12:22 1767157942

It's not the size that's the issue, it's the domain that is. It's tempting to say that adding drivers to Linux is hard because Linux is big, but that's not the issue.

oooyay · 2025-12-31T02:52:29 1767149549

I worked at Slack earlier this year. Slack adopted Cursor as an option in December of 2024 if memory serves correctly. I had just had a project cut due to a lot of unfortunate reasons so I was working on it with one other engineer. It was a rewrite of a massive and old Python code base that ran Slack's internal service catalog. The only reason I was able to finish rewrites of the backend, frontend, and build an SLO sub-system is because of coding agents. Up until December I'd been doing that entire rewrite through sixteen hour days and just pure sweat equity.

Again, that codebase is millions of lines of Python code and frankly the agents weren't as good then as they are now. I carefully used globbing rules in Cursor to navigate coding and testing standards. I had a rule that functioned as how people use agents.md now, which was put on every prompt. That honestly got me a lot more mileage than you'd think. A lot of the outcomes of these tools are how you use them and how good your developer experience is. If professional software engineers have to think about how to navigate and iterate on different parts of your code, then an LLM will find it doubly difficult.

epolanski · 2025-12-31T08:40:59 1767170459

Cool, but most developers do mundane stuff like glueing APIs and implementing business logic, which require oversight and review.

Those crunching hard problems will still review what's produced in search of issues.

generic92034 · 2025-12-31T10:16:14 1767176174

What is (in general) mundane about business logic? This can be highly complex, with deep process integration all over your modules.

epolanski · 2025-12-31T20:47:58 1767214078

Which is why it requires detailed oversight.

mkozlows · 2025-12-31T00:22:11 1767140531

I was going back and looking at timelines, and was shocked to realize that Claude Code and Cursor's default-to-agentic-mode changes both came out in late February. Essentially the entire history of "mainstream" agentic coding is ten months old.

(This helps me understand better the people who are confused/annoyed/dismissive about it, because I remember how dismissive people were about Node, about Docker, about Postgres, about Linux when those things were new too. So many arguments where people would passionately talk about all those things were irredeemably stupid and only suitable for toy/hobby projects.)

HarHarVeryFunny · 2025-12-31T03:54:45 1767153285

The entire history of RL-trained "reasoning models" from o1 to DeepSeek_R1 is basically just a year old!

drbojingle · 2025-12-31T03:52:25 1767153145

Are there techniques though? Tech pairing? Something we know now that we didn't then? Or just better models?

simonw · 2025-12-31T04:43:03 1767156183

Lots of technique stuff. A common observation among LLM nerds is that if the models stopped being improved and froze in time for a year we could still spend all twelve months discovering new capabilities and use-cases for the models we already have.

drbojingle · 2025-12-31T13:39:57 1767188397

Any specifics you'd recommend?

drbojingle · 2025-12-27T02:09:48 1766801388

A well designed feature IS considerate of time and attention. Why would I want a game on 20 fps when I could have it on 120? The smoothness of the experience increases my ability to use the experience optimally because I don't have to pay as much attention to it. I'd prefer if my interactions with machines were as smooth as my interactions driving a car down a empty dry highway mid day.

Prehaps not everyone cares but I've played enough Age of Empires 2 to know that there are plenty of people who have felt value gains coming from shaving seconds off this and that to get compound games over time. It's a concept plenty of folks will be familiar with.

DrewADesign · 2025-12-27T02:44:14 1766803454

Sure, but without unlimited resources, you need to have priorities, and everything has a ‘good enough’ state. All of this stuff lies on an Eisenhower chart and we tend to think our concerns fall into the important/urgent quadrant, but in the grand scheme of things, they almost never do.

jfengel · 2025-12-27T17:26:37 1766856397

Isn't there a limit to human perception, well below 120 fps?

Perhaps 120fps might result in a better approximation of motion blur.

8note · 2025-12-27T03:31:58 1766806318

i still prefer 15fps for games. if theyre putting the fps any higher, its not considerate of my time and attention

i have to pay less attention to a thing that updates less frequently. idle games are the best in that respect because you can check into the game on your own time rather than the game forcing you to pay attention on its time

drbojingle · 2025-12-14T21:32:50 1765747970

I recently built a simple JSON schema form builder for my own purposes. I'm going to expand on it with the ability to send forms via email, handle bigger and more complex forms and then tackle document parsing. https://data-atlas.net for anyone into that kind of thing.

drbojingle · 2025-11-27T16:57:18 1764262638

No, there needs to be control over the algorithms that get used. You ought to be able to tune it. There needs to be a Google fuu equivalent for social media. Or, instead of one platform one algorithm, let users define the algorithm to a certain degree, using llms to help with that and then you can allow others to access your algorithms too. Asking for someone Facebook to tweak the algorithm is not going to help imo.

rcxdude · 2025-11-27T17:11:09 1764263469

IMO there should not be an algorithm. You should just get what you have subscribed to, with whatever filters you have defined. There are better and worse algorithms but I think the meat of the rot is the expectation of an algorithm determining 90% of what you see.

drbojingle · 2025-11-27T20:55:14 1764276914

Dude that's an algorithm.

rcxdude · 2025-11-27T23:21:57 1764285717

Not in the sense that it's commonly used in this context. It's not a recommendation algorithm pulling from the whole platform based on what you're doing, it's far more controllable, deterministic process which only does what you explicitly request.

drbojingle · 2025-11-26T20:32:49 1764189169

Come with me And you'll see A world of OSHA violations

zzrrt · 2025-11-26T21:07:20 1764191240

TFW you realize real-life Tony Stark is actually Willy Wonka.

jzb · 2025-11-26T21:34:01 1764192841

Musk is Justin Hammer.

bdcravens · 2025-11-26T23:00:30 1764198030

I guess that the Tony Stark of this world is Wang Chuanfu (CEO of BYD)

drbojingle · 2025-11-22T15:22:51 1763824971

If we died off we wouldn't be here.

__MatrixMan__ · 2025-11-23T14:22:07 1763907727

Gene expression is more complicated than that.

drbojingle · 2025-11-09T18:04:05 1762711445

If it's not open source then don't make it public but if you want to lose your fear, just do it. It's not like it matters that much.

drbojingle · 2025-11-04T13:55:43 1762264543

If they can vibe code it they can vibe disassemble it and vibe small PR it.