Did GitHub Copilot increase my productivity?

marcus_holmes · 2024-05-13T04:43:08 1715575388

Years ago, over a decade ago now, I was a .Net developer. Microsoft introduced Entity Framework, their new way of handling data in .Net applications. Promises made, promises believed, we all used it. I was especially glad of Lazy Loading, where I didn't have to load data from the database into my memory structures; the system would do that automatically. I could write my code as if all my memory structures were populated and not worry about it. Except, it didn't work consistently. Every now and again a memory structure would not be populated, for no apparent reason. Digging deep into technet, I found a small note saying "if this happens, then you can check whether the data has been loaded by checking the value of this flag and manually loading it if necessary" [0]. So, in other words, I have to manually load all my data because I can't trust EF to do it for me. [1]

Long analogy short, this is where I think AI for coding is now. It gets things wrong enough that I have to manually check everything it does and correct it, to the point where I might as well just do it myself in the first place. This might not always be the case, but that's where I feel it is right now.

[0] Entity Framework has moved on a lot since then, and apparently now can be trusted to lazily load data. I don't know because...

[1] I spat the dummy, replaced Windows with Linux, and started learning Go. Which does exactly what it says it does, with no magic. Exactly what I needed, and I still love Go for this.

nercury · 2024-05-13T07:20:42 1715584842

Pardon me for the tangent (just a general comment not directed to OP).

What I have learned over the years is that the only way to properly use ORM is as a fancy query tool. Build the query, fetch/update data, MOVE THE DATA to separate business objects. Don't leave ORM entities shared across the sea of objects!

Phew, thanks, I got that off my chest.

arrowsmith · 2024-05-13T08:34:27 1715589267

I wouldn't have believed you until I moved from ActiveRecord (Rails's ORM) to Ecto (Elixir/Phoenix's data mapping library which is decidedly not an ORM.) It's a million times better and I'm never going back.

atonse · 2024-05-14T02:50:30 1715655030

Ecto is hands down my favorite part of the elixir ecosystem.

It’s so elegant and the Lego blocks (query, schema, change set, repo) can be mixed and matched in different ways.

I’ve even used schemas and change sets to validate API requests and trivially provide very nice, clean, and specific errors, while getting perfectly typed structs when things are validated.

cultofmetatron · 2024-05-13T12:24:25 1715603065

same, I wish more libraries would go the ecto design route. my ecto queries map pretty close to 1:1 with the sql counterpart. no guessing what the output is going to look like. I spend my time debugging the query and not trying to get the orm to output he query I want.

freedomben · 2024-05-13T16:50:54 1715619054

Yes, same experience here. I felt (and still feel) that ActiveRecord is one of if not the best ORMs out there, but it was always a source of debugging and performance optimizations/pain, and the trick was basically taking hand-written SQL and trying to get ActiveRecord to generate that. After briefly moving to node.js full time I actually got very anti-ORM, although query building libraries all sucked too which left me unsure of what the best approach is.

Then I learned Ecto/Phoenix and this is truly the best way. Ecto is so close and translateable to raw SQL that there's little to no friction added by it, but it handles all the stuff you don't want to have to do by hand (like query building, parameterization, etc). Ecto is a real breath of fresh air and I find myself writing quick scripts that hit a database in Elixir just so I can use Ecto! I also love how easy Ecto makes it to model database tables that were originally defined by another language/framework or even by hand. Trying to do that with ActiveRecord or another ORM is usually a recipe for extreme pain, but with Ecto it's so easy.

arrowsmith · 2024-05-13T14:01:19 1715608879

Yeah, I hear some people say that they find Ecto.Query confusing, and I think it's because they never learned SQL properly. That's understandable because it's possible to use something like ActiveRecord for years without ever learning to write even a simple SQL query. But if you have a good grasp of SQL then Ecto.Query is trivial to learn - it's basically just SQL in Elixir syntax.

cultofmetatron · 2024-05-14T08:00:13 1715673613

> it's basically just SQL in Elixir syntax.

its sql in elixir syntax with a bunch of QOL improvements.

for one thing, I can seperate my subqueries into separate variables

``` sub_q = from(l in Like) |> where([l], l.user_id == ^user_id) |> select([l], %{ user_id: l.user_id, likes_count: count(l.id) }) |> group_by([l], l.user_id)

main_query = from(u in User) |> join(:left, [u], likes_count in ^subquery(sub_q), on: likes_count.user_id == u.id, as: :likes_count) |> select([u, likes_count: l], %{ name: u.name, likes: l.likes_count, }) |> where([u], u.id == ^user_id)

user = main_query |> Repo.one()

```

Being able to think directly in sql lets you perform optimal queries once you understand sql. and imho, this much cleaner than tha equivalen sql to write. it also takes care of input sanatization and bindings.

noduerme · 2024-05-13T07:41:32 1715586092

adding an off the shelf ORM layer creates so much more opacity and tech debt than writing queries I don't understand why anyone would willingly put one into their stack. Sure, they're neat although I don't even know if they save time. There's something very satisfying about well-crafted queries. And is it ever really well crafted if you can't tweak them to improve their their execution plan? I've never had a client or boss who asked to use an ORM framework. I suspect it's something people think looks cool - treating SQL as OOP - until they run into a problem it can't solve.

[edit] for instance, I have a case where I use a few dozen custom queries on timers to trawl through massive live data and reduce it into a separate analytics DB. Using everything from window functions to cron timers to janky PHP code that just slams results from separate DBs together to provide relatively accurate real-time results. At the end from that drastically reduced set in the analytics DB... sure, I'm happy to let the client summarize whatever they want with Metabase. But those things just couldn't be done with an ORM, and why would I want to?

nercury · 2024-05-13T08:16:06 1715588166

Yes, I would not put it just anywhere. But I have few rules about ORMs:

- Proper DB design first. You should be able to remove the ORM and DB should still function as intended. This means application-side cascade operations or application-side inheritance is banned.

- No entities with magical collections pointing to each other. In other words, no n to n relations handled by ORM layer. Create in-between table, for gods sake. Otherwise it becomes incredibly confusing and barely maintainable.

- Prefer fetching data in a way that does not populate collections. In other words, fetch the most fine-grained entity and join related data. Best if you craft special record entities to fetch data into (easy with EF or Doctrine).

- Most ORMs allow you to inspect what kind of queries you create. Use it as query building tool. Inspect queries often, don't do insane join chains and other silly stuff.

I would use ORM in one kind of app: where I would work with data that shares records that might need to be either inserted or updated, and there is several nesting levels of this kind of fun. You know, you need to either insert or update entity, if it exists, you should update, and then assign related entities to it, if it does not, then you should insert, and assign related entities to the newly created id. The ORM can easily deal with that, and on top of that it can do efficient batched queries, which would be really annoying and error-prone to hand-craft.

If the app does not require this kind of database with these kind of relations, I would not use ORM.

thaumasiotes · 2024-05-13T13:14:47 1715606087

> No entities with magical collections pointing to each other. In other words, no n to n relations handled by ORM layer. Create in-between table, for gods sake. Otherwise it becomes incredibly confusing and barely maintainable.

So, I have a database that looks like this. My method was to lay out the database myself, by hand, and then use EF's facility to generate EF code from an existing database. The bridge table was recognized as being nothing but the embodiment of a many-to-many relation and the EF code autogenerated the collections you don't like.

Is this a problem? If you do things the other way around, the ORM creates the same table and it's still there in the database. It isn't possible not to create the bridge table. Why is that case different?

nercury · 2024-05-14T07:01:26 1715670086

This is more of a preference for bridge to be visible in application. Also the bridge may seem simple at first, but it also may gain associated data, like created_at, order, etc.

LaGrange · 2024-05-13T11:13:34 1715598814

> adding an off the shelf ORM layer creates so much more opacity and tech debt than writing queries I don't understand why anyone would willingly put one into their stack.

Simple: because if I don't, I'm going to spend the rest of my career explaining why I didn't to people extremely skeptical of that decision. Meanwhile even people like me tend to just shrug and quietly go "oh, an ORM? Well, that's the price of doing the job."

Also, ORMs are an endless source of well-paid jobs for people who actually learned relational algebra at some point in their lives, and that's not a compliment to ORMs.

lozenge · 2024-05-13T08:52:27 1715590347

ORM is not for writing analytics queries. It's for your CRUD operations. Something like Django Admin would be impossible without an ORM. You create tables for your business logic and customer support or whoever can just browse and populate them.

couchand · 2024-05-13T11:28:12 1715599692

Wouldn't standard ANSI SQL's information_schema be sufficient to build such an interface? I'm struggling to see how an ORM is necessary.

DangitBobby · 2024-05-13T14:58:40 1715612320

I consider an ORM to be any SQL generating API, without which it would indeed be impossible to have a generic Admin class to make Admin views in Django.

randomdata · 2024-05-13T15:33:36 1715614416

Funny how ORM no longer means Object-Relational Mapping.

DangitBobby · 2024-05-13T15:47:14 1715615234

What should I call a program that generates SQL, executes it, and stores the result in a tuple, object, or whatever data structure in the programming language that I'm using? Does it magically stop being an ORM the second I use a tuple instead of a class instance, or is it now an ORM plus another nameless type of program? Are tuples also objects?

randomdata · 2024-05-13T15:59:37 1715615977

Whatever you want. It's your life.

Traditionally, though, SQL generation was known as query building. The query was executed via database engine or database driver, depending on the particulars. ORM, as the name originally implied, was the step that converted the relations into structured objects (and vice versa). So, yes, technically if you maintain your data as tuples end to end you are not utilizing ORM. Lastly, there once was what was known as the active record pattern that tried to combine all of these distinct features into some kind of unified feature set.

But we're in a new age. Tradition has gone out the window. Computing terms have no consistency to speak of, and not just when it comes to databases. Indeed, most people will call any kind of database-related code ORM these days. It's just funny that ORM no longer means object-relational mapping.

marcus_holmes · 2024-05-14T06:22:10 1715667730

I think the core thing that ORMs do is create a 1:1 mapping between the data structures in the database (that are, or should be, optimised for storage) and the data structures in the application (that are, or should be, optimised for the application business logic).

ORMs create this false equivalence (and in this sense, so does Django's admin interface despite using tuples instead of classes). I can see the sense of this, vaguely, for an admin interface, but it's still a false equivalence.

freedomben · 2024-05-13T16:56:20 1715619380

I agree with you, but I do think there's a little fuzziness between full-blown ORM and a tuple-populating query builder in some cases. For example Ecto, which can have understanding of the table schema and populate a struct with the data. It's just a struct though, not an object. There's no functions or methods on it, it's basically just a tuple with a little more organization.

randomdata · 2024-05-13T20:12:31 1715631151

> It's just a struct though, not an object. There's no functions or methods on it

Object-relational mapping was originally coined in the Smalltalk world, so objects were in front of mind, but it was really about type conversion. I am not sure that functions or methods are significant. It may be reasonable to say that a struct is an object, for all intents and purposes.

A pendant might say that what flimsy definition Kay did give for object-oriented programming was just a laundry list of Smalltalk features, meaning that Smalltalk is (probably) the only object-oriented language out there, and therefore ORM can only exist within the Smalltalk ecosystem. But I'm not sure tradition ever latched onto that, perhaps in large part because Kay didn't do a good job of articulating himself.

freedomben · 2024-05-14T20:43:33 1715719413

Thanks for the thoughts, that's a good point. It certainly makes sense that the "object" merely needs typed properties to qualify.

DanielHB · 2024-05-14T07:48:58 1715672938

Most queries are pretty trivial, ORMs are great for 90% of queries. As long as you don't try to bend the ORM query system to do very complicated queries it is fine. Most (all?) ORMs allow raw queries as well so you can mix both.

On top of that most ORMs have migrations, connection management, transaction management, schema management and type-generation built-in.

Some ORMs have inherently bad design choices though, like lazy loading or implicit transaction sharing between different parts of the code. Most modern ORMs don't really have that stuff anymore.

kaba0 · 2024-05-13T08:20:06 1715588406

How do you map rows to objects? How do you insert into/update rows in your databases? These are the basic problems ORMs solve splendidly. They are for OLTP workloads, and have deliberate escape hatches to SQL (or some abstraction over it, like JPQL in java-land).

I just fail to see what else would you do, besides implementing a bug-ridden, half-ORM yourself.

blksv · 2024-05-13T09:12:46 1715591566

Rows are tuples, not objects, and treated as such throughout the code. Only the needed data is selected in the form most appropriate to the task at hand, constructed in a hand-written sql query, maybe even taylored to the DB/task specifics. Inserts/updates are also specific to the task, appropriately grouped, and also performed using plain sql. Data pipelines are directly visible in the code, all DB accesses are explicit.

couchand · 2024-05-13T11:33:09 1715599989

This. The right way to structure database access is a result type per tuple, not an object type per table.

kaba0 · 2024-05-13T12:21:29 1715602889

ORMs don’t mandate mapping the whole table either, you are free to create multiple entities per table/view.

DangitBobby · 2024-05-13T15:03:00 1715612580

Maybe we need to use a different acronym than ORM, because to me the thing we can all agree we need is code that emits SQL. If you can't agree that projects need generated SQL because SQL is dog water for composition, then we can't really agree on anything.

blksv · 2024-05-13T18:57:55 1715626675

Probably so: I can't agree with that particular inference.

1. Very often we need generated SQL because writing SQL for primitive CRUD operations is hell tedious and error-prone (as well as writing UI forms connected to these CRUD endpoints, so I prefer to generate them too).

2. Structured Query Language being very poorly structured is indeed a huge resource drain when developing and maintaining complex queries. PRQL and the like try to address this, but that's an entirely different level of abstraction.

3. Unfortunately, when efficiency matters we have to resort to writing hand-optimized SQL. And this usually happens exactly when we terribly need a well-composing query language.

PeterisP · 2024-05-13T18:26:18 1715624778

I'd argue that "code that emits SQL" is never an inherent need but a possible development time-saver - we need code that emits SQL in those cases (and only those cases) where it saves a meaningful amount of development time compared to just writing the SQL.

sgarland · 2024-05-13T11:18:09 1715599089

Every RDBMS has multiple connector libraries that solve this for you, without requiring the overhead of a full ORM.

DangitBobby · 2024-05-13T15:04:35 1715612675

If the connector library solves this problem then the connector library is an ORM.

sumtechguy · 2024-05-13T12:12:58 1715602378

That is exactly where ORMs help. The problem is all of the other stuff with it. When most people just need a simple mapper. Not something to build their SQL statements for them (which seems to be why most people pick it).

But that comes to the second problem. Most devs I meet seem to be deathly allergic to SQL. :)

One project I had a dev come to me asking me to look at a bug in the thing. Having never seen that particular ORM before I was able to diagnose what was wrong. Because MS ORMs have the same issues over and over (going back to the 90s). You better read those docs! Because whatever they did in this stack will be in their next one when they abandon it in place 3 years from now.

to11mtm · 2024-05-13T22:12:02 1715638322

> These are the basic problems ORMs solve splendidly.

Depends on the ORM.

I have noticed that typically, 'unit of work' type ORMs (EFCore and Hibernate/NHibernate as examples) prevent being 'true to the ORM' but 'efficient'.

i.e. Hibernate and EFCore (pre 7 or 8.0ish) cannot do a 'single pass update'. You have to first pull the entities in, and it does a per-entity-id update statement.

> I just fail to see what else would you do, besides implementing a bug-ridden, half-ORM yourself.

Eh, you can do 'basic' active-record style builders on top of dapper as an afternoon kata, if you keep feature set simple, shouldn't have bugs.

That said, I prefer micro-ORMs that at most provide a DSL for the SQL layer. less surprises and more concise code.

bad_username · 2024-05-13T07:50:01 1715586601

For me the biggest reason is automated database initialization and migration. After defining or updating the ORM model, I don't have to worry about manually CREATing and ALTERing tables as the model evolves.

This is compatible with the OC suggestion of using ORMs as a "fancy query builder" and nothing more, which I strongly support.

devjab · 2024-05-13T09:31:39 1715592699

You always have to worry about your model changes if you run at any sort of scale. Some ORMs will get it right most of the time, but the few times they don’t will really bite you in the ass down the line. Especially with the more “magical” ORMs like EF where you might not necessarily know how it build your tables unless you specifically designed them yourself.

This is where migrations also become sort of annoying. Because if you use them. Then it is harder to fix the mistakes since you can’t just change your DB without using the ORM or you’ll typically break your migration stream or at least run into a lot of troubles with it.

And what is the plus side of having a code-first DB really? You can fairly easily store those “alter table” changes as you go along and have full availability of history in a very readable way that anyone, including people not using C#, Java, Python.

Which is the other issue with ORMs. If you have multiple consumers of your data. Then an ORM most likely won’t consider that as it alters your “models”.

For a lot of projects this is a non-issue, especially at first. Then 10 years down the line, it becomes a full blown nightmare and you eventually stop using the ORM. After spending a lot of resources cleaning up your technical debt.

DangitBobby · 2024-05-13T15:08:53 1715612933

> And what is the plus side of having a code-first DB really? You can fairly easily store those “alter table” changes as you go along and have full availability of history in a very readable way that anyone, including people not using C#, Java, Python.

The benefits should be obvious if you've used ORMs. They are an object that represents your database data in code rather than in a table where you can't touch it. If you have code that brings data from a database into code, congratulations, you've implemented part of an ORM. Having the data model defined "in code" treats the code as first-class instead of the SQL, which makes sense from an ergonomics perspective, you will spend much more time with the code objects than you will the SQL schemas. Either way, you will have two versions: a SQL version and a code version. You might as well get both from writing one.

If you can read alter table in SQL, you can probably read migrations.AddField in Python, and whatever the equivalent is in the other languages. I still am waiting with bated breath for the problems with much maligned (by some) ORMs to arrive.

devjab · 2024-05-13T16:18:49 1715617129

The only area of development where ORMs haven’t been the cause for at least some trouble in my career has been with relatively small and completely decoupled services. Even here I’ve had to replace countless ORMs with more efficient approaches as the service eventually needed to be build with C/C++. That being said, I don’t think any of these should have been build without the ORM. The rewrite would have been almost as much of a hassle if there hadn’t been an ORM after all.

I’m not really against ORMs as such. I’m not a fan of code-first databases for anything serious, but as far as CRUD operations goes I don’t see why you wouldn’t use an ORM until it fails you, which it won’t in most cases, and in those cases where it does… well similar to what I said earlier you just wouldn’t have build it to scale from the beginning anyway, and if you had and it turned out it didn’t need to scale then you probably wasted a lot of developer resources to do so.

noduerme · 2024-05-13T08:07:50 1715587670

I'm not sure if you're talking about creating and altering model tables or if you mean ORMs provide safety in case underlying tables are modified. I'd argue that well-built queries should be resistant to alteration of the underlying tables, and that views and functions and stored procedures already exist to both red flag breaking changes and also to combine and reduce whatever you need without relying on third party code in another language layer to do the lifting.

blksv · 2024-05-13T08:57:05 1715590625

Doesn't it also mean that any non-trivial migration (e.g. which requires data transformation or which needs to be structured to minimize locking) has to be defined elsewhere, thus leaving you with two different sources for migrations, plus some (ad-hoc) means to coordinate the two?

(I would say that it is conceptually perverse for a client of a system to have authority over it. Specifically, for a database client to define its schema.)

marcus_holmes · 2024-05-13T07:55:42 1715586942

Agree completely, as does most of the Go community :) Newbie gophers are regularly told to learn some SQL and stop trying to rebuild ActiveRecord in Go ;)

But in .Net, EF is still the most common way of accessing data (I have heard, because I stopped using it over a decade ago).

lomase · 2024-05-13T10:33:01 1715596381

EF is the common way of saving data.

rafaelmn · 2024-05-13T07:25:57 1715585157

That doesn't really help you with EF because there's plenty of stuff shared at context level. So depending on the order of queries in the context the same query can return different data.

I hate EF and everything it stands for. :)

nercury · 2024-05-13T07:35:29 1715585729

Yeah, in a web app, one context per request. In desktop app... I have never used EF there.

specialist · 2024-05-13T19:07:03 1715627223

> use ORM is as a fancy query tool

As an alternative to query-by-example (QBE)?

https://en.wikipedia.org/wiki/Query_by_Example

LandR · 2024-05-13T08:57:59 1715590679

Well, this month we had to debug an issue where EF was NOT populating fields on classes from the db, that it definitely should have been!

So it still seems flakey. I've never worked a single job that chose EF that didn't end up regretting it. Either from it being unreliable, migration hell or awful performance.

"It allows you to treat your database like an in-memory enumerable"

Then devs go and do exactly that and wonder why performance is so terrible...

I hate EF.

neonsunset · 2024-05-13T09:08:22 1715591302

Which version?

LandR · 2024-05-13T09:22:02 1715592122

We are currently on the latest.

We had an issue last week where we had an obect like

    public class Foo
    {
        public List<Bar> Bars { get; set; }
    }

We'd query for some Foos, like:

    await _dbContext.Foos.ToListAsync();

and some amount of them would have Bars be an empty list where it should definitely be populated from the db. And it wasn't even consistent, sometimes it would populate thousands, sometimes it would populate a handful and then just stop populating Bars.

No errors, no exceptions, just empty lists where we'd expect data.

And so often we have to debug and see what SQL its actually generating, then spend time trying to get it to generate reasonable sql, when if we were using sprocs we could just write the damn sql quicker.

Another issue we have is the _EFMIgratoinsHistory table.

Sometimes we will deploy and get a load of migration errors, as it tries to run migrations its already ran... SO they all fail and then the API doesn't come back up... The fix ? TUrn it off and on again and it stops trying to re-run migrations its already ran!

aayjaychan · 2024-05-13T09:45:26 1715593526

navigation properties are not loaded automatically, because they can be expensive. you need to use `.Include(foo => foo.Bars)` to tell EF to retrieve them.

EF tries to be smart and will fix up the property in memory if the referenced entities are returned in separate queries. but if those queries don't return all records in `Foo.Bar`, `Foo.Bar` will only be partially populated.

this can be confusing and is one of the reasons i almost never use navigation properties when working with EF.

LandR · 2024-05-13T09:52:33 1715593953

We have those, and when I say inconsistent I mean inconsistent on the same query / exact same line of code on the same database.

e.g. stick a breakpoint, step over, see in the debugger that it was not populating everything it should. Then run it again, do the same and see different results. Exact same code, exact same db, different results.

5000 results back from the db, anything between 5000 and a handful were only fully correctly populated.

viraptor · 2024-05-13T11:03:21 1715598201

If that happens with the correct `.Include()`, you really should raise an issue with EF, trying to reproduce it. If it's not a random mistake in your code, that's a really big deal.

thaumasiotes · 2024-05-13T10:08:31 1715594911

Like your parent said, the same line of code will or won't populate the navigation property depending on whether EF is already tracking the entity that belongs there (generally because some other earlier query loaded it). You get different behavior depending on the state of the system; you can't look at "one line of code" in isolation unless that line of code includes every necessary step to protect itself against context sensitivity.

neonsunset · 2024-05-13T09:35:45 1715592945

EF Core 8? Inconsistent behavior is not expected.

Assuming you haven't missed to add an .Include[0], please consider submitting an issue(s) to https://github.com/dotnet/efcore

[0] https://learn.microsoft.com/en-us/ef/core/querying/related-d...

tremon · 2024-05-13T09:55:39 1715594139

Are you saying that in previous versions inconsistent behaviour is the expected outcome?

moron4hire · 2024-05-13T12:28:23 1715603303

I've been using Entity Framework for the last 5 years and have not encountered this issue, as long as I've got all my Includes specified correctly.

There is also the AutoIncludeAttribute that you can specify on entity fields directly to always include those fields for every query.

My main complaints with EF are that the scaffolding and migration commands for the CLI tool are nearly impossible to debug if they error during the run.

But when they run right, they save me a ton of time in managing schema changes. Honestly, I consider that part worth all the rest.

There can also be some difficulty getting queries right when there are cyclical references. Filtering "parent" entities based on "child" ones in a single query can also be difficult, and also can't be composed with Expression callbacks.

But in any difficult case, I can always fall back on ADO.NET with a manual query (there are also ways of injecting manual query bits into EF queries). Which is what we'd be doing without EF, so I don't get the complaints about EF "getting in the way".

rspeele · 2024-05-13T13:56:34 1715608594

Lazy loading was a mistake in EF. A lot of apps had awful performance due to lazy loading properties in a foreach loop creating N+1 queries to the database. It would be fine in dev with 50-100 rows and a localhost SQL and blow up in prod with 1000s of rows and a separate Azure SQL.

Also if you relied on lazy loading properties after the DbContext had been disposed (after the using() block) you were out of luck.

With old EF we would turn off lazy loading to make sure devs always got an exception if they hadn’t used .Include() to bring the related entities back in their initial query. Querying the database should always be explicit not lurking behind property getters.

Fortunately with EF core MS realized this and it’s off by default. EF with wise use of .Include and no lazy loading is a pretty good ORM!

tracker1 · 2024-05-13T19:28:03 1715628483

I switched to Dapper a long time ago, with explicit SQL queries and really haven't looked back.

Quothling · 2024-05-13T09:21:17 1715592077

> [0] Entity Framework has moved on a lot since then, and apparently now can be trusted to lazily load data

To some degree. If you're using it for anything serious you're still going to help it along a lot. It's rather easy to do so, however, and I certainly wouldn't consider writing your own code as fast or easy as simply telling EF how you want it to do certain things.

I'm not an overall fan of EF. I especially dislike how it's model builder does not share interoperability with other .Net libraries which also use it. I also don't really like the magic .Net does behind the scenes. EF as a whole has been one of the better ORMs for any language since .net core. I'd still personally much prefer something like Rust's diesel, but whenever I have to work with C# I tend to also use EF.

to11mtm · 2024-05-13T22:16:34 1715638594

You might want to try Linq2Db, it is much closer to Diesel in how it works (More SQL DSL with parameter+reader mapper, less Unit-of-work ORM).

FWIW, it can actually work 'on top' of an Existing EF Core context+mappings (i.e. if you have an existing project and want to migrate, or just need the better feature-set for a specific case.) or you can get pretty close to 'yolo by convention' depending on case. In general though it's a lot less ceremony to start messing around.

Foobar8568 · 2024-05-13T09:55:58 1715594158

Diesel is not an ORM and a typical rust library...

kevthecoder · 2024-05-13T12:36:47 1715603807

The strapline seems to suggest it is an ORM (I've not used Diesel yet):

>Diesel: A safe, extensible ORM and Query Builder for Rust

https://github.com/diesel-rs/diesel

BaculumMeumEst · 2024-05-13T11:14:40 1715598880

Even more important than the question of productivity is that this turns a joyous activity into a depressing probabilistic shitshow where you describe what you're trying to do and hope for the best. Instead of feeling engaged and challenged, you're just annoyed and frustrated. No thanks!

dmix · 2024-05-13T13:33:41 1715607221

> this is where I think AI for coding is now. It gets things wrong enough that I have to manually check everything

This might be dependant on the programming language, some languages are way more popular and have way more questions on StackOverflow and Reddit and repos on github, so the answers will be better.

When I use copilot for JS it's right 90% of the time.

And where it's 'wrong' it's usually just stuff it skipped over because it didnt have proper context.

nurumaik · 2024-05-13T11:03:02 1715598182

>It gets things wrong enough that I have to manually check everything it does and correct it

Thing is, reading code is way faster than writing code

deely3 · 2024-05-13T21:02:26 1715634146

Huh, for me its always vice versa.

When I'm not sure in the someones code I have to double or triple check it be sure that I understand it correctly and to verify that there no somehow hidden missed steps or side effects.

naasking · 2024-05-13T12:04:07 1715601847

> Long analogy short, this is where I think AI for coding is now. It gets things wrong enough that I have to manually check everything it does and correct it, to the point where I might as well just do it myself in the first place.

Even if that were true, reading code is reasonably faster than typing it out and then reading it again to check it.

datavirtue · 2024-05-13T15:43:20 1715615000

I'm left wondering what AI everyone are using. I can prompt copilot and it gives me exactly what I need. Sure, if I barf out a lazy, half-baked prompt it yields a waste of time.

My problem is running into it's limitations, mostly around resources. I have tried giving it larger tasks and it takes bloody forever.

"Given this unstructured data, create CSV output for all platforms, with each line containing the manual, and model, ignoring the text in parenthesis."

Works great except for God-awful performance and stopping half way through. I had to break out each section and paste it into the prompt and let it work on small pieces. We need to get to the next level with this, especially for paying customers.

More concerning is that I see a clear pattern in smaller companies of hiring seniors and turning them loose with AI assistants instead of hiring junior devs. The prospect is attractive to nearly every stakeholder and the propensity to put off hiring "until next quarter" in light of this is a constant siren song. There is a lot of gravity pulling in this direction with the short-term thinking and distractions that are thoroughly soaked into the business world these days. Supposedly, one third of Gen Z (20-25 yrs old) are sitting at home, up from 22% in 1990.

I'm one of those seniors happily putting off hiring, but I find the situation and it's wider impact on the future very unnerving.

thegrim33 · 2024-05-13T23:10:55 1715641855

Well, having AI transform some data into a certain CSV format is orders of magnitude simpler and more straightforward of a programming task than what I try to use it for.

A lot of the discrepancy between people's experiences is simply due to the fact there's there's a massive range of programming complexity/difficulty that people can be trying to apply AI to. If your programming is mostly lower complexity stuff, non-critical stuff, or simply defined stuff, it obviously works better.

I try to use AI when I get stuck on a hard problem/algorithm, hoping that it can provide an answer/solution to unblock me. But when I'm stuck the problem I'm facing is so complicated that there's no chance at all that AI is actually going to be able to help me with it. I see absolutely no point in using AI when I already know how to solve a problem, I just solve it. I only turn to it when I need help, and it can never help me.

kamaal · 2024-05-13T09:00:11 1715590811

>>It gets things wrong enough that I have to manually check everything it does and correct it, to the point where I might as well just do it myself in the first place.

I have had personal experience with this. And seen others telling me as well. These AI things often suggest wrong code, or even with bugs. If you begin your work by assuming AI is suggesting the correct code you can go hours, to even days debugging things in the wrong place. Secondly when you do arrive at a place where you find the bug in the AI generated code, it can't seem to fix or even modify it, because it misses context in which the which itself generated at the first place. Thirdly the AI itself can interpret your questions in a way you didn't mean.

As of now AI generated code is not for serious work of any kind.

My guess a whole new paradigm of programming is needed, where you will more or less talk to AI in a programming language itself, some what like lisp. I mean a proper programming language at a very abstract level, which can be interpreted in only one possible meaning, and hence not subject to interpretation.

cess11 · 2024-05-13T10:03:54 1715594634

"My guess a whole new paradigm of programming is needed, where you will more or less talk to AI in a programming language itself, some what like lisp. I mean a proper programming language at a very abstract level, which can be interpreted in only one possible meaning, and hence not subject to interpretation."

Code generation is quite old though, and also quite common, also outside the Lisp-family. When doing non-trivial systems development in Java you tend to use it a lot, especially with XML as an intermediary, abstracted language.

DanielHB · 2024-05-13T14:41:11 1715611271

> I was especially glad of Lazy Loading, where I didn't have to load data from the database into my memory structures; the system would do that automatically.

oh god, I have used Java with Hibernate a lot and once I read "Lazy Loading" I didn't even need to finish reading the post.

foxyv · 2024-05-13T15:12:51 1715613171

I've always found that it's easier to code something from scratch, than to review and fix someone else's code, and that's been my experience with Copilot up to this point. I'm not sure if it's better than just writing code from scratch productivity wise, but it makes coding kind of unpleasant for myself.

One thing I've found about Copilot is that it introduces me to novel ways to solve problems and more obscure language features. It makes me a better coder because I'm constantly learning. But do I want to be spending my time learning or do I want to make that deadline that's coming up?

whywhywhywhy · 2024-05-13T11:41:40 1715600500

I feel pre November “dev day” 90% of the time I could trust GPT4 output to just work but post downgrades the increased amount of times I’ve copy and pasted then seen the error and realized there’s unfinished placeholder stuff, straight up parts not done or previous code removed that was important.

Just means I now spend a lot of time rewriting it which I could have just done in the first place but now I’ve wasted time asking GPT too.

smokel · 2024-05-13T08:41:11 1715589671

A key difference between database mapping and interactive AI tools is the position of the user.

I would not be enthusiastic about a system where I receive database query results for review, before delivering them to an end user somewhere on this planet. However, I am more than happy to get some extra help in communicating code from my brain to a compiler.

specialist · 2024-05-13T19:03:27 1715627007

Object-Relational Mappers purport to mitigate the impedance mismatch between object-oriented and relational data structures.

For your analogy to hold, what is the impedance mismatch between programming and Copilot?

Foobar8568 · 2024-05-13T09:54:27 1715594067

From what I remember, lazy loading wasn't part of EF for a long while and even longer for navigational properties. I am not even sure if it was part of EF4.

BurningFrog · 2024-05-13T14:32:24 1715610744

> this is where I think AI for coding is now. It gets things wrong enough that I have to manually check everything

The good way to do this is to write good unit tests for the code.

Which we should be doing anyway!

bluefirebrand · 2024-05-13T14:12:16 1715609536

> does exactly what it says it does, with no magic

I strongly believe that software we develop should feel like magic to the users

The tools that we use to build them should not

kaba0 · 2024-05-13T08:29:09 1715588949

> and started learning Go. Which does exactly what it says it does, with no magic

Too little abstraction is just as bad as too much.

jimbokun · 2024-05-13T12:49:51 1715604591

The amount of abstraction available in Go is just about right. It gives you higher level constructs, while still being reasonably straightforward to predict memory and CPU performance and behavior.

alrs · 2024-05-13T09:48:10 1715593690

Assembly is too little abstraction. Go is not.

neonsunset · 2024-05-13T09:52:26 1715593946

It is. Oh, and also, Go managed to screw up even the assembly, inventing portable but actually not dialect that uses ugly bits of AT&T syntax, custom operator precedence and in practice is non-portable, forcing you to mix Go-only mnemonics (which might even collide with opcode names on certain platforms), supported opcodes of target platform, and BYTE literals for opcodes it doesn't support, making a lot of your preliminary (N)ASM knowledge useless. Isn't that magnificent?

Gee, I wonder if there's a better way to do so that is not such a lazy job. But doing it properly, like .NET does, is supposedly too much effort!

alrs · 2024-05-13T09:54:59 1715594099

It would be great if C# could be discussed as a language on its merits, but Microsoft has been a terrible steward. It's too bad.

neonsunset · 2024-05-13T11:58:10 1715601490

So you are saying it's even worse than suing anyone over using the language like a certain Java-related company or laying off people off the core language team like a certain Dart-related company?

neonsunset · 2024-05-13T08:53:16 1715590396

And it’s by far the best one now that all the issues that made old EF unsound were solved in EF Core.

Good developers know to appreciate that and wouldn’t want to touch Go ecosystem afterwards with a 10 feet pole.

klauserc · 2024-05-13T09:14:11 1715591651

I find GitHub Copilot close to useless for production code. The worst, most obscure bugs I've had to debug in the last year were all in Copilot-written code. It _looks_ plausible, but it makes extremely subtle mistakes. Occasionally, you have repetitive sections of code where it can copy&adapt lines from the context, but that's about it.

It's a different story for test code. Test code is often formulaic and "standardized" (given/when/then). For instance, I find myself writing the first test case and Copilot can come up with additional test cases. Or I might write the method name ( FeatureUnderTest_Scenario_ExpectedOutcome) and Copilot provides the implementation.

I have not found any value in Copilot chat.

couchand · 2024-05-13T11:35:55 1715600155

Test code is code. It's as much of a burden as every other piece of code you are troubled with, so you must make it count. If you're finding it repetitive and formulaic, take that opportunity to identify the next refactoring.

Just churning out more near copies is not a good answer.

AlexandrB · 2024-05-13T15:17:45 1715613465

The problem with refactoring test code is twofold:

1. It can make it harder to see what's actually being tested if there are too many layers of abstraction in the test.

2. Complex test code can have significant bugs of its own that can result in false passes. What tests the test code?

Thus I generally see repetitive or copy/pasted test code as a necessary evil a lot of the time.

munksbeer · 2024-05-14T08:37:39 1715675859

Absolutely this! I was very guilty of over complicating test code to use abtractions and reduce boilerplate, but it certainly resulted in code which you could not always tell what was being tested. And, you'd result in nonsensical tests when the next developer added tests but didn't look deeply to see what the abstractions were doing.

I now find it is best to be very explicit in the individual test code about what the conditions are of that specific test.

lolinder · 2024-05-13T13:53:23 1715608403

> If you're finding it repetitive and formulaic, take that opportunity to identify the next refactoring.

It doesn't really matter how many helper functions you extract from your test code, in the end you have to string them together and then make assertions, and that part will always be repetitive and formulaic. If you've extracted a lot of shared code, then it might look something like "do this high-level business thing and then check that this other high-level business thing is true". But that is still going to need to be written a dozen times to cover all the test cases, and you're still going to want test names that match the test content.

There's a certain amount of repetition and formulaism that will never go away and that copilot is very good at.

causal · 2024-05-13T14:46:16 1715611576

LLMs are pretty good at anything that follows a pattern, even a really complex pattern. So unit tests often take a form similar to the n-shot testing we do with LLMs, a series of statements and their answers (or in the case of unit tests, a series of test names and their tests). It makes sense to me that LLMs would excel here and my own experience is that they are great at taking care of the low-hanging fruit when it comes to testing.

nasmorn · 2024-05-13T14:56:49 1715612209

I agree. A very high impact change I made for an application my team is working on was allowing easy creation of test cases from production data. We deal with almost unknowable upstream data and cheaply testing something that was not working out has reduced the time to find bugs tremendously

tracker1 · 2024-05-13T19:36:08 1715628968

I've found Github Copilot to be pretty great for boilerplate code... It's a tossup for anything much more complex.

devjab · 2024-05-13T09:45:53 1715593553

I think automated tests are the one area that LLMs will truly improve productivity (and overall code quality). It’ll likely also lead to a lot of tests that actually tests nothing, but as a whole, it’ll hopefully be capable of both generating and updating tests if you give it some good inputs to do it on. Documentation is another area where I have high hopes. In the ideal world people update it as they change things. In reality, however, well…

Then there is the design side of things. I really feel bad for designers of Icons now that you can get some really good one really fast by tasking one of the image generating AIs.

I’m not sure LLMs will ever really be capable helpers as far as programming goes. Well I guess it’s two part, they can help with trivial tasks, but they can’t help with anything related to the actual work of generating business value with code. It’s two-sided of course. They certainly allow a lot of people write functioning, though really shitty, code. Which is a huge benefit for a lot of programming tasks where it doesn’t really matter that it’s inefficient and well terrible. We’ve already seen our more digitally inclined employees make great things with power apps, most of which are eventually replaced by more robust software as they scale. But we also see small Python programs helping out with tiny personal tasks around our offices, and while IT operations aren’t too happy it’s generating a lot of individual value that wasn’t there before.

collyw · 2024-05-14T12:19:02 1715689142

If the code isn't doing anything special, it spits out decent enough code (I am using the paid version of ChatGPT with the various customization).

As someone who spends 80% of his time in the backend, I find it great for JavaScript whereas it's not so good for Django which I know pretty well.It can still be useful though and is often faster than looking up docs for specific things.

jtbayly · 2024-05-13T13:09:21 1715605761

Yes, I just used ChatGPT to write me some code to iterate through a CSV and add each row to a system via its API.

It wrote a python app. It hard coded the API key and the CSV file. And then it told me to pass the file name as an argument. lol.

I just asked it to fix that and tested with a two line csv. Worked like a charm and saved me quite a bit of time trying to figure a few new things out.

But a proper programmer would have been slowed down by this, for sure.

datavirtue · 2024-05-13T16:17:59 1715617079

A test that tests nothing is redundant and therefore is not a test. I have seen people make claims about "useless tests" when they are not able to reason about the coverage. You should be using a tool to gauge test coverage. Tests should be proving accuracy and precision. It's easy to conflate those or lose sight of one.

portaouflop · 2024-05-13T09:48:24 1715593704

I think few if any designers rely on icons for income. There have been thousands of free icons around before genai.

kqr · 2024-05-13T11:18:18 1715599098

> It’ll likely also lead to a lot of tests that actually tests nothing

So pair it with mutation testing!

shaky-carrousel · 2024-05-13T09:44:19 1715593459

Copilot for me is very useful with all the scaffold boring code. It sometimes helps with problems, but I have to guide it, and be very precise with my request.

zahrc · 2024-05-13T10:27:53 1715596073

And even then it happens to ignore context or queries from start or halfway through. I'd rather spend the time coding then trying to bruteforce it to give me the answer I need.

packetlost · 2024-05-13T13:59:20 1715608760

I've found Supermaven to be substantially better than Copilot. The latency is near instant and the results are mostly confined to a line or 2 where the success rate is higher. Meanwhile I agree that Copilot was less than useless for me. Actively hurt my workflow and made things harder.

zarathustreal · 2024-05-13T14:50:14 1715611814

If you’re not using a language that can properly support algebraic structures and randomized property-based testing you’re essentially getting no guarantees about your code from tests. You wrote the code, you wrote the tests, they’re equally likely to be incorrect.

munksbeer · 2024-05-14T08:40:26 1715676026

I find this to be a bit of a meaningless point. What are you actually trying to say?

collyw · 2024-05-14T12:22:35 1715689355

Humble-brag about how he uses "a language that can properly support algebraic structures and randomized property-based testing" whatever the hell that is.

Personally I use python and solve real world problems.

zarathustreal · 2024-05-14T11:33:14 1715686394

My statement is clear and straightforward, I’m not sure how to put it any other way. LLM-generated tests don’t make sense as a concept because there are only roughly five properties you actually need to write tests for if you’re writing tests that actually provide any guarantees.

munksbeer · 2024-05-14T12:16:46 1715689006

Apologies, but I understand the English words you're typing but I'm still not sure of the intent you're trying to convey to everyone. You're conversing in a very rigid style which isn't sympathetic to how people typically interact.

I could just leave the discussion I guess, but in the interest of discourse, I don't find your statement meaningful because we're not all working in languages that I think you refer to. Our unit tests are absolutely not perfect and don't offer perfect guarantees, as we're fallible and will write fallible code.

And as such, I just don't understand what point you're trying to make by saying that LLM generated tests are no good because they can't offer perfect guarantees.

zarathustreal · 2024-05-16T23:15:46 1715901346

Ah that makes sense to me, I see where I misunderstood you. When you say you don’t understand what you mean is that you do understand but you disagree with the point.

I’m on mobile so it’s hard to reference what I previously said but I’m assuming my statement needs to be weakened a bit to be correct. What I meant to say was that unit tests provide essentially no value because they can’t offer perfect guarantees, which is probably different than what I originally said. I’m assuming I just said “they offer no value” which is probably false in some cases for some people and some teams depending on their definition of value. My point was that unit tests do not make sense insofar as their purpose is to provide guarantees about the behavior of code because the information they provide does not meet the standard definition of “a guarantee”. For the above mentioned people/teams/situations/value definitions, they may make sense.

Hope that clarifies what I was trying to say.

Regarding languages, algebraic structures can be implemented in any Turing complete language. Likewise with property-based testing (with, eg randomized inputs across the domain). I’d be willing to guess it’s just a matter of education and/or desire keeping most developers from using it.

yuppiepuppie · 2024-05-13T09:34:43 1715592883

Assuming you work on a team with pull requests and code review, how much do you also put blame on that process?

muglug · 2024-05-13T01:17:20 1715563040

GitHub touts a 55% improvement in coding speed based on a study conducted in-house that tested the participants’ ability to paste a pre-written prompt and then check the output: https://github.blog/2022-09-07-research-quantifying-github-c...

The effectiveness of a Copilot-like tool trialed at FB showed that 8% of code contributed by participants was sourced from suggestions, but the latter study made no promise about coding velocity: https://arxiv.org/abs/2305.12050. In my own experience the time taken to review machine-generated suggestions often eats into developer time.

This is not a critique of LLMs in general — I’ve found ChatGPT really great for kicking off greenfield projects in well-known languages and frameworks.

yreg · 2024-05-13T01:43:02 1715564582

My personal feeling is that utilising LLM assistance often isn't faster, but it can take less "stamina", tiring me less.

rjst01 · 2024-05-13T07:53:40 1715586820

I've had the opposite experience - copilot will pop up a suggestion that's wildly wrong and I'll expend energy processing and discarding it.

heyoni · 2024-05-13T02:18:30 1715566710

Especially when renaming variables that are “immune” to normal refactoring. Copilot handles that pretty well and I don’t have to spend all that focus on such a menial task.

eru · 2024-05-13T02:42:19 1715568139

Yes, I often find that copilot is pretty good at picking up on the pattern of refactorings that I am doing.

Especially those that are a bit tedious and almost mechanical, but not quite mechanical enough to do with a simple search-and-replace.

skydhash · 2024-05-13T02:58:55 1715569135

Vim and text motions should do the trick.

dartos · 2024-05-13T04:00:29 1715572829

Sure, but doing the refactor the first time, without thinking about how to record a vim macro, then just hitting tab to have copilot do the same change over and over is a lower friction experience

eru · 2024-05-13T05:10:29 1715577029

Totally depends on what kind of refactoring you want to do; and how well vim's commands map to your language's syntax, too. (Or whether your vim has special support for your language's syntax.)

Semaphor · 2024-05-13T08:24:29 1715588669

Sidebar, but what does "“immune” to normal refactoring" mean?

tomduncalf · 2024-05-13T09:08:00 1715591280

I would guess they mean stuff which isn’t just “rename symbol” or whatever, like changing code from one pattern to another slightly different one. For example, I’ve used LLMs to “change this if statement to a switch”, which I don’t think VS Code can do as an automatic refactor using the native tools.

Semaphor · 2024-05-13T09:15:08 1715591708

That’s not variables, though. But maybe it’s something similar, like some variables being used in strings?

> which I don’t think VS Code can do as an automatic refactor using the native tools.

Yet another point for using an IDE over a texteditor with some bolted on IDE features, because the Jetbrains tools even suggest such refactorings ;)

pradn · 2024-05-13T17:03:56 1715619836

I find it far more tiring because I don't have "micro-breaks" where I'm slowly typing code. I just have to be in a serious "check the logic" mode for a longer period of time.

It also makes it a lot easier for juniors to chuck random code at seniors for review.

Kiro · 2024-05-13T07:38:48 1715585928

That's a great point. I'm definitely procrastinating less.

wouldbecouldbe · 2024-05-13T07:52:50 1715586770

Yeah although the frustration of restarting after failed & failed attempts, or introducing bugs in parts of code you didn't want updated is also tiring.

KaiserPro · 2024-05-13T08:22:25 1715588545

I have a "top class" LLM based autocomplete provided by work.

At first it was a massive pain because I didn't realise it wasn't a "proper" autocomplete(intellisense is probably the king in that regard), and get hit with a large number of hallucinated functions.

This was really hard for me, as I'm slightly dyslexic, which means spotting plausible but bullshit completions is very hard. (I suspect its hard for everyone else too). Worse still, at the time the linter/type inspector was/is very slow so only ran on save/execution.

However its both improved in the last year significantly, and I have got used to it. For me there are a few techniques that help me:

1) It has a recency bias. Which is great for when you're jumping about in code making changes

2) It rewards proper variable names

3) Your comments should say what, and why your doing what you are doing.

2 & 3 should be obvious and you should be doing it anyway. But it really re-enforces that.

However I would really like some UI changes so that I can make _better_ use of the LLM plugin.

1) a completely different colour to indicate that its an LLM suggestion (bonus points for giving a confidence as well)

2) a different keystroke to accept the suggestion. (bonus for partial selection)

scaryclam · 2024-05-13T10:22:18 1715595738

Coding speed really is a horrible metric. I can code really quickly, but it doesn't mean I'm doing anything productive or correct.

I'd rather slower coding speed, properly written as it provides higher overall velocity. And velocity should take into account the refactoring that happens months or even years later. Crappy code can look really fancy, and even be bug free, but if it's overengineered and hard to change, it can create long change times or even a full stop in development later in the products lifecycle.

And that's on top of developers losing the understanding of how something actually works. If AI helps create the code that would have been written without AI, then great, but I don't observe that happening, and the code has never been a better idea than then dev could have done without it.

12907835202 · 2024-05-13T01:23:23 1715563403

This last sentence is the most important. You can describe your dB schema in words and get laravel migration, models, controllers and if you wanted policies, form requests etc all near perfect. You can get a V1 in 10-20 minutes then go about handling all the actual logic

mlinhares · 2024-05-13T02:12:19 1715566339

So rails scaffold from 20 years ago but with a chance of hallucinations?

dudus · 2024-05-13T03:42:46 1715571766

It's very easy to find multiple solutions to one problem. The power of LLM is that it's one solution to many problems.

marcus_holmes · 2024-05-13T04:32:25 1715574745

But is it actually a solution to any of them?

The "chance of hallucinations" is the tricky bit - if I have to manually check everything it does in case it's hallucinating, then it's not actually a solution. It's not saving me time (as TFA says).

perfmode · 2024-05-13T06:36:47 1715582207

while you’re in doubt, i’ll be speeding out

marcus_holmes · 2024-05-13T08:04:55 1715587495

Speed on, I'll enjoy the slo-mo crash footage later :)

LtWorf · 2024-05-13T09:31:44 1715592704

What about the problem of needless pollution?

dudus · 2024-05-13T16:43:17 1715618597

I thought we had moved on from cryptocurrencies

ben_jones · 2024-05-13T02:18:11 1715566691

Shhh we’re raising trillions for this technology remember?

rm_-rf_slash · 2024-05-13T01:29:10 1715563750

Yeah the copilot doesn’t need to be integrated into every keystroke, just able to analyze the context and kickstart the code. Getting up to speed with new libraries is so much easier with AI instead of the bad old days of trial-and-error-and-marked-as-duplicate

erhaetherth · 2024-05-13T07:30:19 1715585419

I especially like when I'm looking for a particular method on a class I've never used before and it just makes one up for me as though it exists.

what-the-grump · 2024-05-13T11:26:13 1715599573

It’s adding features that should have existed for you! It can also make up the documentation for the method while you are at it.

gtirloni · 2024-05-13T02:22:52 1715566972

Can Copilot analyze my whole codebase these days or just the file I've opened? Honest question as I've stopped to use it a long time ago.

theshrike79 · 2024-05-13T07:50:29 1715586629

With VSCode at least you can say @workspace to Copilot and it'll take it into account.

No idea if it feeds all of the code to the LLM or just parts, but it's pretty good at interpreting what the code does

dathinab · 2024-05-13T12:36:26 1715603786

It guaranteed can't have all of your code in the LLM context, or even all of your file (in case of longer, through not even quite long files).

Through it could do stuff like go through your repository(ies) and generate embedding for sections of it and then have a vector database + retrieval argumented generation (RAG) system.

threecheese · 2024-05-13T21:51:08 1715637068

A lead from one of the GH Copilot-adjacent companies was interviewed recently, and that’s precisely what they are doing. They generate embedding of “local” code based on AST (up the stack and sideways, if you know what I mean), and take into account runtime and library versions when doing inference. Sounded like a very interesting challenge.

theshrike79 · 2024-05-13T17:33:20 1715621600

Yea, that's the way they're going based on the low-memory models they're building.

Basically all the LLM needs to do is translate human writing to some format that a "normal" service can use. That can then leverage the existing spotlight system that's pretty decent at searching stuff on the phone anyway.

Then it'll report it back to the LLM which translates whatever format back to something humans can process.

Basically "less shit Siri"

selcuka · 2024-05-13T02:32:54 1715567574

It seems to be hit and miss, at least with the current PyCharm integration. In some cases it can infer information using other files, in some cases it can't (even if they are open in other tabs).

dagw · 2024-05-13T12:24:49 1715603089

As someone who switches between PyCharm and VSCode, I find that Copilot seems to work better in VSCode for some reason. Nothing major, but the suggestions I get just seem slightly more relevant to my code base, and are more often what I wanted.

Although I could be hallucinating the whole thing.

muglug · 2024-05-13T03:43:50 1715571830

Whole-codebase understanding is beyond current coding assistants’ capabilities. To do that you need to add in traditional static analysis. But I’m sure people are working on it!

skywhopper · 2024-05-13T09:48:08 1715593688

The last entity I would trust for trustworthy data on how much Copilot helps programmers is GitHub. They obviously skew the scenarios and metrics to find the best possible number to report.

I think the truth as revealed in your comment and many others is that it all depends on the context. For repetitive code or boilerplate, or starting new projects in well-known frameworks and languages, it probably does increase velocity. The other context factor is the programmer themselves. Their familiarity with the problem domain, the language, and the framework all matter, as does the personality and coding style of the individual.

claytongulick · 2024-05-13T14:51:04 1715611864

It also depends on what tools a developer is already using as a productivity booster.

For example, I use vanilla web components which have some boilerplate for new components.

I already have a simple vscode snippet that does the job well, with no hallucinations [1]. I've experimented with llms doing the same thing with not great results.

It took me longer to explain what I wanted than it did for me to just write that snippet. Doing it repeatedly and waiting for the results definitely didn't increase my speed, though I was impressed that it was eventually able to figure it out (vanilla web components with lit-html renderer isn't a super common technique). Also, I prefer the pythonic approach of snake-case local variable names. Getting the llm to do that in a js project where it's not super common was another whole iteration.

We've had code generation tools around for decades to deal with repetitive boilerplate tasks. Maybe they aren't quite as capable as when the llm "gets it right" - but I wonder with these productivity claims how they are measured.

Are they starting from zero and a new developer? Or comparing against an experienced developer with proficiency with lots of workflow enhancers like templates, snippets, vim macros, etc...

[1] https://gist.github.com/claytongulick/e4251c1b27b22c25fb68f3...

noobermin · 2024-05-13T05:40:39 1715578839

So I am a scientist and I've generally resisted using llms for my work. My general critique reading others' experience is this sounds like what is needed is better abstractions if all it helps people with is doing boilerplate that needs to be checked. Also no need for "better abstractions" to be some ethereal thing, this just means better standardised libraries and frameworks. May be if the word "abstractions" sounds too bespoke may be what is needed is better tools and tooling.

The standardised bit seems more difficult than it should be as it's essentially a social problem with development in general, but that also is a problen AI does not address, it's merely a bandaid over the problem.

thomashop · 2024-05-13T07:37:55 1715585875

It takes literally 10 minutes to try it out. I'd suggest trying it instead of recycling other people's opinions.

AnarchismIsCool · 2024-05-13T00:39:50 1715560790

Local maxima go BRRRRRRR

AI will get there eventually, but this current paradigm seems increasingly only useful for spam and shitty clip art. Even so, everyone is throwing absurd amounts of investment capital at it in the hopes that something useful will happen. It's a pretty clear depiction of the investor class being so detached from the technical reality of what they're investing in that they just sit around lighting billions on fire thinking that they're getting richer instead of poorer. Go ahead and buy more H100s though...

simonw · 2024-05-13T00:49:11 1715561351

I've been finding this stuff genuinely useful for two years now, across Copilot and ChatGPT and Claude 3 Opus and similar tools.

Either I'm a dimwit, easily conned by hype and shiny tools to the point that I can imagine benefits for two years that simply aren't there... or there's something to them.

aimazon · 2024-05-13T01:07:08 1715562428

I don’t think you’re a dimwit but I read your post[1] with an example and I am curious to whether you feel you’re losing something by telling the LLM it’s wrong and to try again, rather than going through the exploratory/iterative learning process yourself. For example, would you have known to ask about GeoJSON if you had not come across and learned about it pre-LLM? More succinctly: do you feel you’re learning more or less or an equal amount when using LLMs?

[1] https://simonwillison.net/2024/Mar/22/claude-and-chatgpt-cas...

simonw · 2024-05-13T01:26:11 1715563571

Absolutely I could have learned more from that particular project if I'd spent more time with it rather than getting the LLM to do the work... but that's why I like it as an example: since it was effectively a distraction (a "side quest") the alternative wasn't learning more, it was not doing it at all (and learning nothing).

I'm able to get really great results out of LLMs because I have 20+ years of experience helping me know what questions to ask of them.

I do feel like my rate of learning has increased though, because I'm much more likely to try out a completely new technology when I know an LLM can flatten the learning curve for me a bit.

gtirloni · 2024-05-13T02:26:27 1715567187

> I do feel like my rate of learning has increased

It's the same for me. Lots of experience knowing what to ask for. It does a better job in summarizing knowledge and getting me a relatively coherent explanation. Much faster than using Google Search to find and connect the dots from dozens of pages.

I just don't see much benefit in its reasoning and code assistance features besides basic stuff.

skydhash · 2024-05-13T03:30:42 1715571042

> I'm able to get really great results out of LLMs because I have 20+ years of experience helping me know what questions to ask of them.

While a direct answer is nice, I like an iterative/explorative process because of all the things I pick up alongside it. An example is when I was working on an epub reader for macOS (side project). I wanted to a native layout engine instead of a webview and I decided to go with muPDF. This has lead me to know more about text layout and rendering, and embedding C inside Swift than I would if I just have direct answers for every problem (if I'd know the correct questions in the first place).

I accumulate side quests like that until I can do a nice experiment to learn as much as I can for a particular domain space.

BoorishBears · 2024-05-13T04:31:33 1715574693

You're talking right past them though: you were working on a side quest you had interest and bandwidth to iterate on.

I agree with the parent comment, my pipeline is already saturated with side quests, I'm already iterating on a bunch of random fun work, and side-project work and work work. So often times the most "LLM-heavy" projects of mine are things I straight up would not do if I didn't have something to get the ball rolling other than more of my own free time which is already in short supply.

Hopping from direct answer to direct answer isn't where I find wonder/fun in programming anyways, but sometimes you don't have bandwidth for the side quest to be fun or wonderous.

yawaramin · 2024-05-13T02:45:26 1715568326

> I have 20+ years of experience helping me know what questions to ask of them.

That seems rather unlikely given that LLMs didn't exist more than seven years ago, much less 20+.

Loocid · 2024-05-13T03:30:11 1715571011

They mean they have 20 years experience in programming, which helps them ask good questions.

simonw · 2024-05-13T03:31:54 1715571114

Yeah, that's what I meant. With LLMs having deep domain knowledge in a field helps enormously in terms of prompting them as productively as possible.

yawaramin · 2024-05-13T19:41:14 1715629274

So, to maximize the usefulness of LLMs, requires expert level domain knowledge in a field? Sounds like a useful tool for highly trained experts. For the average Joe Developer, perhaps not.

erhaetherth · 2024-05-13T07:37:36 1715585856

I was trying to fetch key-value pairs out of a database using PHP+PDO the other day and I knew there was a nice easy to do it but I couldn't remember how. Something about fetchAll, maybe PDO::FETCH_GROUP|PDO::FETCH_COLUMN.... what was it?

So I asked a couple LLMs. They wrote out loops for me to format the data how I wanted. I could have copy-and-pasted that in and it would probably have worked. But I felt there was something better yet, so back to Google I go.

It's `PDO::FETCH_KEY_PAIR`. It's built-in. But oddly kind of hard to find unless you know the right thing to search for, and "key pair" was not springing to my mind.

Point is, if you just let the LLMs do your work you won't even find these better ways of doing things. And I'm quite afraid of what the LLMs are going to do to Google, Stackoverflow and documentation in general. In a couple users it'll be ungoogleable too.

fzeroracer · 2024-05-13T08:05:35 1715587535

As noted LLMs can only give you what you ask for, but for a lot of problems what you ask for isn't what you need; it's two or three steps removed. And LLMs can't tell you that you're doing something wrong; unlike curmudgeonly users on SO or in various forums/channels.

My gut feeling is that we're going to enter into a 'dark age' of coding where a lot of previously available resources are going to be ransacked and made hard to find in favor of big corporation owned LLMs. It's already having an extremely bad effect on search in general; we're potentially only a few fights away from sites like SO having users leave en masse. That's why I think having a strong network of engineers to talk with will become more important than ever, almost a return to the IRC days.

DrSiemer · 2024-05-13T08:33:58 1715589238

Strong disagree. I have learned so many new things since I started using LLM's. Some of them I'm actually embarrassed to admit, because I should have known about them a decade ago.

If you work in a small company and you are the most experienced developer, you don't often get feedback on how you can improve things.

The trick is, quite simply: just ask. I regularly dump some code I wrote in a language model and then ask what can be done better.

I would never do that in any online space, because first, I don't wait an answer maybe some day, I need an answer NOW. And second, I prefer to avoid being called a fool.

fzeroracer · 2024-05-13T16:23:22 1715617402

This is precisely the wrong way to engage with LLMs. If you are asking it 'what can be done better', it'll spit out something. That something isn't necessarily better or not because it has no concept of 'better' or 'worse'.

DrSiemer · 2024-05-16T18:57:08 1715885828

Ah, so that's why all my code has become more concise and efficient and I've learned countless new tricks that I did not know before and probably would have never found without LLM's.

Too bad I'm "engaging with them wrong", I could have sworn it was helping me.

Seriously though, claiming LLM's don't have any higher level understanding of right and wrong and then extrapolating that to "they cannot possibly be used to improve things" is a very stubborn refusal of the fact that the most logical answer to the question "what can be improved here" is... actual improvements.

fzeroracer · 2024-05-19T21:05:16 1716152716

They do not have any higher level understanding of right and wrong. You lead the model on by telling it to improve something, so it will rework the code in question and tell you that it's an improvement. Regardless or not if it is. Coding is about 70% subjective, 30% objective when it comes to figuring out improvements because the majority of improvements deal with business logic and things specific to your domain.

noduerme · 2024-05-13T01:12:07 1715562727

You seem to be writing from a lost world when people enjoyed coding because they enjoyed learning new things with every project. I think that's sadly something only enjoyed at work by a very small minority these days.

eru · 2024-05-13T02:44:17 1715568257

I don't think those 'good old days' ever existed. Or rather, the situation hasn't really gotten worse.

noduerme · 2024-05-13T07:26:25 1715585185

I probably wasn't clear I still live in that world, both professionally and in personal projects. But my perception is that the vast majority of devs and engineers do not.

simonw · 2024-05-13T01:27:11 1715563631

I very much aim to learn something new with every project. The thing I like most about working with LLMs is that they let me take on MORE projects!

thefaux · 2024-05-13T05:11:54 1715577114

You are creating a false dichotomy. Very few people are saying there are no benefits, but many have reasonable concerns both about the current efficacy of these tools as well as the expectation of continued exponential growth which is driving much of the current hype.

What if we have already passed the inflection point where the exponential growth transitions to an s-curve? That would mean that this technology on its own would only get marginally better than is today. Maybe 2, 4 or even 10x better than it is today, but not 100x or 1000x. To break those barriers, we would need further innovations beyond just throwing more gpus and training data at the problem.

I personally am short on LLMs because I believe it is much more likely that we have already crossed the inflection point or will soon. Again, they are impressive but ultimately I think that LLMs will be at best a footnote in history if they are even remembered at all in a few hundred years. But of course I could be wrong.

simonw · 2024-05-13T10:47:02 1715597222

I've been generally staying away from the whole "imagine what this stuff could do next!" side of the discourse.

My personal opinion there is that if all research froze today it would still take us years to figure out all of the potential use-cases and applications for the models we have access to right now, and how best to apply them.

vertis · 2024-05-13T09:00:34 1715590834

No I'm with you. I don't use copilot though for undirected autocomplete, I use tools that allow me to give instructions and diff the results, Cursor and Aider are my current defaults, both with Claude 3 now[0]. Always looking for alternatives though.

I think it's always going to be a YMMV experience with LLMs. I'm an extreme generalist with 23 years of experience so it compliments my strengths and weaknesses. I know how to program across 20 odd languages but most of them I need to look stuff up if I'm not using it frequently enough. Now though, don't really need to look stuff up.

For me there are two groups, those that want to use LLMs and are pushing it forward and those that are would prefer that LLMs not exist and want it to be hype that goes away.

Having followed the hype cycles of VR/AR and crypto I can feel a difference. Both of those felt like a solution in search of a problem. The true believers wanting it to be something like Ready Player One with VR (fun story, Oculus/Facebook actually handed out copies of the novel at Oculus Connect 2 in 2015).

The hype around LLMs seems different; like applying a new solution to all the existing problems to see where it helps, and that's just with the current iteration.

[0]: I'd prefer to be using local/open equivalents but the capabilities are still lacking.

tymscar · 2024-05-13T05:04:13 1715576653

You are clearly not a dimwit, and you don’t only have way more experience than most people here, you also have some amazing projects under your belt.

However I can’t help but notice that the vast majority of blogposts, talks, tweets, and basically everything else you do now is around LLMs. Do you not think that’s indicative of this being “hyped” and “shiny tools”?

ToValueFunfetti · 2024-05-13T05:54:16 1715579656

If hype just means people are excited about it and talking about it, it should be a positive signal about the merit of the thing. I think people get the wrong idea by measuring P(X is good | people are excited about X) and finding it to be low. But P(X is good | people are not excited about X) is vastly lower, and 'hype' as here used is not really discrediting.

thaumasiotes · 2024-05-13T11:24:52 1715599492

> I think people get the wrong idea by measuring P(X is good | people are excited about X) and finding it to be low. But P(X is good | people are not excited about X) is vastly lower

Only if you're including things that don't exist. If you restrict yourself to things that people might conceivably talk about, that probability is actually very high, for the simple reason that bad things are exciting specifically because they are bad.

Here are some boring things:

- bread

- sunlight

- air

- water

- parents

Note that starvation, darkness, suffocation, dehydration, and being orphaned are all much more exciting than their opposites.

ToValueFunfetti · 2024-05-13T12:42:02 1715604122

Poor phrasing on my part; 'good' should have been 'worth talking about'.

simonw · 2024-05-13T10:44:25 1715597065

I'm writing a lot about LLMs at the moment because they're the focus of much of my work.

I don't see that as a hype thing - in the past I've had other topics I've focused on, not because of hype but because those were the topics I was spending the most time with.

tucnak · 2024-05-13T08:40:54 1715589654

Deep learning is the most important thing to happen to philosophy since Socrates / Wittgenstein.

mhuffman · 2024-05-13T12:27:20 1715603240

I would be very interested in you providing an example of Deep Learning offering more cultural or historical impact than Socrates / Wittgenstein.

tucnak · 2024-05-13T17:31:15 1715621475

I mean, already ChatGPT alone has made more cultural and historical impact per capita than the aforementioned gentlemen, simply because nobody cares about philosophy. But that was never my point. My point is: you can think of uncle Witt kind of like "postmodern" twist on Socrates, i.e. they're both effectively talking about the same thing: determining optimal form for computing language. Wittgenstein had made considerable progress here, i.e. language games, and now we finally get means to literally compute these. To me it's absolutely clear; this is the most important thing to happen in philosophy.

selcuka · 2024-05-13T02:39:23 1715567963

I basically say the same when people ask me if the AI bubble is going to burst. While I agree that the current push for LLMs should be taken with a grain of salt, I don't think it's a bubble either. I have been finding it useful too, and I don't think I will stop using it because it will look less shiny in the future.

BlueTemplar · 2024-05-13T14:40:10 1715611210

The dot com was a bubble that burst spectacularly enough, yet we still use the World Wide Web, and even some of those websites.

anonzzzies · 2024-05-13T01:07:32 1715562452

Nah, it seems a lot of people are using them wrong? OP seems to also fall in that category. I sit next to many people in pair programming situations and it’s quite weird to see very smart programmers using gpt or copilot; they are the ones shouting ‘stochastic parrot’ on forums and yet expect some kind of mind reading magic when using these tools. When I show that you can put a comment for your function, like you should do anyway, and it writes the function, it usually clicks. Many still find it ‘faster and easier’ to write the function themselves and that’s fine, but, like OP, doing things like entering ???? and generally expecting a useful result will not be optimal.

ern · 2024-05-13T04:04:31 1715573071

You make an interesting point. I once worked with someone who was brilliant, but had gotten into the field via a math degree, and we were building an enterprise-software product.

We tried pair-programming, and he cracked within a few minutes. He couldn't articulate his thought processes verbally. Sounds similar to the scenario you're describing.

blibble · 2024-05-13T01:13:23 1715562803

I wouldn't feel too bad, there's orders more magnitude more money put into this con vs. the last one (NFTs)

eru · 2024-05-13T02:46:25 1715568385

Your counting of 'cons' seems pretty idiosyncratic.

You know there have been lots of other technologies people have been working on in the meantime and concurrently? Some of them more questionable, some of them less.

015a · 2024-05-13T01:26:04 1715563564

Yeah; its concerning how depressingly similar Sam's recent language is to, say, SBF's in his prime. Waxing poetically with faux-intellectualism about the sheer addressable market his business will capture (all of humanity) (and funded by the government) [1]. You can just listen to key players in the space, how their language has changed over the past year, and recognize that we're probably very near a local maxima / plateau / AI winter.

Reality is: AI is startlingly expensive. Stupid, stupid expensive. Microsoft is building "Stargate", their $100B AI-focused supercomputer [2]. The money to build that isn't coming from selling AI services. Its coming from their old, boring, tried and true businesses; Windows, Office, Azure, and M365. Xbox is dying before our eyes. Their tried and true businesses make great money but aren't growing. Its up to AI to prove to investors that their P/E multiple is justified.

[1] https://twitter.com/tsarnick/status/1789107043825262706

[2] https://www.tomsguide.com/ai/meet-stargate-the-dollar100-bil...

viking123 · 2024-05-13T09:17:17 1715591837

I always mix SBF and Altman in my mind, and have to spend few seconds thinking which was which..

But IMO the current AI hype has some almost religious elements to it

n_ary · 2024-05-13T15:27:18 1715614038

> It's a pretty clear depiction of the investor class being so detached from the technical reality

I think, their ROI model is much different and subtle. Few decades back, everything ran on-premises and costed ok ... Then a huge investment had been shovelled into fancy frameworks and pushing strange paradigms (microservices, distributed systems, monoliths whatever flavour works for the tech giants) so now people just put together whatever and needs massive _cloud_ to run basic things that used to run on few MHz devices.

In the end, the ROI is NOT from these models, those are just gambles in case one actually becomes a king maker. The actual ROI is from the _cloud_ where innocent people will rent expensive hardware to try and utilise these models and funneling wealth to the actual investments(cloud operators, hardware vendors) which are already part of portfolio of these investors. Basically, $1MM invested in random shiny startup will inspire other shiny startups to also spend similar in race to become unicorn, all the while the cloud operators and GPU vendors are laughing to the bank preparing their shareholders(usually same investors) dividends.

noduerme · 2024-05-13T01:09:17 1715562557

I think the nomenclature of calling generative AI an "AI" is just hype and that leads to disappointment. After all, it's much more exciting for investors than calling it the world's most expensive textual/audio/visual autocomplete. But that's really what this generatoon currently is.

And it can feel like magic, because we're pattern-seeking and pattern-matching creatures, so something that seems to intuit the pattern we're looking for (even if imperfectly) can feel quite a bit like reasoning with another human. To be a bit less generous to humans, it's often the same as talking with another human, because we spend much of our lives writing boilerplate or making small talk or otherwise just kind of on autopilot and answering patterns with expected responses without seriously engaging our complex reasoning. (Do you have to really think about how to map-filter-reduce a dataset anymore? Or think seriously about how you really are when someone says "how are you"? We don't usually expect other people to, and most of the time we're satisfied by exchanging recognizable / somewhat coherent language patterns which we can count on each other to fill in the gaps of. But this is not the intelligence in human intelligence).

The corporate hype machine works overtime to hype it as a solution, but that's just what they do. That's all spin. We see through it now, and we're getting to the point where people are asking the only important question about this tech, which is, how much will you pay for a nerfed autocomplete to do X?

tylerchilds · 2024-05-13T02:35:29 1715567729

“how much will you pay for a needed autocomplete to do X?”

i think this is a super important for anyone playing in the LLM space to calculate.

currently “AI” providers are selling electricity at a loss to demonstrate product “value”, so even while asking questions is “free” today, there’s actually a finite resource under the hood that needs to have the bill paid in the end.

an estimate, but from historic trends, free bills come due in around 7 years

eru · 2024-05-13T02:49:50 1715568590

Even _if_ we don't see our current 'AIs' become more intelligent, I am very certain that we will see the amount of electricity needed to produce 2024-state-of-the-art results drop dramatically over the next few years. People are only just figuring out how any of this works, and they are happy to get any reasonable results at all. They aren't really competing on costs like energy efficiency, yet.

---

However I don't really think that 'if' will come to pass: I expect that we will still see lots of advances in the 'intelligence' of these models, so people will compete on these, and worry about electricity usage afterwards.

mistrial9 · 2024-05-13T13:47:46 1715608066

> The corporate hype machine works overtime to hype it as a solution, but that's just what they do.

the CoPilot product has billed hundreds of millions of dollars, and is billing right now. Secondly they copied all the GPL code and put it into a mixer, like a bitcoin mixer in a legal way.

Sage words about "what they do" ring hollow while the only measure that counts is money. This situation needs legal action.

noobermin · 2024-05-13T05:44:14 1715579054

People are either misusing this meme or it makes no sense.

CaptainFever · 2024-05-13T09:38:13 1715593093

It's likely an ironic usage.

dartos · 2024-05-13T04:02:41 1715572961

I don’t really think AI will get there the way we’re going about it.

Statistical models will always be limited by their data and how edge case filled reality is.

mondrian · 2024-05-13T08:54:15 1715590455

It's definitely useful, but in higher level chat use cases, not so much high-precision generation (yet). Rubber ducking, brainstorming, and search-like use cases are definitely a level above what we had.

mhh__ · 2024-05-13T00:56:57 1715561817

> spam

The first 100 lines of any project is spam (usually 0 entropy), so AI is a nice win.