More

brycelarkin · 2025-03-29T00:10:19 1743207019

Who’s the target market for this?

Tagging documents isn’t really a big problem. Companies typically only have a finite, handful of contract templates that the admin tags manually.

brycelarkin · 2025-03-01T00:26:25 1740788785

Nice! This was one of the missing parts of the Vercel AI SDK.

grantsingleton · 2025-03-12T13:53:24 1741787604

Yeah this is what I wanted the AI SDK to do but didn't have it and surprisingly no one built anything for it yet.

brycelarkin · 2024-11-27T00:33:55 1732667635

They increased prices, but looks like they finally are revamping the service. This is probably the biggest update in years.

brycelarkin · 2024-09-27T22:45:05 1727477105

I can see Matt’s point of view. Data transfer fees are expensive, especially at WordPress scale. Automattic probably covers a lot of that cost that wordpress.org is incurring and wants WP Engine to pay their fair share.

WP Engine also seems to do some other “not in good faith” things such as change the woocommmerce Stripe attribution from wordpress.org to their own Stripe account.

While the legal dispute is on trademark, I think it’s really on WP Engine profiting on wordpress.org without giving back. It’s not illegal, but blacklisting WP Engine isn’t illegal either.

Automattic is essentially subsidizing a private equity backed company. I’d be upset and frustrated too if I was in Matt’s position.

If you support WP Engine, you’re supporting Silver Lake Private Equity.

x0x0 · 2024-09-27T23:31:07 1727479867

If Matt had wanted to, for example, tell WPEngine they're own their own for hosting these services with a cutoff date at least 4 weeks in the future, then fair play. Or to demand to bill them some reasonable cost share of the price to operate (and let's not play games, a reasonable cost for the humans involved, which likely far exceeds servers or bandwidth), then still fair play.

To pitch a tantrum, cut them off with no notice, then gloat about it online... oof. I can't understand why anyone would want to be in business with someone like that.

itsdrewmiller · 2024-09-27T23:00:52 1727478052

This position rests pretty heavily on the idea that Wordpress.com is subsidizing wordpress.org, which is a charitable foundation that accepts donations. Do you have any specific reason to believe this is true? I don’t recall seeing that complaint directly here but maybe I missed it.

brycelarkin · 2024-09-27T23:09:25 1727478565

A quick look at the Automattic site (https://automattic.com/about/) says they dedicate 5% of company time to WP open source.

Their GitHub has over 1000 repos, likely all WP related. https://github.com/Automattic

I don’t doubt Automattic is not putting in their fair share into the WP community.

itsdrewmiller · 2024-09-30T00:48:20 1727657300

When I say subsidizing I mean financially subsidizing (i.e. paying for wordpress.org bandwidth costs) - that's the only way the comment makes sense, and I am not inclined to believe it to be true without evidence. As the other response notes though wordpress.org isn't even the foundation apparently, so is wordpress.com subsidizing wordpress.org? Have they asked WPEngine to cover their costs?

FireBeyond · 2024-09-27T23:09:35 1727478575

> Wordpress.com is subsidizing wordpress.org, which is a charitable foundation that accepts donations

Nope. There goes Matt, muddying the waters. Contrary to many people (including myself's belief) wp.org is very explicitly, in his words, NOT the Foundation, just a benevolent gift of his to the community. You could be excused, given all the links to donate to the Foundation and accompanying text, or the fact that it lives on the Foundation's ASN, though.

brycelarkin · 2024-09-18T03:48:02 1726631282

Those last 30 seconds of the video, trying to get the demo to work for a group of people is so relatable.

brycelarkin · 2024-07-19T04:26:57 1721363217

This article just looks like an ad for Bloomberg’s docuseries.

brycelarkin · 2024-07-10T20:21:11 1720642871

Separate Github profiles for work and personal.

solardev · 2024-07-10T20:27:37 1720643257

Is it normal for people to have separate profiles like that on Github?

vunderba · 2024-07-11T02:11:59 1720663919

I certainly do - my work GitHub profile is attached to my work email address whereas my personal GitHub is attached to my personal email address.

it's a complete separation of responsibilities in the same way that I have a work laptop and a personal laptop.

switch007 · 2024-07-10T22:59:21 1720652361

Not normal enough. Would you do all your personal emailing on your work email? I don't know why so many people have a blind spot when it comes to GitHub

At $JOB, it's mandated to have a separate GitHub account

al_borland · 2024-07-11T05:27:38 1720675658

Yes. My work GitHub account is via SSO using my work creds. My personal GitHub account is much older and uses my personal email address.

Of course, we can’t (or shouldn’t) be uploading any of our code to GitHub for work. We have self-hosted Bitbucket servers. The corporate GitHub accounts are strictly used for Copilot.

If/when I leave the company, I’m pretty sure they don’t want all the code tied to my personal account. That sounds like a nightmare. I don’t want that either.

OutOfHere · 2024-07-10T20:35:54 1720643754

Yes, of course. In fact it is a pretty bad idea to use the same profile for an employer. Always treat an employer as disposable, along with the respective GitHub username for it.

ab0aa907 · 2024-07-10T22:30:32 1720650632

I would say most devs use same GitHub profile for work and personal projects. GitHub provides pretty easy way to join various organizations with same handle. It almost seems exception when a developer creates a new profile for work.

OutOfHere · 2024-07-11T02:06:20 1720663580

The practice can vary significantly by the employer. Some employers prefer the usernames be new and unique to the employer, whereas others don't care. It's obvious that it's a major liability for the employee to reuse the username for personal and professional organizations.

solardev · 2024-07-11T15:47:22 1720712842

Yeah, exactly. Personally I always thought that was a better policy (to have separate profiles) but I've actually had senior devs in the org tell me to just use my personal one and add my work email to it, so that's what I did. Seemed strange to me but I didn't question it too much. Maybe I should've, lol.

solardev · 2024-07-11T15:48:27 1720712907

Thanks all. You've convinced me. I'll set up a new profile for the next time I need to work with a company.

brycelarkin · 2024-07-05T23:16:07 1720221367

For postgres, you want to use “bigint generated always as identity” instead of bigserial.

inopinatus · 2024-07-06T04:26:34 1720239994

I agree and apologise for writing bigserial out of ossified habit. Identity columns hew to the SQL standard, bigserial is a pg-ism. I actually do still use it when the ORM’s DDL generator prefers as much (looking at you Rails), there’s no gain from fighting the middleware.

For other readers: the two are almost the same in behaviour, they differ primarily in that identity columns don’t need a USAGE grant on their associated sequence, and the ALWAYS makes it harder (but despite the term not actually impossible) to override the default value on row insert.

thiht · 2024-07-06T07:58:20 1720252700

westurner · 2024-07-06T01:36:26 1720229786

Why “bigint generated always as identity” instead of bigserial, instead of Postgres' uuid data type?

Postgres' UUID datatype: https://www.postgresql.org/docs/current/datatype-uuid.html#D...

django.db.models.fields.UUIDField: https://docs.djangoproject.com/en/5.0/ref/models/fields/#uui... :

> class UUIDField: A field for storing universally unique identifiers. Uses Python’s UUID class. When used on PostgreSQL and MariaDB 10.7+, this stores in a uuid datatype, otherwise in a char(32)

> [...] Lookups on PostgreSQL and MariaDB 10.7+: Using iexact, contains, icontains, startswith, istartswith, endswith, or iendswith lookups on PostgreSQL don’t work for values without hyphens, because PostgreSQL and MariaDB 10.7+ store them in a hyphenated uuid datatype type.

From the sqlalachemy.types.Uuid docs: https://docs.sqlalchemy.org/en/20/core/type_basics.html#sqla... :

> Represent a database agnostic UUID datatype.

> For backends that have no “native” UUID datatype, the value will make use of CHAR(32) and store the UUID as a 32-character alphanumeric hex string.

> For backends which are known to support UUID directly or a similar uuid-storing datatype such as SQL Server’s UNIQUEIDENTIFIER, a “native” mode enabled by default allows these types will be used on those backends.

> In its default mode of use, the Uuid datatype expects Python uuid objects, from the Python uuid module

From the docs for the uuid Python module: https://docs.python.org/3/library/uuid.html :

> class uuid.SafeUUID: Added in version 3.7.

> safe: The UUID was generated by the platform in a multiprocessing-safe way

And there's not yet a uuid.uuid7() in the uuid Python module.

UUIDv7 leaks timing information ( https://news.ycombinator.com/item?id=40886496 ); which is ironic because uuids are usually used to avoid the "guess an autoincrement integer key" issue

avg_dev · 2024-07-06T02:13:05 1720231985

Just noting, the commenter you replied to said:

> use “bigint generated always as identity” instead of bigserial.

The commenter you are replying to was not saying anything about whether to use UUIDs or not; they just said "if you are going to use bigserial, you should use bigint generated always as identity instead".

westurner · 2024-07-06T15:10:05 1720278605

The question is the same; why would you use bigint instead of the native UUID type?

Why does OT compare text and UUID instead of char(32) and UUID?

What advantage would there be for database abstraction libraries like SQLalchemy and Django to implement the UUID type with bigint or bigserial instead of the native pg UUID type?

Merad · 2024-07-06T16:13:06 1720282386

Best practice in Postgres is to use always use the text data type and combine it with check constraints when you need an exact length or max length.

See: https://wiki.postgresql.org/wiki/Don't_Do_This#Text_storage

Also, I think you're misunderstanding the article. They aren't talking about storing a uuid in a bigint. They're talking about have two different id's. An incrementing bigint is used internally within the db for PK and FK's. A separate uuid is used as an external identifier that's exposed by your API.

westurner · 2024-07-06T19:29:38 1720294178

What needs to be stored as text if there is a native uuid type?

Chapter 8. Data Types > Table 8.2. Numeric Types: https://www.postgresql.org/docs/current/datatype-numeric.htm... :

> bigint: -9223372036854775808 to +9223372036854775807

> bigserial: 1 to 9223372036854775807

2*63 == 9223372036854775807

Todo UUID /? postgres bigint UUID: https://www.google.com/search?q=postgres+bigint+uuid :

- UUIDs are 128 bits, and they're unsigned, so: 2*127

- "UUID vs Bigint Battle!!! | Scaling Postgres 302" https://www.scalingpostgres.com/episodes/302-uuid-vs-bigint-...

"Reddit's photo albums broke due to Integer overflow of Signed Int32" https://news.ycombinator.com/item?id=33976355#33977924 re: IPv6 addresses having 64+64=128 bits

FWIW networkx has an in-memory Graph.relabel_nodes() method that assigns ints to unique node names in order to reduce RAM utilization for graph algorithms: https://networkx.org/documentation/stable/reference/generate...

Merad · 2024-07-06T20:46:49 1720298809

Many people store UUID's as text in the database. Needles to say, this is bad. TFA starts by proposing that it's bad, then does some tests to show why.

I'm not quite sure what all the links have to do with the topic at hand.

westurner · 2024-07-07T20:02:46 1720382566

Which link are you concerned about the topicality of, in specific?

Shouldn't we then link to the docs on how many bits wide db datatypes are, whether a datatype is prefix or suffix searchable, whether there's data leakage in UUID namespacing with primary NIC MAC address and UUIDv7, and whether there will be overflow with a datatype less wasteful than the text datatype for uuids when there is already a UUID datatype for uuids that one could argue to improve if there is a potential performance benefit

brycelarkin · 2024-06-12T18:47:46 1718218066

Really cool!

When should you use pgvector vs pgvector scale?

Is there any discussion about getting this added as a supported AWS RDS extensions?

akulkarni · 2024-06-16T13:10:25 1718543425

pgvectorscale only makes pgvector better. The primary developer-facing improvement is the introduction of the StreamingDiskANN index type.

So we would recommend using both from the start. There is no cost (technical or financial) for doing so.

There is discussion about getting this added to AWS RDS (as well as other PostgreSQL providers), but too early to share anything.

brycelarkin · 2024-05-18T22:10:11 1716070211

Another model I’ve seen is a combination of subscription and usage where there’s a monthly base and then additional charge for usage overages. Example is Sendgrid.