Hacker News new | past | comments | ask | show | jobs | submit | brycelarkin's comments login

Who’s the target market for this?

Tagging documents isn’t really a big problem. Companies typically only have a finite, handful of contract templates that the admin tags manually.


Nice! This was one of the missing parts of the Vercel AI SDK.


Yeah this is what I wanted the AI SDK to do but didn't have it and surprisingly no one built anything for it yet.


They increased prices, but looks like they finally are revamping the service. This is probably the biggest update in years.


I can see Matt’s point of view. Data transfer fees are expensive, especially at WordPress scale. Automattic probably covers a lot of that cost that wordpress.org is incurring and wants WP Engine to pay their fair share.

WP Engine also seems to do some other “not in good faith” things such as change the woocommmerce Stripe attribution from wordpress.org to their own Stripe account.

While the legal dispute is on trademark, I think it’s really on WP Engine profiting on wordpress.org without giving back. It’s not illegal, but blacklisting WP Engine isn’t illegal either.

Automattic is essentially subsidizing a private equity backed company. I’d be upset and frustrated too if I was in Matt’s position.

If you support WP Engine, you’re supporting Silver Lake Private Equity.


If Matt had wanted to, for example, tell WPEngine they're own their own for hosting these services with a cutoff date at least 4 weeks in the future, then fair play. Or to demand to bill them some reasonable cost share of the price to operate (and let's not play games, a reasonable cost for the humans involved, which likely far exceeds servers or bandwidth), then still fair play.

To pitch a tantrum, cut them off with no notice, then gloat about it online... oof. I can't understand why anyone would want to be in business with someone like that.


This position rests pretty heavily on the idea that Wordpress.com is subsidizing wordpress.org, which is a charitable foundation that accepts donations. Do you have any specific reason to believe this is true? I don’t recall seeing that complaint directly here but maybe I missed it.


A quick look at the Automattic site (https://automattic.com/about/) says they dedicate 5% of company time to WP open source.

Their GitHub has over 1000 repos, likely all WP related. https://github.com/Automattic

I don’t doubt Automattic is not putting in their fair share into the WP community.


When I say subsidizing I mean financially subsidizing (i.e. paying for wordpress.org bandwidth costs) - that's the only way the comment makes sense, and I am not inclined to believe it to be true without evidence. As the other response notes though wordpress.org isn't even the foundation apparently, so is wordpress.com subsidizing wordpress.org? Have they asked WPEngine to cover their costs?


> Wordpress.com is subsidizing wordpress.org, which is a charitable foundation that accepts donations

Nope. There goes Matt, muddying the waters. Contrary to many people (including myself's belief) wp.org is very explicitly, in his words, NOT the Foundation, just a benevolent gift of his to the community. You could be excused, given all the links to donate to the Foundation and accompanying text, or the fact that it lives on the Foundation's ASN, though.


Those last 30 seconds of the video, trying to get the demo to work for a group of people is so relatable.


This article just looks like an ad for Bloomberg’s docuseries.


Separate Github profiles for work and personal.


Is it normal for people to have separate profiles like that on Github?


I certainly do - my work GitHub profile is attached to my work email address whereas my personal GitHub is attached to my personal email address.

it's a complete separation of responsibilities in the same way that I have a work laptop and a personal laptop.


Not normal enough. Would you do all your personal emailing on your work email? I don't know why so many people have a blind spot when it comes to GitHub

At $JOB, it's mandated to have a separate GitHub account


Yes. My work GitHub account is via SSO using my work creds. My personal GitHub account is much older and uses my personal email address.

Of course, we can’t (or shouldn’t) be uploading any of our code to GitHub for work. We have self-hosted Bitbucket servers. The corporate GitHub accounts are strictly used for Copilot.

If/when I leave the company, I’m pretty sure they don’t want all the code tied to my personal account. That sounds like a nightmare. I don’t want that either.


Yes, of course. In fact it is a pretty bad idea to use the same profile for an employer. Always treat an employer as disposable, along with the respective GitHub username for it.


I would say most devs use same GitHub profile for work and personal projects. GitHub provides pretty easy way to join various organizations with same handle. It almost seems exception when a developer creates a new profile for work.


The practice can vary significantly by the employer. Some employers prefer the usernames be new and unique to the employer, whereas others don't care. It's obvious that it's a major liability for the employee to reuse the username for personal and professional organizations.


Yeah, exactly. Personally I always thought that was a better policy (to have separate profiles) but I've actually had senior devs in the org tell me to just use my personal one and add my work email to it, so that's what I did. Seemed strange to me but I didn't question it too much. Maybe I should've, lol.


Thanks all. You've convinced me. I'll set up a new profile for the next time I need to work with a company.


For postgres, you want to use “bigint generated always as identity” instead of bigserial.


I agree and apologise for writing bigserial out of ossified habit. Identity columns hew to the SQL standard, bigserial is a pg-ism. I actually do still use it when the ORM’s DDL generator prefers as much (looking at you Rails), there’s no gain from fighting the middleware.

For other readers: the two are almost the same in behaviour, they differ primarily in that identity columns don’t need a USAGE grant on their associated sequence, and the ALWAYS makes it harder (but despite the term not actually impossible) to override the default value on row insert.


Why?


Why “bigint generated always as identity” instead of bigserial, instead of Postgres' uuid data type?

Postgres' UUID datatype: https://www.postgresql.org/docs/current/datatype-uuid.html#D...

django.db.models.fields.UUIDField: https://docs.djangoproject.com/en/5.0/ref/models/fields/#uui... :

> class UUIDField: A field for storing universally unique identifiers. Uses Python’s UUID class. When used on PostgreSQL and MariaDB 10.7+, this stores in a uuid datatype, otherwise in a char(32)

> [...] Lookups on PostgreSQL and MariaDB 10.7+: Using iexact, contains, icontains, startswith, istartswith, endswith, or iendswith lookups on PostgreSQL don’t work for values without hyphens, because PostgreSQL and MariaDB 10.7+ store them in a hyphenated uuid datatype type.

From the sqlalachemy.types.Uuid docs: https://docs.sqlalchemy.org/en/20/core/type_basics.html#sqla... :

> Represent a database agnostic UUID datatype.

> For backends that have no “native” UUID datatype, the value will make use of CHAR(32) and store the UUID as a 32-character alphanumeric hex string.

> For backends which are known to support UUID directly or a similar uuid-storing datatype such as SQL Server’s UNIQUEIDENTIFIER, a “native” mode enabled by default allows these types will be used on those backends.

> In its default mode of use, the Uuid datatype expects Python uuid objects, from the Python uuid module

From the docs for the uuid Python module: https://docs.python.org/3/library/uuid.html :

> class uuid.SafeUUID: Added in version 3.7.

> safe: The UUID was generated by the platform in a multiprocessing-safe way

And there's not yet a uuid.uuid7() in the uuid Python module.

UUIDv7 leaks timing information ( https://news.ycombinator.com/item?id=40886496 ); which is ironic because uuids are usually used to avoid the "guess an autoincrement integer key" issue


Just noting, the commenter you replied to said:

> use “bigint generated always as identity” instead of bigserial.

The commenter you are replying to was not saying anything about whether to use UUIDs or not; they just said "if you are going to use bigserial, you should use bigint generated always as identity instead".


The question is the same; why would you use bigint instead of the native UUID type?

Why does OT compare text and UUID instead of char(32) and UUID?

What advantage would there be for database abstraction libraries like SQLalchemy and Django to implement the UUID type with bigint or bigserial instead of the native pg UUID type?


Best practice in Postgres is to use always use the text data type and combine it with check constraints when you need an exact length or max length.

See: https://wiki.postgresql.org/wiki/Don't_Do_This#Text_storage

Also, I think you're misunderstanding the article. They aren't talking about storing a uuid in a bigint. They're talking about have two different id's. An incrementing bigint is used internally within the db for PK and FK's. A separate uuid is used as an external identifier that's exposed by your API.


What needs to be stored as text if there is a native uuid type?

Chapter 8. Data Types > Table 8.2. Numeric Types: https://www.postgresql.org/docs/current/datatype-numeric.htm... :

> bigint: -9223372036854775808 to +9223372036854775807

> bigserial: 1 to 9223372036854775807

2*63 == 9223372036854775807

Todo UUID /? postgres bigint UUID: https://www.google.com/search?q=postgres+bigint+uuid :

- UUIDs are 128 bits, and they're unsigned, so: 2*127

- "UUID vs Bigint Battle!!! | Scaling Postgres 302" https://www.scalingpostgres.com/episodes/302-uuid-vs-bigint-...

"Reddit's photo albums broke due to Integer overflow of Signed Int32" https://news.ycombinator.com/item?id=33976355#33977924 re: IPv6 addresses having 64+64=128 bits

FWIW networkx has an in-memory Graph.relabel_nodes() method that assigns ints to unique node names in order to reduce RAM utilization for graph algorithms: https://networkx.org/documentation/stable/reference/generate...


Many people store UUID's as text in the database. Needles to say, this is bad. TFA starts by proposing that it's bad, then does some tests to show why.

I'm not quite sure what all the links have to do with the topic at hand.


Which link are you concerned about the topicality of, in specific?

Shouldn't we then link to the docs on how many bits wide db datatypes are, whether a datatype is prefix or suffix searchable, whether there's data leakage in UUID namespacing with primary NIC MAC address and UUIDv7, and whether there will be overflow with a datatype less wasteful than the text datatype for uuids when there is already a UUID datatype for uuids that one could argue to improve if there is a potential performance benefit


Really cool!

When should you use pgvector vs pgvector scale?

Is there any discussion about getting this added as a supported AWS RDS extensions?


pgvectorscale only makes pgvector better. The primary developer-facing improvement is the introduction of the StreamingDiskANN index type.

So we would recommend using both from the start. There is no cost (technical or financial) for doing so.

There is discussion about getting this added to AWS RDS (as well as other PostgreSQL providers), but too early to share anything.


Another model I’ve seen is a combination of subscription and usage where there’s a monthly base and then additional charge for usage overages. Example is Sendgrid.


Join us for AI Startup School this June 16-17 in San Francisco!

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: