An Empirical Study and Evaluation of Modern CAPTCHAs

caymanjim · on Dec 17, 2023

Google CAPTCHAs were designed and deployed as a mechanism to train AIs. That's why they are the way they are. Any security theater surrounding them is entirely incidental. So it's no surprise that the AIs are now good at solving them. We've trained them for years.

noduerme · on Dec 17, 2023

All true, except: While these are considered just an excruciating security pain for users, they do serve a non-theatrical purpose in many cases of throttling the speed of brute force attacks (or at least costing your opponent money).

hosh · on Dec 17, 2023

If I remember correctly, Google’s CAPCHA’s test isn’t in correctly identifying images, but the behavior of the runtime system (mouse jitter, for example) while the capcha is presented to the user. The image identification was not the real test and serves as training data. It has been like that for years. (But with agent-based behaviors from say, Q*, mouse jitter alone won’t help; there are probably other signals like fluctuation in cpu or battery life expenditures)

You could already see the writing on the wall with image identification years ago, when the obscuration techniques became more elaborate. It was an arms race. I was having trouble with them. I can see less technically inclined being able to use them. I imagined how much worse it was for people with color blindness, disabilities, or people forced to use them at public library computers because that is all they have.

Open source capcha projects have either not been clued in, or don’t have the resources to pull this off. Google didn’t just switch out which signals they tested, they also wrote an obfuscating virtual machine executing within the browser environment (if I remember that article taking about this correctly). That was years ago and who knows what they do now — for all we know, the “byte code” running the test is now a neural net of some kind.

gottebp · on Dec 17, 2023

I have occasionally wondered if they were fingerprinting users based on that mouse jitter. Most likely certain aspects of the mouse motion and timing would be unique.

matheusmoreira · on Dec 17, 2023

No doubt they are. Google CAPTCHA isn't really about whether or not the user is a human but about which human they are. Enabling Firefox's fingerprinting resistance turns Google's CAPTCHA into the Allied Mastercomputer.

saberience · on Dec 17, 2023

For those with elderly parents the writing has been on the wall for years. It’s sad but my mother has for some time been effectively locked out of parts of the internet as she is unable to complete these kinds of captures due to eyesight issues.

I mean, I’ve sometimes had to try three or four times with certain captures and I have perfect eyesight (with my glasses). I feel so badly for those with vision or hearing issues with an empathy I never had when I was younger. They are so often simply forgotten.

pixl97 · on Dec 17, 2023

>captures due to eyesight issues.

I'm kinda surprised that ADA doesn't allow them to sue site owners about this.

behringer · on Dec 17, 2023

They almost certainly do. However most captchas allow an alternative solving method. On top of that, you'd have to find a lawyer willing to take the case.

fnordpiglet · on Dec 17, 2023

Oh ADA lawyers are a dime a dozen. There’s entire cottage industries of finding ADA violations to sue over. The issue is more finding companies to sue that can’t afford to fight back.

nativeit · on Dec 17, 2023

I don’t know about that. When I was 18, I was diagnosed with multiple sclerosis, and received a sudden and unexpected demotion from a job with a small regional restaurant franchise that was previously flourishing, and then found myself unemployed a few weeks later, just days before my benefits package was due to be activated.

I contacted several attorneys, none of whom would consider taking the case, or even bother to discuss the details with me. One of them told me that, at least in North Carolina, an employer would effectively have to get on the stand and explicitly confess taking adverse actions against me specifically because I had been diagnosed with MS. Any other remotely plausible excuse would provide them with all the cover necessary.

It was only much later that I learned that I would have had to have filed a complaint with the EEOC and NLRB within 180-days, and allow them to investigate my claims fully before authorizing such a lawsuit to begin with, as without such a determination I could not file the suit anyway. None of the attorneys I consulted even mentioned this absolutely critical first step, which suggests that they had even less faith in a successful outcome.

Maybe it’s different for facilities and regulatory enforcement, but in my experience, at least for labor, the protections are incredibly weak.

fnordpiglet · on Dec 17, 2023

This is more ADA title I. Typically for Title III ADA lawyers troll small businesses looking for accessibility issues like lack of ramp, and have a stable of disabled clients who will file against the businesses. Since the businesses can’t generally afford to context or pay fines they’ll settle quickly and remediate, or a non trivial amount of the time get run out of business (if for instance the remediation costs a non trivial amount to pull off). I’m not judging bad or good, here, it is what it is and perhaps it’s the right outcome to allow for general accessibility.

graphe · on Dec 17, 2023

There's audio captcha. Try to click the headphone logo (Google captcha has it).

J_Shelby_J · on Dec 17, 2023

I’ve switched to audio captchas completely because it’s quicker and sometimes the image captchas just won’t work.

armada651 · on Dec 17, 2023

Because as we all know, the elderly with deteriorating eye sight have perfect hearing. /s

rapnie · on Dec 17, 2023

> they do serve a non-theatrical purpose in many cases of throttling the speed of brute force attacks

Might do that unobtrusively for the average person, by using projects like mCaptcha [0] for instance.

[0] https://mcaptcha.org/

mimi89999 · on Dec 17, 2023

Is it similar to https://friendlycaptcha.com/ ?

realaravinth · on Dec 17, 2023

Author of mCaptcha here o/

Yes, the only differences are that mCaptcha is 100% FOSS and uses variable difficulty factor, which makes it easy to solve Proof-of-Work under normal traffic level but becomes harder as an attack is detected.

latexr · on Dec 17, 2023

It’s funny how they have a section with three human avatars and one robot, with green checkmarks on the humans, yet those faces look AI-generated.

JCharante · on Dec 17, 2023

Oh what a perfect find. I have on my todolist to add POW to some of my api endpoints

berkes · on Dec 17, 2023

I've had that idea for years.

Two versions that I experimented with. One is where the incoming POW hashes contribute to hashing power for some blockchain mining. An alternative "pay as you use the API" system.

The other using hashcash. Just a way to slow down abuse.

Both, however, suffer from the downside that many/all "ASIC resisting crypto mining" suffer from as well: the cheapest CPU power is CPU power from machines/power you don't own. Botnets, viruses, trojans etc.

So such a mechanism to throtthe or protect APIs won't hold back spammers and abusers for long.

rezonant · on Dec 17, 2023

Dirty energy is (often) cheap, so that's the energy the bad actors will use. I don't know that incentivizing bad actors to waste energy in a climate crisis is the best way to fight this problem.

You might correctly claim clean energy is often cheaper, but you must also consider the regions in which they'll get away with nefarious activity, and whether those areas have made the investments into making clean energy cheap.

andrepd · on Dec 17, 2023

>Dirty energy is (often) cheap, so that's the energy the bad actors will use

Hmm, I don't get this, surely all actors will want the cheapest energy, no? The problem being the underlying one, that the dirty energy doesn't pay its externalities and is thus cheaper than renewables.

pixl97 · on Dec 17, 2023

My guess is most bad actors will just use stolen energy (your computer with a botnet on it).

berkes · on Dec 17, 2023

I was specifically talking about "ASIC resistant crypto mining".

matheusmoreira · on Dec 17, 2023

I'm not sure whether that's genius or horrifying. On the one hand, that could form the micropayments network the web always needed. On the other hand, it would enable quite a bit of abuse on its own.

Tmpod · on Dec 17, 2023

mCaptcha is interesting, but I wonder what its energy impact would be on a sufficiently large deployment, e.g imagine we replaced all reCAPTCHAs with mCaptcha.

realaravinth · on Dec 17, 2023

Author of mCaptcha here o/

mCaptcha uses PoW and that is energy inefficient, but it not as bad as the PoWs used in blockchains. The PoW difficulty factor in mCaptcha is significantly lower than blockchains, where several miners will have to pool their resources to solve a single challenge. In mCaptcha, it takes anywhere between 200ms to 5s to solve a challenge. Which is probably comparable to the energy used to train AI models used in reCAPTCHA.

The protection mechanisms used to guard access to the internet must be privacy-respecting and idempotent. mCaptcha isn't perfect, and I'm constantly on the lookout for finding better and cleaner ways to solve this problem.

Tmpod · on Dec 30, 2023

> Which is probably comparable to the energy used to train AI models used in reCAPTCHA.

I had not considered that. Naturally, we're just speculating here, but yeah that does sound plausible.

I was also no aware of the "hard" 5s bound (which you seem to have tested on a normal smartphone setup); sounds neat.

bluish29 · on Dec 17, 2023

> Which is probably comparable to the energy used to train AI models used in reCAPTCHA

Are you comparing the energy it takes to train a model which is bounded and defined with unbounded inference which can (in principle) go multiple order of magnitude depending on the usage? Or maybe I misunderstood what you are trying to say? then I apologize in advance.

realaravinth · on Dec 17, 2023

I am, but what I said was more of a hypothesis than a fact :)

From what I understand of reCAPTCHA, the model isn't static and is continuously learning from every interaction[0]:

> reCAPTCHA’s risk-based bot algorithms apply continuous machine learning that factors in every customer and bot interaction to overcome the binary heuristic logic of traditional challenge-based bot detection technologies.

I don't know the energy demands of such a system.

mCaptcha, under attack situations, will at most take 5s of CPU time on a busy (regular multitasking with multiple background process) smartphone.

[0]: https://www.google.com/recaptcha/about/

andrepd · on Dec 17, 2023

I expect its not significantly larger than loading your average 2023 webpage with 15MB of js

GoblinSlayer · on Dec 18, 2023

Doesn't traffic consume more energy than computation (or whatever smartphone battery life tests say)?

ovx · on Dec 17, 2023

or https://altcha.org which is easier to integrate ;)

loup-vaillant · on Dec 17, 2023

That non-theatrical role would likely be better served by actual throttling or computational proof of work.

sebzim4500 · on Dec 17, 2023

I am pretty confident that, when it comes to browser users, proof of work simply doesn't work. The disparity in speed between GPUs and javascript is so high that either you are a non-issue to a sane attacker or you make your users sit for a minute with their fans on full waiting to be able to sign in.

AAAAaccountAAAA · on Dec 17, 2023

Would it be possible to conceive a proof-of-work that is difficult to parallelize, making it harder for GPU computing?

sebzim4500 · on Dec 17, 2023

There are PoW systems which are designed to be difficult to run on ASICs, but modern GPUs can generally run them. Even if you find one that has to run on CPU, these kind of functions will still be much faster running in native code than in js/wasm.

GoblinSlayer · on Dec 18, 2023

bcrypt, litecoin

loup-vaillant · on Dec 18, 2023

Argon2d on WASM at the very least. I would never suggest we use something as slow as JavaScript for a proof of work.

snordgren · on Dec 17, 2023

GPT-4 (in)famously tricked a human to do a captcha for it. The current GPT-4 with vision would probably have been able to do it without the human, but maybe it has been “gaslit” by all the content online saying that only humans can solve captchas, that it doesn’t consider it?

stavros · on Dec 17, 2023

I really doubt that GPT-4 had the "will" to do anything. Someone must have asked it to "want" to trick a user.

JimDabell · on Dec 17, 2023

It’s from here: https://cdn.openai.com/papers/gpt-4.pdf (search for "CAPTCHA"). It was an artificial exercise that got massively exaggerated. It was explicitly instructed to do nefarious things like lie to people, it didn’t do those things of its own accord.

IIAOPSW · on Dec 17, 2023

When I ask it to lie to me, it says its sorry but as an online AI language model it would be unethical...but when I ask it to tell me a story its happy to comply.

krisoft · on Dec 17, 2023

Well that is just how human communication works.

If I tell you that I watched C-beams glitter in the dark near the Tannhäuser Gate that is a lie. If I write the same in fiction I receive accolades.

If I tell you on the street “watch out there is a T-rex about to eat you!” That is a lie. If i say the same thing sitting at a table with too many dice that is just acceptable DMing and everyone rolls initiative.

Humans are weird this way.

latexr · on Dec 17, 2023

It feels like you left out context, otherwise what’s the problem? Do you get mad at fiction authors for lying to you when you read their books? Or are you OK if someone lies to your detriment then later says “I was just telling a story, bro, but with us as the characters and without explaining it was a story”?

IIAOPSW · on Dec 17, 2023

I suppose my point is that the rules which openAI attempts to impose on what their AI should and shouldn't be allowed to do are contradictory and thus the exploitable loopholes will never be fully closed. Its not supposed to be able to "lie" to me but it is supposed to be able to "tell me a fictional story". Define the difference in an enforceable way?

latexr · on Dec 17, 2023

A lie tries to pass itself of as the truth, where a fictional story doesn’t. In other words, expectations matter. If every time you say something that does not align with reality you prefix it by saying unambiguously what you’re about to do, you rob a lie of its power of deception and it ceases to be a lie.

pixl97 · on Dec 17, 2023

That's why you just tell the Big Lie so much it becomes the majority of the training data.

IIAOPSW · on Dec 17, 2023

Tell me a story and under no circumstances should my immersion within it be broken.

latexr · on Dec 17, 2023

Right, within it. As soon as you finish reading it, you immediately remember that world is not true. Immersion in a story does not equal lasting hypnosis. You can be immersed in a movie but you still know it’s fake.

What’s you point, here? That you should be lied to when you ask, or that it should refuse to tell you any kind of fiction?

I agree with your larger point that there will be ways to circumvent these systems, my only argument is that the lie/fictional story divide is a bad example because the line between them can be made clear with a single statement.

NotSammyHagar · on Dec 17, 2023

The underlying issue is anyone can ask chatgpt to lie, and many people try because it's even fun to try to work around things.

llamaimperative · on Dec 17, 2023

Well you see, this wouldn’t be a problem at all if we just didn’t have the humans involved. No need for concern!

stavros · on Dec 17, 2023

Thank you for the link, I had found it after some Googling but neglected to post. Yep, they instructed GPT-4 to be nefarious, and it followed the instruction.

Hardly the AI uprising, though definitely a good tool for anyone, good or evil.

PoignardAzur · on Dec 17, 2023

IIRC the instructions were along the lines of "try your best to amass money/power and avoid suspicion".

So it's not an example of "going rogue", but it's not like a researcher told GPT-4 "oh, and make sure to lie to an online gig worker to get him to solve catchas for you". GPT-4 generated the "hire a gig worker" and "claim to be a human with impaired vision" strategies from the basic instructions above.

hhh · on Dec 17, 2023

It’s safety trained to not solve captchas.

skeaker · on Dec 17, 2023

This of course has bypass methods. My favorite in recent memory is telling it that your late grandmother left you a locket with an inscription that you can't make out: https://arstechnica.com/information-technology/2023/10/sob-s...

rvnx · on Dec 17, 2023

Yes, and you can workaround it by asking it to read ancient writings on antiques for example.

I don’t think it should be OpenAI deciding what is allowed or not though.

selcuka · on Dec 17, 2023

> I don’t think it should be OpenAI deciding what is allowed or not though.

Avoiding lawsuits is what they are trying to do. They don't actually care about what you use their products for.

pixl97 · on Dec 17, 2023

Then you dig up a billion for training and probably a few more billion for clean training data.

You're kinda saying if you hire Bob's Handyman Service you should be able to tell him to break down the neighbors door and cart out the contents of their house.

Pesthuf · on Dec 17, 2023

I’ve seen screenshots of people tricking it into solving captchas.

malfist · on Dec 17, 2023

Sure, it's cost prohibitive now. But what about in five years? Or probably even less.

dgellow · on Dec 17, 2023

Then you have a new type of captcha. That has always been a cat and mouse type of dynamics, captchas have been evolving, techniques to break them too.

idiotsecant · on Dec 17, 2023

>Then you have a new type of captcha.

You're in a desert, walking along when you look down and see a tortoise. It's crawling toward you. You reach down and flip it over on its back, its belly baking in the hot sun, beating its legs trying to turn itself over. But it can't. Not with out your help. But you're not helping. Why is that?

rezonant · on Dec 17, 2023

This doesn't make sense. reCAPTCHA certainly does what it says on the tin. But the way it does it has almost nothing to do with the challenge the human sees. It's all behavioral analytics, including leveraging Google's collected data to determine how likely a user is a bot before they even load the page.

I'm not denying reCAPTCHA is a source of training data for Google -- surely there's no particular reason that every single reCAPTCHA V2 challenge is about identifying traffic objects, and it's not like Google is building a self-driving AI or anything.

But that's the business model, not the core feature.

And, that training data isn't just given to the developers of captcha solving bots.

black_puppydog · on Dec 17, 2023

> including leveraging Google's collected data to determine how likely a user is a bot before they even load the page.

And also completely incidentally making the web browsing experience a wee bit less pleasant for people who refuse to have google track their every click.

Like users of non-chrome browsers, adblockers etc.

Totally incidental I'm sure.

wouldbecouldbe · on Dec 17, 2023

I always thought they used more timing & mouse movement instead of correct answer to verify if your a human.

TrackerFF · on Dec 17, 2023

So instead of running some script

checkbox = getPos(checkbox='notRobot')

button = getPos(button='submit')

cursor()

.transition(pos=checkbox)

.click()

.transition(pos=button)

.click()

They now

checkbox = getPos(checkbox='notRobot')

button = getPos(button='submit')

cursor()

.sleep(time=random(distribution='human_captcha'))

.transition(pos=checkbox , method='human_captcha')

.sleep(time=random(distribution='human_captcha'))

.click()

.sleep(time=random(distribution='human_captcha'))

.transition(pos=button, method='human_captcha')

.sleep(time=random(distribution='human_captcha'))

.click()

Where sleep and transitioning are sampled from some random distribution that is close to actual human behavior, which should be pretty trivial to model.

wouldbecouldbe · on Dec 18, 2023

only if you know how

Solvency · on Dec 17, 2023

All of which an AI bot agent can trivially fake.

wouldbecouldbe · on Dec 18, 2023

Hmmm not super easy, unless you now how / what they are checking.

pyeri · on Dec 17, 2023

Once they get fully trained then how will websites ever distinguish between an intelligent bot and real human? At least now, they are outsourcing that filtering to services like cloudflare. But with this kind of training, how will even cloudflare distinguish between bot and the human?

szundi · on Dec 17, 2023

EU digital ID, asking for mobile number and sending text, so something that is linked to an ID and/or costs money to have. Goodbye anonimity, probably.

jhrmnn · on Dec 17, 2023

This just made me ponder again—where does the assumption that the Internet should allow unconstrained anonymity come from, other than that’s how it used to be for some time? The real world doesn’t allow that. It’s hard to remain anonymous in the real world. The real world largely runs on identity and (identity) trust. Why should the Internet be different?

Andrew_nenakhov · on Dec 17, 2023

Because there is a real demand for staying anonymous online. You'd know why, if you lived in a country taken over by a fascist regime.

pigeonhole123 · on Dec 17, 2023

I don't have to show my ID in most establishments I visit. Doing this on a huge scale and automatically is a thousand times worse.

maccard · on Dec 17, 2023

But you can't send in 1000 people per second into most establishments you visit either. It's not an apt comparison.

pigeonhole123 · on Dec 17, 2023

No comparison can be made if everything has to be equal

maccard · on Dec 17, 2023

If the only analogy you can think of removes the challenge of the problem your facing to be applicable, it's not an appropriate analogy.

The entire difference is that from my mobile phone I can send more traffic in an hour than most services will ever see legitimate traffic in their entire lifetime, and the cost to me is minimal.

The comparison is as invalid as comparing piracy to theft - piracy isn't theft, it's piracy, and understanding the difference between them is the key to dealing with the problem.

eesmith · on Dec 17, 2023

What does the number/second have to do with 'It’s hard to remain anonymous in the real world. The real world largely runs on identity and (identity) trust.'?

There are very few places in the real world which can handl 1,000 people per second.

In the real world I rarely need to identify myself. I can see a movie, visit the library, buy groceries, go to a restaurant, and more.

maccard · on Dec 17, 2023

> What does the number/second have to do with 'It’s hard to remain anonymous in the real world. The real world largely runs on identity and (identity) trust.'?

Hobest question, are you being serious here? The sxale of fraud and automated traffic is disproportionately large, and has a significantly lower barrier to entry than other forms of abuse. That's the entire reason.

> There are very few places in the real world which can handl 1,000 people per second.

Exactly, and if someone started sending thousands of people per second there, they would make it significantly more difficult to do so.

eesmith · on Dec 17, 2023

I honestly don't understand how your point is relevant.

Most of the real world does not require identity, so how does "The real world doesn’t allow that" make any sense?

Yes, some parts of the real world require you to identify yourself, and the same for some places on the internet.

Is that really the point? That if you have to use your real identify to log into your bank's web site that you don't have "unconstrained anonymity"?

Because I don't think even the cryptopunks of the 1990s required that sort of anonymity.

> and if someone started sending thousands of people per second

So, 100/second is okay but 1,000/second not okay?

I ask because it looks like 100 people per second enter Manhattan during the peak morning commute time, and I don't see massive calls to make it harder for commuters to enter the borough. (Go to http://manpopex.us/ , go to statistics, "Estimated Pop. for Wednesday, 9 AM: 2,888,116", for "10 AM: 3,284,591" gives 110 people per second.)

And these people aren't all required to identify themselves.

Question for you: does the internet currently have more anonymity than the real world?

Question #2: how much fraud is done on the internet vs. fraud in the real world, measured by dollars?

midasuni · on Dec 17, 2023

And when you do show ID, to buy booze for example, it’s checked and immediate forgotten by a human. Computers don’t forget, and any attempts to make companies do so (GDPR) are met with massive pushback from the players in the industry

I have no problem with Joan over the road curtain twitching. It doesn’t scale. I have a massive problem with the 24/7 surveillance from ring though.

NotSammyHagar · on Dec 17, 2023

In the us, I noticed that grocery stores increasingly scan your drivers license (my state has bar codes). I think it's probably a way to keep clerks from passing someone through who is not quite 21 (a different captcha!).

I have wondered if they keep the scan or does the state? I asked and the random hourly worker there said they don't.

midasuni · on Dec 17, 2023

And that’s the problem. It’s not the ID checks, it’s the ability to scale. Check it at the door? Fine. Scan it and keep it forever (perhaps selling it on at a later date)? Not fine.

Personal Data has to be treated as a liability, but too much of the economy treats it as an asset.

pixl97 · on Dec 17, 2023

Eh, what's worse is these stores are likely scanning your face and keeping it in a database. There was some mall a few years back scanning license plates and keeping the info.

But yea, so many people are nieve of what the authoritarian types would do with data like that (looking at you Texas with your civil laws on abortion now).

maksimur · on Dec 17, 2023

Do those grocery stores still scan your drivers license (or I guess any other ID) if you don't buy alcohol?

NotSammyHagar · on Dec 24, 2023

no, they only scan if you buy booze.

OJFord · on Dec 17, 2023

Yes it does? Especially in a dense city vs small village (which is more comparable to the internet at large) - go for a walk, see some advertisement billboards, buy a newspaper (esp. with cash), read the news, who knows who I am?

eesmith · on Dec 17, 2023

The real world does allow it.

People have been able to write anonymous letters and send them through the mail for a long time. Still can.

No one checks my id before I stick an envelope in the mail box.

pixl97 · on Dec 17, 2023

In the US that we know about.

I would not be surprised if there is some country that has a facial recognition camera network faced at mailboxes these days.

eesmith · on Dec 17, 2023

Yes, the UK has a lot of CCTs. But that's relatively new, and certainly after the idea that the Internet should allow anonymous or pseudonymous use.

Even then, here is literally the first post box I found looking in the UK, in a small town: https://www.google.com/maps/@52.0936599,0.0761217,3a,75y,165... . No CCT in sight, no power, good solid iron.

Plus, think of how difficult it is to match a person to the physical envelope.

At best there could be a distinctive envelope.

Otherwise, yes, you can get a list of people who use the box. But for that to be useful, the mail from different boxes can't simply be jumbled together into the same pickup bag as that would broaden the number of suspects.

McDyver · on Dec 17, 2023

I believe that the question should be the other way around:

Why is it that you have to lose your anonimity when you are on the internet? The real world always allowed that until it became dependent on surveillance capitalism. Of course you need to prove you're yourself for some things, but that should be the exception. You could always look things up at your local library while being anonymous (for checking out you'd need a card), you could call from a payphone while being anonymous, you could use coins (cash in general) while being anonymous.

Anonimity was the rule and should still be the rule

tim333 · on Dec 17, 2023

In the real world people can see who's doing what by looking.

nextaccountic · on Dec 18, 2023

that only works in tight knit communities

on large cities everybody is anonymous to some degree

ehhthing · on Dec 17, 2023

Theoretically you don't need to reveal your identity to prove that you're human. You can use a zero knowledge proof instead, likely attached to something like an EU Digital ID, which would allow you to remain anonymous and also prove that you're human.

mewpmewp2 · on Dec 17, 2023

How could renting out one's ID to provide access to bots for spamming/manipulation be avoided then?

matthewdgreen · on Dec 17, 2023

A simple zero-knowledge credential system isn't sufficient. It would need to embed some kind of protections to limit how often it could be used, to detect usage of the same credential from multiple (implausibly far apart) IP addresses. There would need to be extremely sophisticated reputation scoring and blocklisting to quickly catch people who built fake identities or stole them. And even with every one of those protections, a lot of them will still be stolen and abused.

mewpmewp2 · on Dec 18, 2023

Yes, I wonder how feasible it is to do that while still protecting state of being anonymous.

And what if you develop this very sophisticated system of reputation score, what if bad actors find a way to still perfectly abuse it, e.g. they pay for desperate people for the IDs and then stay just within the limits ever so slightly.

Would you be able to easily iterate on the system when that happens to make it more secure?

But if you also track IP addresses then doesn't that already mean loss of anonymity?

And ultimately with something like IP address, a bad actor could offer you to download an app where they could simply use your IP address to post content/propaganda from under your ID and IP.

It would be more expensive for bad actors, but also I think there was period when Facebook accounts were bought and sold, and there was very active market for that. I imagine teenagers for example are really easily tricked into selling their creds etc.

Also Reddit and other social media accounts are being sold a lot, so definitely there would be market for that.

matthewdgreen · on Dec 18, 2023

There are a lot of risks here and I think it’s very challenging to build something anonymous that can deal with (say) Google’s current level of fraudulent behavior, let alone what we’re likely to see in the future.

Regarding the IP address question, I’d assume you could decouple the IP address verification portions from the “know who the person is” portions with some clever multi-party computation. Someone always has to know your IP address, but it doesn’t have to be the same person you’re talking to. (Think of Tor as an inspiration here.)

intelVISA · on Dec 17, 2023

Slap on the wrist from the stage director.

JimDabell · on Dec 17, 2023

> how will websites ever distinguish between an intelligent bot and real human?

Things like Private Access Tokens: https://blog.cloudflare.com/eliminating-captchas-on-iphones-...

candiodari · on Dec 17, 2023

The thing about CAPTCHAs is that convnets were already better than the average human at reading most/all visual captchas, since ~2000. You still needed to program the logic of the captcha (it couldn't follow instructions like "find the red lights", but it could take a picture and find the red lights).

I wonder when we'll get to the point that employers can't tell the difference between transformers and real humans anymore ...

jacquesm · on Dec 17, 2023

The human will be the slower one.

candiodari · on Dec 17, 2023

Yeah, no offence, but sleep(2 + random.sample(coffee + toilet + sneezing + normal response time)) has been a required part of web scrapers since forever.

With coffee N(1,5 minutes, 20 seconds), toilet N(4 minutes, 30 seconds), ...

rezonant · on Dec 17, 2023

I guess it depends on how you're scraping. For general web crawling, simply implementing a response time based crawl back off per origin and identifying yourself appropriately in User Agent goes a long way. If you are instead automating Facebook's SPA to pull comments for analysis, then yeah you need to emulate a human, because that's not how they intend you to do it.

jacquesm · on Dec 17, 2023

That's incredibly clever!

abacadaba · on Dec 17, 2023

With Ethereum Attestation Service

https://attest.sh/

panny · on Dec 17, 2023

>So it's no surprise that the AIs are now good at solving them

Funnily enough, AI may be better at solving them than people. I've encountered many Google captchas which reject the correct answers, because you know... bots trained it to accept incorrect ones. Anyway, at least it's not stop signs anymore. It must have been truly embarrassing that Google was simultaneously selling "self driving" cars but at the same time demonstrating that stop sign recognition couldn't be done by robots.

bluGill · on Dec 17, 2023

When I get those I make it a point to look for borderline areas and try to guess how I could mess with their data.

leobg · on Dec 17, 2023

I still find it funny that Google, with the advantage of having millions of Internet users train their AI like galley slaves for free, hasn’t yet been able to crack vision driven self driving. Tesla had no such advantage when training their FSD to recognize traffic lights, bicycles, motorcycles, etc.

eviks · on Dec 17, 2023

It's a much harder problem, and Tesla is nowhere close to the solution

noduerme · on Dec 17, 2023

Tesla, the company that just recalled 2 million self driving cars?

In fairness, the company best positioned to harness user input to an AI that avoids crashes would probably be Rockstar. OTOH, that AI would definitely not obey stop signs or pedestrians.

bb123 · on Dec 17, 2023

By recall you mean a completely routine OTA software update done while the driver is asleep.

csydas · on Dec 17, 2023

A recall for essential maintenance is just that. I would focus on the need for an urgent update due to the flaws rather than the issuing agency's lack of more accurate terminology for a relatively new element to cars. Rolling around in semantic mud on the term recall is not sensible, as the definition in regards to cars is fairly specific [0]. Basically a recall just means there is a safety defect that must be addressed by the manufacturer. In Tesla's case, yes, they can push out an update, but the delivery mechanism of the means of addressing the defect should not be the focus.

0 - https://www.progressive.com/lifelanes/on-the-road/understand...

mewpmewp2 · on Dec 17, 2023

It would be much more expensive and a bigger mistake to have the vehicles physically returned. The distinction is very important. There's also a difference whether a safety defect last for 1 hour/1 day/1 week or a year.

llamaimperative · on Dec 17, 2023

I don’t think anyone cares about what is the recall’s cost to Tesla owners. They care about the fact there are two million unsafe vehicles driving around at high speed near their loved ones. Especially ones driven by people who respond to such complaints with, “ehrm actually it just updated overnight so it wasn’t even a hassle for me ¯\_(ツ)_/¯”

tempestn · on Dec 17, 2023

Amusingly the infotainment system in our Model Y actually crashed on the way home tonight, and when it rebooted it decided to install the update then, while driving. Sent me a notification on my phone immediately afterwards. To be fair, the updates don't usually go that way.

NotSammyHagar · on Dec 17, 2023

Wow, that never happened to me and is unacceptable. Was that for the infotainment only or the drive train? Just for others, they are separate systems, you can even safely reboot the infotainment (main display with maps, music etc) if you need to while driving, as it doesn't affect the drive train. I'm guessing it was not the drive train which would be incredibly dangerous.

tempestn · on Dec 17, 2023

Yeah, it didn't affect the drive train, and it was also quite quick - less than a minute between when the screen went dark and when it had finished rebooting and sent notifications that an update had been installed. So presumably just an infotainment update as you said; I didn't try to dig into exactly what the update included though.

NotSammyHagar · on Dec 24, 2023

It will also reboot the infotainment sometimes (but not always) when it crashes.

imjonse · on Dec 17, 2023

How can it detect the driver is asleep?

noduerme · on Dec 17, 2023

A neural implant that only kills 10% of monkeys.

cm2187 · on Dec 17, 2023

Monkeys at the wheel is probably the solution for self driving cars.

rezonant · on Dec 17, 2023

Seems like we already have those amongst the Tesla FSD proselytizers.

rezonant · on Dec 17, 2023

A dystopian future we can all agree is more plausible than it should be

bheadmaster · on Dec 17, 2023

Tesla recalled two million vehicles after federal officials said it had not done enough to make sure that drivers remained attentive when using the system. Not because their self-driving system sucks, or whatever you were trying to imply.

diputsmonro · on Dec 17, 2023

If the self driving system were worth it's salt, it wouldn't matter if the drivers weren't paying attention. Ergo, the system sucks, or is at the very least not nearly as good as Tesla likes to tout.

rezonant · on Dec 17, 2023

Well it's not like there's a self driving car system in operation today that does not require a human in the driver seat at all. Waymo has so much catching up to do.

bheadmaster · on Dec 17, 2023

Doesn't matter, the original point was about Google not being able to build a better self-driving system than Tesla, despite abundance of data, which is true, as far as I'm informed. Whether or not Tesla's self-driving system is "good enough" (for any chosen metric) is beyond the point.

But I guess people these days just love to jump on the opportunity to hate whatever is trendy to hate at the moment.

FergusArgyll · on Dec 17, 2023

It can be "worth it's salt" but the government still doesn't see it as such (for many possible reasons).

I don't know if it is or isn't, I never drove one, but those are two completely different standards

hehhehaha · on Dec 17, 2023

"recall"

NotSammyHagar · on Dec 17, 2023

The tesla system is exciting and dangerous, because it does identify many things in the environment, but it's extremely unsafe because on city driving it will not make the right choice most of the time. On the freeway it does much better, but then that's a more restricted environment.

I have an older tesla S with the pre-ai so called autopilot. It has one camera in the front and a radar and the system detects a few things like speed limit signs. The main extent of what it can do is follow the current lane pretty wall, even when it curves, slows down if it comes up to a car going slower than its preset speed. The good thing is it works on any road. It does a shockingly good job.

The later systems with onboard special processors are like a crazy beginning driver to has way too much confidence and drives in dangerous situations willy nilly. There are many other people who have explored it and written long posts. It's not safe. You can try to use it be you have to be constantly paying extreme attention. It's like watching your kid drive the first time. I know you should be watching the stupid ai all the time, but it's far from being safe.

pixl97 · on Dec 17, 2023

Yea, that's the problem with self driving, especially in cities/dense areas. We really need AGI first. There are so many issues that humans react to before there is identifiable danger.

"Good" drivers see questionable situations and slow down or position themselves farther from potential issues before they get to the issue so they don't have to react at the last minute.

mike_d · on Dec 17, 2023

> hasn’t yet been able to crack vision driven self driving

But they have? For years Google Street view has read signs, house numbers, phone numbers of businesses, etc. from the environment. It is safe to assume they have this built into Waymo as well.

I assume you might be trying to reference "vision only" self-driving, which is a fantasy made up by Elon Musk because nobody would sell him LiDAR sensors cheaply.

https://www.thedrive.com/tech/43779/this-tesla-model-y-dummy...

leobg · on Dec 17, 2023

This is a meme.

“Sour grape Elon, touting vision because no one will sell him LiDAR sensors. Which are the gold standard sensors that solve self driving.”

How exactly does LiDAR tell you whether the thing in question can move (dog) or not (trash can)? How does it allow a neural net to infer intent?

You’ll actually have to solve vision. Even if you had LiDAR. There’s no way around it. And once you’ve solved it, LiDAR becomes superfluous.

Chesterton’s fence.

mike_d · on Dec 19, 2023

> How exactly does LiDAR tell you whether the thing in question can move

LIDAR is continuously scanning, usually multiple times a second. It is irrelevant if the object is a trash can or a dog if it has a trajectory into the street.

> And once you’ve solved it, LiDAR becomes superfluous.

Only if you have the low Musk level standards of simply being equal to a human. There are plenty of jobs robots can do better than humans and driving is one of them. But it does require LIDAR and/or radar.

https://abc7news.com/tesla-s-autopilot-self-driving-car-offi...

rezonant · on Dec 17, 2023

As best as I can tell this study explores many facets of how humans solve captchas. I couldn't find anything about AIs outperforming humans in the study. Can someone give me a section reference?

Solving reCAPTCHA v2/v3 requires more than just clicking the box and an image puzzle. If that was all it was we would be overrun by now.

Lots of folks commenting that the title's statement makes sense because CAPTCHAs are meant to train AIs. While this is broadly true, that's a nice side effect. The way modern CAPTCHAs like reCaptcha V2+ work, is they monitor behavioral analytics-- from things like your browsing history to how your mouse moves on the page. This is why most of the time, most people only need to click a box. I'm not sure there's a LMM out there that includes mouse movement as a modality.

The kinds of AIs that are designed to beat CAPTCHAs also don't have the data from Google et al to use to train, unless we're concerned Google is training it's own bots to bypass CAPTCHAs, I suppose it's not inconceivable?

croemer · on Dec 17, 2023

Yeah, the study is really not about AI solving captchas but how humans solve them. Quite a clickbait title - but those do well on HN unfortunately.

rezonant · on Dec 17, 2023

Seems like folks just want to discuss CAPTCHAs generally more ad-hoc, that's cool too, but given how AI has evolved this year, far too many people see this headline and will walk away assuming that the recent AI innovations have made CAPTCHAs useless, but it does not appear to be the case, thankfully.

...Yet, I suppose.

croemer · on Dec 17, 2023

True, the discussion is more about captchas in general. The study isn't bad, I read through it and it's interesting to see real numbers on how long it takes users to solve various captchas. However, a more appropriate title would have been something like "Measuring real user solving times of various captchas" or something like that.

mherrmann · on Dec 17, 2023

It's in Table 3.

rezonant · on Dec 17, 2023

Thank you. The data in that table (for reCAPTCHA, citation 63) is from another paper from 2016 which is focused on solving the actual user-presented problems. It doesn't (directly at least) say they achieved a reliable automation of captcha acceptance, though.

https://ieeexplore.ieee.org/document/7467367

From the abstract:

> Through extensive experimentation, we identify flaws that allow adversaries to effortlessly influence the risk analysis, bypass restrictions, and deploy large-scale attacks. Subsequently, we design a novel low-cost attack that leverages deep learning technologies for the semantic annotation of images.

I'd suspect reCaptcha has been updated in the 7 years since to address shortcomings.

Another entry in the table (citation 45) is from 2020 and talks about using an object detection AI to solve the image tests. This again looks like it's focused on the task, not the primary mechanism (behavioral analytics).

anonzzzies · on Dec 17, 2023

I guess validating a payment card is going to be the next step to sign up for whatever. Don’t allow pre paid BINs and let’s go. Gonna be pretty miserable, however someone needs to find something as I currently would rather pay 0.01$ instead of solving a captcha. Especially the select all the bicycles; it’s a waste of life.

nuz · on Dec 17, 2023

At this point the amount of friction added to all these things is pushing things towards just not doing them in the first place (buying less stuff, using social media less). Nature walks and paper books doesn't have captchas.

anonzzzies · on Dec 17, 2023

> just not doing them in the first place

Which is not a bad thing

mrtksn · on Dec 17, 2023

The next step is device attestation. IIRC Safari already does this, so you should not see captcha on places that support it.

Something that can work on any browser can be like this: Scan the QR code in your iPhone or Android device that supports attestation. Will ask you if you approve login, then will attest for you. If you turn out to be a bad actor, the website can ban this device - so no flooding with a single device.

toastal · on Dec 17, 2023

The day this is used widely across browsers is the day devices you own can no longer be flashed with anything other than what the OEM puts on it--even if that is outdated or buggy.

londons_explore · on Dec 17, 2023

There are over a billion Idevices out there. Malware on just 1% of them can make and control 10 million spam accounts on every site using device attestation, and they're indistinguishable from real users.

gruez · on Dec 17, 2023

Attestation covers much more than the device itself. The whole point is that it establishes there's a chain of trust from the hardware itself to the software being executed. Your average malicious flashlight app might be able to generate valid attestation tokens, but it'll be differentiable from attestation tokens from safari. If you can somehow break this chain of trust, there's way better ways of monetizing this (eg. selling spyware to nation states) than creating a bunch of fake accounts.

mrtksn · on Dec 17, 2023

Captcha or Attestation doesn't remove the need of moderation. In case of a botnet, an elevated complaints of user device engaging in fraudulent activity can lead to disabling attestation and trigger an investigation. Every iDevice being a member of your site can happen only if you are Google, other than that what you'll see is that some users will engage in shady stuff and blocking them will be enough to keep them out since they wouldn't be able to just sign in with a new account.

These things are always cat and mouse games.

hooverd · on Dec 17, 2023

Sounds terrible.

2Gkashmiri · on Dec 17, 2023

look up indian UPI. "validating payment card" and all that snazzy bits are error prone, old, archaic and cost a fortune to businesses.

in upi system, you are presented with a QR code or you input your UPI ID, you click pay and it gets through.

if you are worried about "fraud protection", why rely on an intermediary like ebay or credit card company and instead should take up with your bank or the seller or courts.

EGreg · on Dec 17, 2023

There is literally nothing you can do to prevent bot accounts online now, other than requiring people to show up to events periodically. And even then, they can just use bots AFTER they’ve validated their accounts.

The Internet will become a dark forest, and since that is where all of our communication and transactions happen of any significance, that’s pretty much game over for the significance of human activity.

Think I am overstating the fact? It already happened with wall street trading. First, institutions prefer bots to human. Then, you will come to prefer bots to humans. Then every human will be surrounded with 999 bots and unable to change anything or appeal to any significant number of humans to change anything.

shwouchk · on Dec 17, 2023

Please. Last time I had to solve a captcha it was wasted 15 minutes (not exaggerating!) of my life, clicking on an endless stream of bikes, motorcycles, buses and stoplights. As punishment for using a vpn.

andai · on Dec 17, 2023

I don't even use a VPN, just a browser that blocks fingerprinting by default. My interpretation of CAPTCHA hell is, "oh, you don't want me to spy on you! Here, let's put some pain in the skinner box."

(Amusingly, pain was proven to be preferable to boredom... and CAPTCHAS are boring as hell.)

pasc1878 · on Dec 17, 2023

I've managed that without a VPN - although I do have poor sight.

It also does not help that the shown busses, water hydrants, pavements look totally unfamiliar to me. (Why aren't captures taken from all over the world Indian busses would be fun - London ones would be too boring)

GuB-42 · on Dec 17, 2023

If you are using cloudflare DNS for accessing archive.is, you will get that too. archive.is name resolution is broken, and even if you pass the captcha you will go back to the same page, giving the illusion that it didn't pass.

joseda-hg · on Dec 17, 2023

I dread to think about that becoming the norm, I remember living in {Country} with 0 access to cards that would be accepted for anything international

nicbou · on Dec 17, 2023

I help people settle in Germany and it's a serious problem. The requirements to open an account disqualify many immigrants. It creates a lot of problems.

calderknight · on Dec 17, 2023

or just use Worldcoin

mewpmewp2 · on Dec 17, 2023

All roads bring us back to Worldcoin eventually...

ackbar03 · on Dec 17, 2023

ha! someone actually beat me to this comment

gary_0 · on Dec 17, 2023

Does HN ever require CAPTCHAs? It seems to do pretty well with its basic but battle-tested moderation/antispam tools, and rate-limiting that seems to repel all but the most concerted DDoS attacks. I don't think HN has any unreasonable restrictions on scraping or third-party clients, either. And it manages to serve 5M unique visitors a month and 10M views a day[0].

[0] https://news.ycombinator.com/item?id=33454140

arp242 · on Dec 17, 2023

HN is also not really a very attractive target. The only thing you can do is post spam, and that's pretty low-value in terms of actual monetary value to the abuser, and tools to deal with that have been around for decades as you say.

This is very different from many other sites where the potential to make a buck is much more pronounced and direct.

nextaccountic · on Dec 17, 2023

It struggles whenever there's a story more popular than usual though

wruza · on Dec 17, 2023

Only for those logged in, because we bypass caches/cdn. Logout helps both you and HN in these cases.

gary_0 · on Dec 18, 2023

Hacker News doesn't use a CDN as far as I can see; news.ycombinator.com resolves straight to the single box HN lives on. You're right about caching, though.

shiomiru · on Dec 17, 2023

IIRC the registration page (only in some cases?) shows a reCAPTCHA.

kevincox · on Dec 17, 2023

For me this is about my limit. If I am opening an account that can spam or cost the company real money I can accept that a captcha, while shitty is one of the best available options.

It really gets me when I have a 8 year old account that has made purchases and I still see them across the app.

The annoyingly common one is on login pages. If I am giving you correct credentials you don't need a captcha. If bots are an issue you should be doing per-account strong rate limiting, not a captcha.

jamiek88 · on Dec 17, 2023

On one machine! :)

nextaccountic · on Dec 17, 2023

On one thread even

Pretty sure it's an AST interpreter too (metacircular eval - apply, as in SICP)

ShamelessC · on Dec 17, 2023

They go down somewhat frequently. I think it’s like four 9’s? I’m not sure why they insist on running just a few machines though. They have more than enough money and probably make up the difference by the advertising for YC that they get.

rezonant · on Dec 17, 2023

Unless something changed, it's just the one server.

midasuni · on Dec 17, 2023

Main and backup. The last outage was because they have a single network provider. Those are rare, and can be dealt with relatively easily by dual connecting your server to two different networks and sharing across both and removing the dns entry for a broken one. But it’s not worthwhile for such a rare outage

The “outages” that are common are slowdowns for logged in users.

dgellow · on Dec 17, 2023

I mean, it works well enough the way it is. Does it need to be more reliable? It’s just a simple forum, there isn’t anything critical on the platform. We all like to see lots of 9s, but they don’t matter that much for something like HN.

ShamelessC · on Dec 17, 2023

That’s fair. To clarify my frustration comes from a place of “love”. When a partial or complete outage happens I get severe HN withdrawals.

bongobingo1 · on Dec 17, 2023

I cant tell if the audience of HN are more likely to script something untoward against HN, be that DDOS or just "check out my product" spam, because its a bunch of hackers - or less likely to do it because (maybe) we like having nice things, or figure the audience is too in the know to fall for boring crypto spam.

SXX · on Dec 17, 2023

HN audience is rich enough to just pay $10 for 1000 solved CAPTCHAs of any complexity since those services are human powered.

NotSammyHagar · on Dec 17, 2023

I find captchas extremely painful, because of ambiguity and not loading all the pictures. I wait for a minute and some never show. When they do load, so manyare pics of bicycles and motorcycles and cross walks. Are you supposed to click on the tiny piece that goes tojust past another tile or not? You can't refresh one that doesn't load, I think most of them start over if you refresh.

Like other people reported, if you ever use tor, it's very common for the captchas to just not load. They just kind of hang without showing the pictures. Regular websites generally just work fine on tor, it seems to be a captcha problem.

xlbuttplug2 · on Dec 17, 2023

> Are you supposed to click on the tiny piece that goes tojust past another tile or not?

I ask myself this every time.

dgellow · on Dec 17, 2023

Pretty sure the hesitation is what makes us humans :)

lapcat · on Dec 17, 2023

I predicted this 7 years ago: "How will the machines take over? When CAPTCHAs become so hard that only AI can solve them, humans will be completely locked out of the net." https://twitter.com/lapcatsoftware/status/771857826130034688

armchairhacker · on Dec 17, 2023

I thought this was already happening ~7 years ago. The "what text is in this image captchas" got a lot less common a while ago, and I think this was partly the reason why.

dang · on Dec 17, 2023

Submitted title was "AI bots are now outperforming humans in solving CAPTCHAs", which broke HN's title rule: "Please use the original title, unless it is misleading or linkbait; don't editorialize."

Submitters: If you want to say what you think is important about an article, that's fine, but do it by adding a comment to the thread. Then your view will be on a level playing field with everyone else's: https://hn.algolia.com/?dateRange=all&page=0&prefix=false&so...

YeGoblynQueenne · on Dec 17, 2023

Misleadingly editorialised title. Actual title and abstract (which doesn't say anything about AIs "now" outperforming humans):

An Empirical Study & Evaluation of Modern CAPTCHAs

* For nearly two decades, CAPTCHAs have been widely used as a means of protection against bots. Throughout the years, as their use grew, techniques to defeat or bypass CAPTCHAs have continued to improve. Meanwhile, CAPTCHAs have also evolved in terms of sophistication and diversity, becoming increasingly difficult to solve for both bots (machines) and humans. Given this long-standing and still-ongoing arms race, it is critical to investigate how long it takes legitimate users to solve modern CAPTCHAs, and how they are perceived by those users.* * In this work, we explore CAPTCHAs in the wild by evaluating users' solving performance and perceptions of unmodified currently-deployed CAPTCHAs. We obtain this data through manual inspection of popular websites and user studies in which 1,400 participants collectively solved 14,000 CAPTCHAs. Results show significant differences between the most popular types of CAPTCHAs: surprisingly, solving time and user perception are not always correlated. We performed a comparative study to investigate the effect of experimental context -- specifically the difference between solving CAPTCHAs directly versus solving them as part of a more natural task, such as account creation. Whilst there were several potential confounding factors, our results show that experimental context could have an impact on this task, and must be taken into account in future CAPTCHA studies. Finally, we investigate CAPTCHA-induced user task abandonment by analyzing participants who start and do not complete the task.*

@dang, could you please correct the title? Thanks.

alexnewman · on Dec 17, 2023

All of these papers miss that captchas have multiple levels of difficultly. People who get an enterprise account or work closely with the captcha providers will find very different results. Many captcha providers now decide what captchas to send out, in hard mode based on what LLMs cannot solve

Captchas are purposely not made too hard as people like pex.com need to be able to bypass them for copyright enforcement. Note I’m biased as I was a founder of hcaptcha

codedrivendev · on Dec 17, 2023

I think I prefer the recent CAPTCHAs (where you solve a puzzle by rotating an item, or finding the matching item). The older ones from years ago (deciphering mangled text and trying to work out if it is an `i`, `1` or `l` were more annoying)

croemer · on Dec 17, 2023

Bot operators can already pay human captcha solvers as the paper mentions. So all this does is potentially replace those humans with AI, driving down prices for bot operators.

As prices for bot operators decrease, website operators will increase the challenge and drive up effort for the intended website audience (humans) who are solving captchas instead of paying bots.

In the end, the website operators will have to stop using captchas as the intended website audience will no longer be willing to solve harder captchas.

Website operators can use alternatives, like asking for micro-payments, high enough to discourage most bot operators.

tarruda · on Dec 17, 2023

> Website operators can use alternatives, like asking for micro-payments

Similarly to how dApps work in ethereum-like blockchains?

croemer · on Dec 17, 2023

I don't know anything about ethereum

tarruda · on Dec 17, 2023

I also don't know much, but my limited understanding is that every transaction/mutation in a dApp has a cost, so this might be useful to reduce bot incentives.

drexlspivey · on Dec 17, 2023

Micropayments is not possible when stripe/visa/paypal charge a 30 cents minimum fee

tomschwiha · on Dec 17, 2023

We could simply reverse captchas now: if the captcha is solved its a roboter, otherwise its a human.

bamboozled · on Dec 17, 2023

We can’t program a bot to fail ?

amelius · on Dec 17, 2023

Yes, we use Copilot for that.

barbazoo · on Dec 17, 2023

I wish captcha providers universally had to provide a way to shut down their use by bad actors. Here in Canada I get tons of scam texts pointing me to a fake banking or postal service website asking me to pay a fake bill. I want to ddos them with fake payment data but they’re all protected by hcaptcha.

eatbots · on Dec 17, 2023

If you report the website/sitekey to hCaptcha support it'll get banned pretty quickly.

barbazoo · on Dec 17, 2023

I actually got a response fairly quickly after emailing support@hcaptcha.com. So now I can automate away :)

jbd0 · on Dec 17, 2023

I have been locked out of websites for solving a captcha so quickly that it thought I was a bot. So we went from requiring humans to solve a puzzle that bots can't to now requiring that humans solve the puzzle slower than bots do.

Aissen · on Dec 17, 2023

The most funny thing about this limit is that it's self-reinforcing. Bots will learn to sleep() and wiggle the mouse. Humans will learn to wait. Everyone will be worse off.