this post was submitted on 26 Aug 2024
883 points (98.7% liked)

Programmer Humor

19187 readers
1113 users here now

Welcome to Programmer Humor!

This is a place where you can post jokes, memes, humor, etc. related to programming!

For sharing awful code theres also Programming Horror.

Rules

founded 1 year ago
MODERATORS
 
you are viewing a single comment's thread
view the rest of the comments
[–] [email protected] 185 points 3 weeks ago (7 children)

There's a reason captchas have moved mostly image identification systems. These text-based captchas have all been defeated for years.

[–] [email protected] 95 points 3 weeks ago

Yeah because whomever "owns" the data needs humans to train their bots, not because the image based bot detection is better than other methods.

[–] [email protected] 86 points 3 weeks ago

The images are not actually the captcha. They've used other methods and tools to verify your authenticity, then they force you to help train their image recognition AI under the guise of it being the actual captcha. Its Distributed Forced Labor, and Google has been using captchas to do this for decades. Remeber the picture-of-two-words captcha? One word was always squiggly and the other was not. The squiggly word was the real captcha, the other word was from a scanned book and you were helping to train their OCR algorithms.

[–] [email protected] 39 points 3 weeks ago (1 children)

I remember that Jdownloader could crack some CAPTCHAs back in the 00's.

[–] [email protected] 55 points 3 weeks ago (2 children)

There used to be hoardes of sites offering free downloads, quizzes, porn etc etc. You would have to solve a captcha to get through, but they were 'stuck' in an infinite loop. I always believed it was being used by spammers/hackers to bypass actual captcha elsewhere on the web. Its kinda genius, offloading the work to randoms looking for free stuff..

[–] [email protected] 14 points 3 weeks ago (1 children)

I also remember services you could pay to get your captcha solved via a browser extension. You could also register as a captcha solver there to earn a few bucks stupidly solving captchas. Although I'm not sure if they were actually legit.

[–] [email protected] 2 points 3 weeks ago

They are legit, I use them for my bots

[–] [email protected] 6 points 3 weeks ago

I remember back in the day this automated downloader program.. the links had a limit of one download at a time and you had to solve a captcha to start each download.

So the downloader had built in "solve other's captcha" system, where you could build up credit.

So when you had say 20 links to download you spent some minutes solving other's captchas and get some credit, then the program would use that crowdsourcing to solve yours as they popped up.

[–] [email protected] 24 points 3 weeks ago (1 children)

The image ones basically as well

[–] [email protected] 6 points 3 weeks ago (1 children)

Yeah, at this point, most forms of image identification catches have also been defeated, not quite 100% success yet, but they're getting there

[–] [email protected] 10 points 3 weeks ago

I mean the google one is literally training a Algorithm to identify the images so the shit defeats itself.

[–] [email protected] 22 points 3 weeks ago (1 children)

Funnily enough, the reason they switched to those was to use the data to train machine learning (AI) models, just like Google's recaptcha was originally pictures of words from old, scanned books so they could transcribe all of them "for free" and train their transcription algorithms.

[–] [email protected] 15 points 3 weeks ago

Man I miss the times when Google used to trick us into helping make knowledge more easily accessible to everyone. Now we just train fucking AI for luxury cars.

[–] [email protected] 9 points 3 weeks ago (2 children)

It's a bit weird how that actually works though...

"Which of these pictures are traffic lights?"

I'd hope with all the self-driving-(ish) cars coming out, any AI like that should be able to identify a traffic light, right?

[–] [email protected] 30 points 3 weeks ago

When you "solve" a captcha like that, you're just helping train the AI you're talking about.

The stuff that determines whether you're a not or not is based on browser information, how you interact with the page, etc.

[–] [email protected] 12 points 3 weeks ago

That's exactly what you're doing, training the AIs to identify that.

[–] [email protected] 4 points 3 weeks ago* (last edited 3 weeks ago)

If they add audio captchas for the visual impaired then those image captchas can be circumvented. There is a Tampermonkey script on GitHub that can defeat Recaptcha by solving the audio captcha.