digdilem

joined 1 year ago
[–] [email protected] 10 points 2 months ago (6 children)

And hopefully will continue to be asked, because one day it may not be poor OPSEC.

[–] [email protected] 3 points 2 months ago (1 children)

The BBC still uses it to break news, I'm saddened to say.

[–] [email protected] 12 points 2 months ago

In my experience, /most/ people don't care and further, they don't want to care.

Even those that do care have to exist on a sliding scale of compromise in order to function.

[–] [email protected] 20 points 2 months ago

since the plain text isnt stored

I'm not sure I'd accept a bet on that assumption.

[–] [email protected] 9 points 2 months ago* (last edited 2 months ago)

In my experience, the AI bots are absolutely not honoring robots.txt - and there are literally hundreds of unique ones. Everyone and their dog has unleashed AI/LLM harvesters over the past year without much thought to the impact to low bandwidth sites.

Many of them aren't even identifying themselves as AI bots, but faking human user-agents.

[–] [email protected] 23 points 2 months ago

robots.txt does not work. I don't think it ever has - it's an honour system with no penalty for ignoring it.

I have a few low traffic sites hosted at home, and when a crawler takes an interest they can totally flood my connection. I'm using cloudflare and being incredibly aggressive with my filtering but so many bots are ignoring robots.txt as well as lying about who they are with humanesque UAs that it's having a real impact on my ability to provide the sites for humans.

Over the past year it's got around ten times worse. I woke up this morning to find my connection at a crawl and on checking the logs, AmazonBot has been hitting one site 12000 times an hour, and that's one of the more well-behaved bots. But there's thousands and thousands of them.

[–] [email protected] 2 points 2 months ago* (last edited 2 months ago) (2 children)

If cookie prompts annoy you (and why wouldn't they? Complicated and time wasting prompts caused by terrible and compromised legislation that's led to far more intrusion instead of enforcing use of browser settings) and you don't care about cookies, then the browser extension "I don't care about cookies" suppresses the vast majority.

[–] [email protected] 3 points 2 months ago

Or at least, those influencing in favour of Trump and general chaos.

[–] [email protected] 7 points 3 months ago

But UK laws do, which share a lot of commonality - like the GDPR

[–] [email protected] 15 points 3 months ago

I think this type of scheme is illegal under the GDPR, which is in effect in the UK just as it is in the EU.

It's been a while since I worked with the GDPR, but from memory the wording is such that:

The data holder needs to allow people to opt out of data collection. The subject can request to be forgotten. The data holder explicitly cannot charge for this.

But changes move slow, and The Mirror is probably banking on nobody caring enough to complain, and Trading Standards being too underfunded and swamped with other work to investigate otherwise (which they are). If they're challenged, they'll just change tack, go "oops" and are unlikely to hit big fines unless they dig in.

Cookie laws are a horrible mess and always have done - the resulting consent banners are far more intrusive than anyone wanted.

[–] [email protected] 15 points 3 months ago (12 children)

Now we're getting there!

[–] [email protected] 27 points 3 months ago (1 children)

By its own shareholders?

Are they just trying to get some money out before class actions from its customers decimate the company?

view more: ‹ prev next ›