47
Training Generative AI Models on Copyrighted Works Is Fair Use - Change My Mind
(mastodon.lawprofs.org)
This is a most excellent place for technology news and articles.
Every web request costs someone money. If you aren't paying them you are being provided a service. They've given you knowledge/ material in their possession free of charge. You are taking advantage of that good will by using the content for purposes not intended. That is a moral failing.
To be clear the ownership of the material is not important, just the access is immoral, as the harm is already done.
Ill add the caveat that it can be moral if they've specifically told you you can via the websites robot.txt file which websites of consequence all have. But the assumption has to be they don't intend this because that is how consent works.
this is a very common human activity
You asked if it's moral, this is irrelevant
I did not
The original post in this chain talked about ethics, I was continuing that conversation.
In terms of free use, I feel the collection/aggregation of the data is a work in itself. You are taking a greater portion than the author specified you can take. Courts have ruled this does not constitute free use when people used yahoo's market data. How is it any different now when people are using orders of magnitudes more data.