*there is no way for the Web site to know that it is sending the data to my prog...

graycat · on Aug 19, 2013

> your User-Agent string

Sure, my software sends a nice, simple, vanilla pure, good looking string for the user agent string.

I agree with you about essentially all the details of this specific case.

As seemingly hinted in the OP, my concern is with the more general situation -- could a Web site use lawyers, C&D letters, and IP addresses to make big legal problems for Internet users who download an unusually large number of Web pages? I hope not.

Then there's the suggestion that for a user to get a new IP address is somehow nefarious -- it's not. And there's calling getting Web pages screen scraping as if it is different, unusual, and nefarious -- it's not. Then there's the suggestion that what the user did that was bad was getting the data when the real problem was that the user republished the copyrighted data.

pdonis · on Aug 19, 2013

I don't think* this case gives any basis for a site to take legal action against someone just based on downloading a large number of web pages or accessing the site with different IP addresses. There has to be quite a bit more than that. I don't think the headline of the article really gets across all of the factors that had to be present for this ruling to go the way it did (but the body of the article does a better job of that).

* - of course, IANAL.

graycat · on Aug 20, 2013

> be faked

Be careful: The purpose of the agent string is to tell the server how to treat the client. That is, different Web browsers do different things with the same HTML, JS, CSS, etc. So, the agent string tells the Web site how the browser wants to be treated.

In my little program to get Web pages, I just tell the Web server how I want my program treated -- like a certain Mozilla browser. This is not "faking" anything. It would do no good to tell the Web server that I wrote my own Web browser because the Web server would know nothing about my browser and, thus, have no way to respond to it in any special way. So, I just tell the Web server to treat me like Mozilla.

Faking is not really the point.

We've got evil on the brain here.

I wrote my own Web browser. So what?

pdonis · on Aug 20, 2013

Faking is not really the point.

No, but giving reasonably accurate information about what kind of user agent is being used is. If you write your own browser, yes, you're probably better off telling a website that it's, say, Firefox than telling it it's "Joe's Really Cool Browser v1.0". But if you're writing a program whose purpose is not to display pages to the user, but to do something else, your program shouldn't be telling web servers that it's a program whose purpose is to display pages to the user.