Hacker News new | past | comments | ask | show | jobs | submit login
Craigslist Suing Padmapper (gigaom.com)
317 points by staunch on July 24, 2012 | hide | past | favorite | 449 comments



Craig Newmark said on Quora that the reason they didn't like people building stuff on top of Craigslist was simply the extra bandwidth they consumed:

http://www.quora.com/Why-hasnt-anyone-built-any-products-on-...

But now that 3Taps has found an ingenious way to get at the data with zero extra bandwidth cost to Craigslist (by retrieving it from the Google cache rather than CL itself), it's clear that what Craigslist really dislikes is competition.

While Craigslist is probably within their legal rights here, this case shows that for all their talk about their benevolent aims, Craigslist is no different from other companies.


When I think of Padmapper, I think of the naughtiness:

"Though the most successful founders are usually good people, they tend to have a piratical gleam in their eye. They're not Goody Two-Shoes type good. Morally, they care about getting the big questions right, but not about observing proprieties. That's why I'd use the word naughty rather than evil. They delight in breaking rules, but not rules that matter. This quality may be redundant though; it may be implied by imagination.

Sam Altman of Loopt is one of the most successful alumni, so we asked him what question we could put on the Y Combinator application that would help us discover more people like him. He said to ask about a time when they'd hacked something to their advantage—hacked in the sense of beating the system, not breaking into computers. It has become one of the questions we pay most attention to when judging applications."


Naughtiness was using CL to seed their site in the first place. After the Cease and Desist, using 3Taps as a source CL data was simply walking into a lawsuit.

Generally, I don't think you want your naughtiness to tie you up in a legal battle.


Case will never go to trial. Maybe a "legal battle" was the only option.

Can CL "win"? Can they stop everyone else, PadMapper is but the first of many, no doubt, from re-displaying facts (classified ads) in different formats?


If CL doesn't want this to happen, all they need to do is add <meta name="robots" content="noindex"> and we're done.


Craigslist is probably within their legal rights here

I would really call this into question for the copyright claims. The key claim is copyright infringement, and the listings on Craigslist are almost certainly unprotected, much like telephone directory listings. They are statements of fact rather than creative works.

Craigslist may attempt to claim copyright over reproduction of their database as a whole (compilation), but the Supreme Court has ruled that in order to be eligible the compilation must be "original in its selection, coordination, and arrangement", and Craigslist can hardly claim that.

The breach of contract claims may be stronger though. The pages on the Google cache presumably still contain the ToS, and if they apply (who knows?) then PadMapper's use of Craigslist would likely constitute a breach. A ruling on this would be very interesting.


Another thought is that padmapper is not /copying/ the data - They are summarizing it.

So there could be a entire listing for a SF apartment with lots of details and padmapper would not copy any of it- instead they would just make notes - 3 bedroom, location, price, pictures and then link back to CL if someone clicks the listing on PM


>The breach of contract claims may be stronger though. The pages on the Google cache presumably still contain the ToS

Would this work? Can a phonebook just put a section at the beginning of this page that by reading this book you are agreeing to the ToS which state you can't copy it?

I know most ToS contain a section about by using this service you are agreeing to the ToS.

But I think there is a good argument that Padmapper isn't actual using the service since they aren't getting the information from Craigslist's servers.

Am I using a service, and thus bound by contract if I look at a screengrab of a website on a third party website (that contains a ToS).

I've said before though that Craigslist could invent fictitious entries, and sue Padmapper for copying those creative works, just like mapmakers do with fake towns.


Would it work? Well, web pages are cached by ISPs all the time. The nature of the internet is that data is buffered all over the place. The argument seems to be that because the data has been at rest for a few days, the ToS no longer applies - this seems like a fairly weak argument, but who knows.

As for phone books, a shrink-wrap license on a CD-ROM phone book which prohibited copying was upheld by the US courts (ProCD), so that copying was a breach of contract even though the underlying data was unprotected.

> PadMapper isn't actually 'using' the service.

Using is a wonderfully subjective word, and I'd expect accesing, and making use of to be acceptable synonyms.

> I've said before though that Craigslist could invent fictitious entries, and sue Padmapper for copying those creative works, just like mapmakers do with fake towns.

These are known as "trap streets" and US federal court has ruled that they are not protectable. Map makers are able to sue because the rest of their map is protectable; the trap streets simply catch the infringer red-handed.


what Craigslist really dislikes is competition

Hm, would you not say that what Craigslist really dislikes is their competition piggy-backing off their data? Craigslist seemed quite happy with the existing situation until Padmapper launched their own listing service.

Though I agree that it's not in the spirit of a .org, if such concept exists.


No.

It's not their data. It's ours, as the users. Craigslist doesn't own anything I upload to them.

The reason PadMapper exists is because Craigslist refuses to not suck.


"You automatically grant and assign to CL, and you represent and warrant that you have the right to grant and assign to CL, a perpetual, irrevocable, unlimited, fully paid, fully sub-licensable (through multiple tiers), worldwide license to copy, perform, display, distribute, prepare derivative works from (including, without limitation, incorporating into other works) and otherwise use any content that you post. You also expressly grant and assign to CL all rights and causes of action to prohibit and enforce against any unauthorized copying, performance, display, distribution, use or exploitation of, or creation of derivative works from, any content that you post (including but not limited to any unauthorized downloading, extraction, harvesting, collection or aggregation of content that you post)."

http://www.craigslist.org/about/terms.of.use


Hadn't actually dug into the ToS, but the bit:

> You also expressly grant and assign to CL all rights and causes of action to prohibit and enforce against any unauthorized copying, performance, display, distribution...

Seems to be vital to this case.


Rightshaven (copyright troll) was granted the same right by the copyright holders they represent, yet the judge ruled they didn't have standing to sue on the copyright holder's behalf.


The Righthaven case is slightly different, though I hope the logic still applies. (I'm not a lawyer, but I read the Righthaven opinion[1] when the Padmapper/3Taps workaround was originally discussed.) The difference is that Righthaven was granted merely the right to sue on behalf of the original copyright holder, but none of the exclusive rights that copyrights actually bestow upon their owners. It could be interpreted that posters grant Craigslist some of their exclusive rights ("copy, perform, display," etc.), and suing to protect those rights may be legally kosher. I know of no such precedent.

My personal interpretation/hope is that the right to sue for copyright infringement is nontransferable, which would give Craigslist no standing to sue. Individual posters could sue, however.

[1] https://www.eff.org/files/filenode/righthaven_v_dem/order6-1... (See also: http://en.wikipedia.org/wiki/Righthaven_LLC_v._Democratic_Un...)


CL revised their TOU recently (within the past year, possibly more recently than that). It wasn't a minor edit, it was a complete rewrite.

I suspect this may have been one of the introduced terms.


Yes, the key point would be unauthorized, CL is going to have to prove that the PadMapper's usage was unauthorized (by the owner of the data)


That's a given. The only way they'd have authorization by the owners of the data would be to e-mail the poster of every CL listing and ask for it. We know they don't do that. The key point is whether there's copyright infringement at all, not whether it was authorized.


Then Craigslist should ban Google from indexing their pages


And you automatically grant Craigslist your first born child and all future earnings and your left index finger. Laywers love to fill these things with unenforceable outlandish crap. The courts are more discerning.


Err, that's actually a very reasonable Terms of Service.

It says that you own everything you post, but you are granting them the rights to use it. The only unusual bit is that you are additionally granting them the right to go after people who scrape your content on your behalf.


That's not that unusual for sites that contain user-generated content. If someone's hosting a blog full of scraped YouTube videos, YouTube can go after them without contacting the owner of each individual video.


The grant is not exclusive and the authority to decide what is an is not authorized is not claimed by CL. If it is unauthorized they claim rights and causes. You can't have a non-exclusive grant without this distinction.


The ToS may not mention exclusivity, but the source of data (for Padmapper) is Craigslist. Thus, the clause of "unauthorized copying" applies.


Exactly right. And if you want your data to be reposted on another site, then you should repost it.

Craigslist is delivering exactly what it its users signed up for (no more no less). People posting ads on Craigslist do not necessarily want or intend for it to be reposted on other sites.


"People posting ads on Craigslist do not necessarily want or intend for it to be reposted on other sites."

That argument appears to be invalidated by Craiglist's own terms of use, which say that when you upload a listing to Craigslist they can syndicate it wherever they want (http://news.ycombinator.com/item?id=4287519).


Well, that's a fair point, but I don't think that right matters much unless and until they exercise it. A better argument (against myself) is that they're apparently willing to sell some type of access to your posts for use on mobile apps.

Still, I think there's something admirable in the simplicity and transparency of interacting with Craigslist. What you see is pretty much exactly what you get.


The key point is that the TOS says that you grant Craigslist the right to redistribute your listing where ever they see fit, it doesn't say some third party entity has the right to do that.


As far as I'm aware, if you post information in a place where it will be publicly accessible, then you are tacitly agreeing that third parties will be able to access it and use it as they please. That's not a right that needs to be given to these third parties. It exists from the get-go, and must be explicitly taken away by something like copyright.


So I can legally start amazon-copied-reviews.com and scrape every product review from amazon.com with my own referrer links to Walmart without fear of repercussion? Sweet.


Yes it does: "fully sub-licensable (through multiple tiers)"


Licensing implies getting CL's authorisation


And CL refusing in this case is in the user's benefit, how?


> And if you want your data to be reposted on another site, then you should repost it.

I had to giggle a little bit at this in the grand scheme of the internet, sorry.

When I posted something on Facebook Marketplace, I started getting emails and comments from other "market" sites that Facebook had cross-posted my listing to.

While it was annoying to not know this up front, the fact that it was more visible and getting more bites because of it only helped me make the sale quicker. If a service wants to piggyback off of another to make my postings more buoyant, as a user and seller, I don't have a problem with it.


Personally, I'd prefer if Facebook acted a little more like Craigslist with my data, not the other way around.


Interestingly, in both cases, the economic incentives for operators seem to be basically counter to likely user intent and expectations.


In response to your "only helped": on one of the other Padmapper stories here on HN, someone wrote that after their item sold and they cancelled the Craigslist ad, they continued to get contacted about the item because third parties had scraped the ad and did not stop displaying their copy when the ad disappeared from Craigslist.


This happened to me on the Facebook ad too, but I preferred it it over the lack of any response at all - it proved the services were used. Easy to ignore them or send a generic reply.


Padmapper's isn't "reposting" any given listing -- it displays an abbreviated digest of the listing in its search results, and then if the user clicks, it takes them to the original listing.

If this is illegal or otherwise objectionable without an explicit agreement from each Craigslist poster, I'm not really sure how a search engine or even descriptive hyperlinking is kosher without explicit agreement from each website indexed or referred to.


Uh oh... Bye bye Google and all search engines.


Google obeys robots.txt file. Padmapper does not. That's a critical difference.


CL's robot.txt file does not 'disallow' crawling the real estate section. So Padmapper was likely as compliant as Google.

http://www.craigslist.org/robots.txt


No, this is just semantics.

The point is Google respects publishers' desire not to be indexed. Padmapper does not. Robots.txt is simply a common method for conveying that message. It's not like Padmapper could argue they didn't know CL was unhappy; they got a certified letter!

This is just semantics. Google respects publishers who do not want their sites listed.


Good luck explaining that to a judge. Would you gamble a company based on that distinction?


True, but that's irrelevant to the comment I was replying to. We are talking about user expectations with regard to content they posted being made available on other sites.


And as a user you can put your information up on PadMapper if you like. If you don't, then it isn't up to PadMapper to get it for you.


Craig Newmark now says (6/25/12):

"folks, please remember, #craigslist community feedback massively against the use of their stuff for the profit of others."

https://twitter.com/craignewmark/status/217323728260972544


Why would apt owners NOT want their listings cross posted and distributed to the largest possible audience? I call BS.


Craig is doing things right: he is not greedy and he doing it with vision of general purpose. If craiglist is run by a person who investors like to choose as a founder they will be sold for 10M a long time ago.

The problem with a lot of startups in SV is that their only objective is to make money. Nothing else. Craigslist is very very rare exception.

Now lets downvoting starts...


Flogging yourself doesn't make you righteous. Doing good for the world does.

Craigslist wastes several human lifetimes worth of time every month through maintaining a monopoly product with a shitty UI and refusing to let anyone innovate on top of it. They hold back progress and they are evil. They are the IE6 of classified ads.


> They hold back progress and they are evil. They are the IE6 of classified ads.

That's a good one. I'm going to start using it. (Unless you make copyright objections.)


Not sure I would call that ingenious. But your point is sorely need of being made more often.

How many other websites use arguments like "bandwidth" to falsely portray competitors who access their publicly shared data as somehow in the wrong?

Many. Some here on HN. No need to name names.

No doubt even Google would complain about people "scraping" search results.

To me, it is a joke. Because the people who complain use automation to access, retrieve, organise and serve information and thereby establish their business. Only then to try to forbid others from using automation to do the same.

And all the while, it's NOT THEIR INFORMATION. This is not Craigslist's data. It's users' data.

It belongs to users, who are today's "publishers" and possess all those good ole publisher's rights. (Though they may naively license them out.)


The users chose to give Craiglist permission to use their data. They did not choose to give PadMapper their data, or else they would have posted their listing to PadMapper.

They did give Craiglist permission to prevent others from using their data without permission. In this context, scraping any version of Craiglist's site (whether CL itself or a third party cache) falls within Craiglist's rights under the license they were given, and within the user's expectations of what Craiglist will do with their data.


Weak argument. Did they give Google "permission" to access the data (and store and republish it)?

Google is allowed in robots.txt. But that is not exactly what I would call an agreement.

The simple fact is this info is on the public web which, by its nature, copies and transfers data. That's what the web does. You upload something and it goes "viral". You have principles like the "Streisand effect" to contend with.

This goes back a long way. No doubt judges remember. The Ken Starr report on Ms. Lewinsky. Some random classified ad. Like it or not, information gets desseminated.

If you want to protect and restrict access to data, then you do not upload it to the public web. You put it behind access controls, e.g., a password. This is common sense.

If anyone has a claim here, it's users who do not want their ads on PadMapper (if there are any). CL has no standing and their motives are both pathetic and transparent.


Craigslist is a business. What makes it right for another company to profit from the contents they generated through the platform that they build?

Yes, they got an early advantage into the market and has the critical mass that many company can seem to compete with but why take that away from them because they refuse to update/add new features.

Instead of piggybacking on them and relying on their data to earn money, shouldn't Padmapper focus on building their own content. Isn't that where innovation comes from? Beating an existing company by creating a better platform?


Craigslist is a business. And a monopoly, like Microsoft in the 90s. Didn't they have the right to decide which browsers are packaged with their OS? Don't they even have the right to decide which browsers run on their OS?


Exactly, they've pretty much run every newspaper classified out of business. Consumers have no choice but to do business with craigslist if they want to list something in a classified.


That's why website owners should focus on creating another classified website that can compete with Craigslist instead of relying on their contents. That's where innovation starts.

And its not about a matter of choice the users have. There are many competitors in this market, and yes Craigslist dominates every one of them because they had an early advantage on the internet. Small sites can't just leech off the contents on their website and slap ads on it to make money.

Lets say Craigslist was a print company that produce and distribute classified as. Will it be right for a small company to steal their content and slap their ads on it and distribute it themselves?

Your argument won't work here, because there are many major newspaper that do 1000x in revenue and distribution than independent newspapers.


People could always post to padmapper...directly. Or sell on Ebay...Or post it to the various subreddits...Or to local newspaper's community boards...

There is nothing forcing people to use Craiglist. They are not a monopoly, nor do they act like one.


Just because they have competitors doesn't make those competitors viable. Apple and Linux still existed when Microsoft was prosecuted by the DOJ.

The vast majority of people searching classified ads are searching craigslist, therefore if you're trying to list something in a classified ad, you're forced to use craigslist.

Sure you could use another service, but Netscape could have also just sold browsers only to Linux customers. It's all about the numbers.


But Microsoft largely escaped the DOJ antitrust case because of competition with Apple. (MS invested in Apple to retain a competitor.)

People aren't really forced to use craigslist. That the vast majority of people choose to search CL isn't good enough. Another company could spend whatever it takes to get people to search their classifieds instead. As long as CL can't or doesn't block that (in contrast to stuff MS was doing that started the DOJ case against them), the competition is viable.


I'm sure Craig was speaking truthfully when he made his comment. But he is but one piece of a whole, and that whole includes folks who count the pennies. And business person associated with Craigslist would know that the only value Craigslist has is its listings. Giving those away to be re-used by another web site would never fly with such a person. I don't doubt for a minute when you tell someone is taking money out of your pocket that you need to fund your operations (I know, I know its more nuanced than that but its the gist of the argument) well you an roll over or you can fight.

If they go to court and get an opinion it should be really helpful as a guide for other entrepreneurs who see re-processing the information on the web in new ways as the foundation for their business.


If they go to court and get an opinion it should be really helpful as a guide for other entrepreneurs who see re-processing the information on the web in new ways as the foundation for their business.

You mean, like Blekko?

The simple, inconvenient truth is most folks who are making money from the web, like search engines, are not content creators (nor content owners), they are content publishers... who publish for free. "Are you a non-technical person who wants to get something onto the web? No problem. We'll help you with that, for free. Just give us some personal info about you so we can solicit money from advertisers."

(Placement, e.g., paid placement, where the eyeballs are more likely to see something, for a fee, is another matter.)


In the case of search engines there is already a lot of case law from people suing Google of course. As with most things its a spectrum. Using a search engine as an example (and disclaimer I work for Blekko, a search engine) a search engine crawls the web, then it computes a number of parameters about the page its crawled (what its about, how many people link to it, who does it link to, Etc.) and creates a new piece of information called a 'rank'. Then when a search query comes in the query is used to create a way of recognizing a 'target' page and then the rank is used (and in our case slashtags too) to decide what pages you might be looking for. The results, also include snippets from the page to help the user evaluate whether or not the page is the one they want.

Now that use has generally not been highly contested, people want their pages to be found and so they tolerate search engines searching them. They can be explicit in what pages they want searched and which they don't using robots.txt. So that relationship is pretty well understood. People who ban Blekko (and presumably anyone else) from their robots.txt file are not crawled by us, we recognize and honor that it is there choice if they want to be in our index or not. On the copyright issue however it has been pretty clearly established that 'page rank', like someone's review rating on a movie or an application, constitutes an original work of the creator. There is a lot of experience with things like book reviews where the review, using snippets to illustrate the review, and a rating, are both fair use and the original work of the reviewer.

Google however got in trouble with their news aggregation service. And the bulk of much of the arguments there, were that the snippets were so complete on the news page as to exceed 'fair use' exemptions, and that by aggregating these pages they were 'stealing' traffic that might otherwise go to the news site. The results on those cases were mixed, with some newspapers being removed from Google's index, and others not. Generally everyone that was removed has since been replaced (at the request of the news source) because Google does drive more traffic to a web site than any other web service. So in this case while Google was found to violate the copyright of these news organizations by indexing their newspapers without their consent, the papers later found it in their best interest to give their consent.

Craigslist and Amazon and Ebay are a third kind of question. They are a collection of 'facts' (as many have pointed out) which are derived by a process (placing ads). And in the 'old' world the courts have generally sided with the person who had paid the economic cost for creating those collections. And as PadMapper and others before them have shown, is that there is a great temptation to use those same facts and re-package them into a new collection. This pretty naturally sets up a commercial tension between the original collector and the new user of those same facts. That seems to open another front in copyright litigation and policy. So if this court gets an opinion published it cannot help but be influential as there don't seem to be very many in this space. That could be because judges think the right answer is 'obvious' but I seriously doubt that to be the case.


I have heard people saying "we love competition". I wonder if they really mean it. Not saying one cant't love competition, just that the phrase is used so commonly that it's hard to digest.

I, for one, hate my competitors. Basic insticts perhaps?

Is not disliking your competitors something that is practiced by majority? OR is it at least very common? Common enough to point at a company that doesn't follow it?


It depends; If you are trying to start a new industry, you'd like your competitors more, because they validate the market. My university lecturer told me in the 90's (or 80's) he was starting a bottled water company; Back then no one drinks bottled water. Every month or so the half dozen bottle water company startups in Australia have a meet up; and they were all friends.

Hmm, sounds familiar...


Paul -

It's easy to interpret your post to mean that stealing/reusing data just because you can is fair competitive practice. Please do clarify because your words mean a lot to people and your post might be interpreted that way.

Hilarious to expose their sanctimonious nonsense though.


FWIW, the legal basis of suing because of bandwidth and server load is called a trespass to chattels

http://en.wikipedia.org/wiki/Trespass_to_chattels



Can you comment on the status of Priceonomics (YC) in this context? They're building a rich interface on top of CL listings data, presumably using 3Taps to get the volume of data without drawing attention.

Is the endgame for these companies to replace the content source, or to hope (or fight for) a legal precedent to 'open' CL data for third party usage?


Just like they try to keep the facade up that CL barely makes money. They are swimming in so much $ it's insane.


For the immense value they provide to the hundreds of millions of users, having a 100-200M revenue is not "swimming in [insane] $".

They're profitable alright, but let's not kid ourselves that they have by choice left a LOT of money on the table, which is why all these value-added services are trying to take a share of the CL-pie.


They have an estimated 70-80% margins... maybe I should have said drowning in money!


Proof?


It's hard to say from an outsider's perspective whether or not they are 'swimming in cash.'

I suppose we do have some estimates for their revenue, but not how much profit they actually make.

http://www.businessinsider.com/2011-digital-100#10-craigslis...

http://articles.businessinsider.com/2011-10-07/tech/30253426...


source?


The AIM Group estimated $115M revenue for craigslist in 2011, after $141M in 2010. I don't know anything about the AIM group, but the report is referenced in the NYT (presumably they vetted the source, but...). http://aimgroup.com/2011/10/06/craigslist-revenue-falls-off-...


The tone in this article is unbelievable.

>The case raises questions over whether Craigslist is stifling innovation or simply protecting its data

There is no question. Craigslist isn't stopping Padmapper from being built, its stopping Padmapper from using Craigslist data. The continuous attempts to reframe Craigslist's actions as an attempt to stifle innovation seem almost surreal. I just can't understand how a community of professionals could support Padmapper in this.


The posts on CL are no more "Craigslist data" than videos on YouTube are "Google data". The data is submitted by the users for the express purpose of being seen by other people. Does anyone seriously argue that users listing flats on CL do not want their listing found by other avenues? It is clear that Padmapper is providing a service beneficial to both suppliers and consumers of rental properties, and CL is using the police power of the state to prevent that benefit. Why they're doing this is still a mystery to me.


The users of CL never agreed to let you copy their postings to whatever website you feel like, also it is against their TOS to do so:

Any copying, aggregation, display, distribution, performance or derivative use of craigslist or any content posted on craigslist whether done directly or through intermediaries (including but not limited to by means of spiders, robots, crawlers, scrapers, framing, iframes or RSS feeds) is prohibited. As a limited exception, general purpose Internet search engines and noncommercial public archives will be entitled to access craigslist without individual written agreements executed with CL that specifically authorize an exception to this prohibition if, in all cases and individual instances: (a) they provide a direct hyperlink to the relevant craigslist website, service, forum or content; (b) they access craigslist from a stable IP address using an easily identifiable agent; and (c) they comply with CL's robots.txt file; provided however, that CL may terminate this limited exception as to any search engine or public archive (or any person or entity relying on this provision to access craigslist without their own written agreement executed with CL), at any time and in its sole discretion, upon written notice, including, without limitation, by email notice.


The notion that I am bound by whatever legalese someone happens to put on their publicly accessible website is absurd in the extreme; I am not the one checking the box to submit a post. CL is perfectly able to restrict read-access to those who agree to the terms, e.g., by requiring registration. That they choose not to is their choice, and the state should not be used as a substitute bludgeon whenever they dislike the unintended consequences.

Furthermore, the publicly-accessible data isn't being copied, it's being consumed. I may consume the data to find an apartment. Someone might consume the data to calculate the average market prices in a city. Padmapper consumes the data to generate locations on a map.


This makes no sense. You agree to the TOS when you post information to CL, not when you are just browsing for listings; that submitted information is what is bound by the TOS above. Also if you aren't the one checking the agree box, who is?

From what you've written above it sounds like you believe any contract between two parties is absurd in the extreme.


> The notion that I am bound by whatever legalese someone happens to put on their publicly accessible website is absurd in the extreme.

Well, if nothing else, the submitting users' copyright applies. CL has a license to use it however they like, you don't.


I would bet that 99% of CL users not only never read this TOS, but they also do not care if padmapper copies their listings.


I've never read my credit card TOS, so I don't have to pay my bills right?



What does that have to do anything? The point is that users do not care if other companies copy their listing data (and aren't even aware of such a restriction).


How have you determined that every users listing on Padmapper doesn't care it has been scraped from CL and relisted? Did they contact every user and get their permission first?


It's not necessary for them to do so. It's incumbent upon CL to prove the infringement, not the other way around.


Under the HN logic prevailing in this discussion, that would appear to be correct...

But like padmapper, you would be completely screwed if it turned into a lawsuit.


Exactly, ignorance of the law is never an excuse! THen they make more laws


Imagine that Padmapper were released as an alternative web browser. As an end user, I have the right to consume CL's content. Do they have a right to tell me which web browser I can and cannot use?

Since CL offers their data freely to "the public", by what theory can they prevent the public from using their own chosen client browser to view it? Why should there be a distinction between software installed in my computer and a cloud-hosted application like Padmapper?

I believe that Padmapper, as an agent acting on behalf of its users, has every right to reformat the information originating on CL as long as it does not purposefully seek to cause confusion about the origin of the data (which they do not as they link to the source).


No they don't. They just have to check the stupid checbox that doesn't do anything because otherwise they can't post advertisment they wrote. I'd say cheking this checkbox is coerced. Also you may check it witout reading what's written next to it.


Posting to Craigslist isn't a Constitutional right. Don't agree with the terms? Don't post there. No one's coercing.


They have a public interface. I'm using it without agreeing to anything. They can't make me agree by presenting me with checkbox. They can't prevent me from using their interface if I don't agree. On the other hand I can't make them actually publish the information I entered or keep them to their word on anything they declare. It's not what lawers like but that's how internet works.


> They can't make me agree by presenting me with checkbox. They can't prevent me from using their interface if I don't agree.

False. From the EFF: https://www.eff.org/wp/clicks-bind-ways-users-agree-online-t...

> Given the emphasis placed on a user’s assent, courts favor finding a binding agreement where the user engages in affirmative conduct acknowledging the terms of a TOS. For instance, a genuine clickwrap agreement, in which a service provider places a TOS just adjacent to or below a click-button (or check-box), has been held to be sufficient to indicate the user agreed to the listed terms. In these cases, requiring the user to click “I Agree,” after calling attention to the terms and affording the user an opportunity to review them, demonstrates the user agreed to the terms. However, courts generally do not require that you actually have read the terms, but just that you had reasonable notice and an opportunity to read them.


You talk about law. American law to be precise. I talk about physics. They can't even prove it was I that checked the checkbox.


Try using that argument in a court and they'll laugh.


Finding myself in american court in the second scenario in the list of preferable scenarios right after finding myself in third world country prison. So if I'm there I consider myself already gone regardless of any argumentation.

In other courts on the other hand in cases about copyright violations even if accuser provided IP address, logs clearly indicating defendants computer the case was dismissed because he still failed to indicate that it was in fact the defendant that downloaded and/or served the copyrighted file in question.

I'd say same way there's no possibility anyone could prove that it was I who checked the checkbox.

But please ignore me. I'm just venting.


Let me know if you rent your next place without it.


I currently own, but my last place was rented in 2010 without Craigslist. It's entirely possible.


Possible != practical in all cases.


There's no Constitutional right to a practical apartment hunt, either.


There's also no constitutional right to uninstalling a browser and installing your own browser. That doesn't mean something isn't going wrong with the public's interests.

When a group can interfere with how you hunt for a place to live something is up. Where you live is pretty fundamental.


> When a group can interfere with how you hunt for a place to live something is up.

What entitled bullshit. People/companies do that all the time. Padmapper could've run out of money and shut themselves down - would you pursue a court injunction on the basis that they shouldn't be able to interfere with how you hunt for an apartment? How about if they changed the UI in a way you didn't like?


> What entitled bullshit.

Wanting the market to work isn't entitled BS. It's good civic thinking. Craigslist holding onto its incumbency, counter to the interests of the public is entitled BS.

> Padmapper could've run out of money and shut themselves down - would you pursue a court injunction on the basis that they shouldn't be able to interfere with how you hunt for an apartment? How about if they changed the UI in a way you didn't like?

The first would've been the market working as it should. Also, if Padmapper messes up its UI and goes out of business, then the market works as it should.

Craigslist holding onto its monopoly position is a broken market.


> Craigslist holding onto its monopoly position is a broken market.

This sounds more like whining than anything based in reality. Craigslist has a monopoly on apartment rent listings?

http://www.apartments.com/

http://www.apartmentguide.com/

http://www.zillow.com

http://www.apartmentfinder.com/

http://www.forrent.com/

http://www.apartmentsearch.com/

http://www.rent.com/

http://www.apartmentratings.com/

http://www.rentals.com/Apartments/


Yes. In as much as Microsoft had "competitors" but had as close to a monopoly position in its market to warrant action.

Pedant posturing aside, to those who have "skin in the game," namely those renting and renting-out property and paying real money for leases, Craigslist is the 800 lb gorilla in most markets.


Even if you go with the "it's not their data" argument Craigslist would seem to own the specific compilation of the data. I'd apply the same copyright principle as phone books or encyclopedias.

http://www.bitlaw.com/copyright/database.html

Craigslist invested in the infrastructure to enter, store, and display this information. They should be able to set their terms as to how it is used in aggregate.


You should read that link:

the U.S. Supreme Court ruled that a compilation work such as a database must contain a minimum level of creativity in order to be protectable under the Copyright Act.

The Supreme Court (...) held that Rural's white pages are not entitled to copyright protection, since the white pages did not meet the statutory requirement for originality under 17 U.S.C. §102(a).

Cragslist is completely automated. There's absolutely no creativity involved.

Craigslist invested in the infrastructure to enter, store, and display this information. They should be able to set their terms as to how it is used in aggregate.

Unfortunately for them, "sweat of the brow" doesn't afford copyright protection in the US.


>Cragslist is completely automated. There's absolutely no creativity involved.

Whoa whoa whoa, the coding of the site didn't involve a species of creativity?

If you mean that the content of a posting didn't involve creativity on Craigslist's part, you'd be on firmer ground, but even there they made design decisions (however questionable...) about how to present it.


I said that in the context of the previous post, you can't read it in isolation.

The design decision are irrelevant, since Padmapper isn't copying those.


You're right, sweat of the brow is irrelevant, which is why it is irrelevant that their process is now automated. The copyrightable aspects are in the collection, selection, and presentation of data, which is not required by law to be performed manually.


It's relevant because it requires creativity; it's not automatically ruled out, but it makes it harder to claim.

In any case, Craigslist is completely unoriginal in its selection and arrangement of the data; the selection is "whatever people submit" and the arrangement is LIFO. I find it very hard to believe they'll be awarded copyright over it.


>I'd apply the same copyright principle as phone books

So would I—phone books are not subject to compilation copyright, because there's no editorial choice going on. That was settled in the _Feist_ case.


I don't really have a horse in the PadMapper/Craigslist debate but your analogy to YouTube is apt. I have a bunch of videos uploaded to YouTube, and I've permitted embedding because I know that even if a video is embedded somewhere I can still remove the video from my YouTube Channel Manager.

When I upload a video to YouTube, I do expect to retain control of it solely through my interactions with YouTube, and expect that when I take it down it gets taken down across all Google properties. I would be pretty angry if I found that my videos were on another site which had scraped YouTube's data and now was hosting my video without any ability for me to remove it.


I agree! On top of that, the ability to free-ride off of other's data without permission could be seen as an attempt to stiffle innovation as well. Why should I take the time to build something when (a) others can come along and use my data for free and take away my market share or (b) I can just wait a year and scrape the data off of somebody else and not do as much work.

It seems about time Craigslist re-evaluates creating an API. They have such vast amount of local data that others could use that they could easily charge for use and make some money there.


why isn't craigslist doing that then instead of suing someone who found a way to attract users and media attention?


They attracted media attention? Well, why didn't you say so in the first place?


Because we're also a community of apartment seekers, and Padmapper makes Craigslist's data many times more useful. It's fine if Craigslist wants us to use their mapping service, but it doesn't exist yet. It's a purely selfish argument, but shutting down Padmapper does nothing but make our lives more difficult.


That is hardly shutting down Padmapper. It is just a roadblock and they would have to find another source of data. It has been clear for many years that Craigslist doesn't want their site scraped, so building a business that is so dependent that it would close if they lost that source the business would be ruined is extremely short sighted.

Edit: finished my thought that I left out mid-sentence.


The vast majority of useful data (for the areas that I have looked for apartments, at least) comes from Craigslist. When Padmapper can't use that data, I can't use Padmapper. I'm not talking about the legal, or even ethical aspects of the situation, just the practical ones.


No, it is effectively shutting down Padmapper. Apartment listings are on Craigslist.

As I have said elsewhere, I think Craigslist are within their rights here, but the situation still sucks. People want to browse apartment listings on a map, and Craigslist won't let us.


It seems that Craigslist sucks for people trying to find a property.

Does it suck for people on the other end - people with a property that they'd like to rent to others?

Why hasn't some other solution come onto the market to fix the problems of craigslist?


Padmapper was that solution.

Are you asking why nobody has made a website that competes directly with Craiglist, but with a better interface? Padmapper allows you to post listing directly, but it's very difficult to get buyers or sellers to bother with a different marketplace when all the other buyers and sellers are already using Craigslist. You can't beat Craigslist on price, and many non-technical people are already comfortable with the existing system. Convincing them to switch to a different marketplace with fewer potential customers and a new interface is very difficult, no matter how great your features are.


Nope, doesn't suck for sellers, at least in seller's markets (SF), they don't have to do anything other than list, and renters will flood in.

Due to various factors (demographics, etc.), sellers also tend to be not the most tech savvy people. I can't believe that it is not standard procedure to post a video walk-through these days. A lot of listings lack even a photo.


>"Why hasn't some other solution come onto the market to fix the problems of craigslist?"

Monopoly power.


> its stopping Padmapper from using Craigslist data

Is it Craigslist's data? Does Craigslist own the posts that caused women to be murdered and stolen property to be sold? If they own that data, then they must be culpable (to some degree) for these bad acts.

If they own it, they are now more than just a service provider (IMO) and can no longer be protected by the DCA. https://en.wikipedia.org/wiki/Communications_Decency_Act


As I see it, they don't own the information, they own the database.

For example, they do not own the fact that you are selling a toothbrush. The do own the records on their servers which catalog the fact that you are selling a toothbrush.

Why? Because they are the ones who went to the trouble of gathering and storing those records.

This is somewhat akin to recording a verbal note from a customer in a book, and storing that book in a giant library. Sure, the user owns the information, but that doesn't mean you are obligated to let every Tom, Dick, and Harry come in and use your library how they see fit- it's your library, even though you do not hold the copyright to all the information stored there.


But if you let anybody use your library, and you don't own the copyright to the content, why are you allowed to stop people from snapping photos of the books and posting them on the web? It's one thing if the copyright holders asked you to, but in this case they want as much distribution as they can get.


Craigslist clearly states that they don't own the data. But just because they don't own it doesn't make it public domain - the ownership belongs to the person who created the listing. To be totally free to repost the data you would need to get explicit permission from each owner.


No. I could write something illegal (libelous, inciting a crime, etc) and assign copyright to you without your consent.


Ok, but then I wouldn't later claim ownership of it as Craigslist has (implying they have consented).


Craigslit does not claim ownership. Craiglist claims it has a license. They are completely different things. Craiglist is enforcing its license to the post. Ownership of the post itself is, and always has been, with the original poster.


It's not Craigslist data, it's the users'. They just have a license.

/pedant


Users are free to post their listings to both Padmapper and Craigslist. Padmapper is not therefore automatically entitled to repurpose Craigslist as a kind of ad-hoc aggregator API for user listings.


I know, I wasn't claiming otherwise, I just meant what I wrote.


Great. Have the users provided a license to PadMapper? Where is this license agreement? How is it made?


They provide the license to everyone to spread the word that they have apartment for sale by default. It's more resonable to assume that than anything else.


Google also indexes this data. Craigslist obviously doesn't own this data as they have DMCA practices in place. They can't claim DMCA and also say that the data is their data.


Read Craigslist's TOS. The copyright remains with the listing author, but they sign over rights to enforce that copyright to Craigslist.


That is a power that, in the past, Craigslist has used for good. In this case, it's a power being used just for the benefit of Craigslist to protect its network effect incumbency, to the detriment of users.


Doesn't that run afoul of the same problem as Righthaven? There is no bare "right to sue" that can be transferred without transferring the copyright.


Of course you're correct, but surely (under the active hypothetical of CL having a license to the users' data) it would fall to the users to sue PadMapper (which they almost surely won't, because they're happy having more interest in their rental).


Stop assuming things that I didn't claim. I meant what I wrote, nothing else.


I interpret this different. I certainly think that Craigslist is stifling innovation (perhaps purposefully, perhaps not), but simply stifling innovation is not illegal.


Craiglist is being known for not "innovating", at least when it comes to their platform, design etc. Heck, Craig himself said he tries to implement changes as little and as rarely as possible, because he himself does not truly know what makes Craiglist to constantly "tick" year after year.

Im sure that's why their try this lowball trick to defend themselves and show it to judge "hey we want to innovate, and Craiglist does not want to".

But sure its not their business what CL does...


Stifling or not stifling, it's always a huge risk to build your business around someone else's technology, especially one that has a track record of not cooperating with third parties.


> I just can't understand how a community of professionals could support Padmapper in this.

Well, for one thing: different ideas about what constitutes fair game for property/control claims.

Not everything can be copyrighted or patented, and specifically, it's really not clear that a rental/for sale/wanted listing is actually a copyrightable work.

If that weren't enough, though, there's also not really a clear difference between what Padmapper does and a search engine -- it doesn't "steal" listings and put them wholesale on their site without attribution, it provides a geographic search that yields a limited digest and then points users to the original source.


I'm also pretty confident that if it was the opposite (PadMapper suing Craigslist for using PadMapper data), then most people would be cheering on PadMapper.


Craigslist makes use of both Google Maps and Yahoo Mapquest to present locations of individual properties through links on the CL site.

Note that these are free links, not paid-for API calls.

Who's free-riding whom?


Craiglist provides links to Google and MQ, which is the basis of how the Internet and WWW works. Craiglist supplies the actual address data.

It would be different if Craiglist embedded the maps from GM or MQ, but in such case CL would probably do so with permission from those companies (possibly even--gasp--paying for the right to embed maps).


The point I was making is that CL are increasing the utility of their service by using Google/Yahoo to provide specific maps of property locations. The distinction is that the relevant data curation (property locations) is done by CL, not the mapping providers.

I also noted that API usage would have been paid (or at least licensed), so in that sense, CL are (legitimately) freeloading on Google and Yahoo. While the argument could be made that CL are sending traffic to these sites, that same argument would apply to Padmapper.


On a purely philosophical level, the data 'belongs' to the people posting it.


Actually, that's on a legal copyright data -- the expression of the information belongs to its author.

The facts themselves aren't subject to copyright.

Access through the CL site is governed by various computer use statutes. Though aggregation-via-proxy as PM are doing through Google cache raises some interesting issues.


"(Craigslist) said it offered a license that would have allowed PadMapper to use its data on mobile applications but that the competitor did not accept the terms."

But on the other hand, "users and developers are exasperated with Craigslist’s insistence on preserving an outdated interface and design."

I don't know the legal merits of either party's position, but from an ethical perspective, I tend to side with Craigslist here. Dissatisfaction with a commercial site's UI is not just cause for using their data without permission, particularly if they had made an effort to offer a licensing agreement, whatever the terms might have been.


I see this from the opposite viewpoint, that PadMapper is operating as a search engine designed to help you find the right Craigslist posts for your apartment hunt. Claiming that PadMapper's success is from Craigslist failing to 'provide a good UI' is like suggesting that Google News' success is because newspaper companies failed to provide good article SEO. Instead, it's a different modality of content discovery, and it ultimately redirects you to the original source.

In my (personal) opinion, I would argue that the content providers or aggregators and search engines benefit synergistically. CL and Belgian newspapers appear to disagree.


I don't think this has any legal bearing for anyone, but it's worth noting that Google respects publishers who do not wish to be indexed.


Here's the abstract for a nice review of search engine law: http://works.bepress.com/james_grimmelmann/13/ If I remember correctly, indexing a site that asks to not be indexed might be illegal as an illegal tresspass, but it is not settled law. The argument is that you are stealing resources (computer time) from the site owner.


>The argument is that you are stealing resources (computer time) from the site owner.

That's why 3taps is getting the data from google's cache without touching craigslist servers.


How the heck are they scraping Google without being banned or rate limited?


There are a lot of companies that are scraping Google quite successfully. Many of these are for 'rank checking' services that provide ranking data for certain keywords over time; these are heavily used by SEO and marketing agencies.

The two that jump to mind are Authority Labs and SEOmoz.

I guess: a shed load of proxies. :)


Amazon/ other clouds out there. Just auto provision your instances (lots of them), scrap, sleep, wake, scrap, sleep...


It's not impossible. You just tell google not to cache.


And makes it impossible to block them as a side effect.


IANAL but AFAIK it is only a civil matter (i.e. not illegal) since it is a usually prosecuted as a tort of trespass to chattels. For such a case to succeed the prosecution needs to show that the actions of the defendant deprived them of use of the good they were trespassing on. i.e. they need to cause enough of a burden on the servers that the claimant or their customers could not use the service.


What I'd care more about is whether the people posting the ads wish to be indexed; I'd be surprised if many don't. (I don't know about the legal bearing either.)


Perhaps craigslist could put up a checkbox (like they do with the never-checked "It's okay to contact me about products ...") that says something like:

"I'm okay with people finding this listing through another service."

Perhaps because they don't want people to get to their listings that way even if they want to, and thus don't want your opinion? (Edit: and perhaps they really aren't as interested as they claim in making it easier for buyers and sellers to find each other?)


So eventually this comes down to CL trying to protect it's monopoly as the listing site of the internet


For that analogy to hold, PadMapper would have to be pulling data from a large number of sources. The value that Google News provides is that it aggregates a very large number of sources. PadMapper does not.


In my city, Craigslist is hardly used for rental listings. Everybody uses kijiji.com, which is available on Padmapper. There are alternative data sources, and they are actually used. Just not where you are.


Padmapper collates data from multiple sources: Craigslist, Rent.com, Apartments.com, and "Others" (not sure which those would be).


It's pretty clear PadMapper was BUILT off of Craigslist that CL's poor UI is isn't entire reason for being, and it never would have been successful without CL. It's disingenuous to ignore that by listing "rent.com" as an alternate source.


> PadMapper does not.

It does.


It does not. One main source and a handful of secondary sources is not at all comparable to a true aggregator like Google News.

Look, it's obvious that even PadMapper believes Craigslist is THE data source they must have.

Google News could drop any given data source and not blink. PadMapper simply is not in the same position.


But without the Craigslist data it has no value.


Total nonsense. I found my current place with them, without craigslist.


But that is where the source of the problem is - CL has the largest user base for listing and classifieds on the internet and since they have the monopoly on this they have never felt the need to improve the experience on their site. Now that padmapper has started getting large number of users CL feels threatened and is suing them to shut them down.


Does PadMapper respect the robots.txt file?


PadMapper uses 3Taps.com to get the data. 3Taps gets Craigslist data through Google. Google's bot respects robots.txt.

So, yes.

[edit for clarity, and for misspelling clarity]


Well... "sort of", then. PadMapper respects the robots.txt for GoogleBot, not for PadMapper.


Presumably 3Taps violates Google's ToS to get the data from them ?


I would really love to know that kind of shady proxy bot army they have implemented in order to scrape Google on such a scale.


This type of comment is common. Not only on HN but on forums in general.

While I agree the web is full of lowlifes engaged in web development, many of them in porn or some other area that appeals to base instincts, I find this comment perplexing. Because it is so subjective, yet it tries to seem objective by focusing on some random criteria.

Google employs a "bot army" to scrape the entire web. So what?

If the comment was something like "I don't like Company X." Or even "I don't like Company X because...", it would make sense to me.

But that is not how this common type of comment goes. Instead it suggests that bot=evil, i.e. any sort of automation or any sort of data collection by anyone other than [your favorite company] is "shady".

That's crazy. IMO.

It's what a company does with the data that matters.

Anyway, I'm not keen on 3Taps because they are not provinding bulk data, only API's that require "developer keys". Why?

Either you are going to democratise data, or you are just another schemer trying to find ways to collect infromation about people, in this case people using "your API".

I don't want API's I want the data. I can make my own interfaces thank you.


I thought it was weird that CL didn't add a Disallow line for padmapper to robots.txt from the start (just from a PR perspective).

But robots.txt has no special legal authority, it's just a convention used to communicate a publisher's intent. I'm pretty sure the C&D letter made it 100% clear that CL did not want Padmapper crawling their site or using their data.


Padmapper doesn't crawl Craigslist. That's not how it happens.


...any more. Now they are using a third party, but at the time Craigslist sent the C&D they were scraping the site directly.


I know, but I thought the existence of robots.txt was why Google is allowed to crawl sites. If a site disagrees with the crawling they can add a robots.txt entry and Google will honor it. It at least shows that you are giving the publisher an option.


I don't like Craigslist because of lock in. Sellers must sell there because they are the biggest market, buyers must buy there because they are the biggest market. Everyone is locked in.

Lock-in is super common(and often unintentional or unavoidable), but it goes against the ideal of equal opportunity and competitive markets that rewards companies for a continuous commitment to quality and innovation.

I feel that craigslist has done nothing wrong except for standing in the doorway when other people who want to innovate and improve things are trying to get through (like PadMapper).


The reason why Craigslist is valuable is because of it's large network. It is obvious that smaller sites like PadMapper want to slowly try and chip away at Craiglist content (see PadLister) after they hopefully gain enough traction to supplant them. Padmapper / Padlister are fine to start their own sites and have users come and post their listings there, but not to copy content from Craigslist after they specifically asked them to stop.


> It is obvious that smaller sites like PadMapper want to slowly try and chip away at Craiglist content

It's not Craigslist's content. It's our content, the general public's. Craigslist is just a caretaker. Right now, it's a caretaker acting in its own interests against the public's.


The general public gave it to Craigslist under readily available terms. The general public is free to also give it to your service, but that's up to them.


This is disingenuous. Craigslist isn't doing the public's will. It's just taking advantage of public apathy. (Much like Microsoft did with the browser wars.)


Yes you did post some facts about your listing to CL, it is also well within Craigslists right to allow others to copy it or not. Just because you supplied the data to CL doesn't mean you can dictate to them how they should run their business.


A copyright clause designed to allow Craigslist to stop abuse shouldn't be used to squash competitors. That's not dictating "how they should run their business." That's telling them to stop being selfish.


How are they trying to squash competition? Padmapper isn't free to have users submit their listings to their own website?


> How are they trying to squash competition?

Again, disingenuous. CL has the network effect. That's like Microsoft saying, Users are free to install their own browsers. Their current actions are only to preserve that, not to squash abuses. (Which is what they usually do with the power of their TOU.)


So Apple should stop selling iPhones because it has a dominant position in the marketplace because it is the device most preferred by buyers?


Apple's network effect or market position clearly doesn't make other phones and devices a non-starter. Craiglist's network effect and market position clearly makes other housing listing sites a non-starter.

Big difference.


It also does not mean that CL has any obligation to provide that data to another for free. If you want the data in multiple locations, post it to those other locations.


The purpose of CL is to provide data for free. Why should they care how that data is packaged, so long as this packaging is not abusive, but actually beneficial?

The real reason: Craigslist wants to hold onto their monopoly position.


True, but it's also very easy as a buyer (or seller) to use more than one site. Nothing about Craigslist is exclusive and in that sense the lock-in is very weak.

A Craigslist competitor doesn't need a huge market share to be viable. For me as a seller, it just has to expand my audience of buyers enough to justify the small amount of time it takes to post and manage a second listing.


The license offered by craigslist was for a mobile application only. There was nothing offered for websites. You can see this as either, 'license offered with terms' or 'no license offered at all'


How do people not see this for exactly what it is: The same situation as we are finding with cable providers not wanting to be dumb pipes. The only difference is people don't pay money to CL. They just give CL all their information, and CL decides what to do with it.

Craigslist has no divine right to its users' information; it just happens to be the only viable option. If Craigslist didn't exist, someone else would, and would probably do it better.

If Craigslist wants to continue existing, they should take advantage of the fact that they are the go-to for online classifieds, create a modest subscription-based API and let the information flow.


The problem is that Craigslist has a monopoly. They use that monopoly to shut down competitors, resulting in millions of hours of lost time for users every month.


My logical brain says that suing Padmapper is perfectly reasonable. My emotional brain is crying at the idea of using raw Craigslist ads to find an apartment the next time I move to a new city.

I move to a new city every year, usually on short notice, and I often go into the process of apartment hunting with no knowledge of local neighborhoods, the public transportation system, or general geography. Without Padmapper, the process would be unbearable.


Yeah, I'm in a similar position. Every time I have to use Craigslist I come much closer to hating the team behind it. I mean, I know it's probably not rational, but I think they are being shitty people for locking up so much useful data behind a site that is designed to be miserable to use. It's pretty evil when you get down to it. It's obvious they don't care about users.


What's stopping someone from making a craiglist like website with an awesome UI?


It's hard to compete with a site that is both free and "everyone" uses already. Both of those are very strong barriers to entry.


Very true.


The challenge in building a brand and an audience that Craigslist has assembled.

If you build it, doesn't automatically mean that they will come.


network effects


That's ridiculous. Craigslist lets you refine your search down to the neighborhood. It's not that hard to use. People find apartments on there all the time.


Depends on which Craigslist you use -- the neighborhood listings aren't particularly granular, and neighborhoods themselves aren't fixed, so one person's Glen Park (for instance) could be another's Outer Mission. Using a map is an uncontestedly superior interface for organizing geographic data. People find apartments on Craiglist despite the interface.

This is not to address the issue of ownership of listing data, or the case law, which latter isn't settled anyway.


It may not that hard to use, but Padmapper is still a far superior experience (eg email alerts on apartments in an area which meet your criteria)


This isn't surprising of course, it is their data after all, but it brings up the 'easy problem' / 'hard problem' (E/H-P) conundrum quite nicely.

The conundrum is that it is 'easy' to take an existing, quality data source, and re-skin it for broader market appeal. This is something that Craigslist should be doing but perhaps through poor management are not. It is 'hard' to take a conceptual model of skinning and building a quality data source behind it.

As I said elsewhere, the PadMapper guys might have taken this to Craigslist and said "Look what we can do with your data, lets make happy music together." and then debated the terms. Or they could take their UX to the venture capital world and say "Look at this cool product we could build if we had access to a database with Craig list's quality" and debate cap tables and dilution (assuming they got to that stage of a term sheet).

But they chose the third route, "Lets see how long we can get away with this..." and the timer on that just ran out.

Sadly, this third path makes the first two paths much harder. Craigslist already sees them as the 'enemy' and VCs will see them as a team that makes poor choices. Both views make it harder (but certainly not impossible) to consummate a deal. However, from the blog and from previous postings here it seems they made those choices with their eyes open.


>This isn't surprising of course, it is their data after all,

But that's the thing. It isn't their data. They specifically say so in their TOS. The copyright belongs to the user.

You can't say the listing belongs to the user (to protect you from liability) on one hand, and then say the data belongs to craigslist on the other.

They could always do what map companies have done forever to prevent copyright. Facts can't be copyrighted so map makers insert fictional cities.

If someone copies the map they are copying the fictional city and thus violating their copyright.

Craigslist could add a fictional listing here and there that they do own the copyright to.


I hear you, but I don't know if the courts will agree with your interpretation. The reason for that is that the courts will look at the consequence of having that interpretation widely accepted and the impact on the current state of affairs. There are many many places where this has been litigated and the courts always come down on the side of the data aggregator owning rights to the data aggregated.

If PadMapper wanted to legally exploit the fact that the users own the listings, not Craigslist, they would have to establish a relationship with the user and get the information directly from them. They could contact the listing owners and suggest they list on PadListing. They can 'spiff' people who do so (meaning give them some benefit if they list on PadListing and the person discovers and rents through PadMapper). If they can prove that the listing owner asked them to list their property, Craigslist can't sue.

Alternatively PadMapper "need only"[1] create their own apartment listing service to have their own database where people listing apartments go to them directly. And that is a much harder thing to do than something which scrapes the listings from Craigslist and drops them on to a Google Map.

As a proof-of-concept, PadMapper is excellent. As a product, it has a data contamination issue.

[1] The scare quotes are there to acknowledge that its a challenge to get people to move from the known, to the new. I am trying to move people from Google to a new search service, its hard, its slow, but its the only way to do this and avoid this sort of litigation.


>If they can prove that the listing owner asked them to list their property, Craigslist can't sue.

Look at the Rightshaven case. The judge ruled that Rightshaven didn't have standing to sue on behalf just because the copyright holder granted it license to.

If that theory holds up, Craigslist wouldn't have standing to sue on behalf of the copyright holders.

Additionally copyright would only apply if the listings are considered creative works. Most of them are clearly simple statements of fact that wouldn't merit copyright protection in the first place.


As others have pointed out Craigslist can claim copyright to the collection. The legal theory I would expect them to use is that PadMapper wouldn't know about these listings if they didn't access Craigslist's collection, therefore their use of the collection violated Craigslist's copyright.

I did a quick Blekko for the case with the Yellow Pages that was litigated this way (but alas did not find it) where a AT&T sued the maker of a competitive Yellow Pages over using the collection of businesses in their book. The defendant argument was similar to yours, that they could have walked down the street and collected the information so the information wasn't copyrightable, but the judge ruled in favor of AT&T because it was clear the defendant could have done that but they didn't do that. There was some errors and omissions that mirrored the Yellow pages and 'proved' the defendant took their data from the Yellow Pages rather than collect it themselves.

For PadMapper to escape liability they have to be able to prove they came to know about these listings in some other way than through Craigslist's collection of them. They argued in their blog that 3Taps did that for them because they got them from 3Taps they aren't liable for what ever 3Taps is doing. And my take on it is that given the case law it will be a very hard thing to prove. If Eric Goldman is reading he could probably whip out a definitive argument here.

If it goes to trial I'll definitely follow the case to see how it plays out.


See this http://www.copyright.gov/reports/dbase.html

In Feist Publications v. Rural Telephone Service Co, a phone directory was ruled to be protected by copyright only if the selection and arrangement of facts was an original creative act (listing numbers alphabetically was not).

I'm aware of a case after Feist where a yellow pages for chinese immigrants was copyrightable because of the creativity involved in selection of the facts and the arrangement into categories. But even then the facts themselves are not copyrightable.

Craigslist's selection is nonexistent, you send it they publish it. And the arrangement of the subset that padmapper is using is solely by geographic location and time. In addition padmapper is not copying the arrangement.


That is not what the Righthaven court said.

Righthaven did not have a copyright, it had a "right to sue" on behalf of the original copyright holder's copyright. The court deemed that insufficient to give Righthaven grounds to enforce the copyright, because Righthaven did not have any copyright or license therein. A right to sue is not considered a "copy right" because it involves no right to copy the material (i.e., by distribution or reproduction).

Craiglist does have a license to copyrighted content. It can actually "copy" the content. Ergo, it has the right to enforce its license against non-licensed users.


The judge in the Righthaven case said this...

"Because the SAA (lawsuit contract) prevents Righthaven from obtaining any of the exclusive rights necessary to maintain standing in a copyright infringement action, the court finds that Righthaven lacks standing in this case,"

Craigslist ToS doesn't grant them exclusive rights, thus by that judge's definition they don't have standing to sue.


>Craigslist ToS doesn't grant them exclusive rights, thus by that judge's definition they don't have standing to sue. That is not what the judge is saying. By "exclusive" rights he does not mean wholly exclusive in the colloquial sense (i.e.,, sole person with such rights); the judge meant "exclusive" in the legal sense that Righthaven could exclude non-licensees from using such rights.


> I am trying to move people from Google to a new search service

Care to share? I've been trying to move away from google, but DuckDuckGo's search results are so often massively inferior. Other suggestions would be nice to have!


I have no idea how this is going to playout but if there's a copyright argument to be made, it won't be on the copyright of each listing, but the copyright on the compilation of the listings.

You can't say the listing belongs to the user on one hand, and then say the data belongs to craigslist on the other.

You can in fact make this argument in the case of compilations in copyright law - think a compilation of short stories or one of those "Songs of the 90s albums" - the initial authors still own their work, but the entity which creates the compilation still owns the right to the entire compiled work.

(I intentionally edited your statement - don't want to bring liability into this, I think its a separate issue and a bit of a red herring w/r/t this discussion).


I don't think it's as clearcut as saying its a compilation.

Here is the relevant wikipedia section on compilation copyright

>Copyright Act allows for the protection of "compilations," provided there is a "creative" or "original" act involved in such a compilation, such as in the selection (deciding which things to include or exclude), and arrangement (how they are shown and in what order). The protection is limited only to the selection and arrangement, not to the facts themselves, which may be freely copied.

>The Supreme Court decision in Feist v. Rural..rejected what was known as the "sweat of the brow" doctrine, in ruling that no matter how much work was necessary to create a compilation, a non-selective collection of facts ordered in a non-creative way is not subject to copyright protection.

I think there is a good argument that there is no creativity on Craigslist's part in selecting postings, since they aren't selecting them--users are uploading them. If true selection wouldn't be covered.

The arrangement might be (if it can be said to be creative), but I'm assuming padmapper isn't copying their arrangement.


The individual posts belong to the users, that is true. However one post alone is worthless. What's valuable is Craigslist's collection of posts with lots of relevant data (location, description, price, etc). This collection belongs to Craigslist because their servers provide the environment for it, plus they wrote the architecture and back-end to support it.


See my answer to mikeryan's post above for information on why that is not necessarily the case. An individual phone number is worthless as well, the collection is valuable. But case law says a phonebook isn't covered by copyright.


I see two problems with your argument:

1) You are comparing physical media to a web service

2) You are confusing content with access to content

Since you mentioned Phonebook, let's take yellowpages.com This site has much of the same information that a phonebook does. I would argue that it would be illegal for company B to scrape yellowpages.com for this information in order to make money without express permission from yellowpages.com. However, if Company B found another way to access the same information (ex: scanning physical phonebooks or asking people to sign up to their site) then I'd say they're within the law.

Craigslist is 100% within their rights to control who accesses their service and how. If CL users want to register for Padmapper and post the same ads on both services then that's their prerogative.


>if Company B found another way to access the same information (ex: scanning physical phonebooks or asking people to sign up to their site) then I'd say they're within the law.

Company B has found another way. They are getting the data from Google's cache of the craigslist.

Padmapper is not touching Craigslist's servers at all.


The users never agreed to their data being displayed on any other sites besides CL. If they wanted that to happen they would personally place their listings on other websites, or sign up for some type of 'post your classified add to 100s of sites from us' service, which CL clearly is not.


That doesn't necessarily mean that Craigslist has standing to sue on it's users behalf however. Also most of those listings don't merit copyright protection, because they are statements of fact not creative works.


The user owns the post and grants CL the right to display it, and the TOS also grants CL the right to prevent others from displaying it without CL's permission.


If the user isn't posting a creative work just a statement of fact, it doesn't what the user says--it's not protected by copyright.

>the TOS also grants CL the right to prevent others from displaying it without CL's permission.

Just because the ToS says it doesn't mean it will work. Recently a judge said that a copyright troll called Rightshaven didn't have standing to sue on behalf of copyright holders for works that they licensed.


Assemblages of fact are protected by copyright tpateck has commented in the previous threads on Padmapper about this. This is why a Farmers Almanac is under copyright and I cannot wholesale copy a phonebook or Encyclopedia Britannica without infringing their rights.

Regarding Righthaven, the original media company never gave Righthaven control of the copyright of their data, just the right to sue. This is why it was struck down.

As for Righthaven's lack of standing to sue over Review-Journal content, Hunt wrote the recently unsealed lawsuit contract between Righthaven and Stephens Media -- called the Strategic Alliance Agreement (SAA) -- clearly leaves Stephens Media in control of the copyrights and gives Righthaven only the right to sue.

In order to file lawsuits, copyright plaintiffs have to have actual control of the copyrights, not just the right to sue, Hunt found.

http://www.vegasinc.com/news/2011/jun/14/judge-rules-rightha...


See Feist v. Rural

Phone books are not protected by copyright unless there is something original about their selection or organization.

Assemblages of fact aren't protected, only the selection and arrangement of those facts. In addition the selection and arrangement has to be "creative."

The selection is definitely not a creative act on craigslist's part b/c they don't select anything, users post the information.

Craigslist posting selection is nonexistent. Therefore they are only left with "creative" arrangement for protection.

You could argue that the arrangement is a creative act. I don't think the arrangement counts because they are only using a subset of craigslist and that is merely arranged by geographical location, definitely not an original "creative" arrangement, but it doesn't matter because Padmapper isn't copying the arrangement.

See this for more information. http://www.copyright.gov/reports/dbase.html

>Regarding Righthaven, the original media company never gave Righthaven control of the copyright of their data, just the right to sue. This is why it was struck down.

The judge in the Righthaven case said this...

"Because the SAA (lawsuit contract) prevents Righthaven from obtaining any of the exclusive rights necessary to maintain standing in a copyright infringement action, the court finds that Righthaven lacks standing in this case,"

Craigslist ToS doesn't grant them exclusive rights, thus by that judge's definition they don't have standing to sue.


With regard to the re-skin there was quite some external interest in this around 2009. Wired Magazine asked several leading designers to present their version of Craigslist redux.

http://www.wired.com/entertainment/theweb/magazine/17-09/ff_...

An article in the same interest, "Why Craigslist Is Such a Mess" ( http://www.wired.com/entertainment/theweb/magazine/17-09/ff_... ).


Your kidding, right? Until today, I had never heard of Padmapper, but I have hated (like everyone else) CLs interface.

CL just put them on the map in a big way as being "Craigslist with a good UI" -- so good, they had to sue. Stupid, stupid, stupid.


I find this sort of thing very frustrating to read:

"The search tool is antiquated, the images are poor or nonexistent, locations of listings are hardly dependable, and you can forget about an integrated way to save anything for later reference. There is a litany of shortcomings that come with the Craigslist apartment search; they are many and they are painful..."

If you don't like the site, don't like the UI, and can't stand the UX, then don't use the site. Nobody is forcing you to search for an apartment on CL. Nobody is forcing owners to list apartments on CL.

And so now we get to the meat ...

"...Your site is chock-full of data I need..."

Ahh...data you NEED.

So here's the thing. If you truly need the data, then you need Craigslist and you have an obligation to use it the way they (and only they) want you to use it. If you don't like the way they want you to use it, there are several choices:

* Go work for them and convince them to improve it. * Build a better mousetrap.

The former probably won't succeed, so the latter seems to be the way to go.

Look, there's nothing preventing landlords from listing their apartments on Craigslist AND your new Craigslist replacement/improvement. They're not locked in. If they were, that would be a whole different discussion - then we can talk about anti-competitive, innovation-stifling behavior. But that's not the case - the only reason people list apartments on CL is because it works. Despite the bad UI. Hence the reason CL hasn't changed it.

So it's the job of some enterprising entrepreneur not only ro build a better mousetrap, but convince people to use it. Until that happens, as much as I too dislike the Craigslist UI, I can't say I support PadMapper on this one.


If you don't like the site, don't like the UI, and can't stand the UX, then don't use the site. Nobody is forcing you to search for an apartment on CL. Nobody is forcing owners to list apartments on CL.

Apartments aren't a fungible item. If I'm looking for an apartment in a certain area and 75% of the apartments in that area are only being listed on Craigslist, then how can I realistically "choose" to use another avenue for apartment listings?

Look, there's nothing preventing landlords from listing their apartments on Craigslist AND your new Craigslist replacement/improvement.

There are huge barriers preventing landlords from listing their apartments outside of craigslist. Nothing is as popular, so why waste the time? They don't have hours to scour the internet looking for alternatives, nor do they probably want to have to manage listings at many difference sites.


There are huge barriers preventing landlords from listing their apartments outside of craigslist. Nothing is as popular, so why waste the time?

I think you have a flawed idea of what constitutes "huge barriers". The only thing stopping them from listing elsewhere is a cost-benefit analysis? I weep.


I think you are looking at this backwards. How many listings are there on Craigslist for a given city? 100, 200, 500, 1000? (I know city dependent)

There is absolutely no reason that PadMapper cannot call them on the phone, and ask them to list with PadMapper. They can make it trivially easy supporting email, phone, or fax listings. They can sell them on using PadMapper.

Many apartments are owned by Real Estate Investment Trusts (REITs) there are dozens (not hundreds) of those. You can sell the REIT on the concept of easy listings that are so much better/cleaner/easier than Craigslist.

Craigslist was created when near every apartment was listed in the Classified Ads in the print newspaper. That wasn't "lock in" it was an opportunity, they sold these folks on lower costs (since Classifieds were a money fund for newspapers) Landlords hated the extortionate prices that the newspapers charged but they didn't have an alternative, Craigslist gave them that alternative, they moved.

So the 'answer' here is to actually build a classifieds business around rental space. That takes more than slinging some node.js and scraping other sites. Granted its 'easier' for a technical person to do it that way, but its not a 'sustainable' way of doing it.


That wasn't "lock in" it was an opportunity, they sold these folks on lower costs (since Classifieds were a money fund for newspapers) Landlords hated the extortionate prices that the newspapers charged but they didn't have an alternative, Craigslist gave them that alternative, they moved.

But you are using an example in which there was a clear downside to using the existing listing model. The price. If you are a landlord and you have no problem renting out apartments in a reasonable amount of time on Craigslist, and it's free, what exactly can another site offer that is "better"?

They are already getting their apartments rented, there is minimal overhead to using Craigslist. You can't compete with Craigslist on price, unless you are actually giving landlords money for listing on your site. And trying to say "It's easier for users to find your properties." doesn't help if they aren't having a problem with renting out their properties.


> If you truly need the data, then you need Craigslist and you have an obligation to use it the way they (and only they) want you to use it.

This is what we call a monopoly. Many people (apartment hunters in this context) need to use Craigslist because there are no viable alternatives. Just like you have to pay your local utility for water or power -- there are alternatives, but not viable ones.

And a two-sided monopoly is a very real monopoly.

The difference is that other monopolies get regulated; Craigslist is not. Given their current behavior, I'd strongly support a law that explicitly denies craigslist any exclusive right to their listing data.


> If you don't like the site, don't like the UI, and can't stand the UX, then don't use the site. Nobody is forcing you to search for an apartment on CL. Nobody is forcing owners to list apartments on CL.

Yes they are being "forced" to list on CL, because as much as you hate the UX, UI, etc., that's where the buyers are. And as much as the buyers hate the site, that's where the sellers are. I hate CL, but I was forced to use it because lock-in makes it impossible for anyone else to compete.


> Look, there's nothing preventing landlords from listing their apartments on Craigslist AND your new Craigslist replacement/improvement. They're not locked in.

That's disingenuous. Craigslist is pretty much the only game in town in many places. Let's say you need to find a person with a specific need, willing to pay $2000 a month. If you don't find this person, you have to pay the $2000 a month. How do you feel about not using Craigslist now?


Too much obsession over UI.

What people really "need" is the raw data.

If they want to make some UI that they like, then they can do it. If they want to offer this to others, they can do it.

If they want to load the data into some SQL database, they can do it.

If they want split the data into some other format using csplit and load into some other faster database, they can do it.

If they want to extract a portion of a raw file and just use agrep on that, they can do it.

The point is that UI is a personal decision.

Because some people do not like CL's bare bones UI doesn't give them the right to do anything. Because some people don't like bloated and clumsy web interfaces and prefer text commands doesn't give them the right to do anything either.

But people can't be stopped from making personal decisions about how to process data. Public data.

It's funny how some websites think they "own" data that is given to them. Do they "need" this data? Yes, they do.


Here's a nugget from the article which I didn't hear anything about in the previous Padmapper postings and chat here on HN:

"The company said it offered a license that would have allowed PadMapper to use its data on mobile applications but that the competitor did not accept the terms."


This kind of market segmentation through restrictive content licenses is something I'd expect from Hollywood, not Craigslist. It's extremely distasteful and they should feel bad. There's simply no good reason to allow mobile apps but prohibit web apps. Would Craigslist offer a license to a Firefox OS app? How about a Chrome OS app?


It was in PadMapper's original blog post about the C&D letter


That is meaningless. The terms may well have been ridiculous (only covering mobile is a good start at ridiculous).


I wonder what would happen if their mobile app was simply a browser shell pointing to a mobile-optimized website. That mobile-optimized website could be accessible from a desktop, though unadvertised.


That caught me as soon as I read it too. I didn't recall that part mentioned earlier.

From the original: > They allow mobile apps to display their listings if you buy a license from them, but not websites.


How many people use PadMapper on a mobile device, vs. the desktop site?


Their iPad app is one of my guilty pleasures. (ditto HotPads and Redfin)


that is actually the point at which i lost sympathy for craigslist. they can license the data or not, but if they do they should not get to demand where and how it is used


I can't say that I'm that surprised; Padmapper wrote a whole blog post about how they were adding Craigslist support back to their site after being sent a C&D by Craigslist.

It's a shame for the guy behind Padmapper, but hopefully there'll be a lesson learnt about knowing which fights are worth picking versus knowing when to move on.

Somehow Google managed to overcome Yahoo and Altavista without scraping their data. And look at something like GumTree in the UK, which has always been miles ahead of Craigslist - no first mover advantage taken up there. Plus, GumTree's design has been equally woeful over the years.


Speaking of the UK, Foxtons also does a great job providing two interfaces for searching properties (list vs. map view) as well as organizing photos by room with specified dimensions. I wish their design was pretty much applied to Craiglist for property finding in the U.S.


"You're not wrong, [Craigslist], you're just an asshole."

I don't know who's legally right, but I've got no sympathy for Craigslist. They've got a monopoly on the data (whether right or wrong) and refuse to either build a decent UI for consumers or let other people do so.

That sounds bad, but I think you can't know how infuriating it really is until you've tried to find housing in a town where CL is the only real option and where demand far exceeds supply (where you spend many, many hours on the effort because of intense demand-side competition).


Just because you don't like the interface, does not give you license to steal their data. I think HN is fairly ugly, but that doesn't mean I have a right to scrape the site in order to reproduce a slightly better looking, for-profit clone. At least not without PG's express permission.

Yahoo search would be a hell of a lot better if they could just scrape results from Google, but thankfully they lack the misguided sense of entitlement that Padmapper seems to suffer from.


You actually do under the HN terms of use, which are permissive in the licenses they grant regarding HN posts. For example, see HN Magazine and various other HN-based sites and services.

But as you note, the important thing here is the "license." Craigslist offers paid licenses to app develoeprs, which PadMapper chose not to pay for.


The license doesn't cover websites, only mobile applications.


> The lawsuit raises questions about who should be able to use the vast amount of user-created data hosted on a site like Craigslist.

No, it really doesn't. This isn't going to be a long, exciting case challenging the underpinnings of copyright law and server terms of services. Padmapper isn't that company.


Agreed - there have been an awful lot of armchair lawyers asserting that there's more to this case than there really is.


There really is a lot to this case and a lot of precedent to be set, I wont deny that, but PadMapper doesn't have the resources to fight this battle.


3taps, however, is.


Previous discussion: http://news.ycombinator.com/item?id=4218714

Eric, everybody saw this coming.


Commenters here make this sound like an open and shut case in favor of Craigslist. IANAL, so could someone explain why Google's index of material covered by copyright is ok and PadMapper's is not?


[Law student here, but I don't know the full details of the claim CL is making]

The case doesn't actually seem open and shut at all. CL seems to have a "moral high ground" in that they provide the original service and the platform for users to post information, but the legal ground is more shaky. It is relatively settled in copyright law (which appears to be the main thrust of CL's argument) that effort alone does not award copyright protection[1] (or indeed, any property rights in general[2]).

For example, phone listings are not copyrightable, nor is news. The general question is whether there was creative expression (which can include 'creative' presentation or organization). If I were Padmapper, I would argue that classified listings are just a collection of facts, in which case they are not copyrightable. Padmapper does not actually use the listings directly, but extracts the information and puts it in a different format (their interface). On this point I think their case is relatively strong.

Where CL might be able to make a better case is their TOU. The TOU forms a contract between CL and their users. In almost all such prominent cases, this type of contract has been found to be enforceable[3]. If this line of reasoning is followed, then Padmapper will probably lose on this point because they agreed not to "copy, aggregate, distribute", etc.[4]

If this is a contractual claim the damages would be what CL could have expected if Padmapper had not 'breached' the contract - in this case I'm not sure that amount is particularly large, because Padmapper redirects to CL's site for the actual listings.

TLDR: copyright claim seems weak, but the contractual claim might succeed. The trademark stuff seems like a red herring and is CL throwing stuff at the wall to see what sticks.

[EDIT: see below, I overlooked the fact that Padmapper is no longer using CL data directly, so they might not be bound by the TOU at all. Same goes for 3Taps since they use Google's cache. Looking stronger for Padmapper]

[1] Feist v. Rural, http://en.wikipedia.org/wiki/Feist_v._Rural

[2] INS v. AP, http://en.wikipedia.org/wiki/International_News_Service_v._A...

[3] http://en.wikipedia.org/wiki/Clickwrap

[4] CL Terms of Use (see section 3), http://www.craigslist.org/about/terms.of.use


Is it clear that Padmapper is bound by the TOU of CL? Padmapper probably has a contractual relationship with 3Taps, and Google probably has a contractual relationship with CL. I'm not sure 3Taps has entered into a contractual relationship with Google by scraping their cache, but even if they did, what is the legal agreement between CL and Padmapper?


Hmm that is a good question, which I didn't consider. What I said would have applied to the original method that Padmapper used to scrape data directly from Craigslist but probably not under the new arrangement through 3Taps. Probably nobody is bound by the TOU except Google, and unless Google has a collateral contract (TOU) with 3Taps and/or Padmapper there probably isn't a claim at all. Good catch.


Since Padmapper is going through a third party - did they actually agree to the TOU?


In the meantime, CL might be able to get an injunction to stop Padmapper from using their listings, and that itself might be what they're after until a settlement is reached.


Thanks, very interesting!


Because if you ask Google to remove you from their index, they will.

Rupert Murdoch has threatened this in the past. Each time he's backed down, because he knows that being removed from Google's index is akin to being removed from the Internet.


Which shows just how much Google profits from their virtual monopoly. If content companies knew that Google would make billions of dollars selling advertisement based on the content that they generated, it would be very hard for Google to pull the massive Internet scrapping that they managed to do from day one. It is interesting that people understand the illegitimacy of taking data from a single content source (Craigslist), but fail to see how unethical is to take data from everyone and sell advertisements based on that.


Interestingly, Craigslist is being consistent when it comes to not allowing scrapers, including Google.

http://www.craigslist.org/robots.txt

edit: oops! they're not actually blocking google from the apartment listings. thanks smackfu.


Aren't they only disallowing particular pages there? Like apartments are at *.craigslist.org/apa/number.html which doesn't seem blocked by that robots.txt.


Craigslist doesn't needs to be removed from the search index. They just need to prevent caching of posts, which is what 3Taps was abusing. Noarchive in robots.txt ought to take care of that.


To your knowledge, is it against the law to not follow a site's robots.txt? I'm not aware of one, but I am curious.


I don't know if it's illegal, but Google would follow the site's robots.txt.



I can imagine some clever hacks where Padmapper uses Google targeted searches and Google's cache exclusively to get Craigslist data. Since Craigslist would never ask Google to remove them from their index, I wonder if Craigslist would still go after those who get their data as proxied by Google.


> Padmapper uses...Google's cache exclusively to get Craigslist data.

That's what Padmapper is doing right now. They're being sued for it.


I thought Craigslist is suing 3taps because they scrape their site directly?


Most likely because Google aren't using their service to compete with, e.g., Craigslist. I also imagine that if you ask Google (especially with a C&D), they'll be more than happy to remove you from their index.


I agree that this isn't a simple open and shut case. But, I doubt that PadMapper will take things the distance before rolling over.

It's not a new thing for a start-up to use Craigslist to gain an advantage (i.e. AirBnB), but they all seem to eventually be cut-off.


I was always wondering that. In essence, both cases appropriate CL's data for the users benefit, and both tend to send traffic back. I'm currently in a similar situation, where I have to ask myself how much i can scrape/aggregate/visualize 3rd party data that is useful to an user while stil providing value to the 3rd party.

I believe ultimately, it is at the 3rd partys (in this case Craigslist) discretion who is allowed to appropriate their data and who not.


I don't know if it's open and shut either - but not because of the search analogy. That's covered in the CL ToS, sections 3 and 5. (http://www.craigslist.org/about/terms.of.use)

My problem is with section 3. CONTENT AND CONDUCT, part a, where they first state "CL does not control, is not responsible for and makes no representations or warranties with respect to any user content." Then it goes on to state that posters assign a bevy of licensing rights to CL for the use of the content.

How can they claim copyright to content and at the same time aver that they don't control it?


This is where the lawsuit gets _legally_ interesting. Craigslist can't really claim copyright over the listings themselves (see post above), but they can certainly restrict user rights through their terms of use (which is a contract). Generally, contracts can be used to reduce a set of rights a person has against another (e.g. waivers), but it cannot grant rights that they did not have in the first place.

So the first part of that statement is akin to waiver, so that someone can't sue CL for what the listings say. The second part is a claim on copyright and licensing, which they may or may not have.


Because you specifically grant access to your content using the robots.txt. If you don't want to have your site indexed, then you change your robots.txt. Or like it was said before, you ask Google to remove you from the index.


CL allows Google (as a general-purpose search engine) in its TOU. It doesn't allow Padmapper/3taps. And in this context the TOU are enforceable. I wrote about this a few weeks ago here: https://plus.google.com/112961607570158342254/posts/HcCuHVG7...


Who didn't see this coming? I almost feel bad for Padmapper, but not really, because they should have known better.


If anyone is the party we should feel bad for in this case it's 3Taps. They seemed to be flying under the radar until Padmapper started using them.


They're running a service called 'craiggers' that scrapes Craigslist listings and sells an API. It's clear trademark infringement. Why would we feel bad for them?


Alright, perhaps one can argue the name is too close.

But craiggers provides a very valuable service, namely showing pictures of items as you search. It made furniture searching so much easier.

I have zero sympathy for craigslist. They didn't get into their position through "years of hard work". They won the lottery (someone had to win it). They have done zero innovation (other than maybe anti-spam) for years and prevent others from innovating on top. Society wins if they are forced to open their data.


Maybe they had a rough childhood.


"Flying under the radar hoping we don't get caught and sued for something we know is wrong" should not be a part of any reasonable business plan.


But many startups do not have reasonable business plans. They have crazy business plans that everyone knows will fail, except sometimes they don't. Many startups have begun knowing they were going to be sued, knowing they were going to have legal and publicity problems. That's the nature of the disruptive business that everyone wants to be in these days.


>Many startups have begun knowing they were going to be sued

Name one startup that began knowing they were going to be sued and went on to a successful exit.

Parenthetically, the original Napster, the one that began knowing it would be sued, was sold about 3 years after its founding for less than $2.5 million as part of bankruptcy proceedings -- and I doubt that the owners took any significant dividends out of its before its sale: http://en.wikipedia.org/wiki/Napster#Current_status


I never made the argument that these companies were looking for successful exits. I'm not sure how you read that into my post. I were merely saying there are some companies that are started even when the founder knows there is a very low chance of defending the company legally. This article is one such point. One that comes to mind where the founder was actively looking to be sued was the OnLive Windows competitor.

Some do it to prove a point, some do it to prove that the market exists.


Grooveshark will probably have started out knowing they'd run in to trouble. Jury's still out on how that's going to end though.


OK... but I still don't feel bad for crazy, unreasonable business plans that fail.


It worked for AirBnB.


When you can't beat them, sue them into the ground.

Craigslist is one of the poorest user experiences around and succeeds only because people insist on using it. The alternatives somehow manage to be even more spectacularly useless by over-designing their apps and cluttering them up with junk.

Padmapper is one of the few that does what they're supposed to do, and it's not even an optimal implementation of this sort of thing.


> Craigslist is one of the poorest user experiences around and succeeds only because people insist on using it.

So, to paraphrase, it only succeeds because something about it attracts customers. As opposed to other sites, which succeed by fairy dust and rainbows.

If padmapper has a great user experience, that's wonderful -- They should be able to get customers to post their listings on it, and generate data without relying on Craigslist. Or, if they rely on Craigslist, and what the article said is true, they could have negociated a license to the data, instead of scraping it.


It only succeeds because it has a historical legacy that's proving difficult to displace. This is the same thing driving eBay.

It's a difficult nut to crack. Kajiji seems to be gaining some traction, but it's trading one set of UX nightmares for another.


It's http://kijiji.com (looks to be ebayized now). Although in Canada, http://kijiji.ca is more popular than Craigslist.


Sourceforge looked impossible to displace and yet GitHub happened. I can't wait for the same thing to happen to Craigslist.


Why was Sourceforge so tough to displace, aside from no one having the will to do it?

Craigslist controls a two-sided market. If I build a new site, I can't get buyers without there already being sellers. And vice-versa. That's an incredibly difficult business problem.

Github seems to have had it easier. I can just host my new project on github and be done with it (my website links to it after all). I'm sure sourceforge had some value just by being the go-to place, but how strong was the network effect from a developer perspective?


To deconstruct the SourceForge decline, which felt as painfully slow as the glacially slow erosion of IE 6 market share, it's important to recognize it in context.

SourceForge and its related properties were the backbone of the early web, supporting a number of important efforts to which people felt a strong allegiance. It was like a benevolent force at the time.

SourceForge had, at the time, a fairly formal process for registration. They considered themselves more like a library where getting shelf space was a privilege not doled out lightly. This is not unlike how getting into the Yahoo! directory required a lot of begging and pleading.

While this meant that most of the projects hosted by it had a lot of merit, those lesser efforts were left out in the cold. They failed to switch to a more casual model as the "Web 2.0" philosophy started being the dominant mind-set, where expectations shifted dramatically from carefully curated content to emergent user-driven communities.

Also worth noting, GitHub's pace of innovation is so far beyond nearly anything else in the industry that it was only a matter of time before they became the superior platform in terms of technology.

Additionally they were able to ride the surge of popularity that git was gaining, something that SourceForge didn't support at the time, and persuaded a number of high-profile projects to move to them. The real coup was Ruby on Rails, which once hosted there, solidified their position.

Any Craiglist displacer would need to swing a few important, strategic deals to cement it in the minds of people as a reasonable alternative. The rest would be a case of just driving harder than Craiglist is willing to keep up with.


It is a difficult nut to crack. That doesn't give padmapper a license to scrape craigslist.


Kajiji is owned by eBay.


I would've guessed Yahoo! by the amount of visual clutter they've created.


CL won because it got the first mover advantage, and now benefits greatly from network effects. It could be much improved over what it is, but it has little incentive to change now.


Nah, they were pretty steadfast about offering only a mobile license and not being willing to talk about anything.


So because the private licensing terms offered were not optimal for PadMapper, it follows that PadMapper should just ignore their public licensing altogether?


You seem to be straining with "not optimal" phrasing here. It was only for mobile apps. PadMapper is a website. Non-starter.


When you can't beat them, steal their work product.

Padmapper is (indirectly) screen-scraping Craigslist's data in an attempt to unseat Craigslist.


If these were articles written by professional writers, you might have a case. Instead these are posts written by many different individuals trying to advertise their properties.

Craigslist didn't write a single one of those posts. They try and assert control over this content through their terms of service:

"You automatically grant and assign to CL, and you represent and warrant that you have the right to grant and assign to CL, a perpetual, irrevocable, unlimited, fully paid, fully sub-licensable (through multiple tiers), worldwide license to copy, perform, display, distribute, prepare derivative works from (including, without limitation, incorporating into other works) and otherwise use any content that you post. You also expressly grant and assign to CL all rights and causes of action to prohibit and enforce against any unauthorized copying, performance, display, distribution, use or exploitation of, or creation of derivative works from, any content that you post (including but not limited to any unauthorized downloading, extraction, harvesting, collection or aggregation of content that you post)."

The last part of that may not be enforceable. They're asserting that they are singularly responsible for "authorizing" reproduction despite not having copyright for the material in question.


Craiglist doesn't need to write those posts. Under their TOS, Craiglist has a license to the posts (which are copyrighted by their owners ). This license gives Craiglist the authority to enforce copyright against persons who interfere with Craigslists' license in the posts. Thus, the last sentence of their TOS is perfectly legal and enforceable. The license would not give Craiglist a license to do anything to other services on which the original posters also post their listings to. But that is not the case here.


From what I've read, only the actual copyright holder has the right to sue others for copyright infringement. See the Righthaven case from last year.

http://arstechnica.com/tech-policy/2011/09/righthaven-copyri...


They are plenty of cases where copyright enforcement is voluntarily handed over to a third-party: for example, Hollywood to the MPAA, or the BusyBox developers to the SFLC. Does anyone know what the difference is?


Righthaven is different. Righthaven did not have a copyright use license, so it did not have the right to enforce the copyright. (A copyright enforcement "license" is a non-license, legally, since it does not convey any actual copy "right" such as distribution.)

Craiglist does have a copyright use license, and thus has the right to enforce the license it has been granted.


> in an attempt to unseat Craigslist.

Unseat Craigslist as the undisputed champion of what, exactly? Cartographical apartment hunting?


They are the champion of free listings, and built a huge user base around their dead simple platform.


No doubt about that, I just don't know if padmapper is really trying to be the next Craigslist. Sure, you can list your pad, but you can't buy a bike or get a job (hand or otherwise) on padmapper, afaik.


I'd describe them more as the tyrant of free listings.


Not true. You wouldn't link back to Craigslist if you wanted to unseat them.


"Craigslist is one of the poorest user experiences around and succeeds only because people insist on using it."

If a lot of people INSIST on using it, then that means it has good UX. Complaints with their UX should be qualified.


People insist on using it not because it has a good user experience, but because of content. It has a terrible user experience. Don't confuse the two things.

For instance, many payroll companies have truly awful sites, yet people use them since they have no other choice.

As a specific example, searching for an apartment should not mean reading every single listing in order to discern where the property is, what features are available, and what conditions apply. That people post the same listing repeatedly until it is rented is not helping matters.

eBay, which is arguably still quite primitive, has much better filtering options, alerts, and a single listing that persists until the auction is completed. Craiglist is a series of random, repetitive posts with only a minimum of categorization.

Padmapper provided a superior way to navigate listing data and present it in a much more meaningful presentation. Being able to "favorite" listings helps when filtering.


I'd be interested to see interaction studies in the non-tech community about Craigslist, because I haven't seen a person in non-tech find difficulty in posting on Craigslist.

Also, don't confuse my opinions about Craigslist's design/UX with an opinion on the case. I've used Padmapper extensively. While I believe that Craigslist has a right to do what they please with their content, I don't see what Padmapper was doing as wholly legally wrong.


Posting is not the UI problem. Browsing posts is the problem.


Craiglist has a great UX for a classifieds board: location > type of listing > listings by date.

It may look butt ugly, but it is as close to a functionally efficient classifieds board as you can get. Craigslist isn't trying to be anything more than that.


For many classified ads, this is true. However, for most people "location" being defined on a city-wide (or for some rural areas) state-wide basis is nowhere near an effective location filter for apartment/housing. Most people I know have a fairly short list of neighborhoods/locations that they want to live, and it's a huge pain to find those postings in craigslist's interface.

This doesn't mean that PadMapper is right, from a legal standpoint, but I think it's fair to say that craigslist's UX is terrible, for apartment/house searches (where small variations in location matter a LOT) at least.


I am sure Craigslist has many more users than Padmapper. It would he hard to argue that Craigslist can't "beat" them.


>"Craigslist is one of the poorest user experiences around and succeeds only because people insist on using it"

Please use the proper legal word - "Monopoly". I wonder how the Department of Justice decides who to prosecute?


Heartbreaking.

Padmapper is the best apartment rental interface out there. The author has been single handedly working on this for 3-4 years and constantly improving his system.

Legally he might not have been in the right but we should be helping him out not uselessly debating and casting judgement.

If theres anyone who exmplifies the true spirit of Hacker News it would be the author of PadMapper.


> Legally he might not have been in the right but we should be helping him out not uselessly debating and casting judgement.

If he's not "legally not in the right," why should I be helping him (doesn't that imply casting a judgement - that PadMapper is elevated above CraigsList because it's "the best apartment rental interface our there")?

What if the tables were turned and CraigsList was violating PadMapper's TOS? Would you be willing to help CraigsList (they serve more than only apartment listings)?

Should somebody be rewarded, even if (legally) not in the right, just because he/she is a hard worker/smarty pants/exemplifies the True Hacker Spirit?

I often wonder if there is some sort of unwritten clause that You-Must-Pull-For-The-Underdog to join Hacker News? (In the name of Divine Disruption, of course.)


> "If he's not "legally not in the right," why should I be helping him (doesn't that imply casting a judgement - that PadMapper is elevated above CraigsList because it's "the best apartment rental interface our there")?"

Because legal right and moral right sometimes intersect, but not nearly always.

Because once in a blue moon somebody comes along and creates something with such enormous public good that it forces us to reconsider the context of both legal and moral righteousness.


That's all well and good, and I understand the point, but it does't appear to apply to PadMapper.

Personally, while I'm not a big fan of Craiglist's UI, it's stretching it to think that PadMapper would be classified as "such enormous public good" simply because a small subset of total Craigslist (and Internet) users (most likely in a small subset of American cities) are fond of it.


Padmapper ought to do a better job of leveraging this publicity. Apartment season is in full swing in college towns across the country and they need to push really hard to get the word out about Padlister: in particular, the fact that it will cross-post to Craigslist automatically (so you don't lose any viewers). This doesn't matter for realtors - they already have systems to autospam Craiglist. But it does matter to self-listers. They could get good traction targeting roommate listers and seekers as a way to build organic listings and eyeballs. Create a viral campaign. I would share it, and I know plenty of other people who would because Padmapper has made such a difference.


http://www.quora.com/Why-hasnt-anyone-built-any-products-on-...

In particular, read the comments by Craig himself. Funny how these things change.


Snip from Craig - "Actually, we take issue with only services which consume a lot of bandwidth, it's that simple."

I guess that wasn't true.


What's it like to work as Craigslist and read this stuff? How could you be excited to come to work every day knowing that network effects have carved a deep moat around a site that people describe as "an exercise in torture"?


"Yay I have job security" most likely.


This is absolute bullshit. Craigslist is located in San Francisco, and they should be intimately aware of how fucking difficult it is to find an apartment in this city with Craigslist's garbage interface.

Are there any Craigslist employees reading this? Would any of you like a full time job sorting through the cesspool that is your apartment listings page? Oh, you wouldn't? I don't blame you, because that's my job right now, and PadMapper is the only thing helping me preserve what's left of my sanity.

Do a better job than them or shut the fuck up. Stop hurting people by trying to shut PadMapper down.


A key part of building a business around stealing IP is getting acquired by a big firm that can fight legal battles. YouTube executed well on this plan; I'm curious how Padmapper planned for this obvious development.


Or alternatively, steal IP while nobody is looking and then launch an IPO that will pay for the lawyers: that is what Google itself did.


Padmapper doesn't steal IP.


It's pretty clear that Padmapper is making use of the craigslist posts without the rights to do so, which is defined by Merriam-Webster as "stealing".

  STEALING
  transitive verb
  1a : to take or appropriate without right or leave 
  and with intent to keep or make use of wrongfully
http://www.merriam-webster.com/dictionary/stealing

If you're going to downvote me, that's fine, but if you do, please show how copyright infringement does not fit within the definition of stealing. I'd like to settle this semantic debate once and for all.


Padmapper doesn't infringe copyrights either.

edit: Ok, I won't be so cryptic. Padmapper doesn't copy the posts on Craigslist, they link to them. That's not a semantic quibble. All search engines do so. If I blogged that I was renting my apartment and linked to the Craigslist post describing it, I would be doing exactly what Padmapper is doing.


I think there is a significant difference between copying data, and just giving pointers to CL's data.

However,

If I blogged that I was renting my apartment and linked to the Craigslist post describing it, I would be doing exactly what Padmapper is doing.

This doesn't line up in a few ways. Padmapper is a third-party, not the original poster. There is also a distinction between doing it as a one-off and doing it in an automated fashion.

Padmapper needs to make it trivial for people to "upload" their listings from CL to PM. It could be a single bookmarklet.

Now, suppose that Alice tells Padmapper about Bob's apartment for rent on CL. That's getting even weirder, but PM might be okay if they follow DMCA safe harbour procedures.


If linking without permission is problematic, Hacker News is in real trouble. So is just about every blog and news site on the web.


Right. I've already agreed that there is a significant difference between displaying copies of CL listings and just having pointers to them.

It doesn't seem, however, that the link is the crux of CL's complaint. Rather it's the fact that PM has copied CL's data in order to generate a mapping of physical locations to CL listings.


You can't copyright data, only its expression. That a certain apartment is available for rent is a fact. The text that appears on the CL website is one expression of that fact. The map that appears on Padmapper is another expression of the same fact.


Because it's copyright infringement, that's what it is, no one disagrees on that. Using "steal" tries to load copyright violations with a specific meaning, and that is not agreed on. It frames the debate, and people take issue with that as it creates a more emotional and less objective discussion.

Also: http://lesswrong.com/lw/nz/arguing_by_definition/


Nope, not copyright infringement either.


"Find which dictionary phrases it the way I like" is a pretty silly game, especially when they're so vague about the definitions they use anyway.


There ought to be a rule that when someone reaches for the dictionary in a debate they've already lost. I keep seeing this on HN - a dictionary is probably the worst place go find out about what a word really means.


I hear people say "They stole my idea!" or if they hear a familiar melody, "They stole that from The Doors!"

So the dictionary agrees with the common use of steal in the realm of intellectual property. It's really the proponents of copyright violation who have redefined stealing to exclude copyrighted digital media.


Maybe, if we get a rule that people aren't allowed to say no, they didn't _steal_ your work, because you still have it.


It's all framing, I'd be happy if we stopped arguing over the word "steal" all the damned time instead of addressing the finer points of copyright and its role in our society.

The people who will whip out dictionaries over "steal" are crudely trying to inject an emotional response into the debate and trying to override rational discourse with rhetoric.

The people who are constantly whipping out "you still have it" are naively pretending that, just because it doesn't fit within all the criteria of physical theft, it also means that the other moral concerns associated with it are equally moot.

tl;dr: Both sides of the "stealing!" debate are being incredibly disingenuous.


Are you referring to stealing in terms of ethics, or theft in terms of the legal system? Ethics are subjective, so I don't really want to go down that road. In terms of US law, however, the Supreme Court case Dowling v. United States found that "interference with copyright does not easily equate with theft, conversion, or fraud." (src: https://en.wikipedia.org/wiki/Copyright_infringement#Theft )


Does craigslist actually own the copyrights to user generate text and images?


Of course. From their TOS: "You automatically grant and assign to CL, and you represent and warrant that you have the right to grant and assign to CL, a perpetual, irrevocable, unlimited, fully paid, fully sub-licensable (through multiple tiers), worldwide license to copy, perform, display, distribute, prepare derivative works from (including, without limitation, incorporating into other works) and otherwise use any content that you post."


That just means that the users of Craiglist grant them an unlimited license. The users still own the copyright, though.


Where does Padmapper's license to the listing data originate? When do the posters of listings grant Padmapper a license of any kind?


You don't need a license to link to something.


The listing data that is presented and rendered on Padmapper's site isn't linked (e.g. the price & # of bedrooms). That is, I don't have to click a link to go to the original site to view the content.

Padmapper does link to images, but Padmapper would be useless if it didn't also re-publish the content jacked from Craigslist.


Copyright doesn't apply to data. Only the text posted to CL is protected, and Padmapper isn't using that.


I'm not a lawyer. Perhaps you are, and you are certain that the facts of the complaint will be grounds for summary dismissal. My limited legal knowledge leads me to believe this will not happen, which would mean that Padmapper is going to bleed money, time, and energy until the case is resolved.

That was my main point; it's fairly useless for non-attorneys to argue legal cases on HN (my apologies if you're a lawyer and I'm just wasting your time).


No, you're right. I'm not a lawyer either, and Padmapper is definitely in a tough position. I'm mostly arguing against the characterization of Padmapper as "stealing content." Most people here seem to think that Padmapper is in the wrong and it's only a question of whether they can get away with it.

AFAICT, Padmapper is doing something very much in line with what journalists and search engines have traditionally done. That point of view may not prevail in court, but I think it's completely defensible on ethical grounds.


Having a broad license is not the same as owning the copyright (especially a non-exclusive license).


s/stealing IP/behaving in a manner that may be or may not be stealing IP but probably looks enough like stealing IP to a judge to pass legal muster and require an expensive legal defense/g

The point is that simply defending against the lawsuit could kill Padmapper. Lots of time, money, and energy go into lawsuits. Conveniently, those are three things startups don't typically have in excess. The outcome doesn't matter for Padmapper if Padmapper dies during the defense.


Here are a couple of legal questions (not taking sides, just honest questions):

Does the data belong to Craigslist or to those posting on Craigslist?

If so, is this in their TOS? I don't remember ever clicking a checkbox that says "we own your listing" whenever I have sold stuff on Craigslist or posted jobs/gigs. Maybe real estate is different.

Why would I want to give Craigslist ownership of my listing? If I were posting a property for rent I would probably want other services to pick-up the listing.

Similarly, why do they own the Copyright on something I have written?

Again, I don't remember ever giving that away either.


Last time this was discussed, people posted excerpts from Craigslist's terms, in which the user grants Craigslist the right to (i) Use their post, (ii) defend against others copying it.

So, I haven't verified it, but the idea seems to be the user owns their post, and could post it elsewhere if they wanted - but Craigslist has the right to stop other sites copying posts direct from Craigslist.


There was a recent case where the judge ruled that a company that didn't own a copyright, but was assigned a license in order to sue infringers on the owners behalf, didn't in fact have standing to sue.


I'm fairly sure that craigslist does NOT own the data, the person posting the listing does. The article mentions this.

That's what makes this a little less open and shut, at least to me.


Why is craigslist going after people who bring them traffic? Is it because they don't bring in much money themselves so they have to sell licenses? I've never really thought about craigslists business model. I assume it's almost entirely ad based / premium placement of your ad on their site based.


From a Wired article, circa 2009 (so this may be out of date):

"Craigslist is not only gigantic in scale and totally resistant to business cooperation, it is also mostly free. The only things that cost money to post on the site are job ads in some cities ($25 to $75), apartment listings by brokers in New York ($10), and—in a special case born of recent legal trouble—advertisements in categories commonly used by prostitutes, because authorities encourage vendors to maintain a record that would aid investigators. There is no banner advertising. They won't let you join them, and at this price you can't beat them either."

http://www.wired.com/entertainment/theweb/magazine/17-09/ff_...

Usually, when companies get mad about scraping, it's because it's either bypassing the ads that make them money, or in rare cases bypassing a premium API. Here, PadMapper is doing neither.

In fact, if Craiglist is so committed to not updating their ugly website, why don't they just license out a public API, killing two birds (new monetization strategy and having a better interface for free) with one stone?


The first step to beating Craigslist at apartment listings is to get everyone to use a different front-end. (ie. Padmapper) Then you offer a way to create listings entirely within that front-end and bypass Craigslist entirely. CL becomes little more than a backend database, and eventually goes out of business as no one needs to use it anymore.


I'm guessing that PadMapper said no to paying for API access, for some reason.

Though to be honest that makes no sense, because that's exactly what they're getting from the third party site, and they're paying for that...


AFAIK, there is only a license available for mobile apps. Not websites.


Yeah, it doesn't seem like a smart business decision to me. It would just make sense to pursue this new revenue stream, and license out an API. This response just seems like more of an emotional response than a business strategy.

Edit: Not sure why this comment is getting down-voted, but if I offended some one that was not my intention.


The same reason facebook bought instagram: craigslist doesn't want any other web site's userbase to become the kernel for a replacement listing service.

Once you have a web site that attracts buyers, you can replace craigslist data with your own listing service and stop paying for their API.

They clearly don't fear this kind of competition in the mobile space, hence offering licenses for mobile-only API use.


Job listings in select cities cost money, I believe that's their only source of income.


Craiglist doesn't get traffic from PadMapper. PadMapper gets nearly all of its traffic from Craiglist. Craiglist can charge for mobile licenses because people want the traffic that Craiglist can bring them.


PadMapper gets no traffic from Craigslist. Craiglist gets traffic from PadMapper.

PadMapper is basically a pointer to a listing on Craigslist (when Craigslist listings are the reference), where you can search the pointers with filtering that you can't on Craigslist. In order to get to PadMapper, you have to visit the site directly.


No, it's the other way around. I view real Craigslist pages but use Padmapper to get me there.


I'm rather tired of all the sanctimonious, edgier-than-thou commenters and bloggers who keep saying CL has a "shitty UI." And that't not just because I like it (it's clean, simple, and easy to use). I totally agree that it could be improved, and maps do make it easier, but the CL interface never breaks or gets in the way the way an actual shitty UI does.

Also, I think a lot of complainers are not aware of what it takes to roll out UI design changes to hundreds of millions of users. Frankly, I'm not very knowlegeable myself, but I can stop and think about it for a second and imagine there are huge logistical issues (performance, browsers, etc.) and huge psychological issues (non-technical users, familiarity, etc.)

And then there's the whole question of whether CL's UI should be improved. There are clear benefits to having a simple UI that does not do everything for the user. When I used it to find housing near SF, I copy-pasted addresses to map them, and I did a little extra work, but I never found that CL was actually in my way. For neighborhoods and recommendations, I asked real people. When I looked for a car, I made a little spreadsheet to give weighted scores to the features I wanted--I did not go around whining that CL didn't do everything for me.

My impression is that CL wants to be a basic classified service. For whatever reason they don't want to morph into a specialized listing service, and I can understand and respect that. After all, the CL interface is not that much different from HN's.

PS: I've had this item open in my browser for maybe an hour or two--why does the Add Comment button turn into a "dead link" OMG such a shitty UI, if they won't fix it, I am justified in rehosting all the content on a better site.


The millions of people who use padmapper every month seem to think it is a preferable experience to going on Craigslist, trying to figure out if Mountain View is a "Peninsula" or "South Bay" city, and then discovering the wonderful diversity of places that humans will call "Mountain View" one listing at a time.


There's no doubting the legality of Padmapper scraping Craigslist data, so I think now would be a good time for him to innovate and really push his other product, PadLister. If Eric were to write another blog post, I'd make PadLister the focus of it and start building a new community there.


I know Calacanis is a polarizing figure in the community, but I've always agreed with him that you shouldn't piggyback your company's success on top of a single service.

edit: especially one that doesn't allow you to use their data in the first place.


I wonder why craigslist hasn't shut down http://housingmaps.com yet. They appear to be doing the exact same thing as padmapper and mapskrieg, albeit with a much smaller feature set.


I feel like there is a win win solution for everyone here.

How about Padmapper develop a plugin for chrome/firefox that once you go to a craigslist appartment list page, it does what it does now: loops through them all and displays them on a google map. Padmapper could even develop on API on their end which handled all the hard work of actually finding the correct address of the posts and sends it back to the plugin in a nice format and all it has to do is plot in on a map that the plugin places on the craigslist page.

This sounds legal to me, as Padmapper isn't actually storing anything, and it puts a UI on top of the apartment search page.


That sounds similar to RECAP, which is a free interface to the for-money PACER database of legal filings.

RECAP is on a better legal footing because the filings are public domain instead of owned by someone.


How does this differ from Readability or Instapaper stripping advertisements away and serving users pure content in their respective interfaces? I'm not sure what they're doing is illegal.


Readability is just client-side code. Think of it as just a different implementation of a browser that ignores some markup and renders some markup in a non-standard way.


Ha. They need to sue Google and other search engines that provide caches. Good luck with that.

Second, it's not infromation that's protectable by copyright (at least not in the US). It's like the telephone book as others have said.

There's heaps of websites that mistakenly believe they have some copyrights to assert against users that harvest data. But it's not just websites. A longstanding, classic example of this sort of silly argument is WHOIS. The registries can't stand the thought of others getting the data. They claim bandwidth is an issue. Yet they won't provide (mirrored) bulk data, which would easily resolve any such bandwidth claims. They purport to "license" the right to do things with the data, yet they have no rights to grant to begin with. It is public data.

Then there's the "spam issue". As if email is somehow different from direct mail, and from telemarketing. Why don't we outlaw those practices?

Anyhow, again, it's public data: addresses and phone numbers. We call it the telephone book. If, e.g., a brokerage wants to have its summer interns do cold calling to solicit new institutional investors, the interns do not need a "license to use the telephone book". If someone listed in the phone directory does not want to be called, there's no law that says they have to be listed; they can use a nonlisted phone number.

People who list with CL presumably want their information to be found and to be presented in an appealing way. Maybe not exclusively on CL. Maybe CL should ask them what they would prefer.


I hope that PadMapper does not back down. Craigslist is making their data publicly displayed. Consumers consume it - be it people or machines. It will be interesting how this plays out. Screen scraping is analogous to the analog loophole: https://en.wikipedia.org/wiki/Analog_loophole.


Why did Craigslist decided to sue padmapper but let housingmaps.com live for many years is puzzling to me. Anyone has an idea why?


I think Padmapper in particular is in a unique position where while they started with CL listings, it's proven itself an invaluable enough service to enough of us to the point where I'd use it exclusively to list my apartment over Craigslist. I'd even pay month-to-month for exclusive or time-sensitive listings given the way I use it.

Whether or not Padmapper is in the right here, CL's long-standing stance in not evolving their UI or creating a decent API is going to bite them in the ass. Helping people sort your content is now more important for them than the idea that change is bad or time-consuming. As many people have said in the past, it wouldn't take much to really bring CL up-to-speed, but if they don't want to do it themselves, they should at least make it an option for others to pay a fee to use the data.

Edit: Didn't realize http://www.padlister.com/ already existed. Score.


Why doesn't Craigslist want PadMapper using their information? (I know that's irrelevant to the lawsuit)

The more people that use PadMapper and find CraigsList entries on there, the more people that are using CraigsList.

Why wouldn't they want that? If I were a real estate listing company, I'd want my data to be found on as many real estate search apps as possible.


One reason is that once they're big enough, they can just flip the bird at craigslist and use their own data.


I think a lot of the work to put a quality UI on craigslist ignores something very important about user psychology - seeing your data that you willingly gave to one organization publicly available in the hands of another one is generally perceived as creepy. Apartment listings may be less sensitive than other kinds of data, but they're still personal, and some users may not want their listing showing up in different venues even if others will take whatever publicity they get.

Most programmers and designers have made peace with the idea that what's public on the internet is public for the whole internet, but I think most people perceive this kind of information leakage as a lack of control. I think that Craigslist, more than most companies, has a real understanding of this, and a lot of their restrictions that may seem arbitrary to entrepreneurs come out of it.


Google indexes craigslist (obviously)

https://www.google.com/search?btnG=1&pws=0&q=iphone+...

They even have cached thumbnails and you can likely use their custom search API to get results, snippets, etc.

Is what 3taps does that different?


"unlawfully and unabashedly mass-harvesting and redistributing postings entrusted by craigslist users to their local craigslist sites"

Boo on Craigslist! They claim that they're protecting their users, but what harm is padmapper causing to people listing properties on Craigslist? Getting them more potential renters?


Craigslist has the better of the arguments; both Padmapper and 3taps are violating (and probably bound by) the terms of use agreement: https://plus.google.com/112961607570158342254/posts/HcCuHVG7...


My view on this is much as it was before:

- The Craigslist apartments listing interface really, really sucks.

- Padmapper's search interface is much, much better.

- CL almost certainly have legal recourse to control use of their data and trademarks, or at least make things miserable for PadMapper should they choose to do so. Which would put a crimp on PM's future growth prospects as a lawsuit magnet.

- CL undoubtedly add value to their listings by filtering (through community input) for misclassified and spam listings.

- I really wish the two organizations could reach an equitable compromise which preserves the PM interface concept. Licensed use. Buyout. Whatever.

- Barring that, CL are ripe for disruption. My read is that PM have enough mojo at this point to make it on their own by licensing listings from other sources and soliciting them on their own.


In my mind this article, and moreover the lawsuit itself, may be a glimpse into what might be a revolutionary revamp of Craigslist and it's community.

Its no secret that the CL community have frequently requested an update to the UI as well as many additional features to be added in order to make using the site much more friendly; although with no changes being implemented at this point. I look at this litigation as a foreshadowing of what may be in store for the CL community.

With that said, shutting off access of those parties utilizing their services without proper consent may be the first in a strategic battle plan for the CL team in order to ensure they have the most optimal acceptance of their revamp.

Anyone else see it this way, or am I indulging wishful thinking?


If you look at the case of Righthaven (copyright troll), a federal judge ruled that even though the copyright owner granted them a license to sue infringers to stop copying, Righthaven didn't have standing to sue because they didn't actually own the copyright.


I'm getting sick of the general attitude here. A majority of people seem to think that it's OK to copy user posting data from CL, because they are a monopoly. IE is cited as an example of a monopoly, thus anything is OK to take it down.

That attitude is horribly wrong. Agreed, CL is a monopoly. Agreed, they are holding back innovation. But, you don't disrupt a business (and build yours) by copying their data, especially illegally. Although, the legality of issue at hand is up in the air right now, the general attitude is a bit appalling.

What's required is a way for users to post their listings simultaneously at other properties. The content of the postings belongs to the user and they should be free to re-post it anywhere.


Has there ever been any consideration at some kind of partnership? I know there may be some bad blood at this point, but I don't see how it wouldn't benefit both Padmapper and Craigslist to potentially work together to create an alternate Craigslist(using the same data). The contracts would have to be thorough about profit sharing, but it seems at least possible.

If people had a secondary Craigslist portal they could use to access the same data, while avoiding the shitty interface, more and more might migrate over to the new 'design'. The current Craislist design has been dated for some time, and I just can't see a startup or even a big player not moving into their space with a better design and interface.


Eric from Padmapper has tried to get an audience with them, but both Craig and Jim wouldn't take a meeting. They sent a license, but that was only for mobile apps. Most people use padmapper though the website.


A few of humble opinions:

Padmapper is probably getting enough publicity out of this to get over the critical mass problem and will be able to rely on their own listings.

Craigslist content is user generated, and the copyrights remain property of the users. While facts aren't copyrightable anything that padmapper displays that isn't just a fact, photos or a realtor's description for example, is a copyright violation.

This is why sites like stack overflow are awesome, because they licence "their" content, aka your work cc-wiki and claim no ownership.

The bummer here is not that craigslist is suing over unauthorized data access, it's that your work is too restrictively licenced on their site to be of maximum benefit to you.


this reminds me a lot of Feist V Rural [1]; I just hope it turns out the same way.

[1]: http://en.wikipedia.org/wiki/Feist_v._Rural


I'm very interested in the outcome of this as I am building a service similar to Padmapper in the sense that it uses data from other sources to make a better user experience for people, and I'd love to include Craigslist (Would be a big supplier of data for my users), but I don't want to deal with these types of things so I have been leaving it out.


I think the key thing here is that you have to talk to your data sources. You have to use their data source the way they want you to, or come to some sort of agreement (licensing or somesuch), else you are moving into the same legal territory Padmapper was in and risk getting sued.


Do as they did: build it and wait until the C&D. Get a massive amount of free PR and hopefully you can go from there.


The USER intent is to have their listing seen by as many people as possible. If the courts decide that the listing site can block access to such public listings they have been entrusted with, I don't know how the law needs to change, but the current legal situation is bad for the general public, bad for commerce, and bad for progress.


At least they're not claiming a violation of the CFAA (which I think would actually be a more straightforward argument).


Care you explain? I don't know what the CFAA is.


http://en.wikipedia.org/wiki/Computer_Fraud_and_Abuse_Act

The relevant part of the act would be: "Intentionally accessing a computer without authorization to obtain ... information from any protected computer". A "protected computer" is a computer "which is used in or affecting interstate or foreign commerce or communication", which fits Craisglist pretty well.

Violation of the clause I quoted is a criminal offense with potential jail time.


That clause could be construed in any number of ways. Anyone accessing a web page could be liable if they have not 'obtained authorization' from the site owner.


I think they tried to circumvent that using 3tap or whatever the site was that was scraping Google's cache.


Compare & contrast the CL lawyers' behavior with the Jack Daniels trademark lawyers a few days ago.


Craigslist makes its data publicly available. How can it be illegal to access that data? It may be against Craigslist's Terms of Service, but that means it's up to Craigslist to figure out how to prevent scraping. It does not mean the law needs to be involved.


This is the beginning of the end for CL. You can just sense by actions such as this what their survival strategy is. While legally they may win this battle, in the long run, I think CL is headed to deadpool in the next 5 years.


Does anybody know if Craigslist is cracking down on all the Craigslist iPhone apps?


I'm guessing they're not cracking down on the ones that opted to contractually license the CL data.


they won't be if they've paid to use the craigslist mobile api.


They have an api for mobile apps. If you haven't purchased it, doesn't that tell you all you need to know?


I love Craigslist, and have used it for years successfully. It is the type of tool I love using the web for - simple, functional, and document based.

All of this anti-Craigslist sentiment makes me sad.


Reminds me of Craig Newmark's lawsuit against the networks a few years back. In that case he wanted to assert the right to use DVRs to skip commercials.


Well, I can't say this was a surprise, but it is disappointing. I hope Padmapper has the wherewithal to fight the suit. Bon courage, Eric.


Craig is paranoid about changing the UI of craigslist. So while I don't feel they're unjust in suing, he is indeed stifling innovation.


Why wouldn't he offer an alternative UI for craigslist while leaving the original running?


If Craigslist spent as much on UI or API designers as they did on lawyers, we'd all be better off.


If what 3Taps is doing is illegal, then what Google is doing is illegal. It's a search engine.


If you send google a c and d, they will comply.

Pad mapper was sent a c&d and decided to not comply. Hence the case.

Edit: and as has been pointed out, google isn't competing with Craig's list.

Padmapper's gameplan is to disintermediate Craigslist away.


If we are lucky, they will settle for PadMapper's assets - and use them.


> Users and developers are exasperated with Craigslist’s insistence on preserving an outdated interface and design.

Well then go use / collect your own apt listings, lobby congress to pass compulsory use licensing for all entities that collect data, or STFU.


How Craigslist can evolve:

Become a for-pay API.

Thoughts?


It is up to Craigslist, and Craigslist only. I prefer not to speculate.

They already offer an API for mobile.


Craiglist doesn't display ads


Craigslist is ads.


The suit is understandable, but I wish craigslist would get over themselves already and allow users to look at listings on a map. That's all anyone really wants here, and this is getting a bit ridiculous.




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: