Search engines and SEO spam

dang · on Jan 3, 2022

This was in response to mwseibel's thread, which had a big discussion yesterday:

Google no longer producing high quality search results in significant categories - https://news.ycombinator.com/item?id=29772136 - Jan 2022 (1167 comments, spread over multiple pages - note the "X more comments" links at the bottom)

jrockway · on Jan 3, 2022

To some extent, I worry that the problem with search engines is that there isn't any data worth returning. Yesterday's thread talked a lot about reviews. Writing a review is hard work that requires deep domain expertise, experience with similar products, and months of testing. If you want a review for something that came out today, there is no way that work could have been done, so there simply isn't anything to find. Instead you'll get a list of "Best TVs 2021" or whatever, with some blurb and an affiliate link, not an actual review. That's what people can make for free with a day's notice, so if you write a search engine that discards those sites, that's fine, you'll just return "no results" for every interesting query.

I guess what I'm saying is that if you want better reviews, you probably want to start writing reviews and figuring out how to sell them for money. Many have tried, few have succeeded. But there probably isn't some Javascript that will fix this problem.

frenchyatwork · on Jan 3, 2022

I think one of the fundamental things that make search work well about 1-2 decades ago was that web sites would link to each other, and that those links could vaguely correlate with reputation. There were link spammers, but there was actually a some decent organic content as well.

What's happened since then is that almost all the normal "people linking to things they like" has gone behind walled gardens (chiefly Facebook), and vast majority of what remains on the open web are SEO spammers.

wodenokoto · on Jan 3, 2022

Why has blogs and articles stopped linking to things? I'm reading a restaurant review site, and they won't link to the restaurant. The chef name is a link to a list of all articles tagged with the chefs name, rather a wikipedia link or something useful that can tell me who that person is.

orcasushi · on Jan 3, 2022

Average websites goal is now to keep you on them as long as possible. According to some metric folks, the longer you stay on a website the more money you spend there. Linking to another website destroys that metric.

Also if you are going to make a purchase somewhere, any website would try to get a cut of the money you spend by actually sending referral links to the product. So small websites that do not allow this service will not get linked so much.

On a metalevel it is thus that links or connections between items are information. Information is money. And as soon as that became evident links and connections also became more scarce.

fractalb · on Jan 4, 2022

> According to some metric folks, the longer you stay on a website the more money you spend there.

That is really sad. Metric folks inventing metrics for the sake of metrics, which dubiously correlates to profitability of the company.

toastal · on Jan 4, 2022

Yup and developers have been allowing the marketing and product teams to break the back button as well opening every external link in a new window instead so users have to keep something open to their site. You always had middle-click to do this, but now it's being forced on users.

toastal · on Jan 4, 2022

I just noticed even the goobers at GitHub break the back button when you click a project link too. I don't know why people champion this brand when they have dark patterns and shoehorn 'social' functions into the proprietary platform.

type0 · on Jan 4, 2022

This is a prisoners dilemma of sorts and the whole free web is loosing in this.

ijidak · on Jan 3, 2022

Because, years ago, linking to lower reputation sites would drain your page rank.

So everyone worried about SEO became afraid to link to anything except:

1) Their own website 2) High reputation sites like NYTimes, etc.

It's sad. Makes it harder to navigate the web.

chongli · on Jan 3, 2022

Bang on. Saying that "there isn't anything out there anymore" is missing the point: Google's algorithms created this situation, intentionally or not. Before Google, people linked to what they wanted and communities would naturally cluster around topics of interest. Google came in and made reputation into a currency which effectively destroyed all these communities through incentivizing selfishness.

Beldin · on Jan 3, 2022

"When a measure becomes a target, it ceases to be a good measure"

-- Goodhart's Law.

Google's algorithms didn't create this situation; people chasing high Google rankings did. Had Google used completely different algorithms yet became equally dominant, people still would have poured their hearts and souls into getting higher rankings.

Basically, an application of the tragedy of the commons. Or: "why we can't have nice things".

chongli · on Jan 3, 2022

But that's taking for granted that Google would have become dominant. Perhaps if they hadn't chosen the algorithm they did then they wouldn't have been as overwhelmingly successful. Instead, I could imagine a world in which there are multiple search engines and none of them are all that good. In fact, that's the world I remember from before Google existed. Search was bad but communities were strong and life was good.

Then Google came along and we all found it a lot more convenient than the bad search engines we were used to. And of course, we all know where that led. In some sense, Google built an 8-lane superhighway and bypassed all the small towns.

We all traded away paradise in exchange for convenience. Now we have neither.

Beldin · on Jan 3, 2022

On the glass-half-full side of this: we're getting those communities again! Here on HN, on reddit, for certain topics on various social media (there are pearls there too), on Mastodon, various blog authors, Ars Technica, Quanta, etc. [1]

It's just fragmented - i.e., catering to a specific group. Because if it isn't, it's awesome for 5 minutes and then monetization rot sets in.

[1] None of these work for everyone; conversely, all of these are seen as great things by some and have people who prefer that one thing over others for its quality.

type0 · on Jan 4, 2022

The trouble is, you are no longer "surfing" the Web, you are digging through your RSS feeds and links to interesting sites, fediverse subscriptions etc,.that's not good UX, perid.

salawat · on Jan 4, 2022

>Google's algorithms didn't create this situation; people chasing high Google rankings did.

You're technically right. You'd be more right if you said people chased the highest spots on search engines for the widest breadth of queries.

If there were implicit alphabetical ordering of search results I guarantee you'd end up a bias toward A's, Z's or otherwise in people trying to get top spots.

mrkramer · on Jan 3, 2022

>Google's algorithms didn't create this situation; people chasing high Google rankings did.

But lowkey Google incentivized such behaviour by not being open and transparent on how exactly their algorithms work.

B-Con · on Jan 3, 2022

That would have allowed people to artificially chase rankings even faster and more efficiently. It makes the problem worse, not better.

mrkramer · on Jan 4, 2022

How is transparency worse than smoke screen that we have today? For example healthy and good websites could rank according to good content, good optimization, variety of multimedia content, decent design and UI etc. You can't have too much of good things and qualities. That would be something like writing a too good book or making a too good product.

B-Con · on Jan 5, 2022

Because the rank algorithms are subjective heuristics, not absolute metrics. All rank algorithms always have been. It started with the link metrics, then people started gaming that. It's been a signal/noise war ever since.

It's also dangerous to ask for the exact criteria because they are ever changing. Google et al don't want to be prescriptive about what a good site is, they want to recognize what a good one is. You make a good one, they'll figure out how to recognize it.

They can't sit down and publish "The Definitive Guide to a Good Website". That's just not their role and it will be out of date before it's published.

mrkramer · on Jan 5, 2022

I understand that Google can not prescribe and direct how websites should look like but more transparency on their part wouldn't hurt.

xbmcuser · on Jan 4, 2022

A big problem is that a lot of community content went behind Facebook. Instead of creating webpages or forums people started using Facebook pages and Facebook groups. This is the main reason I have been anti Facebook for over a decade. Not because of privacy reasons as many are but because I saw that Facebook will put the web behind it's closed doors. Even today some of the best reviews about any product or service are usually in enthusiast community forums. But a lot of that activity has gone behind closed doors of facebook and now reddit. Most of the current thriving forums are those that pre existed Facebook.

wussboy · on Jan 3, 2022

Surely there is just a different algo that could bring about better communities?

jonathankoren · on Jan 3, 2022

Different, but not better.

The incentives to game the algo remain. People adapt to the environment.

kilburn · on Jan 3, 2022

That's why mechanism design [1] exists as a field of study. The whole idea of that field is to provide the proper incentives to steer the participants towards your objective. Yes, considering they will try to "game" the system however they can.

I'm pretty sure google could do strictly better (i.e.: better in all reasonable accounts) than they do now if they focused on the users' experience instead of revenue for a couple terms.

[1] https://en.wikipedia.org/wiki/Mechanism_design

amelius · on Jan 3, 2022

> The incentives to game the algo remain. People adapt to the environment.

Perhaps it could work if the algorithm changed its algorithm all the time.

jonathankoren · on Jan 4, 2022

You don’t think Google does this already?

wyclif · on Jan 4, 2022

Paul Graham says Google doesn't want to follow anybody down that road (human intervention in search). But ISTM the problem is that even though they don't, they can just throw a giant pile of money at it if they needed to crush a competitor. Also, VC will refuse to invest in anybody doing it because Google.

jonathankoren · on Jan 4, 2022

PaulG doesn’t know what he’s talking about.

vintermann · on Jan 4, 2022

I disagree, he's a pretty good guide to how VCs think, though not always in the ways he wants.

marcosdumay · on Jan 3, 2022

Only if implemented by the monopolist.

People's best chance is stopping using Google and pushing for it to be broken-up.

ajmurmann · on Jan 4, 2022

I wonder if punishing presence of advertisements would filter out most pages that are SEO'd to the max and instead promote "labors of love" type pages.

AlexandrB · on Jan 4, 2022

This is an interesting idea because it would create a type of non- or anti-commercial SEO that could counteract the commercial one. However, Google would never do it because they sell most of the ads that would be (not) hosted on these sites.

naravara · on Jan 4, 2022

Google owns the largest online advertising network though, so that’s definitely not where their bread is buttered.

mrkramer · on Jan 3, 2022

Wouldn't it be reasonable from Google to show how their ranking algorithms work so all webmasters and content creators know how to behave on the web. Now we have black box that's causing confusion and is misdirecting websites and web users.

wyattpeak · on Jan 4, 2022

I don't think so. If the problem is people gaming the system, making it easier to game isn't going to improve the situation. It's not going to put good content creators on a level playing field with spammers, because good content creators simply don't care as much as spammers about search engine gaming.

rtsil · on Jan 4, 2022

But people are already gaming the system, and on any search from product reviews to code snippets, I see SEO-optimized spams populating the top results. Good content creator don't have the time or the technical inclination to reverse-engineer the ranking algorithm (or to brute-force by creating tens of thousands of sites and see what sticks in a giant multivariate test).

Knowing the actual rules might give them a fighting chance, since the bad guys already know these rules anyway.

freeflight · on Jan 3, 2022

> It's sad. Makes it harder to navigate the web.

Some would even say it killed the web by centralizing all the content in the hands of a few [0].

Which is the direct consequence of everybody optimizing to better show up on Google/Facebook/Amazon/Microsoft and ultimately even migrating all their hosting to these companies.

[0] https://staltz.com/the-web-began-dying-in-2014-heres-how.htm...

ufo · on Jan 3, 2022

There aren't as many blogs now as there used to be.

adventured · on Jan 3, 2022

That will get worse yet most likely. Younger people no longer produce public text to the extent they did prior to the the smartphone heavy era. Supply of that blog style content will continue to dwindle as the producers age out. I'm sure there's a stability point it may reach, of course, because some tiny percentage of people will always want to write long-form.

Younger people TikTok, they Instagram, they chat in private conversations with eachother, they occasionally post short messages in walled gardens like Facebook, they YouTube, they listen to music, they watch Netflix & Co. That's what they do. They do not persistently write LiveJournals, Tumblrs, blogs. That pre video/audio-focused era is over and it's not coming back (even if there's occasionally a bubbling up of hipster fakery centered around how cool it is to write text).

lmkg · on Jan 3, 2022

I heard an interesting theory the other day: blog viability declined because Google killed Reader. Which indirectly ends up poisoning Google's biggest well, since blogs are an important source of relevant cross-domain links.

I'm somewhat skeptical, it seems a little too poetic to blame Google's ultimate downfall on a decision that was notably hated at the time. But it's plausible. If you want it to be a conspiracy theory, you can posit that killing off independent blogs was the intent, to convince bloggers to migrate to Google Plus.

heavyset_go · on Jan 4, 2022

Google used to prioritize blogs and original content like forum posts in search results, but they don't anymore.

type0 · on Jan 4, 2022

What do they prioritize now, "reputable" news organizations enrolled in Trusted News Initiative?

heavyset_go · on Jan 4, 2022

Blog spam and pages filled with AdSense ads.

nkrisc · on Jan 3, 2022

I find that claim surprising considering how many more people there are simply using the internet at all.

Fewer unique blog domains due to “blogging” sites that aggregate users? Sounds plausible. Fewer people blogging overall? I’m not convinced yet.

Baeocystin · on Jan 3, 2022

I'd believe it. As an IT consultant, I interact with a lot of people who are semi-techs themselves- mostly small business owners who are used to wearing a lot of hats, and also the type to have been motivated to run their own personal blogs about diving/photography/conlangs/quilting/gardening/whatever their personal hobbies are.

Ten years ago, the majority(!) had at least something up and running, where they would post essays, thoughts, whatever came to mind.

Nowadays? All gone. All! When asked why, the answer almost always is along a mix of ever-increasing negative feedback and harassment from randos, and aggressive automated spamming of their forums. Loss of the pseudo-anonymity plays a large role as well. Many have deleted years' worth of work, simply because they are afraid of someone trolling through their posts to find something to harass them with.

I was never a blogger myself, but I am sad about the change. There was a lot of good stuff out there for a while, and sometimes it just plain made me happy to read someone joyfully nerding out on a favorite subject of theirs.

nkrisc · on Jan 3, 2022

I think a lot of people are still writing this kind of content, but you have to look elsewhere for it: Reddit, Facebook, Twitter; to name the obvious ones. It’s also harder to find, but you can find all kinds of personal content written in comments and posts on these sites.

Baeocystin · on Jan 3, 2022

I realize that this is a hard thing to 'prove', but I am personally certain that the amount and quality of such things has dropped significantly from a decade ago.

Not to zero. You can still find things tucked away in a post on reddit or the like. Almost never, as far as I have experienced, on Facebook or its ilk, as the affordances are different. I genuinely think there has been a loss.

salawat · on Jan 4, 2022

It used to have positive utility, as before you were acquainting with people you would literally have had nil chance of acquainting with before.

Now?

Nope. Putting anything out there is basically just doing the rest of the world's Open Source Intel for them. Maybe it isn't the Net that changed. It's just there's way more sharks out there that can't just leave well enough alone.

kritiko · on Jan 3, 2022

I frequently append site:reddit.com to searches for a niche search term these days. I think a lot of people who would have blogged or commented on blogs are posting there instead.

bee_rider · on Jan 4, 2022

I wonder if they'll do a walled garden after their IPO. I've always found the site pretty useless, outside the 'old.reddit.com' version. On the bright side, maybe this will open up space for one of the federated clones to grow.

ufo · on Jan 3, 2022

I think the bigger issue now is that more content is inside social media "silos" like twitter, instagram or youtube. I don't have the numbers though.

Volker_W · on Jan 3, 2022

Why is this a problem? Can't google index social media silos?

bluGill · on Jan 3, 2022

Which ones. They can index their own, but for the others only the public stuff. Facebook has a lot of things private so nobody can see them except your friends. (they are by no means perfect, but a lot of things are private and only seen by friends - most of it isn't of interest to a search engine anyway but comments of the form "I love X product" could in a perfect world be indexed as a sign of what people find good)

freeflight · on Jan 3, 2022

> I find that claim surprising considering how many more people there are simply using the internet at all.

Most of these many more people are mobile users, where creating long-style text content can be quite bothersome.

What ain't bothersome, with a smartphone, is taking pictures and videos to slap filters over them, alas that's why we are where we are with TicToc, Instagram and Twitter dominating large parts of the web.

It's even noticeable in a lot of online discussions with text outside of these communities; The average length of forum posts feels like it's gotten way shorter over the decades. People have less attention to read anything that looks longer than a few sentences, often declaring it a "wall of text" based on quantity of text alone.

Imho it's a big part of what drives misinformation; Doing any kind of online research on a small phone screen is extremely bothersome compared to the workspace an actual computer/laptop, particularly with multi-monitor, gives.

There's also the difference in attention; When I sit down at my laptop/desktop, I actively decide to spend and focus my attention on that task and device.

While smartphone usage is mostly dominated by short bursts of "can't do anything else right now", I don't chose to take out my phone and surf the web, it's something I do when I'm stuck in some place with nothing else to do and no access to an actual computer.

But for the majority of web-users [0], that smartphone access to the web is all they know, which then ends up heavily shaping the ways they consume and contribute to it.

[0] https://techjury.net/blog/what-percentage-of-internet-traffi...

denton-scratch · on Jan 4, 2022

> TicToc, Instagram and Twitter dominating large parts of the web

For my part, I'm glad these fora aren't indexed well; I don't want my search results dominated by single-sentence posts and photos. In particular, I don't have accounts on any of these services.

I'd be happy if search engines would decline to index sites behind paywalls. Links to Medium, Substack and Washpo are very common, and if the first thing I see is a popup demand for payment, that browser-tab gets closed.

prox · on Jan 3, 2022

I wonder if it would be possible to have a big filter button “commercial” or “non-profit” or something along those lines. So you get results that are not deemed commercial or are.

Don’t know how hard it would be to know which is which. Maybe non-commercial : don’t run ads, don’t sell a product or service and provide information only.

lethologica · on Jan 3, 2022

I wonder if the majority are moving to vlogging instead?

srg0 · on Jan 5, 2022

The original Google algorithm was a clever hack but it relied on web being a hypertext, and links being used contextually.

The algorithm made linking valueable. So instead of writing hypertext people tried to create isolated sites and boost their rank. Remember rel="nofollow"?

Eventually bigger sites took over the small ones.

kyriakos · on Jan 4, 2022

Noticed this with online newspapers too to the extend they are reporting about a website or product and don't include a URL to it.

dpeck · on Jan 3, 2022

I agree very much with this. It seems that between the walled gardens and also people being so reluctant to have “their” audience leave their site/page/etc the discoverability of the web has dropped dramatically.

ZetaZero · on Jan 3, 2022

That's an interesting observation. IMO, we stopped linking to good content because Google was good at finding it. Now Google is suffering, and we need to go back to doing more links.

Mezzie · on Jan 3, 2022

Yup, early Google relied on a lot of unpaid , unseen human intervention and choices. I ran some weblinks and curatorial sites during the search wars, and PageRank could only work because there were people behind the sites choosing links based on their usefulness to their audience.

prepend · on Jan 3, 2022

I wish FB would be more open, but since they have all this walled garden info, are they well placed to start a competing search engine? Would be interesting if their activity could help filter out seo hackers.

kritiko · on Jan 3, 2022

FB search seems to have gotten worse and worse. Unless I can remember the specific Group where I saw something, it's very unlikely that I can find it again. And they know which posts I've been highly engaged on...

snowwrestler · on Jan 4, 2022

This explains why running a search engine on the original Google PageRank algorithm would not work as well today as it did back then.

But Google doesn’t run on PageRank anymore. PageRank is merely one of hundreds of signals they use to sort results.

inlined · on Jan 3, 2022

Does this mean that Facebook is the only company well poised to take on google search?

winternett · on Jan 3, 2022

The main driver of SEO spam, and online scams in general are countries that have little to no opportunity for economic growth. There are literally millions of Internet savvy people who would be able to survive on what we would consider barely anything profit-wise in adsense revenue, which also usually pays out in US dollars. In this currently terrible global economy, desperation turns the most intelligent minds bound into poverty into bootleg SEO engineers, online catfishers, scammers, and ransomware creators, and God bless their creativity...

Instead of creating income opportunity and crowdsourcing people in foreign countries for common (more positive) good, companies rarely create opportunities for the people who would normally turn into spammers and scammers, and that's what creates an endless army of people that constantly destroy online communities like Soundcloud, FaceBook, Twitter, and TikTok with stolen content, trend scams, fake news, and spam messages.

Google search has been invalidating and subverting their most accurate search results based on abstract SEO rules for quite some time now. It was likely done so that they could implant paid ads first into content, because that makes them the most profit. Doing that has destroyed their reliability and reputation as a search service leader, and they're never going to admit it, but payola is the undertone that is ruining their search results... There is a certain type of corruption that occurs when a company turns away from upholding customer service and value towards a monopolistic "profit-first economic stranglehold" business model... That strategy never ultimately works out well for both companies AND users in the long run. The next leader will likely be a search that avoids the same pitfalls until they themselves become a profit-driven monopoly.

There is no algorithm that will usefully and fairly counter spam based on desperation, companies need to realize that creating opportunity for people to operate equally on their platforms is the best move, otherwise, spam will drive any community of rule abiding users away or into madness.

mrkramer · on Jan 3, 2022

>The main driver of SEO spam, and online scams in general are countries that have little to no opportunity for economic growth.

Not quite right because cybercrime aka hacking, cracking, spamming etc. originated in US not in East Europe, Russia and third world countries which are dominating hacking and spamming scene today. Main motivation of cybercriminals is quick money and ease of getting away with it since you are not physically committing a crime but digitally/electronically.

adventured · on Jan 3, 2022

It is quite right. Those are the main drivers and it's due to lack economic opportunities.

Hacking heavily originated in the US because the US practically built the entire modern tech universe from the ground up. The US was far out in front when it came to utilizing the Internet and the Web, so of course unethical people in the US pioneered various types of online crime, the US was the early adopter.

If you're an elite engineer in the US, you can make millions of dollars doing legal work for big tech. It helps in a big way to drain the labor pool as it pertains to criminal activity online. You generally can't do that today in the countries that dominate SEO spam, online scams, etc. In those countries elite engineers suffer terrible wages doing legal work compared to what they should be able to earn for their abilities; commonly they can earn a lot more doing illegal work instead, it's a very potent lure.

You're an elite engineer in Russia, top ~1%-3% globally. What do you do? Earn several thousand dollars per month doing legit software development in Russia (with either zero or little consequential equity compensation); flee Russia for a more affluent market; or do illegal work where the rewards can be dramatically greater. It would be difficult to resist if you were unable or unwilling to leave Russia.

winternett · on Jan 4, 2022

Furthermore, the penalties for cyber crimes, and ability to track footprints are much more articulate and accessible for authorities in the US and UK for citizens that hack and abuse US & UK systems, which makes enforcement upon US and some EU hackers much more harsh/severe/less complex to enforce, and more likely to be apprehended... Many users in economically disadvantaged countries use older devices like PCs running older software, routers that don't allow Mac level reporting, and well past EOL cell phones that don't leave the kinds of footprints that modern devices do (on top of the well documented now legacy security measures they can take).

mrkramer · on Jan 3, 2022

>You're an elite engineer in Russia, top ~1%-3% globally. What do you do? Earn several thousand dollars per month doing legit software development in Russia

Become software entrepreneur?

And many international software companies have software development teams and presence in Russia.

PeterisP · on Jan 3, 2022

In places with a less-established legal system it's harder to make money by above-board entrepreneurship and keep it instead of handing it over to local strongmen (two colorful examples that have stayed in my memory and unlike many others have become public and have also been described in non-Russian media - https://www.independent.co.uk/news/world/europe/valery-pshen... and https://abcnews.go.com/International/wireStory/us-embassy-ru... , but of course those are the exceptions because the usual result is complying with threats and handing over your business or most of it). But it's not really about Russia, it's a general issue with parallels in other countries as well. And of course, there's the issue of the local market; the financial advantages for a skilled tech person going towards entrepreneurship legitimately are less attractive in most places compared to USA; heck, even EU potential tech entrepreneurs often just go 'across the pond' to start their business.

If you can't get a work visa to a first world country, you do have less options than someone already living there; and the salaries offered by first-world "international software companies" in their remote subsidiaries tend to be 'according to local market rates' (the same "several thousand dollars per month" mentioned by the parent poster is a decent rate) and thus not as competitive with "black entrepreneurship" which pays according to global standards.

emerongi · on Jan 3, 2022

> Become software entrepreneur?

Exactly. Hacking for hire, making cheats, botnets, SEO farms, selling exploits and hacked social media accounts; practically anything you can think of that US software engineers can't be bothered with, as they already earn a healthy salary. That is entrepreneurism.

freeflight · on Jan 4, 2022

It's hardly only shady stuff; Kaspersky, ABBYY FineReader, VKontakte, Telegram are Russian software products that come to mind.

Russia also has its own SaaS enterprise sector with companies like SKB Kontur or Diasoft.

Just like the US has to this day warez and cracking groups, where it's for the longest time mostly been about scene prestige, and not making the big bucks.

mrkramer · on Jan 3, 2022

I wasn't speaking about that kind of entrepreneurism but about making legal software and legal web services that solve problems and are useful. So many Russian hackers got arrested when they travelled somewhere outside Russia and now they are serving 10 or 20 year sentences in US jails.

PeterisP · on Jan 3, 2022

> So many Russian hackers got arrested when they travelled somewhere outside Russia

How many? 20? 30? 50? IMHO the cases are rare (and get widely publicized whenever that happens, creating a disproportional visibility), you get a couple captures per year but the number is just a tiny fraction of the actual participants, more like an exception than the rule.

mrkramer · on Jan 4, 2022

My assumption is beyond 20. US is only after big time cybercriminals[0] smaller ones get away.

[0] https://www.fbi.gov/wanted/cyber

denton-scratch · on Jan 4, 2022

> making legal software and legal web services

SEO is perfectly legal. So is spamming, regrettably.

I think Goo no longer cares about the quality of search results; they have other business priorities, so SEO works. Spam is another thing again; we still, after 30 years, don't have an agreed definition of spam. We still don't have a flawless spam filter - far from it. So it astonishes me how much email spam I get promoting SEO services for my non-existent website.

mrkramer · on Jan 4, 2022

>SEO is perfectly legal. So is spamming, regrettably.

SEO is legal but spam is not at least not in US and many other jurisdictions.

>I think Goo no longer cares about the quality of search results; they have other business priorities, so SEO works.

Google cares about spam but there is so much data and information on the web that it is impossible to figure out what is spam and what is not. Another big problem is fake data and information that is also very hard to figure out. Generally Google prefers popularity over quality because it is easier to detect what is popular than what is of good quality.

>Spam is another thing again; we still, after 30 years, don't have an agreed definition of spam. We still don't have a flawless spam filter - far from it.

Definition of spam is unsolicited message. So if I get pharma emails in my email inbox that I didn't ask for it is considered spam. SMS messages are another example for example if I get SMS message promoting free coupons but I didn't ask for it then it is spam.

Considering how much spam Google saw in the last 20 years both on the web and on the Gmail they should have some decent machine learning/AI algorithms which could flag spam pretty efficiently.

denton-scratch · on Jan 5, 2022

> Definition of spam is unsolicited message

That may be your definition. As I observed, not everyone agrees with it. Most definitions of spam include the word "bulk", for example.

mrkramer · on Jan 5, 2022

Bulk unsolicited messages?

mc32 · on Jan 3, 2022

Reviews are a special category. It suffers from a couple of issues:

1. You need to have enthusiastic reviewers (people who care enough about a product category to review them semi-throughly.)

2. Proper reviews can take time and may need domain knowledge.

3. Competition. When there were one or two people doing reviews on some category of products, maybe the economics worked out. Once you have hundreds or thousands competing with you, the time demand may be overwhelming and not worth it.

4. If you are a trusted reviewer or site, you will get economic pressure to review a particular thing or brand you may not like very much but the money may be good. So you will begin to experience conflicts of interest.

5. If reviews are just a hobby and not a way to make money, eventually you will slow down or move on, opening a hole that gets filled up by spammers.

7. Some things are timeless (a pipewrench, let's say) and some are seasonal (consumer electronics, toys, etc). The former deserves a through review but the latter doesn't deserve as much but it may get the bulk of interest due to seasonal demand). Does it really matter if the latter's latest iteration has 2% increased battery life to discuss?

I'm sure there is a lot I didn't think of. But it's a doomed category, unless people are willing to pay for professional reviews (Consumer Reports types and other independents).

chongli · on Jan 3, 2022

If reviews are just a hobby and not a way to make money, eventually you will slow down or move on, opening a hole that gets filled up by spammers.

In my experience, the best reviewers are hobbyists. The thing is, it's not reviews that are their hobby. Rather, they review the products go along with their hobby.

So, for my hobbies (espresso and aquariums), there are tons of easily accessible reviews on all kinds of aquarium gear and coffee machines, grinders, etc. On the other hand, nobody does plumbing or HVAC as a hobby (that I know of) so it's very difficult to find high quality reviews of water softeners or furnaces. It takes a very special rare sort of person who would install these things just to review them. The closest thing I could find was this video [1] on a DIY water filtration system by an RV/off the grid type hobbyist (from what I can tell).

[1] https://www.youtube.com/watch?v=WCC4TOYYGF8

treis · on Jan 3, 2022

People like to talk about their work too. There's plenty of those sort of reviews out there. Mostly on reddit because like others have mentioned organic search results are completely gamed.

giantrobot · on Jan 3, 2022

> Reviews are a special category. It suffers from a couple of issues

Review sites suffer from a singular problem. They are overwhelmingly SEO spam content farms. People go find some product niche and pay some Fivvver/whatever people to write literally fake reviews of products. Because they're pulling all the SEO tricks and are in a niche category they shoot to the top of search results for that niche.

Their reviews sound realistic and viable but they're pure fantasy. The writers never touch the products being reviewed. Many times they'll pull details from Amazon listings (including factual errors) and even other "review" sites.

Once they get established in their niche they'll accept paid placement from product manufacturers without marking it as such. A single scammer might own dozens of these sites, even supposedly competing ones.

eatonphil · on Jan 3, 2022

I pay for Consumer Reports. I'd encourage more people to too. I don't trust it completely but it's a good companion to manual searches on Reddit/HN/car forums, etc.

aemreunal · on Jan 3, 2022

Someone pointed out yesterday on that other search thread that [most?] libraries provide free access to Consumer Reports through a membership. I just looked at the San Francisco Public Library and it does indeed give me access to the magazine and a searchable database.

sanxiyn · on Jan 4, 2022

One way out is to grow a community of enthusiastic reviewers. LibraryThing succeeded for book reviews. LibraryThing book reviews are better because LibraryThing caters to bookworms.

wyclif · on Jan 4, 2022

Goodreads also has a community of enthusiastic reviewers, but because Amazon owns it I'm afraid they don't have much incentive to improve the site or change anything.

Kerrick · on Jan 4, 2022

While I agree with everything you said in general, Ken Rockwell seems to buck the trend for photography gear, especially lenses.

snth · on Jan 3, 2022

> If you want a review for something that came out today, there is no way that work could have been done, so there simply isn't anything to find.

That's not strictly true, given that reviewers are often sent pre-release versions of things in order to do that work before release day.

loceng · on Jan 3, 2022

Not sure why you're being downvoted, as you're correct - however to point out there seems to be a trend where reviewers are only given pre-release versions if they practically always give favourable reviews to the products they list, especially if they're provided the product for free; there doesn't have to be an express relationship or contract between a reviewer and a company either, it's the reverse of how Bill Gates apparently has given $200 million+ to different news channels/media organizations - and so they're less likely going to as freely share negative news about him or perhaps his organizations, so then ; this makes me think, similarly to how stocks being sold by CEOs (etc) must be pre-planned to avoid shenanigans like market manipulation, that anyone giving large sums of money to any media/journalism organization must divide the amount up over 20-40+ years, so that organization at least has a runway and not dependant on larger "dopamine hits" at shorter intervals.

_delirium · on Jan 3, 2022

Yeah, that's been a problem with reviews for a long time. In fact it's what Consumer Reports used initially to differentiate themselves: their "thing" was that they only reviewed products bought anonymously at retail (no free samples or manufacturer-provided review items) and didn't accept any advertising from manufacturers either.

Sites that receive free review samples and are supported by affiliate links are kind of the exact opposite model.

wyattpeak · on Jan 4, 2022

It does in a funny way provide something of a metric for how willing the site is to be critical. Several video game reviewers I follow have stopped receiving product from some studios, which I think is a badge of honest review. Although it's not something you'd know easily so it doesn't help much in terms of finding good reviewers.

nradov · on Jan 3, 2022

I trust DC Rainmaker's reviews of fitness tech products because he always returns products back to the manufacturers after writing reviews. So there's no conflict of interest based on free products.

https://www.dcrainmaker.com/product-reviews

ceejayoz · on Jan 3, 2022

If companies don't like his reviews, they'll stop sending review units. That hits both in the pocketbook and the race to be one of the earlier reviewers of a new product. Reduced conflict, perhaps, but not none.

nradov · on Jan 3, 2022

If companies don't send him review units then he just buys them retail. He has already done this for many products.

ceejayoz · on Jan 3, 2022

Yes, I'm aware. That's less money in his pocket, and less ability to have the review be available on or before the product launch. There's still some conflict of interest, even if it's lessened.

Only purchasing review units at retail would remove this conflict.

bobthebuilders · on Jan 5, 2022

Incorrect. He can then return them.

bluGill · on Jan 3, 2022

Depends, if someone is popular you can't afford not to have them review your things. A a certain point a bad review will still generate more money than no review at all. Few reach that level though, most reviewers don't have that much following.

whakim · on Jan 3, 2022

This presupposes that companies think their products are bad. If you have (what you believe to be) a good product, you definitely want DC Rainmaker to review it. I think this is a reasonably general point across industries - companies want to get their products into the hands of the most reputable reviewers.

tchvil · on Jan 3, 2022

In DC Rainmaker's case it is probably the opposite. A fitness product not reviewed by him is a bad signal.

KarlKemp · on Jan 4, 2022

Any serious publisher has that policy. Here's Wirecutter's (NYT) take on it: https://www.nytimes.com/wirecutter/blog/yes-i-work-at-wirecu...

hubraumhugo · on Jan 3, 2022

Search engines are pretty good at solving the problem they were designed to solve, which is "finding pages which contain all the query words". But they are pretty bad at solving the much harder problem of rating the trustworthiness & authenticity, intentions of the owner, monetization of the site, etc.

One possible solution to this could be:

- Let the community vote on the most trusted sources

- Include results from enthusiasts that have little incentive to write biased reviews (Reddit, HN, expert forums)

- Look at the ownership of the site and how transparent they are about it

- Regularly reassess these criteria

This wouldn't scale for a generic search engine, but I'm working on a service that does this for many product verticals/niches.

darkwizard42 · on Jan 3, 2022

Agreed here, but in your second bullet, people have great incentive to write good quality reviews on Reddit, HN, expert forums... karma/recognition etc. It just so happens that these "forums" have built in voting systems that they spend time preventing from being gamed so the search engine doesn't have to.

Not sure if this is a good model for a search engine, but it does work to a small degree in those forums.

onion2k · on Jan 3, 2022

people have great incentive to write good quality reviews on Reddit, HN, expert forums... karma/recognition

Internet points are a terrible reason to write anything. They're completely meaningless. We should all judge comments on their own merit and not because the author has a lot of karma. Apart from mine, obvs.

NumberCruncher · on Jan 3, 2022

This resonates with my experience. A couple of years ago I invested more time than I proud of into buying the right bluetooth headset for me. I have found a site with pretty detailed reviews and tested their reviews standing in stores and trying dozens of headsets out. I also bought 3 headsets on Amazon and sent back all of them later. My impression was that the reviews on this particular site are 100% unbiased, where all other reviews I read just want to sell whatever product is in focus.

I wonder how a search engine could distinguish between "honest & professional" and "fake & amateur" headset reviews without having a head and two ears?

verve_rat · on Jan 3, 2022

This resonates with me a lot. A few months back I upgraded my desktop's insides. New motherboard, CPU, graphics card, etc. That was the first time in about seven years I've gone looking for review for that sort of stuff.

I remember doing the exact same thing in the past and being overwhelmed with information. The detail and data in reviews would take a long time to collate and make sense of. But this time even the big name sites seem to be much shallower. Less models reviewed, less testing and benchmarking, more regurgitated press releases and other news.

Last time it took me a while to sort out all of the information, this time all my time was spent trying to find any that wasn't 100% fluff.

lucasyvas · on Jan 3, 2022

To address your issue, we can simply accept that certain information cannot ever be available day and date. The best results I've ever got from Google is by doing

> site:reddit.com <QUERY>

If I want to know about headphones, or TVs, I'll find better answers in the sidebar than anywhere else on the Internet. But, I will not be able to find those quality answers until a reasonable period of time has passed where products can be tried and reviewed by real people.

The issue is the immediacy. We want answers now, but we won't ever have them until later. This requires a cultural shift in consumerism that companies will not like: "Wait and see."

The same problem happens for people pre-ordering video games that end up releasing in a state of complete trash. You can only get reliable information once the early adopters have tested it. If you are an early adopter, you are shit out of luck, but you are doing the rest of us a service that we appreciate greatly.

nucleardog · on Jan 4, 2022

I do the same reddit.com trick, but for TVs and headphones (and anything else they cover), I can wholeheartedly recommend rtings.com.

They're in Montreal. They buy their products retail. They get their funding through subscriptions and affiliate links. Overall ratings are formulaic based on measurements applicable to the category. The formulas are available on the site. So are all the test designs.

They're surprisingly thorough too--when I was shopping for some over-ear headphones I really appreciated that they measured the clamping force (when you wear glasses, something that clamps onto the arms too hard gets really uncomfortable pretty quickly) and breathability (temperature differential between your ears and ambient when wearing the headphones). These are pretty important for all-day comfort but don't really factor in most of the time.

Their methodology is all available:

Breathability: https://www.rtings.com/headphones/tests/design/breathability Clamping Force: https://www.rtings.com/headphones/tests/design/comfort#compa...

ALittleLight · on Jan 3, 2022

Maybe there's a hardware engineer out there with a decade of experience shipping and reviewing TVs publishing his thoughts to his blog. He's heard about the latest and greatest and he's offering his expectations based on the promotional material, his friends at the company, history from the brand - whatever. Maybe, if he's built a reputation of good reviews he's got a big audience. Big audience? TV brands give him an early review model.

Modern Google actually makes the content problem worse. When our notional TV blogger is starting out in our world he publishes two or three essays, nobody reads them, he stops putting in do much effort, posts occasionally, dwindles off. In a world with a perfect search engine his early essays get some attention to encourage him to post more, a feedback loop starts, and before you know it he's a full time TV reviewer.

sam0x17 · on Jan 3, 2022

> If you want a review for something that came out today, there is no way that work could have been done, so there simply isn't anything to find.

I think in practice this is actually largely untrue -- with technology products, video games, movies, and just about anything I can think of, most well known reviewers are given early access to the product so that reviews can come out on or before day 1 of general availability. That said this does create a dirth of 100% trustworthy reviews on day 1 since companies are naturally disincentivized from giving early access to reviewers who they know are going to write a negative review.

mminer237 · on Jan 3, 2022

There are still good reviews. For TVs, RTINGS produces high-quality reviews (although they're not listed super straightforwardly). For computer internals, AnandTech does even better. You don't have to talk about the absolute latest product out that same day for you to have quality reviews of other options in the meantime.

Everyone just makes blogspam because its far less work than actually buying products and developing expertise and testing them and writing out a whole thorough review. Google's algorithms just can't tell a quality review from a surface-level, uneducated take.

thereddaikon · on Jan 3, 2022

Its a double edged sword. Reviews take effort so you want to make them easier for customers to write. But making them easier for your users also makes them easier for those trying to game the system. This is why Amazon's product reviews are useless as well as pretty much any other community based review system.

But on top of that you do have the problem of whether or not someone is really qualified to write a review. So Joe User thinks product X is good. What is their metric for good? It reminds me of an LTT review of the Amazon TV from a few months ago. They gave it an awful rating but noted that the reviews on the product page were generally very positive. And their reasoning was that the people buying these TVs and reviewing them didn't have a good comparison point for what a good TV actually is. They are probably comparing it to a much older and less advanced product not to a contemporary one.

So then you think the answer must be get reviews from industry related media. But then you fall into the classic problems of unethical journalists or simply ones that are out of touch.

Spivak · on Jan 3, 2022

It's not a question of whether someone is qualified or not, everybody is more than qualified to write about their own feelings toward a product they bought or service they used. In fact no one is more qualified to talk about their own feelings and experience. How useful that review is to you is a combination of the writing quality and depth, and how similar the reviewer is to your own experience and preferences.

Professional critics usually try to distinguish themselves by producing well written in-depth reviews but not from their own perspective but that of a hypothetical everyman who, ideally, is similar enough to a critical mass of their audience.

So it always interests me when people complain about popular gaming review sites being out of touch because almost always it's the reader that's out of touch but doesn't realize their bubble. It's not an absolute rule but I'm in enough niche hobbies to realize that my desires for products are way out of whack.

xondono · on Jan 3, 2022

There’s a lot of good quality reviews on YT on launch date of pretty much anything these days.

It’s not a problem of doing the review, it’s that there’s not much of a market for written reviews, most people would rather watch a video instead.

fauigerzigerk · on Jan 3, 2022

Interesting. I never watch video reviews. They're painfully slow and impossible to search.

Karrot_Kream · on Jan 3, 2022

> It’s not a problem of doing the review, it’s that there’s not much of a market for written reviews, most people would rather watch a video instead.

I'd say it's more that YouTube offers a clearer path to content monetization than text does. YT is a much more lucrative platform for the same level of effort as SEO for their text blogs.

midasuni · on Jan 3, 2022

There’s not much money in written reviews, and people can’t find them amongst the automatically written SEo/affiliate crap

basch · on Jan 3, 2022

How do you compare 3 or 4 videos before watching them? Watching video reviews of the reviewers?

xondono · on Jan 3, 2022

Like most things, it’s a reputation ladder.

There’s top channels like LTT and if what you are looking for is out of their niche, you look for the biggest channels in that niche and go mostly by association (who they have made collaborations with,..).

EDIT: of course the big win of video reviews is that you can see the thing working.

quickthrower2 · on Jan 4, 2022

I mentioned subreddit searching in another post - this is a good way to find reviews. I often have other forums I will google search inside for opinions on something. It's more effort than a review site, but it is due-diligence. Since there is money in it for someone lying to you, unfortunately its a "why we can't have nice things" scenario. It is why you can't ask the car dealer what the best car is for you. If you want a good understanding of something, you need to dig. If I see a site with Amazon links, I close it in < 500ms.

loceng · on Jan 3, 2022

Developing common standards/protocols for everything required for a quality review vs. a "candy" or shallow hype review would be a good place to start, making it culture that everyone educated knows about to follow - and then they can only go to or support reviewers who list what testing protocols they follow.

Industry has already done this with the "food pyramid" - influencing, capturing governments to make the food pyramid more based on economic reasons and much less on science - with the government putting it out and distributing it into schooling of different levels, giving it an unearned or undeserved authority which then people blindly trust/follow - not understanding that or when systems and their output or oversight have been captured; why the pandemic bringing the classroom home via Zoom, so parents could see/hear the learning material has outraged many parents - an example I've heard, where white children are being taught to feel guilty about their 'white privilege', or parents being upset their children are being taught at a very young age that they can decide what gender they are; I'm not stating what I believe here, just giving examples I've heard of.

This capturing of the government is why I think ultimately the government should be developing and maintaining such platforms, as per law, and requiring individuals and organizations to in real-time add and update their data (simply example being restaurants, their menu's ingredients, their open hours) - in part to de-risk the government having an unnatural power as "the single arbiter" of truth, perhaps instead to de-risk capture that the government funds multiple independent organizations at the federal level - that States can decide which ones they follow, if necessary, part of why States exist - to de-risk the potential capture of the Federal umbrella; however the system is in an imbalanced, broken, captured state - with the duopoly evolving to be more extreme lead or formed by the establishment, with a broken voting system in arguably most countries of the world, and mainstream media being captured by for-profit industrial complexes that fund MSM through ad revenues - which further develop or mould our culture and narratives/talking points and beliefs, whether truthful or not; without fixing these the other platforms/systems excelling won't be possible.

amelius · on Jan 3, 2022

Regarding finding reviews, I think we have to look at the problem in a different way.

Instead of having 1 search engine which returns the same results for everyone (depending on interests, etc, like Google does), we could have trust networks. E.g. you trust a few people, those people trust other people. From this network you could build something like PageRank, which computes some kind of transitive closure of trust for one given person. This will then determine the search ordering for that person.

grvdrm · on Jan 3, 2022

Well said - it is among my biggest annoyances with the web. Reviews are almost always packaged into best-of or top-X lists. The quality of the Wirecutter is gradually trending down but it is still the website I use to find the "best" of something. I don't have to waste time sorting through hundreds of list-spam sites.

mikeryan · on Jan 3, 2022

I always liked the wire cutter for just kind of cutting through the crap and saying “this is the one”. I wonder if we need some sort of thing for reviews where humans filter out the sites that are credible.

It’s a bit funny because this we sort of done by Jason Calcanis’ Mahalo back in the day - but maybe he was just ahead of the SEO curve.

KarlKemp · on Jan 4, 2022

Jason Calacanis was never ahead of anything, except a few more recent idiots.

"Obvious, unactionable, or wrong" is a term of art created specifically to talk about Calacanis' advice.

"Mahalo", a Hawaiian term loosely translated as "who is this freak and why does he think our language is there to promote his shitty startup?" was the answer to the single most terrible thing VCs could identify at the time: that someone would create Wikipedia without a plan to monetize every aspect of it. And yet Calacanis did a worse job than the "original fool", Jimmy Wales, whose similar attempt at least didn't end up as a parked domain of SEO spammers.

fassssst · on Jan 3, 2022

Companies like Sweetwater do this right. They have “sales engineers” that help you find what you’re looking for over the phone or text message or email. It probably doesn’t scale but as a customer, I don’t care as it saves me so much research time and I consistently get what I’m looking for.

ColinHayhurst · on Jan 3, 2022

"SEO" spam is "Google SEO" problem. So SE ranking Optimization is not (yet) so much a problem for other Crawler/index SEs (Bing, Mojeek, Gigablast). You might say that Amazon (in eCommerce) and TRIP (in Travel) have cracked the problem of combining good/deep Content/Reviews and Category expertise with Search.

We regularly see partnership opportunities with customers interested in our API [0]. I presume Bing see the same, though their terms are more fixed and require you sharing more data. Definitely big opportunities in other categories, which are often squandered through a naive, if understandable route, of choosing a Scrape and index route.

[0] https://www.mojeek.com/support/api/

Veen · on Jan 3, 2022

> Why not try writing a search engine specifically for some category dominated by SEO spam?

Back in the olden days, there were lots of organizations that collated high quality content from the best writers. They nurtured expert writers and paid them well. They fact-checked the content and employed diligent editors and proofreaders so it was accurate and well-written. Over the years, they'd build a reputation for reliability and trustworthiness that kept people coming back for more. If you wanted to learn about fitness, or cars, or cooking, or science, you'd find a reputable author and publisher and buy their magazines or books.

But then, in the early 2000s, the geniuses from SV "disrupted" the publishing industry and its financial model. They brought us a much better way to find content, the search engine. Because they were so much better than the old-fashioned publishers, search engines gobbled up the advertising money and became the dominant gateway to content. Publishers had to abandon expensive high-quality writing because rankings and eyeballs now mattered more than quality and trustworthiness. Instead of investing in writers, they invested in marketers and SEO specialists.

The result: worthless content, writers banging out garbage for peanuts, and useless search engines.

Two decades later, looking at the barren wasteland they had created, the SV geniuses thought: I know what we need, more search engines, but smaller ones that collate high-quality content from the best writers. There must be money in that, right?

salt-thrower · on Jan 3, 2022

It's been really sad to grow up and watch the cool techie optimism of the 2000's internet get sucked dry by profit motives and left to rot. The change has occurred pretty much entirely within my adult lifetime (I'm only 27 and I still remember when Google was the cool new thing on the internet).

It went from "search engines and the web will usher in a new era of wisdom and democracy" to "useful content is dying at the hands of monetization schemes, and also the internet will be the death of liberal democracy, woe unto us all" in about 15 years.

GarlicToum · on Jan 3, 2022

I tried creating a search engine for recipes. It works well and people like it, but the struggle is no one remembers that it exists and Google is just their default for search.

So from an individual developer perspective, it's very hard to get people to change their habits. And Google/duck/Bing is the one stop shop.

It's still out there, but I haven't worked on it much lately. I always think that if I had some good advertisers, a better UI, and a salary coming in, maybe it could take over some of Google's usage!

Volker_W · on Jan 3, 2022

> I tried creating a search engine for recipes. It works well and people like it, but the struggle is no one remembers that it exists and Google is just their default for search.

Link please

GarlicToum · on Jan 3, 2022

https://garlictoum.com/

kevingrahl · on Jan 3, 2022

Just an idea but what about making it easier for folks to remember to use you search somehow‽

I like and use Duck Duck Go‘s !bangs [1] all the time, maybe try to add your site with a rememberable name.. may I suggest !garlic ?

[1] - https://duckduckgo.com/bang [2] - https://duckduckgo.com/newbang

GarlicToum · on Jan 3, 2022

Just submitted it, let's see what happens!

ajmurmann · on Jan 4, 2022

This is really cool! Bookmarked!

One piece of feedback: It seems to somewhat liberally fuzz the term, doesn't tell me about it and not toggle it in the UI. I searched for "Natto" and got some great results at the top (Spicy Kimchi Natto!). However, a few results down the recipes start including "Nattu" which seems to be a Indian chicken dish and then quickly have nothing to do with the term at all and give me stuff like Tafelspitz.

GarlicToum · on Jan 4, 2022

Yeah, that's the typo detection in search. The key thing is that "Natto" is prioritized, so that part is working correctly!

Tweaking search can be a whole full time thing. I'll write it down and maybe look into if I can tweak the search so if there are exact matches to not show alternatives.

jimkri · on Jan 4, 2022

This is great. I've been juicing a lot lately and every site you go to is full of adds and useless content and you have to click `Jump To Recipe`.

1 thing that would be helpful is to add in the ingredient amounts.

GarlicToum · on Jan 4, 2022

Yeah, that was part of the long term monetization plan. Paid for user accounts ($3ish/month) that allow people to clip recipes, view previews, discuss, modify, and share.

Never quite got around to it unfortunately, as I wanted more users before new features.

urbandw311er · on Jan 4, 2022

Have bookmarked. I shall try to remember to use it.

imilk · on Jan 3, 2022

Always nice to see other sites using Svelte!

citizenkeen · on Jan 3, 2022

(1) What is it?

(2) Do you have a bang on DuckDuckGo? I'm pretty aggressive with bangs, and I suspect a lot of DDG users end up being aggressive with them as well.

GarlicToum · on Jan 3, 2022

Linked above, I didn't know you could just submit a random site to a DDG to be included in bangs

mkl · on Jan 3, 2022

I think the name may be part of the problem there. Most things people want recipes for probably don't contain or go with garlic. E.g. if I wanted to make a cake, garlic is one of the last things I'd think of.

mistermann · on Jan 3, 2022

Something else that has largely disappeared is that there used to be a fair amount of organization of content, whereas now a lot of content is just thrown into a big pile and the user is left to go fishing on their own with search engines, whose ability to search seems to be declining (ie: Google often seems to no longer support mandatory include/exclude search parameters). Generally speaking, the result seems to be decreasing order and increasing chaos.

Of course, the massive volume of content creates a fundamental problem, but user curation & categorization on sites like Youtube would be possible, were Google to provide the software support so people could do that. Whether this and similar decisions are deliberate or accidental is likely one of those things that we will never know.

allochthon · on Jan 3, 2022

> Google often seems to no longer support mandatory include/exclude search parameters

I've noticed this, and it's frustrating. I have assumed it's intentional. I am left to guess as to what a change in this behavior would accomplish.

DaveExeter · on Jan 4, 2022

>Because they were so much better than the old-fashioned publishers, search engines gobbled up the advertising money

No, what happened is that the publishing industry lost their monopoly. They could no longer extract monopoly rents from advertisers.

One problem with search engines is affiliate marketing. If Google de-indexed the junk affiliate sites, the web would be much less polluted with affiliate spam.

earthboundkid · on Jan 4, 2022

> No, what happened is that the publishing industry lost their monopoly. They could no longer extract monopoly rents from advertisers.

Publishing was not a monopoly. Google/FB are a duopoly. If publishers capture rent from advertisers, they plow it back into content, aka the thing consumers actually want. If Google/FB capture rent, they don’t provide a living wage to content creators and plow the money into buying other startups and whatever “metaverse” is.

KarlKemp · on Jan 4, 2022

An "industry" can, almost by definition, not be a monopoly.

AussieWog93 · on Jan 4, 2022

It can if there are only a few big players and they're all in cahoots.

rapind · on Jan 4, 2022

Popular HN post circa 2015: “Content Marketing Handbook“

Popular HN post in 2022: “Search engines and SEO spam”.

Inevitable popular HN post 2025: “How to avoid getting flagged when content marketing“.

throwaway2037 · on Jan 4, 2022

I cannot find the link now, but there is great one page graph / chart that shows all the categories of Craigslist that have been cloned/displaced by start-ups. Does anyone have a link to that report?

dehrmann · on Jan 3, 2022

I think you're unfairly putting blame on Silicon Valley. Publishers were only able to produce high-quality content because, with no conversion metrics, advertisers were willing to overpay for placement. Tech undermined publishers' revenue, but what it revealed was that people don't actually want high-quality journalism, they want entertainment, and they're definitely not going to pay a premium for it. This was hidden behind publishers' business model.

nemothekid · on Jan 3, 2022

>Publishers were only able to produce high-quality content because, with no conversion metrics, advertisers were willing to overpay for placement.

This implies that big budget advertisers (the CPGs, like Coke and P&G), are buying Google/FB because they have better conversion metrics. That isn't true today; only SMBs and gaming companies care about conversion metrics. There are interns in LA/NY probably collectively spending millions on FB for P&G and only reporting the number of likes back to their bosses. Google and FB has never meaningfully delivered on conversions past anything like app downloads.

Tech undermined publisher's revenue because the internet cratered distribution costs. Advertising revenues for big media crashed because the eyeballs moved away, not because it was any less efficient.

dehrmann · on Jan 3, 2022

> the CPGs, like Coke and P&G

Did any of these heavily buy newspaper ads before 2000? Definitely TV, possibly magazine, but newspaper? I just don't remember seeing ads for Tide in newspapers.

throwaway2037 · on Jan 4, 2022

I thought for a moment about your reply. Do you remember Saturday / Sunday coupon sections? These are essentially adverts. Those are 100% consumables -- like branded health, food, and cleaning products.

dehrmann · on Jan 4, 2022

That's a fair point; I forgot about those. That said, the earlier comment was on conversion metrics. Unlike brand advertising, brands track conversion metrics on coupons closely.

I stand by my point that tech just exposed a bad business model. Newspapers were only viable because they were the ~only game in town.

Now what actually killed newspapers wasn't search, it was Craigslist killing the classifieds...at least according to a study Google paid for.

jayd16 · on Jan 3, 2022

I guess there's also the rise of influencers in the mix here. The commoditization of publishing means content creators can more easily work independently.

renewiltord · on Jan 3, 2022

Those guys covered literally nothing compared to what I can get recommendations for with “product type Reddit”. No thanks.

Veen · on Jan 3, 2022

You may not be aware, but the written word can be used for more than product reviews.

renewiltord · on Jan 3, 2022

Oh, they were just usually wrong on everything else. Fortunately, these days we have individuals debunking the nonsense. Back then, people just uncritically believed total horseshit.

The invariant has always been: find people who make falsifiable predictions and improve. Back then the pool was small and you had no choice. Now, fortunately we have a choice.

Volker_W · on Jan 3, 2022

What does SV stand for?

KarlKemp · on Jan 4, 2022

Greed, mostly.

allochthon · on Jan 3, 2022

Silicon Valley, i.e., the California tech scene.

smk_ · on Jan 3, 2022

Silicon Valley

canyonero · on Jan 3, 2022

I've been troubled by the just plain awful results being delivered by Google search over the last few years. I think these are just plain hard problems to solve and that Google is not incentivized to solve. Google wants you to click on ads at the end of the day, full-stop.

Often times I find myself searching for "best ($product|$thing_to_do)" which I think many other people do as well because we all want the best. Other times I'm looking for a music or a book recommendation with some depth. This of course nearly always leads to SEOd trash. There is no relevance nor is there trust. So, I like others to use keywords like "reddit" or "forum" to get to real humans who I trust and intentions are not to sell via affiliate links.

These issues often lead to the need in finding trust in real human-centered recommendations that stem from real human interests and needs. I've never found an algorithmic solution to this problem. This is why I think college radio stations or those south-of-the-dial end up being so, so much better. And why beer recommendations from your local brew-shop owner are better than anything you can find on the net.

I think building search vertical that are hand-curated would be very interesting to see. But I also think we need to build more communities which allow recommendations to be shared without an incentive to get hits via search and aren't paid for by large corporations and where community impact/quality _is_ incentivized. I do worry that those days may be gone and there are just not may be enough folks (not in tech) willing to spend so much time online and contributing to niche communities. A lot of folks spend much of their time in walled-gardens like Facebook, Instagram or Twitter, so it'll be challenging to be sure.

visarga · on Jan 3, 2022

> Often times I find myself searching for "best ($product|$thing_to_do)" which I think many other people do as well because we all want the best.

I do too. I'm wondering why didn't Google invest some effort into "best X" searches? I bet they could extract such information from the web and correlate various sources. They already answer all sorts of semantic knowledge questions.

cblconfederate · on Jan 3, 2022

> So, I like others to use keywords like "reddit" or "forum" to get to real humans who I trust and intentions are not to sell via affiliate links.

And therein lies the problem. Reddit makes very little money. Forums probably make negative money nowadays. Google has decided to demonetize the organic internet and subsidizes SEO crap and AMP or whatever dumb thing their signals consider valuable. We get what we incentivize, and right now the incentives in almost all of tech are pretty atrocious.

nerdponx · on Jan 4, 2022

Did forums ever make money?

cblconfederate · on Jan 4, 2022

yes i think. until they were demonetized

IndexPointer · on Jan 4, 2022

What do you mean by demonetizing forums?

allochthon · on Jan 3, 2022

> I think building search vertical that are hand-curated would be very interesting to see.

That was my inspiration behind a side project I made a few years ago — a decentralized, hand curated "search engine" [0]. Never got beyond the side project stage. But I see promise in this in the future. Eventually we'll figure out that moderated crowd-sourced curation is better than the best machine learning. The filtering capabilities have to be pretty sophisticated to make it work, though.

[0] https://github.com/emwalker/digraph

ijidak · on Jan 3, 2022

In some ways paid search disincentives Google from delivering quality organic results.

The larger the gap between paid results vs organic results, the more users click the paid results.

Not sure how to solve this problem.

topicseed · on Jan 3, 2022

But paid results do not always, if ever, answer the search query better in any shape of form.

So this would end up with displeased users and bounce backs.

hammock · on Jan 3, 2022

>This may not just be a problem with Google but possibly also the recipe for beating Google. A startup usually has to start with a niche market. Why not try writing a search engine specifically for some category dominated by SEO spam?

>You might need to do a lot of manual spam fighting initially. That could be both the thing-that-doesn't-scale, and the thing that differentiates you by being alien to Google's DNA. (They must hate manual interventions; so inelegant).

Is he describing...Yahoo circa 1994? A manually curated directory service.

gorbachev · on Jan 3, 2022

Some of the examples used in the Twitter thread Paul was referring to would be better served by a manually curated directory service with a possible addition of a search engine only surfacing content from the sites in the directory.

For health information and recipes in particular there are only a handful of really high quality sites that have quality content for 95% of the information most people need. I bet if you wanted to increase the coverage to 99%, that list would expand to less than a thousand sites. At those numbers manually curating the information would be easily achievable.

How to get people to use your top notch Google replacement instead of Google, however. That's the hard problem.

basch · on Jan 3, 2022

Isn't that what google Programmable Search is?

https://cse.google.com/cse?cx=dc408db269da4e769 (try searching for something you want a review of)

Make a search, whitelist the domains. Every time you run into a good review site, add it to the searchable list.

DangitBobby · on Jan 3, 2022

That's all fine and dandy, but the goal isn't to just make some good sites a bit easier to find, it's to keep the top of your search results from being interspersed or superseded by SEO spam. Unless I misunderstood your suggestion.

llaolleh · on Jan 3, 2022

It's really hard to get people to use something other than Google. If you were to launch such a product, it would have to be so much better that people recommend it organically to other people.

Rastonbury · on Jan 3, 2022

That was what Google was to Yahoo/Altavista back in the day, a 10x improvement. Reading this thread, people feel pain enough do all sorts of hacky stuff - appending 'reddit 'or 'forum' to queries, blacklisting spam domains, switching search engines depending on topic. If G keeps declining and a new product does things better, the penny will drop and people will swap.

Siebel and PG see blood in the water no doubt, they see G's market share and want to fund companies to take some of this.

visarga · on Jan 3, 2022

Should be able to run on top Google in a browser extension to insert itself only when the topic allows.

loceng · on Jan 3, 2022

And makes me think that StumbleUpon had a similar curation ability, in that the value qualifier is how often [hopefully] real people interact with content - tracked by who's using SU and agreed to allow tracking; can't remember if sharing that was optional or not?

The gamification of the system then would have to come through onboarding fake users, pretending/mimicking real user behaviour to send that signal into the system; not sure if SU ever ran into that problem or was actively paying attention to trying to identify and removing fake or suspicious signals from their output?

I feel a much better system is easily within reach, it's simply getting the right structure to it, the right foundation, and then it will quickly take off due to the quality difference. I've already figured out a design pattern that Twitter and Facebook has indoctrinated us with, making us think it is normal - and keeping us blind to an actual normal way or organizing or communicating, but that isn't conducive to control or ad revenues - and so extending my future plans to include a better search-directory system would fit snugly into my efforts.

visarga · on Jan 3, 2022

SU was a great way to surface random interesting stuff. I bet most blog entries today could be picked from Twitter, even if they are unlinked.

matt_heimer · on Jan 3, 2022

I always used DMOZ more than Yahoo! Directory. It looks like [dmoz](https://en.wikipedia.org/wiki/DMOZ) became https://curlie.org/ which is still active.

edoceo · on Jan 3, 2022

dmoz!

https://en.m.wikipedia.org/wiki/DMOZ

noduerme · on Jan 3, 2022

He's right as often as he's wrong

sillysaurusx · on Jan 3, 2022

What has he been wrong about?

noduerme · on Jan 3, 2022

Okay. I was gonna respond with something snarky about what a crappy mod he was, or how he was a Saint and you don't deserve to worship at his functional feet. But I'll tell you what he was wrong about: He was, as a leader and a human and a mod, petty. He pied pipered himself into a sweet spot and no one would deny he's a good coder, but there the ego took off and forever left behind a skidmark. The cool exterior, the sense of self-importance, the punching down, above all the love of spreading one's revelatory wisdom to the poor little guy; you can love that sort of thing too much, and he did. Perhaps you weren't here or didn't interact with him directly. In my view he became dismissive and derogatory toward people who worshiped him (like you) once he acquired a small degree of fame.

Jorengarenar · on Jan 3, 2022

What in the world are you babbling about? Who here does worship who?

Arbalest · on Jan 4, 2022

I'm going to guess Paul Graham, who wrote the tweet. One of the founders of YCombinator and HN (just in case people who read this don't know).

tonyedgecombe · on Jan 3, 2022

I'm starting to think Yahoo circa 1994 might be better than Google today.

behnamoh · on Jan 3, 2022

I wouldn't just complain about Google. Google search results mostly reflect a deeper problem with the web today. I do miss the simplicity of the 2000s.

birken · on Jan 3, 2022

The funny thing is that if the people who worked on spam at Google were free to talk about it, I'm sure it would become evident that they know more about spam and anti-spam efforts than anybody else in existence. It's a ridiculously hard problem, especially when people are targeting you directly. But they aren't free to talk about it, because if they did it would just give more assistance to the spammers, and make the problem worse.

I'm not saying that curated search results for particular verticals is a terrible idea (though I'm sure like anything the devil is in the details), but on the whole Google search is very, very good considering the constant assault they are under from spammers (which most other search engines are not, at least directly).

numbsafari · on Jan 3, 2022

The problem isn't that Google doesn't employ these people or invest in their activities.

It's that Google has destroyed their own search results in order to continue to expand their revenue opportunities.

If Google:

- Enabled downvoting on results, like YT videos. (Has its own spam problems, just like YT)

- Allowed you to block certain domains from your search results, like YT videos. (If they added some kind of "coordinated network detection" and down-ranked domains coordinating with ones you've blocked, that'd be pretty cool).

- Allowed you to create your own custom search engines, like "Programmable Search Engine".

That would be incredibly valuable. They already have most of the tech. They could even create a subscription service around custom search engines if they really wanted. Plenty of people would find something like that incredibly valuable.

Anyhow, buried in there is your startup idea. Remember: your startup doesn't have to generate the same revenue or profit as the incumbent on day one to be successful.

Rastonbury · on Jan 3, 2022

The biggest perverse incentive for Google is that making better search results can mean less clicks to ads (clicking an ad because results are crap, going thru more pages of results means more ads). Clicks are revenue which is much easier to optimise for.

Internal Search owners can push for better algos, but what if the algo causes revenue to fall? Are there strong forces strong enough within the organisation to ensure that search quality prevails?

If this is the case, the problem is existential. It can only be arrested at the very top

https://en.wikipedia.org/wiki/Perverse_incentive

nitrogen · on Jan 3, 2022

The biggest perverse incentive for Google is that making better search results can mean less clicks to ads

This gets close to the real root of the issue -- attention is monetizable independently of the quality of content. There would be much less incentive to create SEO spam if search engines negatively weighted pages with ads and affiliate links, and if manufacturers were barred (e.g. by the FTC) from owning or imitating reviewers.

Closi · on Jan 3, 2022

> The biggest perverse incentive for Google is that making better search results can mean less clicks to ads

This is also something that Google can control if competitors come along.

i.e. If a reasonable competitor comes along that is willing to sacrifice ad revenue for better search result quality than Google, google can just adjust their search quality upwards to knock them out (and then adjust it back once the competitive threat is gone).

Perverse incentives from Google are all over the place - Searching for the delivery business "Just Eat" in the UK for instance returns an ad for their competitor Deliveroo above the legitimate organic search result for me - and I can also see that JustEat are trying to pay for their own brand name just to compete - and IMO this sort of behaviour is anti-competitive, borderline extortion considering Google is the de-facto way of searching for a business, and shocking from a search-quality perspective (where the wrong result is intentionally shown at the top because they paid more money).

yabones · on Jan 3, 2022

If I had to pay $10/month for good search results, I absolutely would. I think most people would. Get rid of the ads and spam, and you have a service worth a premium. The solution is to make it user-centric instead of advertiser(spammer)-centric.