Behold the Mighty Pterodactyl: Meta Search Lives!

by

While researching this feature on Infospace’s recent contributions to the metasearch space, I decided to reacquaint myself with Metacrawler’s advanced features. Not a bad idea considering how things have changed. The current media obsession with corporate wrong-doing has found its way into the search field. If you believe everything you read, you’d probably think that there is absolutely no way to use a cool Internet search tool without being bowled over with paid advertising. Not so.

For example, I tried a search on queries like “paleontology” and “pterodactyl” on a few different versions of Infospace’s metasearch. First, I used Webcrawler, which offers a version of Metacrawler results. The results I saw were mostly non-commercial in nature. In addition, Webcrawler has been relaunched as a “no banners, no buttons, no pop-ups” search site. The reduction in clutter is welcome, but some won’t be happy with the fact that some of the search results are sponsored listings not clearly demarcated from non-sponsored listings.

I tried the same search on Metacrawler and got a good overview of the topic. Results which appeared in all of the Inktomi, Google, and Teoma/Ask Jeeves indexes were ranked high on the page. There was only one sponsored result at the top – but of course this illustrates the fact that so much keyword inventory has still gone unsold to advertisers. Only one advertiser is paying for the keyword “pterodactyl,” and only on Overture. A company called edinos is paying a mere one penny per click for this ad! I’d guess they’re doing well.

Metacrawler can do even better than this for an advanced user. Because we’ve been lulled into the above-mentioned media obsession with corporate wrongdoing, it’s been easy to assume that Metacrawler doesn’t offer its old strengths – but it does. If you want, for example, to customize your metasearch so that only Google and Teoma results appear, you can easily adjust and save your advanced settings. Or if you want to query all major directories and free-to-submit spider indexes plus all major paid inclusion indexes, but leave out sponsored listings from Overture and FindWhat, you can do this too. For me, the Teoma plus Google search was satisfying. It was a bit like just using Google, but coverage was slightly broader. Whenever you put several major indexes together, you’re likely to get additional coverage, which is why metasearch can offer an advantage over using a single engine like Google.

There is no question that the metasearch sector has faced big challenges. For one, there are seemingly fewer major indexes to include in a search. Excite Search, formerly a staple of most metasearch engines’ results, went the way of the dodo after the bankruptcy of Excite@Home. The brand was bought by Infospace, which discontinued the already-dead-anyway Excite Index and replaced the results with Overture sponsored listings. Shortly thereafter, Excite Search was relaunched as a metasearch engine, using results generated from a custom version of Metacrawler.

Essentially, then, Excite Search is now a more commercialized variant on Infospace’s flagship Metacrawler. Of course this means metasearch engines like Metacrawler (and, er, Excite Search) no longer have Excite Search to include in their results. The same goes for numerous other formerly vibrant search engines like Infoseek.

For those of you who still don’t know what metasearch is (86% of consumers do not currently know what it is, but when it is explained to them, 84% find it “valuable”), it’s a way of searching several different search engines or databases and presenting the results in a convenient format. The world leader in metasearch is Infospace, which has long been proprietor of Metacrawler. It also owns Dogpile, Webcrawler, and Excite Search. All four properties offer metasearch, presented in different formats and under different brand identities.

One of the drawbacks of metasearch in the past couple of years has been an increasing tendency to stuff the results with sponsored listings, a trend which, confesses Richard Pelly, Infospace’s VP and General Manager, Search, contributed to the decline in public confidence in the search experience on Metacrawler and Dogpile. Although paid listings are still a significant part of Infospace’s overall mix, they’ve reversed course to some extent, making a conscious effort to reduce clutter and show advertising only where it’s relevant. Advanced searchers will never want to be buried under Dogpile’s avalanche of sponsored listings. But at least Infospace has thrown them a bone with a banner-free Webcrawler and advanced settings on Metacrawler.

York Baur, the company’s Executive VP of Wireline & Broadband Services, laments that the press “and Ralph Nader” have constructed an “artificial divide between a paid result and a ‘spidered’ result.” Infospace believes that consumers should see relevant results whether they are paid or non-paid.

The technology behind Metacrawler has been beefed up partly in order to address search queries on either side of the “divide,” leaving one to question how artificial this divide really is. The first thing Metacrawler does before returning results, explains Tasha Irvine, Infospace’s Product Manager, Search, is to decide whether a query is more likely to be commercial or non-commercial in nature based on large proprietary keyword lists. This determines the “mix” of paid versus non-paid results. If an inquiry is for “real estate in denver,” more Overture and FindWhat results will be shown. If it’s for “paleontology,” the number sponsored listings will be less, and they’ll be shown further down the page.

Much has been made of the technological breakthroughs and “under the hood” computing power that has allowed spidering engines like Inktomi and Google to outdo their contemporaries. Irvine makes it clear that Metacrawler’s lead over competing metasearch engines is also partly buttressed by high-powered technology. Speed is a huge issue for metasearch engines; querying multiple databases and returning aggregated results takes time. And when users press their browser’s “back” button, typically the same page needs to be generated and it may not be from a cache. Slow metasearch services, therefore, aren’t very user-friendly.

Contractual relationships also set Infospace’s metasearch engines apart from the competition. According to Infospace’s Baur, Infospace is the only metasearch provider that has a contractual arrangement with Google to include the Google index in its metasearch. “There have been questions about whether metasearch is legal,” says Baur. “Our view is, as with any grey area relating to Internet copyright, it depends on how you do it. We have signed agreements with several major search companies.” This special access not only means that Infospace’s metasearch is authorized; it also allows it to work faster and more reliably.

The decline of early-adopters’ interest in metasearch – and if that trend continued, its potential extinction – has has been fueled not only by an excess of paid results in the mix, but also by the erosion of the former raison-d’être for metasearch: the premise that a number of distinct, vibrant, non-paid web search indexes exist and that metasearch can “query them all” to save time and to help in comparisons. The dominance of Google has led many consumers to assume that Google is all they need; in some way, that Google is search much as eBay is auctions and Amazon is books. Few consumers do a “meta-book-store” search. Little is heard anymore about services like AuctionRover which used to query several auction sites at once. Could a similar phenomenon be happening in the search space now that Google is ascendant?

According to Irvine, Metacrawler has had to change constantly to reflect the state of the search industry, and it will continue to do so. In recent months, it has added FAST Search, changed its relationship with Ask Jeeves, and, reluctantly, removed Wisenut. “We have a number of tests that we require our search partners to pass,” says Irvine. “One criterion is that they can handle the high volume of queries we send them. Wisenut, when we first partnered with them, was a small company and got overwhelmed. Their systems couldn’t handle the load.” In short, Infospace is always re-evaluating the mix of search results that go into its metasearch.

When Google’s halo effect wears off, the current environment of “single engine dominance” may change again. Metasearch could catch on again as consumers realize that one search engine can’t do everything for them.

The re-emergence of metasearch depends in large part on search enthusiasts finding it useful again. Tens of thousands of university librarians, elementary school teachers, and other research experts have a big influence on the adoption of cool tools by students seeking information to write papers, and ultimately, to perform other types of job-related and lifestyle-related searching. Thus the importance of managing the delicate balance amongst different forms of paid and non-paid listings: experts won’t recommend a tool if it’s too blatantly commercialized.

Stories claiming a 19th-century sighting of the once-mighty pterodactyl were probably hoaxes. Metasearch, on the other hand, has merely been hibernating but remains very much alive. As long as metasearch product developers have the genetic code of a variety of ingenious and powerful search tools at their disposal, it seems inevitable that future generations of metasearch will fly high.

You may also like