To get Wikipedia Miner up and running, you have to download and host a huge amount of information. We've created the services below to make things a bit more convenient.


The Search service allows you to treat Wikipedia as a gigantic thesaurus, for describing everything from nanotechnology to Barbie dolls. The Wikipedia articles this service locates provide a wide array of useful linguistic information, including definitions, synonyms, translations, and related topics. The search vocabulary is extensive (5 million or more terms and phrases), and encodes both synonymy and polysemy.


The Compare service allows you to compare terms and concepts to measure how strongly they relate to each other. From this you can tell that nanotechnology doesn't have much to do with Barbie dolls, but that it does have a lot in common with engineering.

The details of how this works (and an evaluation) can be found in this paper:


The Wikify service automatically augments either snippets of text or entire web pages with links to relevant Wikipedia topics. It doesn't just use Wikipedia as a source of information to link to, but also as training data for how best to do it. In other words, it has been trained to make the same decisions as the people who edit Wikipedia.

This paper describes how the wikifier was implemented and evaluated:

Note: All of the services above are machine-readable.

They can be made to return XML by appending &xml to the request.

Feel free to point a bot or a service here (via POST or GET, it doesn't matter). Bear in mind that we may restrict access if usage becomes excessive. You can always run these services yourself by installing your own version of Wikipedia Miner.