A directory just wouldn’t be a directory without categories and links. Most directories are based on Dmoz’s category structure and it’s usually the first place directory owners refer to when gathering categories and even links.
There are Dmoz extraction mods and software out there such as Nicecoder’s Dmoz Extractor which will extract the categories and links directory from Dmoz. So what you end up with is a directory that is a small clone of a portion of Dmoz. I highly recommend against using duplicate info from Dmoz. Using the exact categories is ok but not the links with descriptions written by Dmoz category editors.
It is so common to find directory scripts that will pull this info directly from Dmoz without one change being made. If you want your directory to last and not look like a bunch of duplicate content then you should spider each link for new titles and descriptions.
There is one piece of software out that that will do this for you and it was an absolute godsend when I stumbled upon it. It extracts data from Dmoz but after all the categories and links have been extracted it goes one step further and spiders each URL for a new title, description, keywords and email address. Yes, it goes to each site in your Dmoz extraction and pulls the Meta info and email address (if it can find one) and creates a lovely new database of unique content ready for importing.
I use this software in conjunction with Indexu to build directories fast. The reason I use Indexu is that it has an import feature in the admin that allows you to import categories and links from the file types that this Dmoz Extractor exports in.


Recent Comments