Home | Blog | Wikitastic | Computers | Movies | Music | TV Shows | Code | Sitemap | Contact | Linking | Search
The historical background of my Search Engine CGI program.
Note: The rest of this overview section is a carryover from the prerelease stage of this project. I have kept it because I like it and I don't want to change it.
I am working on a search engine, it is a CGI program built using Perl, C++, and Java. I decided to build this search engine after using CGISRCH from AnalogX on my home network site, I won't go into details, but suffice it to say it didn't do what I wanted it to do. So following the old adage, "if you want it done right, do it yourself," and set out to build my very own search engine. Of course, building a search engine has already been done right by Google and they even sell a Google Appliance, but I really don't need to buy one of those and I love programming (also this will look good on an application to grad school or on my résumé).
Since Google did build a search engine the right way, I decided to use it as my primary example. My search engine will not have all the features of Google, but hopefully it will work similarly on a much smaller scale. Unlike CGISRCH, my search engine uses an index of pages to be more efficient. My search engine is also more customizable in its output style. My design for the search engine consists of four files that make up the program:
Those files have Windows file extensions because I am developing the search engine on Windows, but it will be ported to Linux and FreeBSD, where the only difference in the file names will be some extensions (HttpConnectorHelper.dll will be HttpConnector.so). The search engine will also consist of five HTML files that make up the customizable templates for the output:
The actual index files produced by the main program when it is indexing will be XML files, the search engine will contain a Document Type Declaration file called index.dtd which will allow you to analyze the indices and check for errors with Internet Explorer or an XML Validator.
Note: As with the overview section, the rest of this download section is old.
Since I have not completed this project, it is not yet available to be downloaded. When it is ready, it will be available in binary for Windows, Linux, and FreeBSD all on Intel x86 processors. I will also release the source code with an Apache/BSD style license.
However, if you would like to be ready to run it here is a list of what you need:
Also, if you would like to build the search engine from the source code, either to port it to your platform (maybe Mac OS X, IBM OS/2, or Solaris) or just to say that you can compile it, here is a list of what you will need:
Copyright © 2002-2009, Douglas Thrift. All Rights Reserved.