mnoGoSearch is a full-featured SQL based web search engine. mnoGoSearch consists of two parts. The first part is an indexing mechanism (indexer). The indexer walks over html hypertext references and stores found words and new references into a database. The second part is a web CGI front-end to provide search using data collected by the indexer. A PHP and a Perl front-ends are also available from our site http://www.mnogosearch.org/.
mnoGoSearch first release took place in November 1998. The search engine was named UDMSearch until the project was acquired by Lavtech.Com Corp. in October 2000 and its name changed to mnoGoSearch.
The latest changelog can be found on our website.
Main mnoGoSearch features are as follow:
MySQL (libz library required), PostgreSQL, SQLite, iODBC, unixODBC, EasySoft ODBC-ODBC bridge, Mimer, Virtuoso, Interbase, Oracle (see the Section called Oracle notes in Chapter 5), MS SQL, DB2 (see the Section called IBM DB2 notes in Chapter 5), Sybase, InterSystems Cache back-ends support.
HTTP support.
HTTP proxy support.
HTTPS support.
FTP support.
NNTP support (both news:// and nntp:// URL schemes).
HTDB virtual URL scheme support. One may build an index and search through the big text fields/blobs of an SQL database.
Mirroring features.
text/html, text/xml and text/plain built-in support.
External parsers support for other document types.
Searching all of the word forms using ispell affixes and dictionaries
Basic authorization support. One may index password protected Intranet HTTP servers.
Proxy authorization support.
Reentry capability. One may use several indexing and searching processes at the same time even on the same database. Multi-threaded indexing support.
Stop-list support.
<META NAME="robots" content="..."> and robots.txt support.
C language CGI, PHP, Perl web front-ends.
Phrase search.
Boolean query language support.
Results sorting by relevancy and popularity rank.
Fuzzy search: word forms (stemming), synonyms, substrings, dehyphenation, accent insensitive search.
Best matching documents are found first.
Various character sets support
HTML templates to easily customize search results.
Advanced search options like time limits, category limits etc.
Phrases segmenting for Chinese and Japanese languages.
Smart excerpts.