|
SimpleScan Software, Inc.
SimpleScan Software, Inc - providing powerful, cost effective, enterprise wide document management software solutions.
http://www.simplescan.com/
The Combine Harvesting Robot
Combine is an open system for harvesting and threshing (indexing) Internet resources.
http://www.lub.lu.se/combine/
ClusterClick
Full-text indexing of desktop documents for researchers, journalists, and historians with low indexing overhead 13 percent beyond document space. Also displays the most important words from each document. [Windows 95/98]
http://www.clusterclick.com/
Cheshire II Project Home Page
Cheshire II is a "Next-Generation Online Catalog and Full-Text Information Retrieval System." It features advanced IR techniques, including support for Boolean and probabilistic 'best match' ranked searching, SGML/XML as the primary data base format, and a client/server architecture that uses the Z39.50 Information Retrieval Protocol.
http://cheshire.berkeley.edu/
ht://Dig
A complete world wide web indexing and searching system for a small domain or intranet. (C++) [GNU/Linux, Unix]
http://www.htdig.org/
Dieselpoint, Inc.
Search software in 100% Java (J2EE) with parametric, natural language and full-text search capabilities.
http://www.dieselpoint.com/
IB Search Engine
High speed, fully featured, multilingual fielded fulltext engine. Available for many platforms including Solaris, BSD, Linux and Windows-NT.
http://www.bsn.com/Z39.50
H5 Technologies
Develops enterprise software that intelligently processes text-based information using automated information indexing and tagging.
http://www.h5technologies.com/
Onix Full-Text Indexing and Retrieval Toolkit
Toolkit (SDK) for adding full-text indexing and searching capabilities to applications. Ported to a wide range of platforms and highly scalable. Designed for use in both large and small scale systems. Free evaluation download.
http://www.lextek.com/onix/
Ultraseek Server
The tools they use at their site for sale. Demo version available for download.
http://software.infoseek.com/products/ultr...
Megaputer Intelligence
Data, text, and web mining software. PolyAnalyst includes in-place mining, strong Microsoft integration.
http://www.megaputer.com
Building Task-Specific Interfaces to High Volume Conversational Data
A scholarly paper by Loren G. Terveen, William C. Hill, Brian Amento, David McDonald, and Josh Creter.
http://www.acm.org/sigchi/chi97/proceeding...
Zebra Z39.50 Search Engine
Zebra is a fulltext and free-text indexing and retrieval system that conforms to ANSI standard Z39.50. It is very good for indexing and searching highly structured data such as MARC records, and GILS records. The Zebra server is freely available for noncommercial applications.
http://www.indexdata.dk/zebra
SearchExpress
Provides document scanning, optical character recognition and full-text searching.
http://www.searchexpress.com/
Web Search Engine Software
Create and maintain a search engine using a perl script and database management tool.
http://www.web-search.com/websoft.html
Thunderstone
Provides SQL-based relational full-text retrieval, dynamic publishing, object management, and web-indexing software.
http://www.thunderstone.com/
Glimpse
A Unix based indexing and query system. It is good for indexing relatively small amounts of data. Different types of indexes allow you to trade off search speed for index size. The default search engine used in Harvest.
http://glimpse.cs.arizona.edu/
dtSearch
Searches all popular file types, with features including hit highlighting, natural language, fuzzy, phonic, boolean, proximity, field, numeric range.
http://www.dtsearch.com/
Managing Gigabytes
Information and select sections of a book about indexing and compression techniques for documents and images. Also provides information about open source IR system released with the book.
http://www.cs.mu.oz.au/mg/
FreshStart
Bankruptcy software in WordPerfect and MS-Word for legal professionals. Menu-driven data input and automatic form compilation in official forms typeset format for Chapters 7, 9, 11, 12, 13. Electronic filing (ECF) compatible.
http://www.freshstart.com/
Lucene Search Engine
Jakarta Lucene is a full-featured text search engine written entirely in Java, and it is an open source project available for free download from Apache Jakarta. The current goals of the project are primarily to provide application and also a platform for research.
http://jakarta.apache.org/lucene/
CCR WinOcular
CCR, a software developer and integrator, specializes in document imaging, COLD report management, workflow software and Kodak scanner products for corporations, small business, all government entities, and schools.
http://www.winocular.com/
Simple Web Indexing System for Humans
SWISH-Enhanced is a fast, powerful, flexible, free, and easy to use system for indexing collections of Web pages or other text files.
http://swish-e.org/
MicroISIS by UNESCO
Non-numerical information storage and retrieval software developed to allow institutions, especially in developing countries, to streamline their information processing activities.
http://portal.unesco.org/ci/ev.php?URL_ID=...
OpenText
Supplier of information retrieval and collaborative software.
http://www.opentext.com
Dataware Technologies
Search engine vendor of BRS/Search, a text based core product, and web enabled products.
http://www.dataware.com
ISYS
Suite of search software products that finds information in multiple file formats and languages. Features product descriptions, evaluation version download, company profile and contact information.
http://www.isys.com.au/
KE Software Inc.
Developer of the KE Texpress database system, KE Texhtml WWW module, KE EMu (electronic museum management) and LifeData (vital statistics management).
http://www.kesoftware.com/
Isearch
Software for indexing and searching text documents, using full text and field based search, relevance ranked results, Boolean queries, and heterogeneous databases. Support for document types such as HTML, SGML, mail folders, and USMARC.
http://www.etymon.com/Isearch/
|