Arch is an open source extension of Apache Nutch (a popular, highly scalable general purpose search engine) for intranet search. Not happy with your corporate search engine? Not surprising, very few people are. To the best of our knowledge, there are no intranet engines that work as well as the Google's global Web search does. There is a fundamental reason for this: the algorithms used by Google on the global Web (or similar) do not work nearly as well on intranets for the lack of statistical data. Arch (finally!) solves this problem. It uses a novel method to deliver high precision search results that works great. Don't believe it? Blind test evaluation tools are included. You can deploy Arch and compare its performance to your current search engine and/or Google (on the public part of your site) using a blind test methodology.   In addition to the excellent search quality, Arch has many features critical for corporate environments:  - Document level security. Users can find only documents that they are authorized to see. - Inexpensive index updates. Arch is able to keep indexes up to date and avoid regular complete site recrawling. - 24/7 availabilty. There is always a working index available, even if a crawl fails. - Support for simultaneous indexing and search of multiple web sites, with ability to search and administer any site separately, if needed. Dynamic adding and removal of web sites is easy. - An automatically generated site directory. - Low cost support once deployed. - Double interface (PHP and Java) for easy deployment and customization. Use the one that better matches your skills. - An extensive and extensible set of parsers for parsing a variety of file formats: HTML, PHP, PDF, MS Office, Open Office, etc. - A modular, plugin-based architecture that can be easily customized and extended. - The source code is included. - High performance and scalability. Arch can run on computer clusters to index very large data sets.
Home:New Listings:Top Listings:Add New Listing:My Account
Home : Tools and Utilities : Search Engines
Arch Search Engine
Report Listing Error
Arch is an open source extension of Apache Nutch (a popular, highly scalable general purpose search engine) for intranet search. Not happy with your corporate search engine? Not surprising, very few people are. To the best of our knowledge, there are no intranet engines that work as well as the Google's global Web search does. There is a fundamental reason for this: the algorithms used by Google on the global Web (or similar) do not work nearly as well on intranets for the lack of statistical data. Arch (finally!) solves this problem. It uses a novel method to deliver high precision search results that works great. Don't believe it? Blind test evaluation tools are included. You can deploy Arch and compare its performance to your current search engine and/or Google (on the public part of your site) using a blind test methodology.

In addition to the excellent search quality, Arch has many features critical for corporate environments:

- Document level security. Users can find only documents that they are authorized to see.
- Inexpensive index updates. Arch is able to keep indexes up to date and avoid regular complete site recrawling.
- 24/7 availabilty. There is always a working index available, even if a crawl fails.
- Support for simultaneous indexing and search of multiple web sites, with ability to search and administer any site separately, if needed. Dynamic adding and removal of web sites is easy.
- An automatically generated site directory.
- Low cost support once deployed.
- Double interface (PHP and Java) for easy deployment and customization. Use the one that better matches your skills.
- An extensive and extensible set of parsers for parsing a variety of file formats: HTML, PHP, PDF, MS Office, Open Office, etc.
- A modular, plugin-based architecture that can be easily customized and extended.
- The source code is included.
- High performance and scalability. Arch can run on computer clusters to index very large data sets.
Detailed Information:
Title Arch Search Engine
Added by Arkadi Kosmynin
Date Added Thursday, October 07, 2010
Date Updated Thursday, October 07, 2010
User Rating
PHP, ASP .NET, JSP Scripts, Resources, Reviews
Click to Read Reviews
User
Rating
Add this rating box to your site
It's a fantastic way to gain your visitor’s confidence in your products or services.
Hits 109
Votes 0
Reviews 0
Size Over 5MB
Version 1.2
Platform Linux / Windows / FreeBSD / MacOSX
Database None
Demo http://www.atnf.csiro.au/computing/software/arch/
Download http://www.atnf.csiro.au/computing/software/arch/arch-src.zip
Website Address http://www.atnf.csiro.au/computing/software/arch/
Listing ID 14925

Share:
Blink List del.icio.us Digg Facebook Furl Google Ma.gnolia Mixx Reddit Stumble Upon Technorati Twitter Windows Live Yahoo! MyWeb Slash Dot Squidoo ASK Netscape News Vine SupaScripts Bookmark

License Information:
Licence #1
Type: FreewarePrice: Free

Reviews:
Be the first to write a review.



Add Listing New Listings:Top Listings:Modify Listing:My Favorites:Getting Rated
Sign Up:Login:Link to us:Help/Support:Newsletter:Terms:Privacy:Contact us
Add Your Site
Link Partners

© 2012 SupaScripts.com


Online Gift Shop . Wholesale Souvenirs . Souvenirs from London . Indian Jewelry
Union Jack Shop . Muslim Directory . Baby Products . Indian Jewellery . Bangladesh News