DocFetcher

DocFetcher is an open source desktop search application. It is written in Java and runs on Windows, Mac OS X and Linux[1]. The application has a graphical user interface, which is written using the Standard Widget Toolkits[2]. Indexing and search are based on Apache Lucene[2], a widely used open source search engine.

DocFetcher
Developer(s)DocFetcher project
Stable release
1.1.22 / Jul, 30, 2018
Written inJava
Operating systemMS Windows, Mac OS X, Linux
LicenseEclipse Public License
Websitehttps://sourceforge.net/projects/docfetcher/

Features

  • Unicode support
  • Full text search for all major document file formats, including:
    • Office files (Microsoft Office, OpenOffice, Outlook (PST), ...)
    • EPUB, PDF
    • RTF, SVG and any other plain text files
    • Audio metadata (MP3, FLAC)
    • Picture metadata (JPEG)
    • Archive formats (ZIP, 7z, RAR, Tar). Also supports nested archive files
    • HTML with pair detection. Which means that DocFetcher detects when an HTML file and a folder containing the resource files (Images, Scripts, ...) of the page belong together. (These resource files are usually downloaded when saving a Website)
  • Possibility to automatically detect file changes and update the index accordingly
  • Exclusion of files from indexing based on regular expressions
  • A query language supporting boolean operators (OR, AND, NOT), wildcards, phrase search, fuzzy search and proximity search
  • Translations in Chinese, Italian, Ukrainian. Partly translated to French, Japanese, Spanish, and German.[3]

See also

References

  1. DocFetcher homepage, retrieved 2016-12-04
  2. "start - DocFetcher-Wiki". docfetcher.sourceforge.net. Retrieved 2020-05-18.
  3. "DocFetcher translation on Transifex". www.transifex.com. Retrieved 2020-02-11.
This article is issued from Wikipedia. The text is licensed under Creative Commons - Attribution - Sharealike. Additional terms may apply for the media files.