You can find the latest release on the download page. Please see the Getting Started page for more information on how to start using Tika. The Parser and Detector pages describe th
Apache Tika (TM) is a toolkit for detecting and extracting metadata and structured text content from various documents using existing parser libraries. Tika is a project of the Apa
Apache Tika uses the Bouncy Castle generic encryption libraries for extracting text content and metadata from encrypted PDF files. See https://www.bouncycastle.org/ for more detail