Apache Tika icon

Apache Tika

Extract text and metadata from any document type

FreeOpen SourceApache-2.0CLIAPI

Description

Content-analysis toolkit that takes virtually any document (PDF, Office, images, archives) and extracts the embedded text and metadata along with the detected MIME type and language, used by investigators to pull authorship, timestamps and hidden metadata out of files at scale.

Reviews

0.0 (0 reviews)