CVE-2025-66516
Apache Tika core, Apache Tika parsers, Apache Tika PDF parser module: Update to CVE-2025-54988 to expand scope of artifacts affected
In short
Apache Tika fails to safely process PDF files containing malicious XML data, allowing attackers to read sensitive files or execute commands on affected systems. This flaw affects multiple Tika components across different versions.
Technical detail
XXE (XML External Entity) injection vulnerability in Apache Tika's PDF parsing logic triggered via crafted XFA (XML Forms Architecture) content within PDF files. Exploitation requires only a specially crafted PDF file and no authentication; impact includes arbitrary file disclosure and potential remote code execution. The vulnerability persists in tika-core across versions 1.13-3.2.1 even if tika-pdf-module is patched, and affects tika-parsers 1.x releases.
Summary generated and translated by AI from the official description.
Critical XXE in Apache Tika tika-core (1.13-3.2.1), tika-pdf-module (2.0.0-3.2.1) and tika-parsers (1.13-1.28.5) modules on all platforms allows an attacker to carry out XML External Entity injection via a crafted XFA file inside of a PDF.
This CVE covers the same vulnerability as in CVE-2025-54988. However, this CVE expands the scope of affected packages in two ways.
First, while the entrypoint for the vulnerability was the tika-parser-pdf-module as reported in CVE-2025-54988, the vulnerability and its fix were in tika-core. Users who upgraded the tika-parser-pdf-module but did not upgrade tika-core to >= 3.2.2 would still be vulnerable.
Second, the original report failed to mention that in the 1.x Tika releases, the PDFParser was in the "org.apache.tika:tika-parsers" module.
CVSS:3.1/AV:L/AC:L/PR:N/UI:N/S:U/C:H/I:H/A:H
Affected products
Apache Software Foundation · Apache Tika coreApache Software Foundation · Apache Tika parsersApache Software Foundation · Apache Tika PDF parser moduleWant to know if your infrastructure is exposed to this?
Talk to TrueHacking →